Senior Data Scientist
v2.1.1World-class senior data scientist skill specialising in statistical modeling, experiment design, causal inference, and predictive analytics. Covers A/B testi...
⭐ 6· 3.2k·21 current·23 all-time
byAlireza Rezvani@alirezarezvani
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
OpenClaw
Benign
medium confidencePurpose & Capability
Name/description (A/B test design, feature engineering, model evaluation) match the included guidance, reference docs, and Python helper scripts. The provided scripts and checklists align with the stated data-science purpose.
Instruction Scope
SKILL.md contains code examples and checklists that stay within the domain of experiment design and ML pipelines. It references use of Python, R, SQL and MLflow; however, the distributed code files are Python-only and do not implement MLflow or R/SQL-specific behavior. Instructions do not request or access system secrets or external endpoints directly.
Install Mechanism
No install spec (instruction-only) and included scripts are plain Python files. There is no download-from-URL or package installation specified by the skill itself, which minimizes install-time risk. The code does reference third-party Python libraries (scikit-learn, xgboost, mlflow) but the skill does not attempt to install them.
Credentials
The skill declares no required environment variables or credentials, which is proportional to the static code (no network/auth usage). One caveat: SKILL.md mentions MLflow (which in real use often requires a tracking URI/credentials) and R/SQL usage, but the package does not declare or request any MLflow or database credentials—users will need to provide these if they integrate tracking or data sources themselves.
Persistence & Privilege
Skill is not always-enabled and is user-invocable; it does not request elevated privileges or modify other skills/config. The scripts are simple command-line tools that read input/output paths provided at runtime; they do not install persistent agents or write system-wide configuration.
Assessment
This skill appears to be a legitimate senior data-science helper, but review a few practical points before installing or running it:
- Review the included Python scripts: they are simple, well-logged stubs whose _execute() functions currently return success and contain placeholders rather than full production logic. Expect to supply your own implementation or data-processing logic.
- Dependencies: the docs mention NumPy, Pandas, Scikit-learn, XGBoost and MLflow. The skill does not install these; ensure your environment has the required packages before using the code.
- MLflow & external endpoints: SKILL.md mentions MLflow tracking. If you enable MLflow tracking in your workflow, you will need to configure tracking URIs and any server credentials yourself — that could send metrics to an external server, so only point it at a trusted tracking server.
- R/SQL references: the documentation mentions R and SQL use-cases, but there are no R scripts or SQL connectors included. If you expect R/DB integration, you will need to supply those components and any database credentials locally.
- File access: the scripts accept --input and --output paths and will read whatever files you point them at. Run them in a controlled environment with data you trust.
If you need stronger assurance, ask the author for: (1) a complete implementation (not placeholder stubs), (2) a list of required Python package versions, and (3) explicit instructions for MLflow/database configuration and any network endpoints the skill will talk to.Like a lobster shell, security has layers — review code before you run it.
latestvk973cg9qtkg68j656mm15asb7582kch3
License
MIT-0
Free to use, modify, and redistribute. No attribution required.
