Spark Engineer
v0.1.0Use when building Apache Spark applications, distributed data processing pipelines, or optimizing big data workloads. Invoke for DataFrame API, Spark SQL, RDD operations, performance tuning, streaming analytics.
⭐ 1· 1.8k·4 current·4 all-time
byVeera@veeramanikandanr48
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
OpenClaw
Benign
high confidencePurpose & Capability
Name/description match the content: all required files and instructions are Spark-focused (DataFrame API, RDDs, partitioning, tuning, streaming). No unrelated binaries, environment variables, or external services are declared as required.
Instruction Scope
SKILL.md and reference files contain only Spark code examples, configuration recommendations, and monitoring guidance. They reference typical cluster endpoints and storage (S3, HDFS, Kafka) as examples for normal Spark usage, but do not instruct the agent to read local system secrets/configuration or to exfiltrate data to unexpected endpoints.
Install Mechanism
No install spec or code files with executable install steps are present — this is instruction-only, so nothing is downloaded or written to disk by the skill itself.
Credentials
The skill declares no required environment variables or credentials. Example snippets show connecting to typical data systems (S3, Kafka, HDFS) which would need credentials when actually run, but the skill itself does not request or embed secrets.
Persistence & Privilege
Skill is not always-included, does not request persistent privileges, and is user-invocable only. There is no behavior that modifies other skills or global agent settings.
Assessment
This skill is an offline reference and looks internally consistent with its Spark-focused purpose. Before running any provided code in your environment: 1) review and supply only the credentials your cluster/storage requires (the skill does not request any itself), 2) avoid running example collect() or large broadcasts on production data without safeguards, and 3) inspect any mapPartitions/foreachPartition code that opens external DB/HTTP connections to ensure it uses approved endpoints and secure credentials. If you plan to let an agent execute code from this skill automatically, ensure the agent does not have unrestricted access to production cluster credentials or sensitive storage buckets.Like a lobster shell, security has layers — review code before you run it.
latestvk977rafqqm0crkyx8mq31tsetd80876s
License
MIT-0
Free to use, modify, and redistribute. No attribution required.
