Aws Emr Skill

Security checks across malware telemetry and agentic risk

Overview

The skill is mostly a coherent AWS EMR management tool, but one result-retrieval path can return the newest shared S3 result instead of the requested job's result.

Install only if you are comfortable granting the skill AWS permissions for EMR and S3 operations. Use a least-privilege AWS role, avoid shared S3 result prefixes for sensitive workloads, and treat `get_job_result` as unsafe in multi-user or concurrent-job environments until results are bound to the specific job run ID.

SkillSpector

By NVIDIA

Vulnerability Patterns

MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code

Findings (3)

Intent-Code Divergence

Low

Confidence: 97% confidence
Finding: The file defines a `_mask_secrets` helper specifically intended to redact credential-like values from log text, but `get_step_log()` and `_read_s3_gzip_log()` return raw log lines without invoking it. EMR step logs commonly contain command lines, Spark/Hive configs, environment variables, stack traces, and accidental credential material, so returning them unredacted can expose secrets to downstream users or agents.

Intent-Code Divergence

Medium

Confidence: 97% confidence
Finding: This is a true vulnerability: the module defines a `_mask_secrets` helper and documents secret masking behavior, but `get_job_log` and `_read_s3_gzip_log` return raw log lines without applying any masking. EMR/Spark driver logs commonly contain credentials, tokens, JDBC URLs, command-line arguments, environment values, and stack traces, so exposing unredacted logs through the skill can leak sensitive data to users or downstream systems.

Intent-Code Divergence

High

Confidence: 98% confidence
Finding: `get_job_result` checks that the requested `job_run_id` succeeded, but then ignores that identifier when reading from S3 and instead returns the newest `part-00000` object under the shared `skill-results/` prefix. In a multi-user or multi-job environment, this can cause result confusion or unauthorized disclosure by returning another job's output, which is especially dangerous because this skill is explicitly built to run EMR SQL/Hive/Spark jobs that may process sensitive data.

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal