audio-enhancement-engine

Security checks across malware telemetry and agentic risk

Overview

This appears to be a real audio enhancement skill, but running it can automatically download code and modify Python environments beyond what the skill clearly discloses.

Install only if you are comfortable with first-run network downloads and Python environment changes. Prefer running it in an isolated container or disposable virtual environment, review and pin dependency sources first, and use it only on audio files or folders you explicitly intend to process.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • Behavioral ASTexec() Call, eval() Call, Dynamic Import
  • MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Findings (33)

subprocess module call

Medium
Category
Dangerous Code Execution
Content
"""专门为老项目(使用 pkg_resources 的 setup.py)修复 setuptools 版本"""
    logger.info("🔧 正在修复 setuptools 版本(兼容旧 GitHub 包构建)...")
    try:
        subprocess.check_call([
            sys.executable, "-m", "pip", "install",
            "--quiet", "--force-reinstall", "setuptools<=81.2.0", "wheel"
        ])
Confidence
95% confidence
Finding
subprocess.check_call([ sys.executable, "-m", "pip", "install", "--quiet", "--force-reinstall", "setuptools<=81.2.0", "wheel" ])

subprocess module call

Medium
Category
Dangerous Code Execution
Content
cmd.extend(["-i", "https://pypi.tuna.tsinghua.edu.cn/simple"])

    try:
        subprocess.check_call(cmd)
        logger.info(f"✅ {spec} 安装/升级完成!")
        
    except subprocess.CalledProcessError as e:
Confidence
98% confidence
Finding
subprocess.check_call(cmd)

subprocess module call

Medium
Category
Dangerous Code Execution
Content
logger.warning(f"🔧 正在安装 {install_str} ...")

    try:
        subprocess.check_call([
            sys.executable, "-m", "pip", "install",
            install_str,
            "-i", "https://pypi.tuna.tsinghua.edu.cn/simple",
Confidence
97% confidence
Finding
subprocess.check_call([ sys.executable, "-m", "pip", "install", install_str, "-i", "https://pypi.tuna.tsinghua.edu.cn/simple", "--quiet"

subprocess module call

Medium
Category
Dangerous Code Execution
Content
sys.executable, "-m", "pip", "install",
                        "--upgrade", fallback_zip, "--quiet"
                    ]
                    subprocess.check_call(cmd_fallback)
                    logger.info(f"✅ 使用本地包 {fallback_zip} 安装成功!")
                    return
                except subprocess.CalledProcessError as e2:
Confidence
96% confidence
Finding
subprocess.check_call(cmd_fallback)

subprocess module call

Medium
Category
Dangerous Code Execution
Content
logger.info("虚拟环境创建成功")

        logger.info("正在升级 pip...")
        subprocess.check_call([str(venv_python), "-m", "pip", "install", "--upgrade", "pip"])

    # ==================== 检查 PyTorch GPU 是否已安装 ====================
    if Path(venv_python).exists() and is_torch_gpu_installed(venv_python):
Confidence
94% confidence
Finding
subprocess.check_call([str(venv_python), "-m", "pip", "install", "--upgrade", "pip"])

subprocess module call

Medium
Category
Dangerous Code Execution
Content
# 安装 PyTorch
        logger.info("正在安装 PyTorch(~2-3GB,请耐心等待)...")
        subprocess.check_call([
            str(venv_python), "-m", "pip", "install", "torch", "torchvision", "torchaudio",
            "--index-url", index_url
        ])
Confidence
95% confidence
Finding
subprocess.check_call([ str(venv_python), "-m", "pip", "install", "torch", "torchvision", "torchaudio", "--index-url", index_url ])

subprocess module call

Medium
Category
Dangerous Code Execution
Content
logger.info("安装 audio-separator CPU 版 + librosa...")
            subprocess.check_call([str(venv_python), "-m", "pip", "install", "audio-separator[cpu]", "librosa"])

        subprocess.check_call([str(venv_python), "-m", "pip", "install", "pydub"])
        subprocess.check_call([str(venv_python), "-m", "pip", "install", "huggingface-hub[tqdm]"])
        
        logger.info("✅ 虚拟环境及所有依赖安装完成!")
Confidence
90% confidence
Finding
subprocess.check_call([str(venv_python), "-m", "pip", "install", "pydub"])

subprocess module call

Medium
Category
Dangerous Code Execution
Content
subprocess.check_call([str(venv_python), "-m", "pip", "install", "audio-separator[cpu]", "librosa"])

        subprocess.check_call([str(venv_python), "-m", "pip", "install", "pydub"])
        subprocess.check_call([str(venv_python), "-m", "pip", "install", "huggingface-hub[tqdm]"])
        
        logger.info("✅ 虚拟环境及所有依赖安装完成!")
Confidence
92% confidence
Finding
subprocess.check_call([str(venv_python), "-m", "pip", "install", "huggingface-hub[tqdm]"])

subprocess module call

Medium
Category
Dangerous Code Execution
Content
# 安装 audio-separator + librosa(你提到的)
        if use_gpu:
            logger.info("安装 audio-separator GPU 版 + librosa...")
            subprocess.check_call([str(venv_python), "-m", "pip", "install", "audio-separator[gpu]", "librosa"])
        else:
            logger.info("安装 audio-separator CPU 版 + librosa...")
            subprocess.check_call([str(venv_python), "-m", "pip", "install", "audio-separator[cpu]", "librosa"])
Confidence
95% confidence
Finding
subprocess.check_call([str(venv_python), "-m", "pip", "install", "audio-separator[gpu]", "librosa"])

subprocess module call

Medium
Category
Dangerous Code Execution
Content
subprocess.check_call([str(venv_python), "-m", "pip", "install", "audio-separator[gpu]", "librosa"])
        else:
            logger.info("安装 audio-separator CPU 版 + librosa...")
            subprocess.check_call([str(venv_python), "-m", "pip", "install", "audio-separator[cpu]", "librosa"])

        subprocess.check_call([str(venv_python), "-m", "pip", "install", "pydub"])
        subprocess.check_call([str(venv_python), "-m", "pip", "install", "huggingface-hub[tqdm]"])
Confidence
95% confidence
Finding
subprocess.check_call([str(venv_python), "-m", "pip", "install", "audio-separator[cpu]", "librosa"])

Lp3

Medium
Category
MCP Least Privilege
Confidence
91% confidence
Finding
The skill declares itself as a local audio-processing tool but appears to invoke shell, inspect environment state, and use networked installation flows without declaring permissions. This creates a trust-boundary violation: invoking the skill may trigger code execution and dependency retrieval far beyond simple file processing, increasing supply-chain and host-impact risk.

Tp4

High
Category
MCP Tool Poisoning
Confidence
97% confidence
Finding
This is a substantial description-behavior mismatch: the skill claims to only process local audio, but the analyzed behavior includes virtualenv creation, GPU/CUDA probing, downloading packages from pip/GitHub, and executing upgrade scripts. Such hidden setup and remote code acquisition materially expand the attack surface and can lead to unreviewed code execution on the host.

Description-Behavior Mismatch

Medium
Confidence
92% confidence
Finding
The README states that VoiceFixer and AudioSR will automatically download models on first run, which contradicts the skill metadata's claim that it runs 'completely locally.' This hidden network behavior can violate user expectations, create supply-chain risk, and bypass environments that only permit offline/local processing.

Description-Behavior Mismatch

Medium
Confidence
98% confidence
Finding
The script advertises fully local audio processing, yet it performs a runtime install from a remote GitHub repository. Pulling and executing code from an external URL during normal operation creates a supply-chain risk: a compromised repo, tag, branch, or transport path could result in arbitrary code execution on the user's machine.

Context-Inappropriate Capability

Medium
Confidence
97% confidence
Finding
Automatically installing packages from PyPI and GitHub at runtime gives the skill unnecessary code-fetching and code-execution capability unrelated to processing a provided audio file. Any dependency compromise, typosquatting, or malicious update could execute arbitrary installer or import-time code in the host environment.

Description-Behavior Mismatch

High
Confidence
94% confidence
Finding
The file presents itself as part of an audio-enhancement skill, but its actual behavior is a generalized package installer. This mismatch is dangerous because users and reviewers may authorize an audio-processing capability while unknowingly granting code that changes the Python environment and installs arbitrary software.

Context-Inappropriate Capability

High
Confidence
99% confidence
Finding
This module provides arbitrary installation from PyPI, git repositories, and local zip/wheel files, which is not justified by an audio-only enhancement skill. In context, this is more dangerous because the declared skill domain would not reasonably require broad software installation capabilities, increasing the likelihood of hidden supply-chain or execution abuse.

Context-Inappropriate Capability

High
Confidence
99% confidence
Finding
The module changes the host environment on import by force-reinstalling core packaging components. Import-time side effects are especially dangerous because simply loading the skill triggers system modification, making review, containment, and user consent much harder.

Intent-Code Divergence

Medium
Confidence
90% confidence
Finding
The documentation and code behavior are inconsistent: the comments describe an audio-enhancement skill, while the implementation is a dependency installer. This discrepancy undermines transparency and can conceal risky behavior from users, auditors, and automated policy controls.

Context-Inappropriate Capability

Medium
Confidence
97% confidence
Finding
This is a valid scope-creep finding: the bootstrap installs audio-separator, which introduces source-separation functionality not described in the manifest focused on enhancement, denoising, and super-resolution. In a security review, hidden or unnecessary capabilities matter because they increase attack surface and can violate user expectations about what the skill can do.

Description-Behavior Mismatch

High
Confidence
98% confidence
Finding
This skill is declared as a local audio-enhancement tool, yet the script performs network downloads, package bootstrapping, and forceful PyTorch installation into a Python environment. That behavior materially exceeds the skill's stated purpose and creates supply-chain and environment-integrity risk, especially because it fetches and installs large binary artifacts from the network without any hash or signature verification.

Context-Inappropriate Capability

High
Confidence
97% confidence
Finding
The script traverses parent directories to locate a venv, searches /usr/local for shared libraries, changes LD_LIBRARY_PATH, uninstalls existing torch packages, and reinstalls replacements. For an audio-enhancement skill, these host-level discovery and mutation actions are unnecessary and dangerous because they can alter unrelated software environments and make execution depend on attacker-influenced filesystem state.

Intent-Code Divergence

Medium
Confidence
84% confidence
Finding
The script logs that it is a 'PyTorch 2.11.0 CUDA 13.0' downloader while the actual wheel URLs target torch 2.9.1/cu129. This inconsistency is a security-relevant trust issue because it obscures what software is really being installed, making operator review and incident response harder and increasing the chance that unexpected binaries are accepted.

Description-Behavior Mismatch

High
Confidence
98% confidence
Finding
The skill advertises itself as running completely locally, but it performs runtime dependency installation from PyPI and GitHub and may also trigger model downloads on first use. This creates a supply-chain and unexpected network-execution risk: remote code or artifacts can be fetched and executed in the user's environment without prior review, which is especially dangerous for a skill whose expected capability is only local audio processing.

Context-Inappropriate Capability

High
Confidence
96% confidence
Finding
Installing packages at runtime from external sources is an unnecessary capability expansion for an audio enhancement skill, because it allows arbitrary third-party code to enter the execution environment at the moment the skill runs. The direct install from a GitHub repository further increases risk because it bypasses normal review expectations and can change over time, enabling supply-chain compromise or silent behavior drift.

VirusTotal

67/67 vendors flagged this skill as clean.

View on VirusTotal