LangSmith CLI

Security checks across malware telemetry and agentic risk

Overview

This is a disclosed LangSmith trace-analysis CLI, but users should treat its trace output as potentially sensitive.

Install only if you intend for your agent and terminal session to inspect LangSmith traces. Use a least-privilege LangSmith API key, keep project/time-window limits narrow, avoid running it in shared logs or transcripts, and prefer session-only environment variables or a secret manager over putting the key in ~/.zshrc.

SkillSpector

By NVIDIA

Vulnerability Patterns

Rogue AgentSelf-Modification, Session Persistence
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands

Findings (5)

Lp3

Medium

Category: MCP Least Privilege
Confidence: 85% confidence
Finding: The skill declares sensitive capabilities in metadata requirements (environment secret access and outbound network use) but does not present an explicit permission model in the skill definition. That creates a governance gap: users or orchestrators may invoke a networked, secret-consuming skill without clear consent boundaries, increasing the chance of unintended secret use or data egress.

Description-Behavior Mismatch

Low

Confidence: 88% confidence
Finding: The prompt-diff command retrieves arbitrary runs by ID and prints system prompts and outputs verbatim, which can expose secrets, internal instructions, PII, or proprietary prompt logic contained in trace data. In the context of a trace-analysis skill, access to prompts/outputs is plausible, but the unrestricted raw disclosure makes this more dangerous because LangSmith traces often contain sensitive application and user content.

Vague Triggers

Medium

Confidence: 79% confidence
Finding: The trigger phrases are broad enough to match ordinary troubleshooting or analysis requests, which can cause the skill to activate unexpectedly. Because the skill can access an API key and fetch trace data, accidental invocation could expose sensitive operational data to the agent context when the user did not clearly intend to use LangSmith.

Missing User Warnings

Medium

Confidence: 93% confidence
Finding: The ask and replay flows print trace-derived inputs, outputs, and metadata directly to stdout without warning or redaction. Because observability traces commonly contain API keys, user data, prompts, and model outputs, this can lead to unintended disclosure in terminals, logs, transcripts, or downstream agent context.

Session Persistence

Medium

Category: Rogue Agent
Content: ## Auth Setup ```bash export LANGSMITH_API_KEY=<your-key> # or add to ~/.zshrc ``` Test with: `python3 scripts/langsmith.py runs <project> --limit 3`
Confidence: 88% confidence
Finding: add to ~/.zshrc

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal