arXiv Master Search

Security checks across malware telemetry and agentic risk

Overview

This appears to be a coherent arXiv research helper with expected network and PDF-processing dependencies, but its dependency hygiene needs attention.

Install only in an environment where you are comfortable processing remote PDFs, and prefer a version that pins and updates dependencies such as requests, urllib3, PyYAML, PyPDF2, and pdfminer.six. Treat this as dependency risk to manage, not as evidence that the skill is malicious.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Findings (18)

Unpinned Dependencies

Low
Category
Supply Chain
Content
# arXiv Search Skill Dependencies

# 核心 arXiv API
arxiv>=1.4.8

# HTTP 请求
requests>=2.28.0
Confidence
91% confidence
Finding
arxiv>=1.4.8

Unpinned Dependencies

Low
Category
Supply Chain
Content
arxiv>=1.4.8

# HTTP 请求
requests>=2.28.0
urllib3>=1.26.0

# 数据处理
Confidence
93% confidence
Finding
requests>=2.28.0

Unpinned Dependencies

Low
Category
Supply Chain
Content
# HTTP 请求
requests>=2.28.0
urllib3>=1.26.0

# 数据处理
pandas>=1.5.0
Confidence
93% confidence
Finding
urllib3>=1.26.0

Unpinned Dependencies

Low
Category
Supply Chain
Content
urllib3>=1.26.0

# 数据处理
pandas>=1.5.0
numpy>=1.23.0

# YAML 配置
Confidence
86% confidence
Finding
pandas>=1.5.0

Unpinned Dependencies

Low
Category
Supply Chain
Content
# 数据处理
pandas>=1.5.0
numpy>=1.23.0

# YAML 配置
PyYAML>=6.0
Confidence
86% confidence
Finding
numpy>=1.23.0

Unpinned Dependencies

Low
Category
Supply Chain
Content
numpy>=1.23.0

# YAML 配置
PyYAML>=6.0

# 并行处理
tqdm>=4.64.0
Confidence
94% confidence
Finding
PyYAML>=6.0

Unpinned Dependencies

Low
Category
Supply Chain
Content
PyYAML>=6.0

# 并行处理
tqdm>=4.64.0

# PDF 处理
PyPDF2>=3.0.0
Confidence
84% confidence
Finding
tqdm>=4.64.0

Unpinned Dependencies

Low
Category
Supply Chain
Content
tqdm>=4.64.0

# PDF 处理
PyPDF2>=3.0.0
pdfminer.six>=20221105

# 日期处理
Confidence
94% confidence
Finding
PyPDF2>=3.0.0

Unpinned Dependencies

Low
Category
Supply Chain
Content
pdfminer.six>=20221105

# 日期处理
python-dateutil>=2.8.0

# 日志
colorlog>=6.7.0
Confidence
82% confidence
Finding
python-dateutil>=2.8.0

Unpinned Dependencies

Low
Category
Supply Chain
Content
python-dateutil>=2.8.0

# 日志
colorlog>=6.7.0

# 命令行界面
click>=8.1.0
Confidence
81% confidence
Finding
colorlog>=6.7.0

Unpinned Dependencies

Low
Category
Supply Chain
Content
colorlog>=6.7.0

# 命令行界面
click>=8.1.0

# 可选 - 用于总结功能 (需要额外配置)
# openai>=1.0.0
Confidence
82% confidence
Finding
click>=8.1.0

Known Vulnerable Dependency: requests — 10 advisory(ies): CVE-2014-1830 (Exposure of Sensitive Information to an Unauthorized Actor in Requests); CVE-2024-47081 (Requests vulnerable to .netrc credentials leak via malicious URLs); CVE-2024-35195 (Requests `Session` object does not verify requests after making first request wi) +7 more

High
Category
Supply Chain
Confidence
95% confidence
Finding
requests

Known Vulnerable Dependency: urllib3 — 10 advisory(ies): CVE-2025-66471 (urllib3 streaming API improperly handles highly compressed data); CVE-2024-37891 (urllib3's Proxy-Authorization request header isn't stripped during cross-origin ); CVE-2026-21441 (Decompression-bomb safeguards bypassed when following HTTP redirects (streaming ) +7 more

High
Category
Supply Chain
Confidence
95% confidence
Finding
urllib3

Known Vulnerable Dependency: numpy — 10 advisory(ies): CVE-2014-1859 (Numpy arbitrary file write via symlink attack); CVE-2021-41495 (NumPy NULL Pointer Dereference); CVE-2021-33430 (NumPy Buffer Overflow (Disputed)) +7 more

Critical
Category
Supply Chain
Confidence
70% confidence
Finding
numpy

Known Vulnerable Dependency: PyYAML — 8 advisory(ies): CVE-2019-20477 (Deserialization of Untrusted Data in PyYAML); CVE-2020-1747 (Improper Input Validation in PyYAML); CVE-2020-14343 (Improper Input Validation in PyYAML) +5 more

Critical
Category
Supply Chain
Confidence
97% confidence
Finding
PyYAML

Known Vulnerable Dependency: tqdm — 3 advisory(ies): CVE-2024-34062 (tqdm CLI arguments injection attack); CVE-2016-10075 (TDQM Arbitrary Code Execution); CVE-2016-10075 (The tqdm._version module in tqdm versions 4.4.1 and 4.10 allows local users to e)

High
Category
Supply Chain
Confidence
72% confidence
Finding
tqdm

Known Vulnerable Dependency: PyPDF2 — 5 advisory(ies): CVE-2023-36464 (pypdf and PyPDF2 possible Infinite Loop when a comment isn't followed by a chara); CVE-2023-36807 (PyPDF2 vulnerable to possible Infinite Loop when reading malformed objects); CVE-2023-36810 (PyPDF2 quadratic runtime with malformed PDF missing xref marker) +2 more

High
Category
Supply Chain
Confidence
97% confidence
Finding
PyPDF2

Known Vulnerable Dependency: pdfminer.six — 2 advisory(ies): CVE-2025-70559 (Insecure Deserialization (pickle) in pdfminer.six CMap Loader — Local Privesc); CVE-2025-64512 (Arbitrary Code Execution in pdfminer.six via Crafted PDF Input)

High
Category
Supply Chain
Confidence
96% confidence
Finding
pdfminer.six

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal