douyin-keyword-search
v1.0.0自动化抓取抖音搜索结果页面的文章数据,支持关键词搜索、格式输出和数量限制,需安装Playwright和Chromium浏览器。
Security Scan
OpenClaw
Benign
high confidencePurpose & Capability
Name/description claim browser-automated Douyin keyword scraping; included files (Python script, shell wrapper, requirements) and SKILL.md all implement and document Playwright + Chromium usage. No unrelated services, credentials, or binaries are requested.
Instruction Scope
SKILL.md instructs installing Playwright and Chromium and how to run the script. The runtime code only visits Douyin pages, extracts page elements, optionally saves a screenshot to the script directory for debugging, and prints/writes results. It does not read unrelated system files or send collected data to external endpoints beyond the target site.
Install Mechanism
There is no automated install spec in the registry metadata; SKILL.md asks the user to run pip and playwright install manually. The repository contains requirements.txt and the script can optionally run pip install via os.system if the user consents. No suspicious external download URLs or archive extraction are used.
Credentials
The skill requires no environment variables, credentials, or config paths. It does check for local Chrome/Edge executable paths on Windows to optionally reuse a local browser, which is reasonable for a browser automation tool.
Persistence & Privilege
Skill does not request persistent 'always' inclusion, does not modify other skills or global agent settings, and does not claim elevated agent privileges.
Assessment
This skill appears to do what it says: automated scraping of Douyin search results using Playwright. Before installing/running, consider: (1) running the code in an isolated environment (virtualenv) because it may install Playwright and download Chromium; (2) the script may prompt you to log in in a non-headless browser and will save a debug screenshot in the script directory if selectors fail; (3) scraping may violate Douyin's terms of service—respect robots.txt and rate limits; (4) inspect the included Python file yourself if you have concerns (it does not exfiltrate data to remote servers). If you do not want the script to auto-install packages, decline the install prompt and run dependency installation manually in a controlled environment.Like a lobster shell, security has layers — review code before you run it.
latest
抖音关键词搜索抓取技能
通过浏览器自动化抓取抖音搜索结果页面的文章数据。
文件说明
| 文件 | 说明 |
|---|---|
douyin_keyword_search.py | Python 主脚本 |
douyin_keyword_search.sh | Shell 包装脚本 |
requirements.txt | Python 依赖列表 |
安装依赖
pip install -r requirements.txt
python -m playwright install chromium
使用方法
命令行调用
# 基本使用
python douyin_keyword_search.py --keyword "关键词"
# 指定输出格式和数量
python douyin_keyword_search.py -k "科技新闻" -o json -l 30
# 保存到文件
python douyin_keyword_search.py -k "美食" -o csv -f results.csv
# 无头模式(后台运行)
python douyin_keyword_search.py -k "新闻" --headless
参数说明
| 参数 | 说明 | 默认值 |
|---|---|---|
--keyword, -k | 搜索关键词(必填) | - |
--output, -o | 输出格式:json/csv/txt | json |
--limit, -l | 抓取数量限制 | 20 |
--headless | 无头模式,不显示浏览器窗口 | false |
--output-file, -f | 输出文件路径 | 控制台输出 |
Claude Code 技能调用
在 Claude Code 中配置后可使用:
/skill douyin-keyword-search --keyword "人工智能"
输出示例
[
{
"title": "视频标题",
"author": "作者名称",
"url": "https://www.douyin.com/video/xxx",
"stats": ["10万点赞", "5000评论"],
"keyword": "搜索关键词",
"crawl_time": "2024-01-01T12:00:00"
}
]
注意事项
- 首次使用需要安装 Playwright 和 Chromium 浏览器
- 抖音需要登录才能查看完整搜索结果,建议首次运行时不使用
--headless参数 - 请遵守抖音的使用条款和 robots.txt 规则
- 建议控制抓取频率,避免对服务器造成压力
许可证
MIT License
Comments
Loading comments...
