Install
openclaw skills install @docsor1212/paper-polisher-proDetect and quantify AI writing traces, rewrite to remove AI style and reduce similarity while preserving terminology, data, and academic rigor in bilingual p...
openclaw skills install @docsor1212/paper-polisher-pro🎯 AI Detection · De-AI Rewriting · Paraphrase · Terminology Check · Quality Report Bilingual (CN/EN) · 100% Local · Zero Upload · Zero Setup
Most AI detectors only catch ChatGPT. This is the only open-source tool that fingerprints Chinese LLMs — DeepSeek V4, GLM-5/5.1, Qwen 3.5/3.6, Kimi K2.5/K2.6, MiniMax M2.5, and Step — alongside GPT, Claude, and Gemini.
Benchmark (2026 flagship models, long-form academic text):
"polish paper", "deai", "reduce ai detection", "check ai writing", "paper polish", "rewrite paper", "humanize paper", "AI paper detector", "academic writing assistant", "AI writing checker", "remove AI traces", "humanize AI text", "thesis polishing", "dissertation polish", "detect AI writing", "AI writing score", "SCI paper editing", "manuscript polishing"
python3 {{SKILL_DIR}}/scripts/ai_detector.py your_paper.txt --lang auto
Output:
📄 your_paper.txt
AI Risk Score: 68.5 / 100 🔴 HIGH
Paragraphs analyzed: 12
Top patterns: "it is worth noting" (×3), "plays a crucial role" (×2)
Suggestion: Rewrite paragraphs 2, 5, 8
python3 {{SKILL_DIR}}/scripts/term_check.py your_paper.txt
python3 {{SKILL_DIR}}/scripts/quality_report.py your_paper.txt --format json
| Layer | Weight | What it detects |
|---|---|---|
| Pattern matching | 50 | 1,002 model-specific AI patterns (ZH 663 + EN 339) |
| Burstiness | 20 | 3-component variance analysis (human writing is uneven) |
| TTR (Type-Token Ratio) | 15 | Vocabulary diversity |
| Perplexity | 15 | Statistical predictability |
| Info density | 15 | Content density vs filler ratio |
| Sentence templates | 10 | Formulaic sentence structures |
| Opener patterns | 10 | Predictable paragraph starters |
| Length distribution | 5 | Unnatural uniformity |
| RLHF alignment | +bonus | Sycophantic/aligned phrasing |
Chinese LLMs (exclusive fingerprinting): DeepSeek V4 · V3.2 · R1 · GLM-5 · GLM-5.1 · GLM-4.7 · Qwen 3.5-397B · Qwen 3.6 · Kimi K2.5 · K2.6 · MiniMax M2.5 · Step
International models: ChatGPT · Claude · Gemini
| Language | Patterns | Categories |
|---|---|---|
| Chinese | 663 | 18 categories (transition words, structural markers, medical templates, RLHF alignment, Kimi/DeepSeek/GLM fingerprints, etc.) |
| English | 339 | 9 categories (academic formal, filler phrases, markdown artifacts, RLHF alignment, etc.) |
Plus: 501 synonym groups, 25 Chinese sentence pattern templates.
python3 {{SKILL_DIR}}/scripts/ai_detector.py <file> --lang auto --format json --output report.json
Returns 0–100 AI risk score, risk level, matched patterns per paragraph.
Chinese: Remove filler ("值得注意的是", "综上所述"), break symmetrical structures, replace vague praise with data, vary sentence length, use content-based transitions.
English: Ban AI tells (plays a crucial role, has gained significant attention, delve into, myriad, plethora), use active voice, mix sentence lengths 5–30 words.
2,255 authoritative terms (National Terms Commission + MeSH 2026 + Drug Terminology). Flags non-standard usage and suggests corrections.
Before/after comparison: AI score change, per-paragraph breakdown, terminology rate, readability metrics.
paper-polisher/
├── SKILL.md ← This file (English, for ClawHub)
├── SKILL_ZH.md ← Chinese documentation (for SkillHub)
├── scripts/
│ ├── ai_detector.py ← AI detection engine (v1.1.0, 9 layers, 1002 patterns)
│ ├── perplexity.py ← Perplexity scoring module
│ ├── term_check.py ← Terminology standardization (2,255 terms)
│ ├── ngram_similarity.py ← N-gram repetition analysis
│ └── quality_report.py ← Comprehensive quality report
├── references/
│ ├── ai_patterns_zh.json ← Chinese AI patterns (663 rules, 18 categories)
│ ├── ai_patterns_en.json ← English AI patterns (339 rules, 9 categories)
│ ├── synonyms_general.json ← Synonym dictionary (501 groups)
│ └── sentence_patterns_zh.json ← Chinese sentence templates (25 patterns)
├── data/
│ └── terminology.json ← Standard terminology database (2,255 terms)
└── templates/ ← Prompt templates for rewriting