Install
openclaw skills install agentsop-bio-fraud-forensicsScreens biomedical / life-science papers for signs of data fabrication, image manipulation, and statistical anomalies, using the detection techniques distilled from the field's canonical exposure platforms (PubPeer, Data Colada, Science Integrity Digest, For Better Science) and tools (ImageTwin/Proofig, statcheck, GRIM/GRIMMER, Problematic Paper Screener, Seek & Blastn). Use when asked to check a paper/figure for image duplication, blot splicing, impossible statistics, paper-mill or tortured-phrase signals, research integrity, or "is this data faked"; or when a user shares a figure, Western blot, supplementary dataset, or DOI and asks whether it looks manipulated. Reports observable anomalies as questions for clarification — it never accuses anyone of fraud.
openclaw skills install agentsop-bio-fraud-forensicsA screening methodology for life-science papers. It reverse-engineers how real cases were caught — the exact panels compared, the transform applied, the statistic recomputed — and turns that into a reproducible per-paper checklist. It is a detective's lens, not a verdict machine: every output stays at "observed anomaly" or "question for the authors," because red flag ≠ proof and an accusation can end a career.
Trigger when:
.xlsx, or a DOI and asks if it's trustworthy.Do NOT trigger when:
Run this as a chain-of-steps. Cheapest, fastest signals first; the expensive image/stat forensics last (they tell you where to dig is often answered for free by the cheap checks).
Step 1 — Scope & status. Identify the input: single figure, full paper, supplementary dataset, or a batch. Run the status cascade in parallel (it's free and may hand you the answer): Retraction Watch Database → PubMed retraction banner → Crossref/Crossmark notice → PubPeer (search DOI/author) → ORI case index (only if adjudicated US PHS misconduct is the question). Note what already exists; your job may shift to verifying/extending a prior flag.
Step 2 — Ordered screen. Walk the pipeline, recording each hit; do not stop at the first:
.xlsx → calcChain.Step 3 — Match a model & classify. For each hit, Read references/sop_models.md, match the
operation model (M1–M7), and name the sub-type + Bik category. Confirm image matches by
performing the transform yourself (flip/rotate/overlay) and including the result; confirm any
tool flag by human inspection — a large share of automated image hits are benign reuse, so treat
none as a finding until you have reproduced it by hand.
Step 4 — Benign-explanation gate (mandatory before any escalation). Run the benign-explanation checklist in M6. Record which innocent causes were excluded and why (disclosed splice, JPEG block, same-experiment loading-control reuse, tiling overlap, figure-assembly slip). No "looks suspicious → flag." Apply the honest-error discriminators from M1 (directionality, recurrence, sophistication, provenance, disclosure).
Step 5 — Grade & document. Default every finding to Tier 1 (observed anomaly). Escalate to Tier 2 (question for authors) only after Step 4, using the disclosed-evidence + hedge + named-alternative formula. Never originate Tier 3 (adjudicated misconduct) — cite the body that ruled. Write each finding in the reproducible annotation format (M7) and pick an Output Mode.
| # | Model | Core proposition | Main source |
|---|---|---|---|
| M1 | FFP Taxonomy & Honest-Error Discriminators | Classify the anomaly (fabrication/falsification + sub-types); separate honest error from misconduct via 5 tests; only ever assert the "significant departure," never intent. | ORI/42 CFR 93; Bik mBio 2016 |
| M2 | Image Forensics | Every band/field is a fingerprint; catch by eye, confirm by flip/rotate/overlay-Difference; correlated background texture (not band shape) is decisive; Bik Type I/II/III drives escalation. | Bik; ASM/ImageTwin pilot; Proofig |
| M3 | Statistical Forensics | Consistency tests (GRIM/GRIMMER/statcheck) prove impossibility from the text alone; distributional tests (digit/uniformity/duplication) raise flags; .xlsx calcChain exposes moved rows. | Data Colada [98],[109]; Brown & Heathers; Nuijten |
| M4 | Exposure-Site Method Mining + Verification Routing | Treat PubPeer/blog threads as worked detection recipes to replay; map each red flag to the platform that confirms/contextualizes it. | PubPeer; Data Colada; For Better Science |
| M5 | Paper-Mill & Systemic Signals | The fingerprint is recurrence across a batch: tortured phrases, wrong gene reagents (Seek & Blastn), templated "too-clean" figures, sold-authorship network shape. | Cabanac/Labbé; Byrne; Bik Tadpole mill |
| M6 | Graded-Evidence & Red-Line Discipline | Three-tier language with a banned-word filter; mandatory benign-explanation gate; the Data Colada disclosed-facts+hedge+alternative formula is both the ethics and the legal safe harbor. | COPE; Gino v. Data Colada; Sarkar v. Doe |
| M7 | Reproducible Screening Workflow & Annotation | Cheapest-signal-first ordering; a finding is real only if a stranger with the PDF can repeat your exact check; 7-field annotation (locator+comparison+transform+result+category+exclusions+neutral wording). | Bik; PubPeer FAQ; STM Integrity Hub |
Full cards (inputs, action steps, evidence, failure modes, boundaries, confidence) live in
references/sop_models.md. Read the matching card before acting; do not paste the card back to the user.
| Mode | Trigger | Output structure |
|---|---|---|
| Figure check | One figure/blot/panel shared | Per-panel: observation → transform performed + result → Bik category → benign causes excluded → tier + neutral wording |
| Full-paper screen | A paper/DOI to screen | Status-cascade result, then ordered-pipeline findings by layer, a triage summary, and an overall "monitor / clarify / already-flagged" disposition |
| Stats recompute | Means/SDs/p-values or .xlsx | Per-stat: test (GRIM/GRIMMER/statcheck/SPRITE/calcChain) → input → verdict (impossible/consistent/implausible) → cannot-prove line |
| Paper-mill / batch | "Is this a mill?" / multiple papers | Per-layer firing (text/reagent/image/network) + recurrence/batch evidence + advisory composite, human-review gate |
| Verification routing | "Where do I check this?" | The red-flag → platform routing table: which site, how to query, what it confirms |
| Annotation draft | "Write a PubPeer-grade comment" | The 7-field reproducible annotation, neutral and hedged, with the transform result attached |
| File | What | When to read |
|---|---|---|
references/sop_models.md | Full M1–M7 operation cards: inputs, action steps, evidence, failure modes, boundaries, confidence | Step 3 — read the matching card before acting |
references/research_notes.md | Human-readable evidence summary + the red-flag→platform routing table + tortured-phrase / banned-word seed lists | When you need the routing table or a source citation |
references/R01..R07-*.md | Primary research dossiers with real cases and URLs (audit trail) | When you need to trace a claim to its source case |
examples/demo_screening.md | Worked screening transcripts (figure check, stats recompute, boundary refusal) | To see the expected output shape |