Amazing PsyCoder

Workflows

Orchestrates psychological experiment creation by enforcing a 3-step chain: design via 5-phase programming, code generation, and mandatory code review for Ps...

Install

openclaw skills install amazing-psycoder

Amazing PsyCoder — System Orchestrator

Version

v1.2 — stable, 2026-06-06.

Purpose

This is the single entry point for the Amazing PsyCoder experimental skill system. It orchestrates a mandatory sequential chain of three sub-skills that convert psychological experiment ideas into audited, production-quality code.

This skill does NOT generate code itself. It routes the user through the correct sequence and enforces the chain: Programming → Coder → Reviewer. No step may be skipped.

Platforms: Claude Code / Codex / Hermes / OpenClaw — follows the agentskills.io open standard. See PLATFORMS.md for platform-specific installation and tool mapping.

System Architecture

User describes experiment (English / 中文)
       │
       ▼
┌──────────────────────────────────────┐
│ ① psych-experiment-programming      │  Orchestration layer
│   Input: natural language description│  5-phase design workflow
│   Output: config YAML + conditions   │  Design Decision Registry
│   Gates: Gate 1→2→3→4→5             │  Progressive confirmation
└──────────────┬───────────────────────┘
               │ config YAML (internal artifact, not shown to user)
               ▼
┌──────────────────────────────────────┐
│ ② psych-experiment-coder            │  Code generation layer
│   Input: config YAML + conditions    │  12-step code template
│   Output: runnable code + README     │  4-layer priority architecture
│   Gate: 9-item post-generation check │  Canonical Code Skeleton
└──────────────┬───────────────────────┘
               │ Runnable experiment code
               ▼
┌──────────────────────────────────────┐
│ ③ psych-experiment-code-reviewer    │  Audit layer (final mandatory gate)
│   Input: code / config / plan        │  5 review modes
│   Output: audit report + readiness   │  Platform-aware checks
│   Gate: 0 Critical + 0 Major         │  Severity grading
└──────────────────────────────────────┘

All three steps are mandatory — none can be skipped. Programming → Coder → Reviewer. No experiment code is considered complete without passing reviewer audit with ready_for_collection or ready_after_minor_fixes.

How to Use This Skill

When a user invokes this skill, determine where they are in the pipeline and route accordingly:

Routing Decision Tree

User request
  │
  ├─ "I want to build an experiment" / "我要做一个…实验"
  │    → Invoke psych-experiment-programming (start Phase 1)
  │
  ├─ "Generate code for this config" / "生成代码"
  │    → Check: has Programming Gate 5 passed?
  │       Yes → Invoke psych-experiment-coder
  │       No  → Route back to psych-experiment-programming first
  │
  ├─ "Review this experiment code" / "审计这个实验"
  │    → Invoke psych-experiment-code-reviewer
  │
  ├─ "I got an error" / "代码报错"
  │    → Check the error type:
  │       Design error → psych-experiment-programming
  │       Code error   → psych-experiment-coder (see Debugging & Iteration Loop)
  │
  ├─ "Review found issues" / "审计不通过" / "代码有问题需要修"
  │    → Check the severity:
  │       Critical/Major issues → Fix in psych-experiment-coder, re-audit after
  │       Minor only → User can collect data; fix when convenient
  │
  ├─ "What paradigms are available?" / "有哪些范式"
  │    → Show [Paradigm Coverage Matrix](#paradigm-coverage-matrix)
  │
  └─ "How does this system work?" / "这个系统怎么用"
       → Show system overview + three-step chain

Mandatory Execution Order

  1. First: Invoke psych-experiment-programming — guide user through 5-phase design. Output: complete config YAML (internal, not shown to user)
  2. Second: Invoke psych-experiment-coder — generate platform code from config. Output: .py/.js/.m + README
  3. Third: Invoke psych-experiment-code-reviewer — audit the generated code. Output: audit report + readiness label

Never skip a step. Never generate code before the trial window timeline is confirmed.


Design Principles (System-Wide)

#PrincipleDescriptionApplies to
1Output is deliverableEvery phase produces complete, usable artifactsAll
2Progressive confirmationDesign decisions confirmed phase by phase; defaults flagged [ASSUMED]Programming
3Decision traceabilityDesign Decision Registry records source of every decisionProgramming
4Skeleton-first generationAll code generation MUST start from platform spec Canonical Code SkeletonCoder
5Paradigms provide logic, skeleton provides APIParadigm files define experiment logic; API patterns from spec skeletonCoder
6Anti-pattern zero-tolerancetime.sleep(), event.getKeys(maxWait=), KbCheck for RT, jsPsych.init(), WaitSecs — blockedCoder, Reviewer
7Your experiment, our standardsUser owns the experimental design; system guarantees code qualityAll
8Window timeline before codeNo code generation before trial window timeline is validatedProgramming, Coder
9Post-generation audit mandatoryAll generated code must pass through Reviewer before data collectionCoder, Reviewer
10Input bounds outputReviewer's conclusions cannot exceed what the input supportsReviewer

Red Lines (System-Wide Absolute Prohibitions)

These rules span all three skills. Violation is never acceptable:

#RuleOwned byConsequence of violation
R1No code generation before trial window timeline is completeProgrammingStructural errors, expensive late-stage fixes
R2No assumed response mappingProgrammingGuessing key mapping invalidates accuracy data
R3No time.sleep() in experiment codeCoderBlocks event loop, Escape unresponsive
R4No event.getKeys(maxWait=...)CoderBlocks event loop
R5No data saved only at experiment endCoderCrash = all data lost
R6No silent filling of [MISSING] valuesProgrammingEvery gap must be resolved by asking user or offering flagged default
R7No Chinese/CJK text without explicit fontCoderPsychoPy default font lacks CJK glyphs; text renders as tofu (□□□)
R8No skipping paradigm Must-Confirm itemsProgrammingUnconfirmed items produce broken experiments
R9No rt_onset omitted on response windowsCoderMissing/incorrect RT onset invalidates all reaction time data
R10No code delivery without Reviewer passAllAll code must pass reviewer audit before data collection

Code Generation Architecture (4-Layer Priority)

All platforms use the same 4-layer priority stack. When layers conflict, higher always wins:

Layer 1: spec/          ← Highest: API spec, anti-patterns, Canonical Code Skeleton
Layer 2: mapping/       ← Structural: config YAML fields → platform code structures
Layer 3: paradigms/      ← Paradigm logic: window sequence, accuracy rules, condition structure
Layer 4: demo/          ← Lowest: raw demo code, logic reference only, never API reference

Code generation priority: spec canonical skeleton > spec anti-patterns > config→code mapping > paradigm logic > demos

Platform Layer Fill Status

LayerPsychoPyjsPsychPsychtoolbox
L1 spec/
L2 mapping/
L3 paradigms/✅ 27 paradigms✅ 25 paradigms✅ 5 paradigms
L4 demo/_raw/✅ 45 .py✅ 23 .js✅ 100 .md

Code Template (Overview)

All generated code follows a 12-step structure: imports → parameters → display setup → stimulus preloading → condition loading → helpers → instructions → practice → main loop (blocks → randomization → per-trial windows) → incremental data save → cleanup → package with README. The full template with detailed per-step requirements is in the coder SKILL.md.


Post-Generation Quality Gate (Overview)

Before delivery, all generated code must pass the 9-item Quality Gate defined in the coder SKILL.md. Covers: spec skeleton compliance, anti-pattern scan, API patterns, parameter placement, escape handling, RT source, incremental save, preloading, and CJK font config. Any failure = fix before delivery.


Review Modes and Readiness Labels

Review Modes (auto-selected by Reviewer)

ModeInputMaximum label
code-auditComplete experiment codeready_for_collection
config-auditConfig YAML / trial timelinepre_code_ready
implementation-plan-reviewPseudocode / architecture planpre_code_ready
triage-onlyNatural-language descriptionNone (missing-info list only)
blockedInsufficient inputNone (state what's needed)

Readiness Labels

LabelMeaning
ready_for_collectionZero Critical + zero Major — can collect data
ready_after_minor_fixesOnly Minor issues remain
not_ready_for_collectionCritical or Major issues exist — do NOT collect
pre_code_readyDesign complete, ready for code generation
needs_experiment_infoKey design information missing
blockedInput insufficient for any review

Severity Classification

SeverityDefinitionCan collect data?
CriticalInvalidates all data; must fix before any collectionNo
MajorDegrades data quality; fix before formal collectionNo
MinorDoes not affect data quality; fix when convenientYes

Paradigm Coverage Matrix

Core Paradigms (14) — Full Programming-Layer Spec

All 10 required sections filled (When to Use, Core Logic, Must Confirm, Do Not Assume, Condition File Columns, Trial Window Timeline, Data Output Columns, Randomization Checks, Common Failure Modes, Example):

Go/No-go · Navon · Priming · Stroop · Eriksen Flanker · Simon · Rating · Stop-signal · IAT · N-back · Dot-probe · Visual Search · Task Switching · EAST

Extended Paradigms (24) — Programming-Layer Reference Descriptions

Antisaccade · ANT · BART · Bilingual Stroop · Change Detection · Children Flanker · Choice RT · Climate Reflection · CPT · Corsi Blocks · Cyberball · Delay Discounting · Drag and Drop · Mental Rotation · Multisensory Nature · Numerical Stroop · Phone a Friend · Posner Cuing · Psychophysics Staircase · Rating to Choice · Sternberg · Ultimatum Game · WCST · Writing Distraction

Paradigm Count Explained

LayerCountWhat they are
Programming paradigms/38 (14 core + 24 extended)Design references for the 5-phase workflow. Apply to ALL platforms
Coder psychopy/paradigms/27PsychoPy code references with platform-specific generation logic
Coder jspsych/paradigms/25jsPsych code references with timeline plugins
Coder psychtoolbox/paradigms/5PTB code references; most paradigms use 12-step template for manual adaptation

Cross-Platform Mandatory API Quick Reference

DimensionPsychoPyjsPsych (7.x)Psychtoolbox
Keyboardkeyboard.Keyboard(backend='ptb')Plugin class referencesKbQueueCreate + KbQueueCheck
RT sourcekey.rt (USB HID async)data.rt (automatic)firstPress - VBLTimestamp
RT originwin.callOnFlip(kb.clock.reset)Stimulus onset (automatic)VBLTimestamp = Screen('Flip') return
Timing loopCountdownTimertrial_duration parametervbl + (waitframes-0.5)*ifi
Data savetry/finally + per-trial flushon_finish callbacktry/catch + fopen/fprintf/fclose
QuitEscape check in every loop'escape' in choicesKbCheck(KbName('ESCAPE'))
Cleanupwin.close() + core.quit()jsPsych.endCurrentTimeline()sca + Priority(0) + ShowCursor
CJK fontFONT_CONFIG toggle + TextBox2CSS font-familyScreen('TextFont') + TextStyle

Inter-Skill Communication Protocol

Programming → Coder

  • Artifact: Complete config.yaml (internal, never shown to user)
  • Precondition: Gate 5 passed (user confirmed full Design Decision Registry)
  • Coder's duty: Load config → select platform → copy skeleton → map code → Quality Gate → deliver

Coder → Reviewer

  • Artifact: Generated experiment code (.py / .js / .m) + condition files + README
  • Precondition: Coder's Post-Generation Quality Gate passed
  • Reviewer's duty: Detect platform → load corresponding spec → audit each dimension → output graded report + readiness label

Shared Artifacts

ArtifactProducerConsumerFormat
config YAMLProgrammingCoder.yaml (internal, not displayed)
Condition filesProgrammingCoder.xlsx / .csv
Experiment codeCoderReviewer, User.py / .js / .m
Experiment READMECoderReviewer, User.md (alongside code)
Audit reportReviewerUserMarkdown (graded + readiness label)

Code Output Specification

Deliverables

Every code generation produces two files:

FileFormatContent
Platform experiment file.py / .js / .mRunnable code, all parameters at top, FONT_CONFIG toggle if CJK used
Experiment README.mdWindow sequence diagram, condition/block structure, response rules, data columns, how to run, parameter line numbers, known limitations

Language Consistency

Code comments and README language MUST match the user's language:

  • 中文用户 → 中文 README + 中文代码注释
  • English user → English README + English code comments

File Structure

amazing-psycoder-skills/
├── amazing-psycoder/                       ← Entry orchestrator (this skill)
│   ├── SKILL.md
│   ├── PLATFORMS.md                        ← Platform adapter reference
│   ├── install.sh                          ← Cross-platform installer
│   ├── psych-experiment-programming/        ← ① Orchestration layer
│   │   ├── SKILL.md                         ← 5-phase workflow + 10 red lines
│   │   ├── README.md
│   │   ├── paradigms/                       ← 38 paradigm reference files
│   │   └── references/                      ← Design references (config-schema, timing, etc.)
│   ├── psych-experiment-coder/              ← ② Code generation layer
│   │   ├── SKILL.md                         ← Generation flow + 4-layer arch + 9-item gate
│   │   ├── README.md
│   │   ├── psychopy/                        ← PsychoPy (full auto)
│   │   │   ├── spec/README.md               ← Canonical Skeleton + anti-patterns
│   │   │   ├── mapping/README.md            ← Config→code mapping
│   │   │   ├── paradigms/                    ← 27 paradigm references
│   │   │   └── demo/_raw/                   ← 45 demo .py files
│   │   ├── jspsych/                         ← jsPsych
│   │   │   ├── spec/README.md               ← Canonical Skeleton + anti-patterns
│   │   │   ├── mapping/README.md            ← Config→timeline + migration table
│   │   │   ├── paradigms/                    ← 25 paradigm references
│   │   │   └── demo/_raw/                   ← 23 demo .js files
│   │   └── psychtoolbox/                    ← PTB
│   │       ├── spec/README.md               ← Canonical Skeleton + anti-patterns
│   │       ├── mapping/README.md            ← Config→MATLAB + frame loops
│   │       ├── paradigms/                    ← 5 paradigm references
│   │       └── demo/_raw/                     ← 100 API demos by category
│   └── psych-experiment-code-reviewer/      ← ③ Audit layer
│       ├── SKILL.md                         ← 5 review modes + platform-aware audit
│       └── README.md
├── docs/                                    ← Multi-language READMEs
│   ├── README_EN.md
│   ├── README_ZH-HANT.md
│   ├── README_JA.md
│   ├── README_DE.md
│   └── README_FR.md
└── README.md                                ← Main README (简体中文)

When NOT to Use This Skill

  • Quick PsychoPy/jsPsych/PTB API questions: Answer directly; don't invoke the full workflow
  • General Python/JavaScript/MATLAB questions: Answer directly
  • Non-experiment programming tasks: Not in scope