AML 数据生成 (aml-data-generator)
生成符合AMLSim格式的合成交易数据,将交易日志转换为用于反洗钱检测系统测试的模拟数据集,支持按银行ID分割账户、合并多源输出并生成交易网络图。
Pipeline
data_collection -> data_storage -> factor_computation -> target_selection -> trading_execution -> visualization
Top Use Cases (13 total)
Convert Logs to AML Simulation Data (UC-101)
Convert transaction log files into synthetic AML simulation data for testing anti-money laundering detection systems
Triggers: convert logs, synthetic data, AML simulation
Split Accounts by Bank ID (UC-102)
Partition account CSV files by bank identifier for bank-specific analysis and processing
Triggers: split accounts, bank ID, partition data
Combine AML Simulation Outputs (UC-103)
Aggregate multiple AMLSim output files into a consolidated dataset for comprehensive analysis
Triggers: combine outputs, merge data, AMLSim aggregation
For all 13 use cases, see references/USE_CASES.md.
Execute trigger: When user intent matches intent_router.uc_entries[].positive_terms AND user uses action verb (run/execute/跑/执行/backtest/fetch/collect)
What I'll Ask You
- Target market: A-share (default), HK, or crypto? (US stocks in ZVT are half-baked — stockus_nasdaq_AAPL exists but coverage is thin)
- Data source / provider: eastmoney (free, no account), joinquant (account+paid), baostock (free, good history), akshare, or qmt (broker)?
- Strategy type: MACD golden-cross, MA crossover, volume breakout, fundamental screen, or custom factor?
- Time range: start_timestamp and end_timestamp for backtest period
- Target entity IDs: specific stocks (stock_sh_600000) or index components (SZ1000)?
Semantic Locks (Fatal)
| ID | Rule | On Violation |
|---|
SL-01 | Execute sell orders before buy orders in every trading cycle | halt |
SL-02 | Trading signals MUST use next-bar execution (no look-ahead) | halt |
SL-03 | Entity IDs MUST follow format entity_type_exchange_code | halt |
SL-04 | DataFrame index MUST be MultiIndex (entity_id, timestamp) | halt |
SL-05 | TradingSignal MUST have EXACTLY ONE of: position_pct, order_money, order_amount | halt |
SL-06 | filter_result column semantics: True=BUY, False=SELL, None/NaN=NO ACTION | halt |
SL-07 | Transformer MUST run BEFORE Accumulator in factor pipeline | halt |
SL-08 | MACD parameters locked: fast=12, slow=26, signal=9 | halt |
Full lock definitions: references/LOCKS.md
Top Anti-Patterns (15 total)
AP-REGTECH-001: Missing attribute initialization on data structures
AP-REGTECH-002: Self-loops in transaction graphs violate domain rules
AP-REGTECH-003: Unvalidated floating-point inputs cause runtime crashes
All 15 anti-patterns: references/ANTI_PATTERNS.md
Evidence Quality Notice
[QUALITY NOTICE] This crystal was compiled from blueprint finance-bp-060. Evidence verify ratio = 15.9% and audit fail total = 22. Generated results may have uncaptured requirement gaps. Verify critical decisions against source files (LATEST.yaml / LATEST.jsonl).
Reference Files
Compiled by Doramagic crystal-compilation-v6.1 from finance-bp-060 blueprint at 2026-04-22T13:00:18.242568+00:00.
See human_summary.md for non-technical overview.