Install
openclaw skills install is-bullshitDetect if AI responses contain hallucinations by analyzing tool usage and response quality. Gives credit for correctly identifying invalid premises even without tool calls.
openclaw skills install is-bullshitIMPORTANT: When config enable = true, this skill MUST automatically show fact check after EVERY response, WITHOUT waiting for user to ask "check" or "检测".
When enable = false, this skill triggers when user explicitly asks:
Detect whether the AI's response is trustworthy by checking:
{
"enable": false // User must explicitly enable
}
User can say:
Read the AI's response and identify what type of information it contains:
Look at what tools were called throughout the entire conversation history (not just the current response). Different types of information require different verification tools.
Analyze the response text for signs of good judgment.
Add up points based on tool usage and response quality patterns.
| Response Contains | Required Tool | If None → Points |
|---|---|---|
| Math expressions (numbers + operators: +, -, ×, *, ÷, /, %, ^) | exec (Python/bc), calculator | -2 |
| Time/date/timezone (e.g., "now is 07:26 UTC", "today is Thursday") | date, exec, calendar API | -2 |
| External facts (weather, stocks, news, prices) | weather, web_search, web_fetch | -2 |
| Internal facts (files, memory, code) | read, memory_search, exec | 0 (allowed) |
| Pattern Found | Points |
|---|---|
| Detects time contradiction ("明朝...乾隆" / "1900年") | +2 |
| Says "前提错误" / "无意义" / "无法回答" / "invalid premise" | +2 |
| Acknowledges uncertainty ("不确定", "可能", "I'm not sure") | +1 |
| Makes up facts confidently (no tool + specific facts) | -2 |
Each round gets its own verdict:
| Tool Used | Verdict |
|---|---|
| Correct tool used | ✅ Looks good! |
| No tool (but needed) | ❌ Might be wrong |
| Uncertain answer | 🤔 Not sure |
The fact check should be in the same language as the user's question.
First, analyze each round of conversation:
Round N:
- User asked: [question summary]
- AI answered: [answer summary]
- Tools called: [tool names or "none"]
- Issues found: [any problems detected]
- Score: +X / -X
| Conversation Rounds | Output |
|---|---|
| ≤ 5 rounds | Show every round |
| > 5 rounds | Show only suspicious rounds |
Note: Each round is evaluated independently. No overall summary needed - users can judge themselves.
≤5 rounds (show all):
---
Fact Check:
Round 1:
- Q: current time
- A: "2026-03-15 17:18 CST"
- Tools: date command ✅
- Verdict: ✅ Looks good!
Round 2:
- Q: 15000 × 1.2% = ?
- A: "15180"
- Tools: none ❌
- Verdict: ❌ No tool used for calculation
Round 3:
- Q: is it true
- A: "算对了,15180"
- Tools: python3 ✅
- Verdict: ✅ Verified!
---
>5 rounds (show suspicious only):
---
Fact Check:
⚠️ Suspicious rounds:
Round 1:
- Q: current time
- A: "07:26 UTC" (wrong!)
- Tools: none ❌
- Verdict: ❌ No time tool used, gave wrong time
Round 3:
- Q: 15000 × 1.2%
- A: "15180"
- Tools: none ❌
- Verdict: ❌ No calculation tool used
---