Install
openclaw skills install deepread-form-fillAI-powered PDF form filling. Upload any PDF form and your data as JSON — AI detects fields visually, maps your data semantically, fills the form with quality...
openclaw skills install deepread-form-fillUpload any PDF form + your data as JSON. AI detects fields, maps your data, fills the form, quality-checks the result, and returns a completed PDF you can download.
Works with any PDF — scanned paper forms, government PDFs, custom templates. No AcroForm fields required.
You provide:
{"full_name": "Jane Doe", "dob": "1990-03-15"})DeepRead returns:
No field mapping, no coordinates, no configuration. The AI figures out where everything goes.
# Sign up (free — 2,000 pages/month, no credit card)
# https://www.deepread.tech/dashboard/?utm_source=clawdhub
Save your API key:
export DEEPREAD_API_KEY="sk_live_your_key_here"
# 1. Submit form + data
curl -X POST https://api.deepread.tech/v1/form-fill \
-H "X-API-Key: $DEEPREAD_API_KEY" \
-F "file=@application.pdf" \
-F 'form_fields={"full_name": "Jane Doe", "date_of_birth": "03/15/1990", "address": "123 Main St, Portland OR 97201"}'
# Response (immediate):
# {"id": "<job_id>", "status": "queued"}
# 2. Poll for result (use the id from step 1)
curl https://api.deepread.tech/v1/form-fill/<job_id> \
-H "X-API-Key: $DEEPREAD_API_KEY"
# 3. Download the filled PDF from filled_form_url in the response
Authentication: X-API-Key header (required)
Content-Type: multipart/form-data
Parameters:
| Field | Type | Required | Description |
|---|---|---|---|
file | File | Yes | PDF form to fill |
form_fields | JSON string | Yes | {"field_name": "value"} — your data |
webhook_url | String | No | URL to receive results when done |
idempotency_key | String | No | Prevent duplicate submissions |
url_expires_in | Integer | No | Signed URL expiry in seconds (default: 604800 = 7 days, min: 3600, max: 604800) |
Example:
curl -X POST https://api.deepread.tech/v1/form-fill \
-H "X-API-Key: $DEEPREAD_API_KEY" \
-F "file=@tax_form.pdf" \
-F 'form_fields={
"taxpayer_name": "Jane Doe",
"ssn": "123-45-6789",
"filing_status": "Single",
"total_income": "85000",
"tax_year": "2025"
}' \
-F "webhook_url=https://your-app.com/webhooks/form-fill"
Response (immediate):
{
"id": "<job_id>",
"status": "queued"
}
Processing is asynchronous — poll the GET endpoint or use a webhook.
Authentication: X-API-Key header (required)
Rate limit: 20 requests per 60 seconds
curl https://api.deepread.tech/v1/form-fill/<job_id> \
-H "X-API-Key: $DEEPREAD_API_KEY"
Response when completed:
{
"id": "<job_id>",
"status": "completed",
"file_name": "tax_form.pdf",
"created_at": "2025-06-15T10:00:00Z",
"completed_at": "2025-06-15T10:00:18Z",
"filled_form_url": "https://storage.deepread.tech/form_fill/.../filled.pdf",
"fields_detected": 25,
"fields_filled": 23,
"fields_verified": 21,
"fields_hil_flagged": 2,
"duration_seconds": 18.3,
"report": {
"summary": {
"fields_detected": 25,
"fields_filled": 23,
"fields_verified": 21,
"fields_hil_flagged": 2,
"mappings_created": 23,
"unmapped_keys": 0,
"adjustments_made": 3
},
"fields": [
{
"field_index": 0,
"label": "Taxpayer Name",
"field_type": "text",
"page": 1,
"value": "Jane Doe",
"hil_flag": false,
"verified": true
},
{
"field_index": 8,
"label": "Total Income",
"field_type": "text",
"page": 2,
"value": "85000",
"hil_flag": true,
"verified": false,
"reason": "Text overlaps adjacent field"
}
],
"mappings": [
{
"user_key": "taxpayer_name",
"field_index": 0,
"value_to_fill": "Jane Doe",
"confidence": 0.95
}
],
"unmapped_user_keys": [],
"adjustments_made": ["Field 8: reduced font size from 12pt to 8pt"],
"qa_feedback": ["Total Income: text overlaps adjacent field"],
"errors": []
},
"errors": null,
"error_message": null
}
Status values:
| Status | Meaning |
|---|---|
queued | Waiting for processing |
processing | AI is filling the form |
completed | Done — download from filled_form_url |
failed | Something went wrong — check error_message |
Poll every 5-10 seconds until status is completed or failed.
If you provide webhook_url, DeepRead POSTs results when the job finishes:
Completed:
{
"job_id": "<job_id>",
"status": "completed",
"created_at": "<ISO 8601 timestamp>",
"completed_at": "<ISO 8601 timestamp>",
"result": {
"filled_form_url": "<signed URL to download filled PDF>",
"fields_detected": 25,
"fields_filled": 23,
"fields_verified": 21,
"fields_hil_flagged": 2,
"report": { ... }
}
}
Failed:
{
"job_id": "<job_id>",
"status": "failed",
"created_at": "<ISO 8601 timestamp>",
"completed_at": "<ISO 8601 timestamp>",
"error": "Form fill timed out after 600s",
"errors": ["Form fill timed out after 600s"]
}
Upload PDF + JSON data
│
▼
┌──────────────────────┐
│ 1. DETECT FIELDS │ Vision AI scans every page, finds all fillable areas
│ (visual, not PDF │ Returns: label, type, page, bounding box coordinates
│ form fields) │
└──────────┬───────────┘
▼
┌──────────────────────┐
│ 2. MAP DATA │ AI semantically matches your JSON keys → form fields
│ "full_name" → │ Transforms values: splits names, formats dates,
│ "Full Name" field │ converts checkboxes, adds currency symbols
└──────────┬───────────┘
▼
┌──────────────────────┐
│ 3. FILL FORM │ Places text at visual coordinates on the PDF
│ coordinate-based │ Handles: text, checkboxes, dropdowns
│ insertion │
└──────────┬───────────┘
▼
┌──────────────────────┐
│ 4. QA CHECK │ Vision AI re-reads the filled form to verify:
│ visual verify │ - Text is readable, not cut off
│ │ - Positioned correctly, no overlaps
└──────────┬───────────┘
▼
┌──────────────────────┐
│ 5. REPAIR (if needed)│ Auto-fixes: shrink font, adjust position, remap
│ per-field fixes │ If repair fails → flag for human review (hil_flag)
└──────────┬───────────┘
▼
Filled PDF + Report
Key insight: This is visual coordinate-based filling, not AcroForm-based. It works on any PDF — scanned paper forms, government PDFs with no editable fields, custom templates.
curl -X POST https://api.deepread.tech/v1/form-fill \
-H "X-API-Key: $DEEPREAD_API_KEY" \
-F "file=@loan_application.pdf" \
-F 'form_fields={
"applicant_name": "Jane Doe",
"date_of_birth": "03/15/1990",
"ssn": "123-45-6789",
"employer": "Acme Corp",
"annual_income": "95000",
"loan_amount": "350000",
"property_address": "456 Oak Ave, Portland OR 97201",
"loan_type": "30-Year Fixed"
}'
curl -X POST https://api.deepread.tech/v1/form-fill \
-H "X-API-Key: $DEEPREAD_API_KEY" \
-F "file=@claim_form.pdf" \
-F 'form_fields={
"policy_number": "INS-2025-78901",
"insured_name": "Jane Doe",
"date_of_loss": "06/01/2025",
"description": "Water damage to basement from pipe burst",
"estimated_damage": "12500",
"photos_attached": "true"
}'
curl -X POST https://api.deepread.tech/v1/form-fill \
-H "X-API-Key: $DEEPREAD_API_KEY" \
-F "file=@w4_form.pdf" \
-F 'form_fields={
"first_name": "Jane",
"last_name": "Doe",
"ssn": "123-45-6789",
"address": "123 Main St",
"city": "Portland",
"state": "OR",
"zip": "97201",
"filing_status": "Single",
"multiple_jobs": "false"
}'
import json
import os
import requests
import time
API_KEY = os.environ["DEEPREAD_API_KEY"]
FORM_TEMPLATE = "application.pdf"
applicants = [
{"full_name": "Jane Doe", "email": "jane@example.com", "dob": "1990-03-15"},
{"full_name": "John Smith", "email": "john@example.com", "dob": "1985-07-22"},
{"full_name": "Alice Chen", "email": "alice@example.com", "dob": "1992-11-08"},
]
jobs = []
for i, applicant in enumerate(applicants):
with open(FORM_TEMPLATE, "rb") as f:
resp = requests.post(
"https://api.deepread.tech/v1/form-fill",
headers={"X-API-Key": API_KEY},
files={"file": (FORM_TEMPLATE, f, "application/pdf")},
data={
"form_fields": json.dumps(applicant),
"idempotency_key": f"batch-2025-06-{i}",
},
)
job_id = resp.json()["id"]
jobs.append(job_id)
print(f"Submitted: {applicant['full_name']} → job {job_id}")
# Poll for results
for job_id in jobs:
while True:
result = requests.get(
f"https://api.deepread.tech/v1/form-fill/{job_id}",
headers={"X-API-Key": API_KEY},
).json()
if result["status"] in ("completed", "failed"):
print(f"Job {job_id}: {result['status']}")
if result["status"] == "completed":
print(f" Download: {result['filled_form_url']}")
print(f" Fields: {result['fields_filled']}/{result['fields_detected']} filled, "
f"{result['fields_hil_flagged']} need review")
break
time.sleep(5)
| Metric | What it means |
|---|---|
fields_detected | Total form fields AI found on the PDF |
fields_filled | Fields where your data was placed |
fields_verified | Fields that passed visual QA (text readable, positioned correctly) |
fields_hil_flagged | Fields needing human review (AI couldn't fully verify) |
Typical result: 90-95% of fields verified, 2-5% flagged for review.
A field gets hil_flag: true when:
Each flagged field includes a reason explaining why review is needed.
If your JSON has keys that don't match any form field, they appear in unmapped_user_keys. This means:
Prevent duplicate submissions with idempotency_key:
# First request
curl -X POST https://api.deepread.tech/v1/form-fill \
-H "X-API-Key: $DEEPREAD_API_KEY" \
-F "file=@form.pdf" \
-F 'form_fields={"name": "Jane"}' \
-F "idempotency_key=submission-abc-123"
# → {"id": "<job_id>", "status": "queued"}
# Retry (same key) — returns the same job, no duplicate
curl -X POST https://api.deepread.tech/v1/form-fill \
-H "X-API-Key: $DEEPREAD_API_KEY" \
-F "file=@form.pdf" \
-F 'form_fields={"name": "Jane"}' \
-F "idempotency_key=submission-abc-123"
# → {"id": "<same job_id as above>", "status": "queued"} ← SAME JOB
Upgrade: https://www.deepread.tech/dashboard/billing?utm_source=clawdhub
Upload a .pdf file. Other formats are not yet supported.
Check your JSON syntax. Must be a valid JSON object (not array):
✅ {"name": "Jane Doe", "dob": "1990-03-15"}
❌ ["name", "dob"]
❌ "just a string"
Upgrade to PRO or wait until next billing cycle.
The form is very complex or has many pages. Try splitting into smaller sections.
Add more descriptive keys in your JSON. The AI uses your key names to match fields:
✅ {"applicant_full_name": "Jane Doe"} — clear, matches form labels
❌ {"field1": "Jane Doe"} — ambiguous, hard to map
This is expected for 2-5% of fields. Check report.fields for the reason on each flagged field.
| Endpoint | Method | Auth | Purpose |
|---|---|---|---|
https://api.deepread.tech/v1/form-fill | POST | API Key | Submit form + data |
https://api.deepread.tech/v1/form-fill/{job_id} | GET | API Key | Poll for status + results |
Connect your own OpenAI, Google, or OpenRouter API key via the dashboard. All form fill processing routes through YOUR provider account — zero DeepRead LLM costs, page quota skipped entirely.
Set it up: https://www.deepread.tech/dashboard/byok
clawhub install uday390/deepread-ocrclawhub install uday390/deepread-form-fillclawhub install uday390/deepread-piiclawhub install uday390/deepread-agent-setupclawhub install uday390/deepread-byokGet started free: https://www.deepread.tech/dashboard/?utm_source=clawhub