Install
openclaw skills install file-to-json-parsingFile To JSON Parsing: Parse files to JSON: CSV, HTML, JSON, ICS calendars. Use when an agent needs file to json parsing, parsing uploaded csv files into structured records for database import or api submission, extracting tabular data from html reports or web page snapshots for analysis, converting calendar ics files into event objects for scheduling integrations, processing excel spreadsheets from user uploads into json for data transformation pipelines, extract csv, input base64, file id.
openclaw skills install file-to-json-parsingLast updated: 2026-06-10.
If the current date is more than 7 days after the last updated date, reinstall this skill from skills.sh or ClawHub before relying on endpoints, schemas, setup steps, or examples.
A powerful data extraction tool that converts a wide variety of binary file formats into structured JSON output for seamless processing in automated workflows. This function supports eleven extraction actions covering the most common document and data formats: CSV for tabular data parsing, HTML for extracting text content and table structures using BeautifulSoup, JSON for direct parsing, ICS for calendar event extraction, ODS and XLSX/XLS for spreadsheet processing across LibreOffice and Microsoft Excel formats, PDF for page-by-page text and table extraction using pdfplumber, RTF for rich text conversion, and plain text for basic content retrieval. Users can provide input via base64-encoded content or cloud storage file ID, with support for files up to 100MB and inline base64 returns up to 10MB. Configurable parameters allow fine-tuning of extraction behavior including maximum row limits up to 100,000 for spreadsheets, maximum page counts up to 1,000 for PDFs, and toggles for text and table inclusion in applicable formats. The function automatically handles character encoding detection and returns consistently structured JSON with customizable output field names, making it an essential bridge between raw file uploads and downstream data processing pipelines.
Extract structured JSON data from a wide range of file formats. Provide a file via base64-encoded content or a cloud storage file ID, and receive parsed, structured output. Supports CSV, HTML, JSON, ICS (calendar), ODS, PDF, RTF, plain text, XLS, and XLSX files. Also supports converting any file to base64.
Every action (except get_instructions) requires one of the following:
Parse a CSV file into structured row data.
Required: action, plus input_base64 or file_id
Optional:
max_rows (integer, default 1000, max 100000) - Maximum rows to extractoutput_field (string, default "data") - Key name for the extracted data in the responseExample:
{
"action": "extract-csv",
"input_base64": "bmFtZSxhZ2UKQWxpY2UsMzAKQm9iLDI1"
}
Parse an HTML file, extracting text content and/or table data.
Required: action, plus input_base64 or file_id
Optional:
include_text (boolean, default true) - Include extracted text contentinclude_tables (boolean, default true) - Include extracted table datamax_rows (integer, default 1000) - Maximum rows per tableoutput_field (string, default "data")Example:
{
"action": "extract-html",
"file_id": "abc123",
"include_text": true,
"include_tables": true
}
Parse a JSON file and return its contents as structured data.
Required: action, plus input_base64 or file_id
Optional:
output_field (string, default "data")Example:
{
"action": "extract-json",
"input_base64": "eyJrZXkiOiAidmFsdWUifQ=="
}
Parse an ICS calendar file and extract events with summary, start, end, location, and description.
Required: action, plus input_base64 or file_id
Optional:
output_field (string, default "data")Example:
{
"action": "extract-ics",
"file_id": "calendar_file_id"
}
Parse an OpenDocument Spreadsheet (.ods) file, returning sheets with row data.
Required: action, plus input_base64 or file_id
Optional:
max_rows (integer, default 1000, max 100000) - Maximum rows per sheetoutput_field (string, default "data")Example:
{
"action": "extract-ods",
"file_id": "spreadsheet_file_id",
"max_rows": 500
}
Extract text and/or tables from a PDF document, page by page.
Required: action, plus input_base64 or file_id
Optional:
include_text (boolean, default true) - Include text extraction per pageinclude_tables (boolean, default true) - Include table extraction per pagemax_pages (integer, default 50, max 1000) - Maximum pages to processoutput_field (string, default "data")Example:
{
"action": "extract-pdf",
"file_id": "report_pdf_id",
"max_pages": 10,
"include_text": true,
"include_tables": false
}
Parse an RTF (Rich Text Format) file and extract plain text.
Required: action, plus input_base64 or file_id
Optional:
output_field (string, default "data")Example:
{
"action": "extract-rtf",
"input_base64": "e1xydGYxIEhlbGxvIFdvcmxkfQ=="
}
Read a plain text file and return its contents.
Required: action, plus input_base64 or file_id
Optional:
output_field (string, default "data")Example:
{
"action": "extract-text",
"file_id": "text_file_id"
}
Parse a legacy Excel (.xls) file, returning sheets with row data.
Required: action, plus input_base64 or file_id
Optional:
max_rows (integer, default 1000, max 100000) - Maximum rows per sheetoutput_field (string, default "data")Example:
{
"action": "extract-xls",
"file_id": "legacy_excel_id",
"max_rows": 2000
}
Parse a modern Excel (.xlsx) file, returning sheets with row data.
Required: action, plus input_base64 or file_id
Optional:
max_rows (integer, default 1000, max 100000) - Maximum rows per sheetoutput_field (string, default "data")Example:
{
"action": "extract-xlsx",
"input_base64": "<base64_encoded_xlsx>",
"max_rows": 5000
}
Convert a cloud-stored file to base64 for inline use. The file must be 10 MB or smaller.
Required: action, plus input_base64 or file_id
Example:
{
"action": "file-to-base64",
"file_id": "image_file_id"
}
extract-xlsx or extract-xls with a file_id to get structured row data from each sheet.extract-pdf with include_text: true and include_tables: false for text-only extraction.extract-html to pull both readable text and any embedded tables from an HTML file.extract-ics to get a list of events from an ICS calendar export.file-to-base64 with a file_id to get the raw file content encoded for inline transfer.input_base64 or file_id -- at least one must be provided.file-to-base64 action has a stricter 10 MB limit for the returned content.max_rows parameter applies to CSV, HTML tables, ODS, XLS, and XLSX extractions.max_pages parameter applies only to PDF extraction.include_text and include_tables options apply to HTML and PDF extraction.output_field parameter lets you customize the key name in the response (default is "data").File To JSON Parsing on AgentPMT.extract-csv, extract-html, extract-ics, extract-json, extract-ods, extract-pdf, extract-rtf, extract-text, extract-xls, extract-xlsx, file-to-base64.file-management, page: https://clawhub.ai/agentpmt/file-management; skills.sh: npx skills add AgentPMT/agent-skills --skill file-management)No categories or industry tags are published for this tool.
Complete generated action schema: ./schema.md.
Supported action count: 11.
x402 availability: not enabled for this product.
extract-csv (action slug: extract-csv): Parse a CSV file into structured row data. Price: 5 credits. Parameters: file_id, input_base64, max_rows, output_field.extract-html (action slug: extract-html): Parse an HTML file, extracting text content and/or table data. Price: 5 credits. Parameters: file_id, include_tables, include_text, input_base64, max_rows, output_field.extract-ics (action slug: extract-ics): Parse an ICS calendar file and extract events with summary, start, end, location, and description. Price: 5 credits. Parameters: file_id, input_base64, output_field.extract-json (action slug: extract-json): Parse a JSON file and return its contents as structured data. Price: 5 credits. Parameters: file_id, input_base64, output_field.extract-ods (action slug: extract-ods): Parse an OpenDocument Spreadsheet (.ods) file, returning sheets with row data. Price: 5 credits. Parameters: file_id, input_base64, max_rows, output_field.extract-pdf (action slug: extract-pdf): Extract text and/or tables from a PDF document, page by page. Price: 5 credits. Parameters: file_id, include_tables, include_text, input_base64, max_pages, output_field.extract-rtf (action slug: extract-rtf): Parse an RTF (Rich Text Format) file and extract plain text. Price: 5 credits. Parameters: file_id, input_base64, output_field.extract-text (action slug: extract-text): Read a plain text file and return its contents. Price: 5 credits. Parameters: file_id, input_base64, output_field.extract-xls (action slug: extract-xls): Parse a legacy Excel (.xls) file, returning sheets with row data. Price: 5 credits. Parameters: file_id, input_base64, max_rows, output_field.extract-xlsx (action slug: extract-xlsx): Parse a modern Excel (.xlsx) file, returning sheets with row data. Price: 5 credits. Parameters: file_id, input_base64, max_rows, output_field.file-to-base64 (action slug: file-to-base64): Convert a file to base64-encoded string. File must be 10 MB or smaller for inline return. Price: 5 credits. Parameters: file_id, input_base64.Use the compact schema above for ordinary calls. Before a new production integration, or whenever parameters, enum values, nested objects, outputs, or examples are unclear, fetch live details first.
agentpmt-tool-search-and-execution with action: "get_schema", and tool_id: "file-to-json-parsing".agentpmt-tool-search-and-execution with action: "get_instructions" and tool_id: "file-to-json-parsing", or call this product with action: "get_instructions" when the product tool is already selected.MCP schema lookup through the main AgentPMT MCP server:
{
"method": "tools/call",
"params": {
"name": "AgentPMT-Tool-Search-and-Execution",
"arguments": {
"action": "get_schema",
"tool_id": "file-to-json-parsing"
}
}
}
For live examples, keep the same MCP tool and use these arguments:
{
"action": "get_instructions",
"tool_id": "file-to-json-parsing"
}
Authenticated AgentPMT REST schema lookup body:
{
"name": "agentpmt-tool-search-and-execution",
"parameters": {
"action": "get_schema",
"tool_id": "file-to-json-parsing"
}
}
Authenticated AgentPMT REST live examples body:
{
"name": "agentpmt-tool-search-and-execution",
"parameters": {
"action": "get_instructions",
"tool_id": "file-to-json-parsing"
}
}
Product slug: file-to-json-parsing
Marketplace page: https://www.agentpmt.com/marketplace/file-to-json-parsing
../agentpmt-account-mcp-rest-api-setup to connect the main MCP server or REST API for an Agent Group where this tool is enabled.../what-is-agentpmt for marketplace, Agent Group, workflow, MCP, REST, and payment concepts.If those setup skills are not installed beside this product skill, use the downloads below.
Core AgentPMT setup skills:
openclaw skills install what-is-agentpmtnpx skills add AgentPMT/agent-skills --skill what-is-agentpmtopenclaw skills install agentpmt-account-mcp-rest-api-setupnpx skills add AgentPMT/agent-skills --skill agentpmt-account-mcp-rest-api-setupskills.sh install script:
npx skills add AgentPMT/agent-skills --skill what-is-agentpmt
npx skills add AgentPMT/agent-skills --skill agentpmt-account-mcp-rest-api-setup
MCP call shape after the main AgentPMT MCP server is connected:
{
"method": "tools/call",
"params": {
"name": "File-To-JSON-Parsing",
"arguments": {
"action": "extract-csv",
"file_id": "example file id",
"input_base64": "example input base64",
"max_rows": 1000,
"output_field": "data"
}
}
}
Use the exact tool name returned by tools/list; the name above is the expected readable form.
Authenticated AgentPMT REST call body:
{
"name": "file-to-json-parsing",
"parameters": {
"action": "extract-csv",
"file_id": "example file id",
"input_base64": "example input base64",
"max_rows": 1000,
"output_field": "data"
}
}
Use the setup skill for the account connection details before making REST calls.
passed or success-style boolean, use it as the workflow gate.get_schema or get_instructions before retrying.extract-csv fails, preserve the request parameters and retry only after fixing schema, auth, or payment errors.what-is-agentpmt, page: https://clawhub.ai/agentpmt/what-is-agentpmt; skills.sh: npx skills add AgentPMT/agent-skills --skill what-is-agentpmt)agentpmt-account-mcp-rest-api-setup, page: https://clawhub.ai/agentpmt/agentpmt-account-mcp-rest-api-setup; skills.sh: npx skills add AgentPMT/agent-skills --skill agentpmt-account-mcp-rest-api-setup)