Overview
Features
recognition_text
Text recognition from images, Word documents, and PDFs; returns extracted text.
doc_to_markdown
Converts images, PDFs, and Word documents to Markdown; returns Markdown text.
general_information_extration
Automatically identifies and extracts key information from documents or user-specified fields; returns a JSON with the extracted data; supports PDF, Word/Excel, and images.
Who Is This For?
- Developers:Integrate OCR, text extraction, and Markdown conversion into applications via the MCP server.
- Data engineers:Extract structured data from documents and generate Markdown representations within pipelines.




