OCR and Documents
Extract text and structure from PDFs, scanned documents, DOCX files, and images — ready for summarization, search, or downstream analysis.
npx clawhub@latest install ocr-docuOCR and Documents gives your AI assistant the ability to read and extract usable text from a wide variety of document formats — including born-digital PDFs, scanned paper documents, and DOCX files. Whether you need raw text, structured markdown, or specific fields pulled from an invoice or report, this skill preprocesses documents into clean output that other skills and workflows can act on.
How It Works
The skill selects the right extraction strategy based on the document type. Text-based PDFs are processed quickly using libraries like PyMuPDF or pdfminer. Scanned documents and image-heavy files are routed through an OCR pipeline (e.g. Tesseract or a compatible OCR service). DOCX files are parsed using python-docx. The extracted content is then normalized into plain text or structured markdown, ready for summarization, indexing, archival, or further analysis by downstream skills.
Key Features
Requirements
Use Cases
How to Install
npx clawhub@latest install ocr-docunpx clawhub@latest install ocr-docuReviews
0 reviewsLog in to write a review
No reviews yet. Be the first to share your experience!