OCR and Documents
Extract text and structure from PDFs, scanned documents, DOCX files, and images — ready for summarization, search, or downstream analysis.
npx clawhub@latest install ocr-docuOCR and Documents gives your AI assistant the ability to read and extract usable text from a wide variety of document formats — including born-digital PDFs, scanned paper documents, and DOCX files. Whether you need raw text, structured markdown, or specific fields pulled from an invoice or report, this skill preprocesses documents into clean output that other skills and workflows can act on.
仕組み
The skill selects the right extraction strategy based on the document type. Text-based PDFs are processed quickly using libraries like PyMuPDF or pdfminer. Scanned documents and image-heavy files are routed through an OCR pipeline (e.g. Tesseract or a compatible OCR service). DOCX files are parsed using python-docx. The extracted content is then normalized into plain text or structured markdown, ready for summarization, indexing, archival, or further analysis by downstream skills.
主な機能
動作要件
ユースケース
インストール方法
npx clawhub@latest install ocr-docunpx clawhub@latest install ocr-docuレビュー
0件のレビューレビューを書くにはログイン
まだレビューはありません。最初の体験をシェアしましょう!