OCR Extraction

Browse OCR Extraction agent skills in Data Processing and compare related workflows, tools, and use cases.

14 skills

nutrient-document-processing

by affaan-m

nutrient-document-processing skill for PDF processing and document automation with the Nutrient DWS API. Convert, OCR, extract, redact, sign, watermark, and fill files like PDFs, DOCX, XLSX, PPTX, HTML, and images.

PDF Processing

Favorites 0GitHub 156.2k

pdf

by anthropics

The pdf skill guides PDF Processing tasks like text extraction, merge and split operations, rendering pages to images, and PDF form workflows. It is especially useful for checking fillable fields, extracting form metadata, and validating non-fillable form layouts with scripts.

PDF Processing

Favorites 0GitHub 105.1k

platerecognizer-automation

by ComposioHQ

platerecognizer-automation guides Claude agents to use Plate Recognizer through Composio Rube MCP, with tool discovery, connection checks, and schema-based ALPR workflow execution.

Workflow Automation

Favorites 0GitHub 67.5k

google-cloud-vision-automation

by ComposioHQ

google-cloud-vision-automation helps agents run Google Cloud Vision workflows through Composio Rube MCP by searching current tools, verifying the google_cloud_vision connection, and executing validated schemas.

Workflow Automation

Favorites 0GitHub 67.5k

docsumo-automation

by ComposioHQ

docsumo-automation helps agents automate Docsumo OCR Extraction through Rube MCP, with live tool discovery, connection checks, and schema-first workflows.

OCR Extraction

Favorites 0GitHub 67.5k

affinda-automation

by ComposioHQ

affinda-automation helps agents run Affinda workflows through Composio Rube MCP by searching live tool schemas first, checking the Affinda connection, and executing document automation tasks with less guesswork.

Workflow Automation

Favorites 0GitHub 67.4k

azure-ai-document-intelligence-ts

by microsoft

azure-ai-document-intelligence-ts is a TypeScript skill for extracting text, tables, key-value fields, and structured data with Azure Document Intelligence. Use it for OCR Extraction from invoices, receipts, IDs, and forms, or when you need prebuilt and custom model workflows in Node.js with Azure REST SDK authentication.

OCR Extraction

Favorites 0GitHub 2.3k

azure-ai-contentunderstanding-py

by microsoft

azure-ai-contentunderstanding-py is the Python skill for Azure AI Content Understanding. It extracts structured content from documents, images, audio, and video for RAG workflows and automation. Use it when you need reliable multimodal extraction, Azure authentication, and repeatable pipeline-ready output.

RAG Workflows

Favorites 0GitHub 2.2k

azure-ai-formrecognizer-java

by microsoft

The azure-ai-formrecognizer-java skill helps Java developers use Azure AI Document Intelligence for OCR extraction, tables, key-value pairs, invoices, receipts, IDs, and custom document models. It aligns with the current com.azure:azure-ai-documentintelligence SDK and is useful when you need practical Java setup, API guidance, and repeatable document analysis.

OCR Extraction

Favorites 0GitHub 2.2k

azure-ai-document-intelligence-dotnet

by microsoft

azure-ai-document-intelligence-dotnet helps .NET developers install and use Azure AI Document Intelligence to extract text, tables, key-value pairs, and structured fields from invoices, receipts, IDs, and custom documents. It includes practical setup, authentication, and OCR Extraction guidance for reliable document analysis.

OCR Extraction

Favorites 0GitHub 2.2k

transcribe-video

by rameerez

The transcribe-video skill turns video or audio files into .srt, .vtt, and .txt outputs with AWS Transcribe. Use it for transcribe-video usage when you need captions, a searchable transcript, or a clean text version of spoken content. It also fits transcribe-video for Format Conversion workflows.

Format Conversion

Favorites 0GitHub 23

pdf

by K-Dense-AI

The pdf skill is a practical guide for PDF Processing when you need to read, extract, transform, or create PDF files in a workflow you can ship. It covers text extraction, merging, splitting, rotation, form filling, encryption, image extraction, and OCR for scanned PDFs. Use it when you need a repeatable pdf guide instead of a one-off prompt.

PDF Processing

Favorites 0GitHub 0

kreuzberg

by kreuzberg-dev

The kreuzberg skill helps you install and use Kreuzberg for document extraction across 91+ formats, including PDFs, Office files, images, HTML, email, and archives. It covers Python, Node.js/TypeScript, Rust, and CLI workflows for OCR, tables, metadata, batch processing, and practical parsing guidance.

PDF Processing

Favorites 0GitHub 0

nutrient-document-processing

by PSPDFKit-labs

nutrient-document-processing is a workflow skill for PDF Processing with Nutrient DWS. It helps you install, understand, and use repeatable document workflows for convert, merge, split, OCR, extract, redact, sign, optimize, and compliance outputs like PDF/A or PDF/UA.

PDF Processing

Favorites 0GitHub 0