markitdown

by K-Dense-AI

markitdown converts files and office documents to Markdown for easier reading, chunking, search, and LLM workflows. This markitdown skill supports PDF, DOCX, PPTX, XLSX, HTML, CSV, JSON, XML, ZIP, EPUB, images with OCR, and audio transcription, making it a practical markitdown guide for format conversion.

Stars0

Favorites0

Comments0

AddedMay 14, 2026

CategoryFormat Conversion

Install Command

npx skills add K-Dense-AI/claude-scientific-skills --skill markitdown

Curation Score

This skill scores 78/100, which means it is a solid directory listing candidate: users get a clear purpose, real workflow content, and enough operational detail to decide whether to install it for document-to-Markdown conversion. It is useful, though the install decision should account for missing support files and limited external guidance.

78/100

Strengths

Explicitly scoped conversion task: files and office documents to Markdown, including PDF, DOCX, PPTX, XLSX, images/OCR, audio/transcription, HTML, CSV, JSON, XML, ZIP, YouTube URLs, and EPUBs.
Substantial workflow content in SKILL.md with valid frontmatter, long body text, many headings, and no placeholder markers, suggesting real operational guidance rather than a stub.
Agent-friendly tool access is declared with Read, Write, Edit, and Bash, which supports a practical conversion workflow instead of a generic prompt-only skill.

Cautions

No install command, scripts, or support files are provided, so users may need to infer setup and runtime details from the prose alone.
The repository has limited auxiliary documentation and references, so edge cases, prerequisites, and validation steps may not be immediately obvious.

Markdown Pdf DOCX Pptx XLSX OCR Audio Transcription

Overview

Overview of markitdown skill

What markitdown does

The markitdown skill converts source files into Markdown that is easier to read, chunk, search, and feed into LLM workflows. It is best for users who need reliable markitdown for Format Conversion across office docs, PDFs, slides, spreadsheets, web pages, archives, and some media inputs without hand-cleaning the output.

Who should install it

Install the markitdown skill if you routinely turn documents into prompts, notes, summaries, knowledge-base pages, or downstream agent inputs. It is especially useful for analysts, researchers, and content ops teams that want consistent Markdown extraction instead of ad hoc copy-paste or generic OCR.

What makes it worth using

The main value is practical conversion coverage: markitdown supports formats like DOCX, PPTX, XLSX, PDF, HTML, CSV, JSON, XML, ZIP, EPUB, images with OCR, and audio with transcription. That makes it a strong choice when your input mix is messy and you want one markitdown guide for common file-to-text jobs.

How to Use markitdown skill

Install and confirm the skill path

Use the directory’s install flow for the markitdown install step, then confirm the skill files under scientific-skills/markitdown. The repo’s core entry point is SKILL.md, and there are no helper scripts or reference folders to browse, so the decision surface is narrow and quick to inspect.

Turn a rough task into a usable prompt

The best markitdown usage starts with a clear conversion target, not just “convert this file.” State the source type, desired output shape, and any special handling. For example: “Convert this scanned PDF to clean Markdown, preserve headings and lists, ignore page numbers, and keep table structure where possible.” That gives the skill the constraints it needs to make good tradeoffs.

Read the files that matter first

Start with SKILL.md to understand supported formats, output expectations, and any workflow notes. Then check the repository’s top-level metadata in the skill file itself for scope clues such as description, allowed tools, and license. Because the skill tree is minimal, there is little hidden behavior to discover elsewhere.

Use the right input for the right format

markitdown works best when the source is already structurally meaningful: Office docs with real headings, PDFs with selectable text, CSVs with clear columns, and HTML with semantic markup. For image scans, noisy screenshots, or audio, expect more variance and provide context about what must be preserved, such as speaker labels, table cells, or figure captions.

markitdown skill FAQ

Is markitdown only for documents?

No. The markitdown skill is broader than plain document conversion and is meant for mixed file-to-Markdown workflows. It is a good fit when you need one conversion path for docs, slides, spreadsheets, web content, archives, and some media sources.

Do I need it if I can just ask an AI to summarize files?

Yes, if you care about repeatable extraction first. A normal prompt can summarize a file, but markitdown is aimed at producing a cleaner Markdown base layer that other prompts, agents, or indexing steps can reuse. That usually improves consistency and reduces formatting loss.

Is it beginner friendly?

Mostly yes. The skill is useful even if you are not technical, as long as you can name the file type and the output goal. Beginners should keep requests concrete and avoid asking for too many transformations at once; convert first, then summarize or rewrite second.

When should I not use markitdown?

Do not use it as a replacement for domain-specific parsing when you need perfect layout reconstruction, legally exact pagination, or specialized data extraction from complex spreadsheets. If your job is true document forensics or pixel-faithful reproduction, a generic Markdown conversion layer may not be enough.

How to Improve markitdown skill

Give the converter less room to guess

The biggest quality gains come from telling markitdown what matters: headings, tables, speaker turns, code blocks, captions, or links. If the source is messy, add short instructions like “preserve table rows,” “drop boilerplate navigation,” or “keep only the main article text.”

Use format-specific instructions

Strong inputs mention the source and the desired handling. Example: “Convert this PPTX into Markdown with one section per slide, keep slide titles as H2s, and summarize bullet-heavy slides into concise bullets.” That is better than a generic conversion request because it matches the document structure.

Watch for common failure modes

The main risks are over-retained noise, collapsed tables, weak OCR on scans, and uneven treatment of mixed-media inputs. If the first output is too literal, ask for cleanup rules in the next pass; if it is too aggressive, ask to preserve more structure and source wording.

Iterate in two passes

For better markitdown usage, first extract faithfully, then refine. Use the first pass to get a clean Markdown version, and the second to normalize headings, trim boilerplate, or prepare the text for RAG, notes, or publishing. That workflow usually yields better results than asking for extraction and rewriting in one step.

Ratings & Reviews

No ratings yet

Share your review

0/10000

Latest reviews

Saving...

more skill

kreuzberg

by kreuzberg-dev

The kreuzberg skill helps you install and use Kreuzberg for document extraction across 91+ formats, including PDFs, Office files, images, HTML, email, and archives. It covers Python, Node.js/TypeScript, Rust, and CLI workflows for OCR, tables, metadata, batch processing, and practical parsing guidance.

PDF Processing

Favorites 0GitHub 0

xlsx

by anthropics

The xlsx skill helps agents read, edit, repair, create, and convert .xlsx, .xlsm, .csv, and .tsv files when the required deliverable is a spreadsheet. It is strongest for template-preserving updates, formula-safe workbook edits, messy tabular cleanup, and practical spreadsheet workflows backed by repo scripts for packing, validation, and recalculation.

Spreadsheet Workflows

Favorites 0GitHub 105.1k

pdf

by anthropics

The pdf skill guides PDF Processing tasks like text extraction, merge and split operations, rendering pages to images, and PDF form workflows. It is especially useful for checking fillable fields, extracting form metadata, and validating non-fillable form layouts with scripts.

PDF Processing

Favorites 0GitHub 105.1k

baoyu-youtube-transcript

by JimLiu

baoyu-youtube-transcript helps extract YouTube transcripts, subtitles, and cover images from a URL or video ID. It supports language selection, translation, markdown or SRT output, cached reformatting, and a fallback from InnerTube API to yt-dlp for more reliable transcript retrieval.

Format Conversion

Favorites 0GitHub 13.2k

baoyu-url-to-markdown

by JimLiu

baoyu-url-to-markdown converts live URLs to Markdown with a vendored baoyu-fetch CLI using Chrome CDP, site adapters, and generic fallback. Review Bun runtime needs, first-time EXTEND.md setup, and usage for X, YouTube, Hacker News, and rendered pages.

Format Conversion

Favorites 0GitHub 13.2k

pymatgen

by K-Dense-AI

pymatgen is a Python materials science toolkit for crystal structures, phase diagrams, electronic structure, and file conversion. This pymatgen skill helps with scientific workflows using CIF, POSCAR, VASP, and Materials Project data.

Scientific

Favorites 0GitHub 0

minimax-xlsx

by MiniMax-AI

The minimax-xlsx skill helps create, read, edit, validate, and format Excel workbooks with an Excel-first workflow. Use minimax-xlsx for Spreadsheet Workflows when you need structured files that preserve formulas, styles, sheet layout, and workbook behavior. It supports .xlsx, .xlsm, .csv, and .tsv tasks, including analysis, new workbook creation, minimal-invasive edits, formula repair, and validation. The minimax-xlsx guide is designed for real workbook handoff, not flat tables.

Spreadsheet Workflows

Favorites 0GitHub 0

baoyu-format-markdown

by JimLiu

baoyu-format-markdown formats plain text or messy Markdown into cleaner, publishable Markdown while preserving meaning. It repairs frontmatter, headings, lists, code blocks, quotes, and CJK spacing, making it useful for Format Conversion without rewriting content.

Format Conversion

Favorites 0GitHub 13.2k

baoyu-danger-x-to-markdown

by JimLiu

baoyu-danger-x-to-markdown converts X posts, threads, and some articles into Markdown with YAML front matter. It uses scripts in `scripts/` with `bun` or `npx -y bun`, supports cookie-based access and consent flow, and fits repeatable Format Conversion workflows better than a generic prompt.

Format Conversion

Favorites 0GitHub 13.2k

baoyu-markdown-to-html

by JimLiu

baoyu-markdown-to-html converts Markdown into styled HTML for WeChat-style publishing. It supports themes, code highlighting, math, PlantUML, footnotes, image handling, and optional link citations, with runtime execution through bun or npx -y bun.

Format Conversion

Favorites 0GitHub 13.2k

nutrient-document-processing

by affaan-m

nutrient-document-processing skill for PDF processing and document automation with the Nutrient DWS API. Convert, OCR, extract, redact, sign, watermark, and fill files like PDFs, DOCX, XLSX, PPTX, HTML, and images.

PDF Processing

Favorites 0GitHub 156.2k

speech-to-text

by NoizAI

The speech-to-text skill transcribes supported audio files into plain text, with options for timestamps, speaker labels, and JSON output. It is designed for practical speech-to-text usage in repeatable workflows, including interviews, meetings, podcasts, lectures, and automation tasks where consistent transcription matters.

Workflow Automation

Favorites 0GitHub 498

transcribe-video

by rameerez

The transcribe-video skill turns video or audio files into .srt, .vtt, and .txt outputs with AWS Transcribe. Use it for transcribe-video usage when you need captions, a searchable transcript, or a clean text version of spoken content. It also fits transcribe-video for Format Conversion workflows.

Format Conversion

Favorites 0GitHub 23

pdf

by openai

Use the pdf skill for PDF Processing tasks where layout, pagination, and rendered output matter. It helps you read, create, edit, and review PDFs with a visual-first workflow: render pages, inspect the result, then adjust. Use it when you need reliable PDF install, pdf usage, and a practical pdf guide for document accuracy.

PDF Processing

Favorites 0GitHub 0

web-to-markdown

by softaworks

web-to-markdown is a Format Conversion skill that turns live web pages into clean Markdown through the local web2md CLI, using a Chromium-family browser for JS-rendered pages, interactive flows, and batch URL conversion. It only runs when explicitly invoked by name.

Format Conversion

Favorites 0GitHub 1.3k

defuddle

by kepano

defuddle extracts clean markdown from web pages with the Defuddle CLI, removing clutter for research, docs, and articles. Use it for standard HTML pages, install with npm, and skip URLs ending in .md.

Web Research

Favorites 0GitHub 19.7k