docx

by anthropics

The docx skill helps agents create, inspect, convert, and edit .docx files with practical workflows for pandoc, unpack/repack, comments, tracked changes, and LibreOffice-based conversion.

Stars0

Favorites1

Comments0

AddedMar 29, 2026

CategoryDOCX Workflows

Install Command

npx skills add anthropics/skills --skill docx

Curation Score

This skill scores 84/100, which means it is a solid directory listing candidate: agents get strong trigger cues, real executable workflows, and meaningful leverage beyond a generic prompt, though adopters should expect some environment setup and low-level DOCX/XML handling.

84/100

Strengths

Very clear trigger scope in frontmatter, including when to use it for creating, editing, extracting, tracked changes, comments, and DOCX-specific deliverables.
Backed by substantial operational assets: 59 scripts plus concrete tools for unpacking, repacking, validating, commenting, accepting changes, and LibreOffice conversion.
SKILL.md gives task-to-approach guidance and workflow patterns such as convert .doc to .docx, read with pandoc, and edit via unpack → XML edit → repack.

Cautions

No explicit install command in SKILL.md, and key workflows depend on external tools like LibreOffice, pandoc, and likely other local utilities.
Some editing paths require direct XML manipulation and pre-escaped content, which raises adoption complexity for users expecting a purely high-level document API.

Anthropic DOCX Workflow Documentation

Overview

Overview of docx skill

What the docx skill is for

The docx skill helps an agent create, inspect, and modify Microsoft Word .docx files with fewer blind spots than a generic prompt. It is best for users who need real DOCX workflows: generating a polished Word deliverable, extracting content for review, editing existing files, handling comments or tracked changes, or repairing package-level issues by working directly with the Office XML structure.

Who should install docx

Install this docx skill if you regularly need to:

produce Word documents, not just plain text
edit an existing .docx without manually clicking through Word
preserve document structure such as headings, comments, and revisions
convert legacy .doc files before further processing
inspect package contents when normal text extraction is not enough

It is especially useful for AI-assisted document operations where the output must remain a usable .docx, not merely a markdown draft.

What makes docx different from a normal prompt

The main differentiator is workflow specificity. The skill does not treat DOCX as “just text.” It recognizes that a .docx is a ZIP archive of XML files and points the agent toward the right path for each job:

pandoc for text-focused reading and extraction
unpack/edit/repack for structural edits
LibreOffice automation for some format conversions and accepting tracked changes
validation and repair steps when XML edits risk breaking the file

That makes docx more credible for DOCX workflows than a generic “write a report” instruction.

Best-fit jobs to be done

Use docx when the real job is one of these:

“Create a Word report with sections and professional formatting.”
“Read this .docx and summarize it, including tracked changes.”
“Replace or restructure content inside an existing Word file.”
“Add comments or process revisions programmatically.”
“Convert .doc to .docx so it can be edited safely.”

Important limits before you adopt

This skill is not a universal office suite. It is strongest when the task clearly targets .docx. It is a weaker fit for:

PDFs
Google Docs-native collaboration
spreadsheet-heavy workflows
visual layout perfection that requires manual Word desktop review
users who need zero local tooling such as pandoc or LibreOffice

The practical tradeoff: docx gives more control, but package-level editing requires care.

How to Use docx skill

Start with the install context, not just a command

The repository does not advertise a single formal docx install command inside SKILL.md, so treat it as a skill you add from the Anthropic skills repository, then use with local helper scripts and external tools. In practice, users evaluating docx usage should assume they may need:

Python
pandoc for reading and conversion-oriented extraction
LibreOffice soffice for .doc conversion and accepting changes
a shell environment that can run the included Python scripts

If your environment blocks GUI-ish office tooling or native subprocess calls, check that first. That is often the real adoption blocker.

Read these files first

For fastest orientation, read in this order:

skills/docx/SKILL.md
skills/docx/scripts/office/unpack.py
skills/docx/scripts/office/pack.py
skills/docx/scripts/accept_changes.py
skills/docx/scripts/comment.py
skills/docx/scripts/office/soffice.py

This path tells you the actual operating model of the docx skill: read, unpack, edit, validate, repack, and use LibreOffice only where XML-only editing is the wrong tool.

Choose the right workflow for the task

The best docx guide starts by selecting the right lane:

Read or analyze content: use pandoc or inspect unpacked XML
Create a new document: use the document-generation route referenced in SKILL.md
Edit an existing document: unpack → modify XML/assets → repack
Convert .doc to .docx: use LibreOffice conversion first
Accept tracked changes: use the provided LibreOffice macro helper
Add comments: use the comment script plus correct XML markers

If you skip this decision and jump straight to editing, quality drops fast.

Inputs the docx skill needs for good output

For reliable docx usage, give the agent more than “make a Word doc.” Strong inputs usually include:

the source file path, if editing
the desired output file path
whether the task is create, read, convert, annotate, or revise
formatting requirements such as headings, page numbers, TOC, tables, letterhead
whether tracked changes or comments must be preserved
whether the document contains images, tables, or templates that must survive intact

A weak prompt:

“Edit this Word document.”

A stronger prompt:

“Open contract_review.docx, preserve tracked changes, summarize all comments, then create a new executive_summary.docx with H1/H2 headings, a short risks table, and a final recommendations section.”

Practical commands users actually care about

A few high-value operations surfaced directly in the repository:

Convert legacy .doc before anything else:

python scripts/office/soffice.py --headless --convert-to docx document.doc

Extract text while keeping revision context:

pandoc --track-changes=all document.docx -o output.md

Unpack a DOCX for XML-level edits:

python scripts/office/unpack.py document.docx unpacked/

Repack after editing:

python scripts/office/pack.py unpacked/ output.docx --original document.docx

These commands reflect the actual docx for DOCX Workflows value: not just writing text, but operating on Word packages safely.

How to prompt the agent so docx triggers correctly

The skill is more likely to activate well when your request names the file type and operation clearly. Include:

.docx
the desired end state
whether the work is on an existing file or from scratch
what must be preserved

Good trigger examples:

“Create a polished .docx board memo from these notes.”
“Read this .docx and extract text including tracked changes.”
“Unpack and update the title page, then repack the .docx.”
“Add review comments to specific paragraphs in this Word document.”

Avoid vague phrasing like “make a document better” if you actually need package-safe editing.

When to use pandoc versus unpacking XML

This is one of the biggest practical decisions.

Use pandoc when you want:

readable text extraction
markdown conversion
easier review of tracked changes
content analysis more than layout surgery

Use unpack/edit/repack when you need:

comments
tracked-change-aware structural edits
image or package part replacement
low-level fixes inside word/ XML and relationships

If your goal is semantic reading, XML editing is overkill. If your goal is exact DOCX mutation, plain text extraction is not enough.

Special handling for tracked changes and comments

The repository includes unusually practical support here:

scripts/accept_changes.py accepts tracked changes using LibreOffice
scripts/comment.py helps insert comments into unpacked documents
helper code in scripts/office/helpers/ addresses run merging and redline simplification

This matters because revisions make raw DOCX XML much messier. If your documents involve legal review, editorial comments, or negotiated drafts, the docx skill is more compelling than a basic document generator.

Watch for XML-specific quality traps

Some failure modes are easy to miss:

comment markers must be placed correctly in document.xml
text for comments should be XML-escaped
DOCX edits can break relationships or schema validity
run fragmentation can make search/replace unreliable
tracked changes can distort the apparent text flow

The included pack/validate path reduces risk, but it does not remove the need for careful task framing.

Environment details that can block adoption

A practical blocker in docx install decisions is office automation. The repository’s soffice.py includes logic for sandboxed environments where Unix sockets may fail and an LD_PRELOAD shim may be needed. That is a strong signal that the authors expect real environment friction.

If your deployment cannot run LibreOffice, some workflows still work, but:

.doc conversion becomes harder
accepting tracked changes via the provided script is not available
some “just automate Word-like behavior” requests may need a different toolchain

Suggested workflow for consistent results

A good default docx guide workflow is:

Confirm whether the source is .doc or .docx.
Convert .doc to .docx first if needed.
Decide whether the task is text extraction or package editing.
Unpack only when structure-level edits are required.
Make targeted changes, not broad regex-style XML rewrites.
Repack with validation against the original when possible.
Open the output in Word or LibreOffice for final visual smoke testing.

That workflow prevents the most common corruption and mismatch issues.

docx skill FAQ

Is docx good for beginners?

Yes, if your need is clear and limited, such as converting, extracting, or making small edits. But advanced docx usage quickly becomes package-level XML work. Beginners can use it successfully if they stay within a guided workflow and do not treat Word files like plain text blobs.

When is docx better than an ordinary writing prompt?

Use docx when the output must be an actual Word file or when you must preserve structure from an existing .docx. A normal writing prompt can draft content, but it usually does not tell the agent how to convert, unpack, validate, or handle comments and revisions safely.

Can the docx skill create new documents from scratch?

Yes, but the repository’s strongest evidence is around practical file operations and editing workflows, not just prose generation. If your main need is “write content,” many tools can do that. If your need is “deliver or edit a usable .docx,” this skill is a better fit.

Does docx work for old .doc files?

Indirectly. Legacy .doc files should be converted first using LibreOffice. That is an important boundary: the docx skill is for .docx workflows, not native .doc editing.

Is docx suitable for legal or review-heavy documents?

Potentially yes, because tracked changes, comments, and validation are first-class concerns in the repository. That said, review-heavy documents should always be opened after generation to confirm visible behavior in Word-compatible editors.

When should I not use docx?

Skip this docx skill if:

you only need plain text output
the destination is PDF rather than Word
the workflow is Google Docs-first
you cannot run the local tools it depends on
pixel-perfect desktop publishing matters more than editable DOCX structure

How to Improve docx skill

Give docx explicit output constraints

The fastest way to improve docx results is to specify the finished artifact, not just the topic. Include:

target filename
source filename
preserve vs rewrite instructions
required sections
style constraints
whether comments, revisions, images, or tables must remain intact

This reduces tool-choice mistakes and prevents the agent from defaulting to a text-only path.

Ask for a workflow choice before execution

For better docx usage, prompt the agent to state which path it will use:

pandoc
unpack/edit/repack
LibreOffice conversion
comment or revision tooling

Example:

“Before editing, tell me whether this task should use pandoc extraction or unpack/repack, and why.”

That simple step catches many wrong-turn decisions early.

Improve search and replace tasks with structure hints

If you need replacements, identify where the content lives:

body text
headers/footers
comments
tables
title page
specific section headings

Why this helps: DOCX text is often split across many runs. A vague “replace all mentions” request can miss matches or break formatting.

Be careful with comments and XML escaping

When adding comments, provide clean, XML-safe text. The repository explicitly notes that comment text should be pre-escaped. If your comment includes ampersands, smart quotes, or special symbols, mention that they must be escaped or normalized.

This is a small detail, but it materially affects whether the resulting file opens cleanly.

Use original-file validation whenever possible

When repacking, include --original if you have the source file. That gives the validator more context and makes the docx skill safer for edits to existing documents. It is one of the highest-value habits in this skill’s workflow.

Iterate after first output with file-aware feedback

Do not stop at “looks wrong.” Better follow-up feedback is:

“The document opens, but comments do not appear in Word.”
“Tracked changes were flattened; preserve them instead.”
“The body text updated, but header branding stayed old.”
“The XML packed, but formatting broke in the table section.”

That kind of feedback helps the agent choose the next repair step instead of retrying blindly.

Common failure modes to catch early

Watch for these before scaling up a workflow:

output opens but comments are missing
tracked changes are unintentionally accepted or lost
edits affect only visible text, not headers/footers
smart quotes or symbols break XML
repacked files pass zipping but fail in Word

A quick smoke test on a small document is worth it before processing a large batch.

How to get better results on complex docx files

For complex docx for DOCX Workflows, split the task:

extract and inspect
decide edit points
apply one class of change at a time
repack and validate
visually verify

This is slower than a one-shot prompt, but much more reliable for templates, contracts, reports, and revision-heavy files.

What to improve if you extend the docx skill

If you are evaluating how to improve the docx skill itself, the most valuable additions would be:

clearer documented entrypoints for common tasks
example prompts mapped to each workflow lane
a tighter install/prerequisites checklist
more explicit guidance on creating new documents versus editing existing ones
end-to-end examples for comments, redlines, and image replacement

Those improvements would reduce adoption friction more than adding generic prose.

Ratings & Reviews

No ratings yet

Share your review

0/10000

Latest reviews

Saving...

more skill

minimax-docx

by MiniMax-AI

minimax-docx is a DOCX-focused skill for creating, editing, and formatting Word documents with OpenXML SDK and .NET. It supports three paths: create from scratch, edit existing content, and apply template formatting with XSD validation. Use it when you need a real .docx with structure, style preservation, and fewer layout surprises.

DOCX Workflows

Favorites 0GitHub 11.7k

google-workspace-ops

by affaan-m

google-workspace-ops helps you work across Google Drive, Docs, Sheets, and Slides as one connected workflow. Use it to find, inspect, summarize, edit, migrate, or clean up Workspace assets with less guesswork, and follow the google-workspace-ops guide for safer in-place changes.

Office Documents

Favorites 0GitHub 156.2k

frontend-design

by anthropics

frontend-design helps you turn vague UI ideas into distinctive, production-grade interfaces with real frontend code, strong aesthetic direction, and less generic AI styling.

UI Design

Favorites 1GitHub 105.2k

create-colleague

by titanwings

create-colleague turns coworker docs, chats, emails, screenshots, Feishu, and DingTalk data into an editable AI skill with separate work and persona outputs, plus update flows for ongoing refinement.

Skill Authoring

Favorites 1GitHub 747

hyperframes

by heygen-com

hyperframes is a workflow skill for building HTML-based video compositions in HyperFrames. Use it for title cards, overlays, captions, voiceovers, audio-reactive motion, and scene transitions when you need structured, code-first hyperframes for Video Editing. It favors layout, timing, and animation decisions over generic prompt-only video requests.

Video Editing

Favorites 0GitHub 2.7k

kreuzberg

by kreuzberg-dev

The kreuzberg skill helps you install and use Kreuzberg for document extraction across 91+ formats, including PDFs, Office files, images, HTML, email, and archives. It covers Python, Node.js/TypeScript, Rust, and CLI workflows for OCR, tables, metadata, batch processing, and practical parsing guidance.

PDF Processing

Favorites 0GitHub 0

skill-creator

by anthropics

skill-creator is a Skill Authoring meta-skill for drafting new skills, revising existing SKILL.md files, running evals, comparing variants, and improving trigger descriptions with repository scripts and review tools.

Skill Authoring

Favorites 2GitHub 105.1k

azure-identity-py

by microsoft

azure-identity-py helps set up Azure authentication in Python with Microsoft Entra ID. Use it to choose DefaultAzureCredential, managed identity, or service principal auth, configure environment variables, and troubleshoot access control and credential chain issues. Install guidance, usage patterns, and practical setup notes are based on the repo skill file.

Access Control

Favorites 0GitHub 2.2k

claude-api

by anthropics

claude-api is a practical skill for installing and using the Claude API and Anthropic SDKs. It helps developers choose the right SDK or raw HTTP path, detect language-specific docs, and implement streaming, tool use, files, batches, and error handling with less guesswork.

API Development

Favorites 0GitHub 105k

wrangler

by cloudflare

The wrangler skill helps you find correct CLI commands, config shapes, and deployment steps for Cloudflare Workers. Use it for wrangler usage, wrangler install checks, and a practical wrangler guide when building or shipping Workers for Backend Development.

Backend Development

Favorites 0GitHub 1.3k

clickhouse-best-practices

by ClickHouse

clickhouse-best-practices is a ClickHouse best practices skill for Database Engineering. It guides schema design, query tuning, insert strategy, and agent connectivity with rule-based recommendations, making clickhouse-best-practices usage easier to trigger, review, and cite in ClickHouse workflows.

Database Engineering

Favorites 0GitHub 412

clickhouse-architecture-advisor

by ClickHouse

clickhouse-architecture-advisor helps design ClickHouse workloads with workload-aware decisions for ingestion, partitioning, joins, dictionaries, upserts, and pre-aggregation. It is especially useful for Backend Development, observability, SIEM, product analytics, IoT telemetry, and financial pipelines. The skill labels guidance as official, derived, or field.

Backend Development

Favorites 0GitHub 412

figma-generate-library

by figma

figma-generate-library helps you build or update a Figma design system from a codebase with an ordered workflow for tokens, component libraries, documentation, and light/dark theming. Use the figma-generate-library skill when you need a practical guide for Design Systems, not a one-off mockup. It complements figma-use for Plugin API calls.

Design Systems

Favorites 0GitHub 0

winui-app

by openai

The winui-app skill helps you bootstrap, build, and troubleshoot WinUI 3 desktop apps with C# and the Windows App SDK. Use it for environment readiness, new app setup, shell and navigation choices, XAML controls, theming, accessibility, deployment, and launch-fix workflows for Frontend Development.

Frontend Development

Favorites 0GitHub 0

speech

by openai

Use the speech skill to turn text into spoken audio for narration, voiceover, IVR prompts, accessibility reads, and batch speech generation. It uses the OpenAI Audio API with built-in voices, a bundled CLI, and `OPENAI_API_KEY` for live runs. Custom voice creation is out of scope.

Design Implementation

Favorites 0GitHub 0

security-threat-model

by openai

Repository-grounded security-threat-model skill for AppSec threat modeling. It maps trust boundaries, assets, attacker goals, abuse paths, and mitigations into a concise Markdown threat model. Use it when you need security-threat-model for Threat Modeling on a specific repo or path, not a generic architecture review or code check.

Threat Modeling

Favorites 0GitHub 0