K

defuddle

by kepano

defuddle extracts clean markdown from web pages with the Defuddle CLI, removing clutter for research, docs, and articles. Use it for standard HTML pages, install with npm, and skip URLs ending in .md.

Stars19.7k
Favorites0
Comments0
AddedApr 5, 2026
CategoryWeb Research
Install Command
npx skills add kepano/obsidian-skills --skill defuddle
Curation Score

This skill scores 76/100, which means it is a solid directory listing candidate: agents get a clear trigger, a simple command pattern, and a concrete reason to use it instead of a generic web fetch for normal web pages. Directory users can make a credible install decision, though they should expect a lightweight wrapper around an external CLI rather than a deeply guided workflow.

76/100
Strengths
  • Strong triggerability: it explicitly says to use Defuddle when a user provides a standard web URL to read or analyze, and not for URLs ending in .md.
  • Operationally clear: the skill gives install guidance plus concrete commands for markdown extraction, file output, and metadata retrieval.
  • Good agent leverage: it explains the practical benefit of removing navigation, ads, and clutter to reduce token usage versus raw page fetching.
Cautions
  • Limited edge-case guidance: beyond excluding .md URLs, it does not explain handling failures, unsupported pages, auth walls, or dynamic sites.
  • Minimal supporting material: there are no scripts, references, or examples showing expected outputs, so adoption relies on the short SKILL.md alone.
Overview

Overview of defuddle skill

What the defuddle skill does

The defuddle skill turns a normal web page into clean, readable markdown with much less clutter than a raw fetch. It is built for pages like articles, docs, guides, blog posts, and other HTML pages where menus, ads, sidebars, and navigation waste tokens and distract analysis.

Best fit for Web Research

Use defuddle for Web Research when your real goal is to read, summarize, compare, quote, or analyze page content rather than inspect site chrome or raw HTML. The main value is cleaner input for downstream reasoning. If a user gives you a standard page URL and wants the content, defuddle usage is usually a better starting point than a generic web fetch.

Key limits and when not to use it

The biggest boundary is simple: do not use defuddle on URLs ending in .md. Those pages are already markdown, so a direct fetch is cleaner and avoids unnecessary transformation. It is also a weak fit when you need exact page structure, interactive elements, scripts, or full DOM fidelity.

Why users choose defuddle

The practical differentiator is not “can it fetch a page,” but “can it give me the main text in a token-efficient format quickly.” That makes the defuddle skill attractive for research pipelines, note capture, article summarization, and documentation reading where cleaner markdown materially improves output quality.

How to Use defuddle skill

defuddle install and basic command

For defuddle install, the repository points to the Defuddle CLI itself:

npm install -g defuddle

Core command:

defuddle parse <url> --md

Use --md consistently. That is the recommended output for most research and analysis workflows because it removes visual noise while preserving readable structure.

Inputs the defuddle skill needs

The defuddle skill needs a page URL and, ideally, a clear intent. Good input looks like:

  • the exact URL
  • what you need from it
  • whether you want full markdown, saved output, or only metadata

Examples:

  • “Read this article and summarize the main argument: <url>
  • “Extract clean markdown from this docs page and save it to content.md: <url>
  • “Get only the page title and description for <url>

Useful commands:

defuddle parse <url> --md -o content.md
defuddle parse <url> -p title
defuddle parse <url> -p description
defuddle parse <url> -p domain

Turn a rough goal into a strong defuddle prompt

Weak request: “Look at this URL.”

Better defuddle guide prompt:

  • “Use defuddle on <url> with markdown output. Ignore site navigation. Then summarize the key points in 5 bullets and quote the most important section.”
  • “Use defuddle for this documentation page: <url>. Extract markdown, identify setup steps, prerequisites, and caveats, then rewrite them as a checklist.”
  • “Pull only metadata from <url> first. If the title and description match the topic, then extract full markdown.”

This works better because it tells the agent both how to call defuddle and what to do with the cleaned content afterward.

Suggested workflow and what to read first

This skill is intentionally small. Read skills/defuddle/SKILL.md first because it contains the whole usable workflow: install, parsing command, output formats, and metadata properties. In practice, use this sequence:

  1. Check whether the URL is a normal HTML page or a .md file.
  2. Run defuddle parse <url> --md.
  3. If you need a reusable artifact, add -o content.md.
  4. If you only need routing or validation, query metadata with -p.
  5. Pass the cleaned markdown into summarization, extraction, or note-taking.

defuddle skill FAQ

Is defuddle better than an ordinary prompt plus fetch?

Usually yes for article-style pages. A normal fetch often includes headers, footers, cookie notices, and navigation. defuddle usage improves signal-to-noise before analysis starts, which can lower token cost and reduce summarization errors caused by irrelevant page elements.

When should I not use the defuddle skill?

Skip defuddle for .md URLs, raw files, or cases where you need exact HTML, embedded media behavior, page scripts, or layout details. It is a content-extraction tool, not a browser automation or DOM inspection tool.

Is the defuddle skill beginner-friendly?

Yes. The command surface is very small: install once, then use defuddle parse <url> --md. That makes the defuddle skill easy to adopt even if you only want cleaner source text for research or note capture.

What outputs can defuddle return?

You can get markdown with --md, JSON with --json, HTML by default, or specific metadata using -p <name>. For most reading and research tasks, markdown is the best default; metadata mode is useful for quick validation and routing.

How to Improve defuddle skill

Give defuddle a precise page target

The easiest way to improve defuddle results is to supply the canonical content page, not a homepage, search page, or listing page. Article URLs and single-doc pages produce cleaner markdown than hubs full of navigation and repeated links.

Ask for the downstream task in the same request

The defuddle skill is stronger when extraction is paired with a concrete next step. Instead of only saying “parse this,” ask for:

  • summary
  • key claims
  • setup steps
  • FAQs
  • quotes
  • comparison points

That reduces handoff ambiguity and helps the agent structure output around your real job-to-be-done.

Use metadata mode before full extraction when uncertain

If the URL may redirect, be low quality, or be the wrong page, start with:

defuddle parse <url> -p title
defuddle parse <url> -p description
defuddle parse <url> -p domain

This is a simple but effective defuddle guide tactic: validate relevance first, then spend effort on full markdown extraction.

Common failure modes and how to iterate

If output feels thin or oddly structured, the issue is often the source page, not the CLI. Try a more specific URL, switch from a category page to an article page, or save markdown to inspect it manually. If your first result is too broad, rerun defuddle for Web Research with a narrower instruction like “extract setup steps only” or “quote sections about authentication only.”

Ratings & Reviews

No ratings yet
Share your review
Sign in to leave a rating and comment for this skill.
G
0/10000
Latest reviews
Saving...