diagnose

by mattpocock

diagnose is a structured debugging workflow for hard bugs, flaky tests, and performance regressions. It helps you reproduce the issue, minimise the failing case, form one hypothesis at a time, add instrumentation, fix the root cause, and lock it in with a regression test. Use the diagnose guide when “debug this” is not enough.

Stars66k

Favorites0

Comments0

AddedMay 8, 2026

CategoryDebugging

Install Command

npx skills add mattpocock/skills --skill diagnose

Curation Score

This skill scores 74/100, which means it is worth listing for users who need a disciplined bug-diagnosis workflow, but it is not yet a highly polished install decision page. The repository gives enough concrete process guidance for agents to use it with less guesswork than a generic prompt, especially around building a deterministic feedback loop and choosing reproduction methods.

74/100

Strengths

Explicit trigger and scope for hard bugs, throwers/failures, and performance regressions in the frontmatter.
Strong operational guidance: Reproduce → minimise → hypothesise → instrument → fix → regression-test, with concrete ways to build a pass/fail loop.
Includes a runnable human-in-the-loop shell template, which improves agent-triggerability for interactive reproduction workflows.

Cautions

The visible evidence is skewed toward diagnosis methodology; the excerpt does not show the full end-to-end workflow, so install users may need to fill in some execution details.
Experimental/test signal and no install command in SKILL.md may make adoption feel less turnkey than more mature skills.

Scripts Testing Playwright Browser Automation Cli

Overview

Overview of diagnose skill

What diagnose is for

The diagnose skill is a structured debugging workflow for cases where a bug is hard to pin down, a test is flaky, or performance has regressed and you need a reliable way to isolate the cause. It is best for agents and developers who want more than a generic debug this prompt: they need a repeatable path from symptom to reproduction, then to hypothesis, instrumentation, fix, and regression test.

Who should install it

Install the diagnose skill if you often work on codebases where failures are intermittent, environment-dependent, or only visible in the UI or production-like flows. It is especially useful for Debugging in projects where a quick code skim is not enough and you need a disciplined way to create a pass/fail signal before touching implementation.

What makes it different

The diagnose skill is centered on building a fast feedback loop first. That is the main differentiator: it prioritizes reproducibility and observability over premature code changes. It also encourages using the project’s glossary and ADRs so the agent can align with domain language instead of guessing module intent.

How to Use diagnose skill

Install diagnose skill

Use the skill install path from the repository, then confirm the skill files are available in your local skills directory. For this repo, the documented install command is:
npx skills add mattpocock/skills --skill diagnose

After install, start with SKILL.md, then inspect the supporting files that shape the workflow. The most relevant repository paths are scripts/hitl-loop.template.sh and any project-specific documentation that explains terms, architecture, or testing boundaries.

Turn a vague bug into a good diagnose prompt

The diagnose skill works best when your input includes a concrete symptom, where it happens, and what success looks like. A weak prompt says “diagnose this.” A stronger prompt says:
“Diagnose why the export button sometimes fails in staging. Reproduce it in the browser, minimize the steps, identify whether the issue is server-side or client-side, and add a regression test if possible.”

For diagnose usage, include:

the observed failure mode
the environment where it happens
any known-good or known-bad examples
whether you can run tests, a dev server, or a browser harness

Tips that materially improve output

If you want better diagnose for Debugging results, tell the agent what tools are allowed: unit tests, CLI commands, HTTP requests, browser automation, or replaying captured traces. Also mention whether the bug is deterministic, flaky, or performance-related, because that changes how the loop should be built. The more specific the observable signal, the less time the agent spends guessing.

diagnose skill FAQ

Is diagnose better than a normal debug prompt?

Usually yes, when the issue is hard to reproduce or spans multiple layers. A normal prompt may jump straight to code changes; diagnose is designed to create evidence first, which is safer for flaky bugs and regressions.

When should I not use diagnose?

Do not use diagnose for straightforward syntax errors, obvious null checks, or tiny one-file fixes where the failure is already fully explained. In those cases, the overhead of a full diagnose guide may be more than you need.

Is the diagnose skill beginner-friendly?

Yes, if you can describe the symptom clearly and run the suggested checks. It is most helpful when you are unsure where the bug lives, because it gives structure to the investigation instead of requiring deep prior knowledge.

Does diagnose fit every stack?

It fits most stacks that can expose a test, script, browser check, or replayable input. It is less useful when the system has no deterministic way to observe success or failure, since the skill depends on a reliable feedback loop.

How to Improve diagnose skill

Give the skill a stronger starting signal

The biggest improvement comes from better reproduction detail. Instead of “the app is broken,” provide the exact action, data shape, and expected versus actual result. If you have logs, a failing URL, a payload sample, or a minimal fixture, include it up front.

Remove ambiguity before asking for root cause

If there are multiple possible failures, name the one you want diagnosed first. For example, separate “the button does nothing” from “the request returns 500” and from “the page is slow.” Diagnose works best when the initial problem statement maps to one observable failure mode.

Use the first pass to choose the next experiment

After the first output, improve diagnose skill results by answering one of three questions: did the reproduction become deterministic, did the hypothesis narrow the search, or do you need a different signal? If the output is still vague, ask for a smaller harness, a different test seam, or a browser/CLI replay path instead of asking for a broad explanation again.

Ratings & Reviews

No ratings yet

Share your review

0/10000

Latest reviews

Saving...

more skill

systematic-debugging

by obra

systematic-debugging is a root-cause-first debugging skill for bugs, flaky tests, build failures, and unexpected behavior. Learn the four-phase workflow, companion files, and when to use it before proposing fixes.

Debugging

Favorites 0GitHub 121.8k

web-perf

by cloudflare

web-perf analyzes web performance with Chrome DevTools MCP. It measures Core Web Vitals, trace-based load issues, render-blocking resources, layout shifts, caching problems, and accessibility gaps. Use the web-perf skill for Performance Optimization, debugging slow pages, and evidence-based web-perf guide workflows that rely on current docs and live traces.

Performance Optimization

Favorites 0GitHub 1.3k

playwright-best-practices

by currents-dev

playwright-best-practices is a Playwright + TypeScript skill for writing stable tests, reducing flake, improving auth flows, choosing fixtures vs page objects, and handling CI, popups, mobile, iframes, websockets, and multi-user scenarios with practical repo-backed guidance.

Test Automation

Favorites 0GitHub 174

autofix

by coderabbitai

autofix helps safely turn CodeRabbit PR review-thread feedback into validated code changes on the current GitHub branch. Use this autofix skill when you need a branch-aware CodeRabbit for Code Review workflow with explicit approval, not a generic prompt-following fixer. It checks repo state, reads trusted instructions, and applies only verified fixes.

Code Review

Favorites 0GitHub 0

sentry

by openai

The sentry skill is a read-only Observability tool for inspecting Sentry issues, events, and health signals. Use it to investigate recent production errors, summarize impact, and run repeatable CLI-based queries with structured output. It is best when you need a practical sentry guide for triage, not a broad observability overview.

Observability

Favorites 0GitHub 0

aspire

by github

aspire skill for install, AppHost setup, local run, dashboard debugging, and publish workflows for Deployment. Covers CLI usage, references, troubleshooting, and the key publish-vs-deploy boundary.

Deployment

Favorites 0GitHub 0

property-based-testing

by trailofbits

property-based-testing skill guide for writing, reviewing, and improving PBT across languages and smart contracts. Use this property-based-testing guide to spot roundtrip, idempotence, invariant, parser, validator, and normalization cases, choose generators, and decide when property-based-testing is stronger than example-based tests.

Skill Testing

Favorites 0GitHub 5k

terminal-ops

by affaan-m

terminal-ops is an evidence-first repo execution skill for terminal work. Use it to run commands, inspect git state, debug CI or builds, and make narrow fixes with proof of what changed and what was verified. This terminal-ops guide helps reduce guesswork for Code Editing and repo operations.

Code Editing

Favorites 0GitHub 156.3k

investigate

by garrytan

The investigate skill guides systematic debugging and root-cause analysis for broken, flaky, or unexpected behavior. Use it for code review, incident triage, bug fixes, and "it worked yesterday" cases when you need evidence before changing code. It follows a four-phase workflow: investigate, analyze, hypothesize, implement.

Code Review

Favorites 0GitHub 91.8k

browser-testing-with-devtools

by addyosmani

browser-testing-with-devtools helps agents test and debug real browser behavior through Chrome DevTools MCP. Use it to inspect the DOM, capture console errors, analyze network requests, profile performance, and verify fixes in a live browser.

Test Automation

Favorites 0GitHub 18.7k

libfuzzer

by trailofbits

libfuzzer is a coverage-guided fuzzer for C/C++ projects compiled with Clang. This libfuzzer skill helps you install, understand, and use the workflow for harnessing targets, running sanitizers, and starting a practical security audit with minimal setup.

Security Audit

Favorites 0GitHub 5k

vue-debug-guides

by vuejs-ai

vue-debug-guides is a Vue 3 debugging skill for diagnosing runtime errors, warnings, async component failures, reactivity issues, and SSR or hydration mismatches with targeted reference-based fixes.

Debugging

Favorites 0GitHub 2.1k

ios-simulator-skill

by conorluddy

ios-simulator-skill is a task-focused iOS simulator skill for accessibility-aware app launch, navigation, text entry, gestures, screenshots, state capture, build/test loops, and simulator lifecycle control. It is designed to reduce guesswork for AI agents, QA engineers, and developers working on repeatable iOS test automation.

Test Automation

Favorites 0GitHub 0

datadog-cli

by softaworks

datadog-cli helps agents run Datadog CLI workflows for logs, traces, metrics, services, and dashboards. Learn setup with DD_API_KEY and DD_APP_KEY, use npx @leoflores/datadog-cli commands, and handle --site plus dashboard update safety for incident triage.

Observability

Favorites 0GitHub 0

agent-introspection-debugging

by affaan-m

The agent-introspection-debugging skill provides a structured self-debugging workflow for AI agent failures: capture the failure state, diagnose likely causes, apply a contained recovery step, and produce a human-readable introspection report. Use it for looping, retry-heavy, or drift-prone runs, not routine verification.

Debugging

Favorites 0GitHub 156k

root-cause-tracing

by NeoLabHQ

root-cause-tracing helps you debug failures by tracing backward from the symptom to the original trigger. It is ideal for deep stack errors, misleading outputs, and cases where invalid data, paths, or working directories were introduced earlier. Use it as a root-cause-tracing guide for disciplined debugging and safer fixes.

Debugging

Favorites 0GitHub 982

diagnose

Overview of diagnose skill

What diagnose is for

Who should install it

What makes it different

How to Use diagnose skill

Install diagnose skill

Turn a vague bug into a good diagnose prompt

Suggested workflow and files to read first

Tips that materially improve output

diagnose skill FAQ

Is diagnose better than a normal debug prompt?

When should I not use diagnose?

Is the diagnose skill beginner-friendly?

Does diagnose fit every stack?

How to Improve diagnose skill

Give the skill a stronger starting signal

Remove ambiguity before asking for root cause

Use the first pass to choose the next experiment

Ratings & Reviews