benchmark

by garrytan

The benchmark skill helps detect performance regressions in web and app workflows. Use it to establish a baseline, compare before and after changes, and track whether a PR made pages slower, heavier, or less stable. It is a practical benchmark guide for performance optimization, Core Web Vitals, Lighthouse checks, bundle size, and load time trends.

Stars91.8k

Favorites0

Comments0

AddedMay 9, 2026

CategoryPerformance Optimization

Install Command

npx skills add garrytan/gstack --skill benchmark

Curation Score

This skill scores 67/100, which means it is listable for directory users but with clear caveats: it appears genuinely workflow-oriented for performance regression benchmarking, yet the install decision is weakened by missing supporting assets and some placeholder markers. Users who need automated page-speed regression checks should consider it; users who want a very polished, self-contained install experience may want more documentation first.

67/100

Strengths

Specific, actionable purpose: performance regression detection for page load times, Core Web Vitals, and resource sizes.
Good triggerability: explicit use cases and voice aliases such as "speed test" and "check performance" reduce guesswork.
Substantial workflow content in SKILL.md with many headings and code-fenced steps, suggesting real operational guidance rather than a stub.

Cautions

No install command and no supporting scripts/references/resources, so adoption may require more manual setup and inspection.
Placeholder markers are present, which lowers trust that every branch of the workflow is fully finalized.

Lighthouse Performance Web Vitals Bundle Size Browser Automation Frontend

Overview

Overview of benchmark skill

What benchmark skill does

The benchmark skill is for performance regression detection in web and app workflows. It helps you establish a baseline, compare before/after changes, and track whether a PR made pages slower, heavier, or less stable. In practice, the benchmark skill is most useful for teams trying to answer one question: did this change improve or harm performance?

Who should use it

Use this benchmark skill if you care about page speed, Core Web Vitals, Lighthouse-style checks, bundle size, or load time trends over time. It is a strong fit for reviewers, frontend engineers, and AI agents that need a repeatable way to evaluate performance changes instead of guessing from a screenshot or a quick manual test.

Why it is different

The benchmark skill is not just a generic “run a test” prompt. It is oriented around before/after comparison, regression detection, and ongoing trend awareness, with workflow guidance tuned for browser-based performance measurement. That makes it more useful for Performance Optimization than a one-off prompt that only asks for “speed issues.”

How to Use benchmark skill

benchmark install and setup

Install the benchmark skill in your Claude skills environment with the repository’s skill command, then open the skill file before using it in a real task. The expected install path is:
npx skills add garrytan/gstack --skill benchmark

After install, confirm the skill is available in the current workspace and that your task is specific enough to measure. The skill works best when the repo under test, the page or route, and the change being evaluated are all known up front.

What to read first

Start with SKILL.md, then inspect SKILL.md.tmpl if you want to understand the generated structure. Because this repository does not expose extra rules/, resources/, or helper scripts for the skill, the main source of truth is the skill file itself. For decision-making, the most important sections are the preamble, plan-mode guidance, and any routing or constraint notes that affect when the benchmark skill should run.

How to write a good prompt

A weak prompt says “check performance.” A stronger benchmark usage prompt names the target, the baseline, and the decision you need:

“Compare /pricing before and after the image compression change and report any regressions in LCP, CLS, and total transfer size.”
“Benchmark the checkout page on mobile emulation and tell me whether the new bundle split improved load time.”
“Run a performance benchmark for the homepage and summarize whether the PR is safe to merge.”

Include the page, device assumptions, and what counts as a failure. That reduces ambiguity and makes the result actionable.

Workflow that produces useful results

Use the benchmark guide as a repeatable loop: identify the page, establish the baseline, run the comparison, and then interpret the delta against the change you made. If you are working in plan mode, confirm whether the skill should only inspect or should also execute measurements. For best output, keep the test scope narrow; one important route usually beats a whole-site sweep.

benchmark skill FAQ

Is benchmark skill only for web performance?

It is primarily for browser-visible performance optimization, especially pages, routes, and frontend changes. If your task is backend latency, infra profiling, or database tuning, the benchmark skill may not be the best first choice unless the user-facing page metric is the goal.

Do I need a full prompt, or is the skill enough?

The skill helps structure the work, but it still needs a concrete target. A generic prompt can trigger the benchmark skill, but stronger benchmark usage happens when you provide a route, a change, and a comparison point. The more specific your request, the less the agent has to infer.

Is benchmark good for beginners?

Yes, if you want a guided way to check whether a change made performance worse. It is easier to use than building your own evaluation checklist from scratch, but you still need to know what page or feature you want measured.

When should I not use it?

Do not use benchmark skill when you only need a qualitative UI review, when the page is too unstable to measure meaningfully, or when your main problem is not performance. If you cannot define a stable before/after comparison, the benchmark result will be noisy.

How to Improve benchmark skill

Give the skill a measurable target

The biggest quality boost comes from specifying exactly what to benchmark and what success looks like. Say which URL, device class, and metric matter most. For Performance Optimization, that often means naming one primary metric, such as LCP or bundle size, instead of asking for “all performance issues.”

Include the change being tested

Benchmarking is strongest when the skill knows what changed: a new image pipeline, a code-splitting refactor, a font swap, or a third-party script removal. That context helps separate normal variance from a real regression and makes the output easier to trust.

Ask for the comparison you will act on

If you need a merge decision, say so. If you need remediation ideas, say that too. Useful follow-up prompts include:

“Compare against the last stable build and flag anything above a 5% regression.”
“Benchmark this branch, then tell me the highest-impact fix if results are worse.”
“Rerun the check on mobile and desktop, but prioritize the route with the worst LCP.”

Iterate on the first run

If the first result is noisy, improve the input before rerunning: narrow the route, remove unrelated changes, or define the test conditions more tightly. The benchmark skill is best when you treat it as a repeatable benchmark skill for decision support, not a single-pass diagnostic for every kind of speed problem.

Ratings & Reviews

No ratings yet

Share your review

0/10000

Latest reviews

Saving...

more skill

vercel-react-best-practices

by vercel-labs

vercel-react-best-practices is a Vercel Engineering skill that guides AI agents to optimize React and Next.js performance with prioritized rules for waterfalls, bundle size, and rendering.

Frontend Development

Favorites 0GitHub 24k

performance-optimization

by addyosmani

The performance-optimization skill helps you measure first, find the real bottleneck, fix it, and verify results. Use it when performance requirements exist, you suspect a regression, or Core Web Vitals, load times, or interaction latency need improvement.

Performance Optimization

Favorites 0GitHub 18.7k

supabase-postgres-best-practices

by supabase

supabase-postgres-best-practices is a Supabase Postgres optimization skill for query tuning, indexing, schema design, RLS performance, locking, and connection management.

Database Engineering

Favorites 0GitHub 1.7k

wp-performance

by WordPress

Use wp-performance to investigate and improve WordPress performance from the backend, without a browser UI. It supports measurement-first diagnosis for slow frontend requests, admin pages, REST routes, and WP-Cron, with guidance on WP-CLI profile/doctor, Query Monitor via REST headers, Server-Timing, database queries, autoloaded options, object caching, cron, and remote HTTP calls.

Performance Optimization

Favorites 0GitHub 1.4k

web-perf

by cloudflare

web-perf analyzes web performance with Chrome DevTools MCP. It measures Core Web Vitals, trace-based load issues, render-blocking resources, layout shifts, caching problems, and accessibility gaps. Use the web-perf skill for Performance Optimization, debugging slow pages, and evidence-based web-perf guide workflows that rely on current docs and live traces.

Performance Optimization

Favorites 0GitHub 1.3k

react-native-best-practices

by callstackincubator

react-native-best-practices is a practical React Native performance optimization guide for slow startup, dropped frames, heavy renders, memory leaks, bundle bloat, and animation jank. Use it when you need evidence-backed fixes for Hermes, bridge overhead, FlashList, native modules, or profiling a release regression.

Performance Optimization

Favorites 0GitHub 1.3k

swift-nio

by Joannis

swift-nio is a skill for SwiftNIO backend development, covering servers, clients, pipelines, buffers, codecs, and event-loop-safe async code. Use it for swift-nio usage questions, protocol parsing, TCP/UDP services, NIOAsyncChannel integration, and debugging blocking work on an EventLoop. It is a practical swift-nio guide for correct architecture and implementation.

Backend Development

Favorites 0GitHub 0

audit-website

by squirrelscan

The audit-website skill uses the squirrel CLI to audit websites and webapps across 230+ rules for SEO, technical, content, performance, security, links, and site health, then returns actionable LLM-ready reports.

UX Audit

Favorites 0GitHub 68

autoresearch

by github

autoresearch is an autonomous experimentation loop for coding tasks with measurable outcomes. It helps developers define a goal, baseline, metric, and scope, then iterate through code changes, tests, and keep-or-revert decisions using git-backed checkpoints.

Workflow Automation

Favorites 0GitHub 0

godot-gdscript-patterns

by wshobson

godot-gdscript-patterns helps Godot 4 users generate and review GDScript with better scene structure, signals, state machines, autoloads, and async loading patterns. Use it to install proven Godot architecture into gameplay systems, UI flows, and maintainable project code.

Frontend Development

Favorites 0GitHub 32.5k

pytorch-patterns

by affaan-m

pytorch-patterns helps you write, review, and debug PyTorch code with device-agnostic patterns, reproducible experiments, and explicit tensor handling. Use the pytorch-patterns skill for cleaner training loops, model refactors, and practical PyTorch guidance.

Code Editing

Favorites 0GitHub 156.2k

nextjs-turbopack

by affaan-m

The nextjs-turbopack skill helps you use Turbopack in Next.js 16+ for faster local development, HMR, and bundler decisions. Use it as a practical nextjs-turbopack guide for install, usage, and when to switch back to webpack in Frontend Development workflows.

Frontend Development

Favorites 0GitHub 156.2k

jpa-patterns

by affaan-m

jpa-patterns is a practical JPA/Hibernate guide for Spring Boot backend development. It covers entity design, relationships, query tuning, transactions, auditing, pagination, and pooling to help reduce ORM mistakes and improve persistence performance.

Backend Development

Favorites 0GitHub 156.2k

rust-async-patterns

by wshobson

rust-async-patterns is a practical skill for async Rust with Tokio, covering tasks, channels, streams, timeouts, cancellation, tracing, and error handling for backend development.

Backend Development

Favorites 0GitHub 32.6k

go-concurrency-patterns

by wshobson

go-concurrency-patterns helps you apply idiomatic Go concurrency for worker pools, pipelines, channels, sync primitives, and context-based cancellation. Use it to design safer backend services, debug race conditions, and improve graceful shutdown behavior from the guidance in SKILL.md.

Backend Development

Favorites 0GitHub 32.6k

async-python-patterns

by wshobson

async-python-patterns is a practical guide to choosing safe asyncio patterns for I/O-bound Python systems. Use it to install context, review usage, avoid blocking the event loop, and design async APIs, workers, scrapers, and backend services with bounded concurrency, cancellation, and sync-vs-async tradeoffs.

Backend Development

Favorites 0GitHub 32.6k