web3-testing

by wshobson

The web3-testing skill helps you design and scaffold smart contract test workflows with Hardhat and Foundry, including unit tests, integration coverage, mainnet forking, fuzzing, gas checks, and setup guidance for Solidity and DeFi teams.

Stars32.6k

Favorites0

Comments0

AddedMar 30, 2026

CategoryTest Automation

Install Command

npx skills add wshobson/agents --skill web3-testing

Curation Score

This skill scores 68/100, which means it is acceptable to list for directory users who want reusable guidance for smart contract testing, but they should expect a documentation-only skill with some setup guesswork. The repository shows real workflow content around Hardhat/Foundry testing patterns, mainnet forking, gas reporting, coverage, and verification, giving agents more structure than a generic prompt. However, the lack of support files, install steps, and linked references limits confidence and ease of execution.

68/100

Strengths

Strong triggerability: the description and use cases clearly signal when to invoke it for Solidity testing, fuzzing, gas checks, and mainnet fork scenarios.
Substantial workflow content: the long SKILL.md includes concrete configuration and code examples for Hardhat-based testing and related tooling.
Practical agent leverage: it bundles multiple common Web3 testing tasks into one reusable guide instead of requiring ad hoc prompting.

Cautions

Documentation-only skill: there are no scripts, references, or companion resources to reduce implementation guesswork.
Setup clarity is incomplete: SKILL.md includes config examples but no explicit install command or quick-start path for dependencies and execution.

Testing Blockchain Smart Contracts Solidity Hardhat Foundry Ethereum

Overview

Overview of web3-testing skill

What web3-testing does

The web3-testing skill helps an agent design and scaffold smart contract test workflows using Hardhat and Foundry. It is aimed at teams that need more than a generic “write some Solidity tests” prompt: unit tests, integration coverage, mainnet forking, fuzzing, gas checks, and verification-related setup all appear in scope.

Who should use web3-testing

This web3-testing skill is best for:

Solidity developers starting or upgrading a test suite
QA and test automation engineers moving into Web3
DeFi teams that need realistic fork-based validation
auditors or protocol engineers who want structured test ideas fast

It is less useful if you only need a single trivial unit test or if your stack does not use Hardhat or Foundry.

Real job-to-be-done

Most users are not looking for theory. They want to get from “I have contracts and risk areas” to “I have a credible, runnable testing plan and starter tests.” The value of web3-testing is that it pushes the conversation toward concrete test setup and advanced patterns that ordinary prompts often miss, especially forked state, fuzzing, gas reporting, and multi-layer test strategy.

What differentiates this skill

Compared with a generic coding prompt, web3-testing gives stronger guidance for:

choosing between Hardhat and Foundry workflows
setting up realistic network and environment configuration
covering edge cases common in smart contracts
testing protocol behavior against forked chain state
adding quality signals such as coverage and gas reporting

What to know before installing

The repository signal is narrow but practical: the skill is primarily a single SKILL.md playbook rather than a larger toolkit with scripts or references. That means adoption is easy, but you should expect guidance and examples, not automation. If you want a prescriptive testing framework with ready-made helpers, this is more of a thinking-and-scaffolding aid than a drop-in package.

How to Use web3-testing skill

Install context for web3-testing

Install the skill from the parent repository:

npx skills add https://github.com/wshobson/agents --skill web3-testing

Because the repo path is plugins/blockchain-web3/skills/web3-testing, you are installing a focused skill document, not a standalone npm test library.

Read this file first

Start with:

SKILL.md

That is the real source of truth here. There are no meaningful support folders in the skill directory, so users should not expect hidden helpers elsewhere.

What input the skill needs from you

The web3-testing skill works best when you provide:

the contract purpose
key functions and access controls
invariants or safety properties
the toolchain you prefer: Hardhat, Foundry, or both
external dependencies such as oracles, pools, tokens, or proxy contracts
whether you need unit tests, integration tests, fork tests, fuzzing, or gas checks

Weak input: “Write tests for my contract.”

Strong input: “Using Foundry, create unit and fuzz tests for an ERC20 staking contract with reward accrual, admin-only parameter updates, pause behavior, and emergency withdrawal. Include revert-path coverage and invariants around total staked balances.”

Turn a rough goal into a usable prompt

A good web3-testing usage prompt usually has four parts:

stack
contract surface
risk areas
desired output format

Example:

“Use the web3-testing skill to propose a test plan and starter files for a Hardhat project. Contract set: Vault.sol, Strategy.sol, OracleAdapter.sol. Focus on deposit/withdraw accounting, role restrictions, stale oracle handling, slippage boundaries, and upgrade safety. Include unit tests, one mainnet fork scenario, and gas reporter setup.”

That is much better than asking for “comprehensive tests,” because it tells the agent what “comprehensive” means in your case.

Choose Hardhat vs Foundry deliberately

The source material covers both frameworks, so your prompt should state which one to optimize for.

Use Hardhat when you want:

JavaScript or TypeScript test flows
plugin-heavy workflows
coverage and gas reporter setup in a familiar Node environment
easier integration with broader app tooling

Use Foundry when you want:

faster Solidity-native tests
fuzzing and invariant-style workflows
a tighter smart-contract-focused developer loop

If your team runs both, say so explicitly and ask the skill to split responsibilities instead of blending them loosely.

Best workflow for web3-testing for Test Automation

For web3-testing for Test Automation, the strongest workflow is:

ask for a test matrix first
review missing failure cases
ask for setup/config files
generate starter tests
refine with real contract code and ABI details
add fork and fuzz layers last

This sequence prevents the common failure mode where the agent generates runnable-looking tests that do not actually reflect your protocol risks.

What the skill can produce well

In practice, web3-testing is most useful for generating:

initial hardhat.config.js testing setup
test category breakdowns
starter unit tests for standard behaviors
fork-test ideas for DeFi integrations
fuzzing targets and edge-case inventories
gas reporting and coverage suggestions

It is strongest when used as a structured testing guide plus code scaffold generator.

What usually blocks good results

The biggest blockers are not installation issues. They are missing protocol context:

no contract code or function list
no statement of critical invariants
no explanation of external integrations
asking for “full coverage” without priorities
mixing framework assumptions in one vague request

If you omit these, the output often becomes generic ERC20-style testing advice rather than protocol-specific test automation.

Practical prompt pattern that improves output quality

Use this structure whenever possible:

Repository context: framework, Solidity version, proxy pattern, package manager
Contracts in scope: filenames and responsibilities
Critical behaviors: deposits, liquidations, claims, rebase logic, governance
Failure conditions: unauthorized access, rounding, reentrancy assumptions, stale data
Desired artifacts: config, test plan, test file skeletons, mock strategy, fork scenario
Constraints: keep tests deterministic, avoid external API reliance, target CI runtime under X minutes

This format gives the web3-testing guide enough precision to produce something your team can adapt quickly.

Use fork tests only where realism matters

The skill surfaces mainnet forking as a differentiator, but not every project needs it. Use fork tests when:

behavior depends on real protocol state
integrations with DEXes, lending markets, or price feeds matter
mocks would hide dangerous edge cases

Skip or limit fork tests when:

CI speed is more important than realism
your contract is mostly isolated business logic
reproducibility matters more than ecosystem simulation

Validate generated output before adopting it

Before merging anything produced with the web3-testing skill, check:

are revert reasons and access-control assumptions correct?
do token decimals and rounding assumptions match reality?
do fork block numbers align with the scenario?
are gas and coverage plugins compatible with your stack?
do tests prove invariants, or only happy paths?

This skill can save time, but protocol-specific correctness still depends on your review.

web3-testing skill FAQ

Is web3-testing good for beginners

Yes, if you already understand basic Solidity concepts. The skill can accelerate setup and show what a mature testing stack looks like. Absolute beginners may still need separate help on Solidity syntax, deployment flow, and framework basics.

Is web3-testing only for Hardhat

No. The skill explicitly covers both Hardhat and Foundry. Its fit is strongest when you tell the agent which ecosystem to prioritize rather than leaving it ambiguous.

How is web3-testing different from a normal AI prompt

A normal prompt often returns surface-level unit tests. web3-testing is better oriented toward full smart contract test strategy: fork-based realism, fuzzing, gas checks, coverage, and environment setup. That makes it more useful for real protocol validation, not just demo tests.

Can web3-testing help with DeFi protocols

Yes. This is one of the better-fit use cases, especially if you need integration tests against realistic state. Provide protocol dependencies, expected invariants, and the exact user flows you care about.

When should I not use web3-testing

Do not reach for web3-testing if:

you only need a one-off assertion
your project is not Solidity or EVM-focused
you want a packaged framework with helpers and fixtures included
you lack enough contract context to specify meaningful test goals

Does web3-testing include executable tooling

Not really. The repository evidence shows a document-first skill with examples, not bundled scripts or reusable assets. Treat it as guidance and generation support, not an installable testing framework.

How to Improve web3-testing skill

Give the skill protocol risks, not just filenames

The fastest way to improve web3-testing usage is to state the failure modes you actually fear:

accounting drift
price manipulation
permission bypass
bad upgrade initialization
insolvency after extreme inputs

That changes the output from generic scaffolding to risk-driven test design.

Ask for a test matrix before code

A high-leverage pattern is:

“List test categories and invariants.”
“Now generate the highest-priority test skeletons.”
“Now fill in mocks and edge cases.”

This reduces wasted code and surfaces misunderstandings early.

Provide real contract interfaces

If you paste function signatures, events, custom errors, and storage constraints, the web3-testing skill can generate much stronger tests. Without these, it may invent setup details or rely on broad assumptions.

Separate happy paths from adversarial paths

Ask the skill to organize output into:

happy-path functionality
authorization checks
boundary and rounding cases
integration failures
fork-specific scenarios
fuzz or invariant candidates

This structure makes review easier and improves CI planning.

Improve mainnet fork prompts with exact state assumptions

For better fork test output, include:

network name
RPC environment variable name
target block number
contracts to impersonate or interact with
balances or approvals needed
expected post-transaction state

Without these, fork suggestions stay conceptual and require more manual cleanup.

Common failure modes to watch for

The main ways web3-testing output can go wrong:

unrealistic mocks replacing critical integration behavior
test coverage that looks broad but misses value-at-risk paths
framework config that conflicts with your existing repo
using fork tests where unit tests would be faster and clearer
overemphasis on setup while under-specifying invariants

Review generated work for risk coverage, not just syntactic correctness.

Iterate on the first draft instead of starting over

When the initial result is close but incomplete, give corrective feedback like:

“Add revert-path tests for every admin function.”
“Convert these integration cases into Foundry fuzz tests.”
“Replace mocks with a fork-based scenario for the oracle dependency.”
“Prioritize accounting invariants over boilerplate deployment tests.”

This usually yields better results than discarding the first output and prompting from scratch.

Improve web3-testing with repository-specific context

The skill becomes much more useful when you mention:

current repo layout
existing fixtures or helper libraries
CI time limits
whether you already use forge-std, hardhat-toolbox, or custom deployment scripts
naming conventions for tests and fixtures

That lets the agent adapt output to your repository instead of generating isolated examples.

What high-quality output from web3-testing looks like

Good output from web3-testing should give you:

a clear test plan tied to protocol risk
framework-specific setup that matches your stack
test skeletons that map to real functions and invariants
selective use of fork and fuzz testing where they add value
obvious next steps for turning generated code into a maintainable suite

If the output does not improve decision quality or save implementation time, tighten the input rather than asking for “more comprehensive” results.

Ratings & Reviews

No ratings yet

Share your review

0/10000

Latest reviews

Saving...

more skill

playwright-interactive

by openai

playwright-interactive is a browser automation skill for persistent Playwright sessions in local web and Electron apps. Use it to inspect UI state, retry interactions, and run functional or visual QA without restarting the toolchain. Ideal when you need a practical playwright-interactive guide for iterative debugging.

Browser Automation

Favorites 0GitHub 0

playwright-skill

by testdino-hq

playwright-skill is a Playwright-specific guide for reliable browser automation. It helps teams write, debug, and scale tests for E2E flows, API checks, component testing, visual regression, accessibility, auth, CI/CD, and migration from Cypress or Selenium. Use the playwright-skill skill when you want practical patterns instead of generic testing advice.

Test Automation

Favorites 0GitHub 0

laravel-tdd

by affaan-m

laravel-tdd is a Laravel test-driven-development guide for PHPUnit and Pest. It helps with unit, feature, and integration test choices, database strategy, fakes, coverage targets, and a practical workflow for test automation.

Test Automation

Favorites 0GitHub 156.2k

cpp-testing

by affaan-m

The cpp-testing skill helps you write, run, and debug C++ tests with GoogleTest, GoogleMock, CMake, and CTest. Use it for coverage, flaky-test fixes, sanitizer-backed diagnostics, and practical cpp-testing usage in modern C++ projects.

Test Automation

Favorites 0GitHub 156.1k

test-driven-development

by addyosmani

The test-driven-development skill helps you change code by writing a failing test first, then making the smallest fix pass. Use it for logic changes, bug fixes, regressions, and edge cases where proof matters more than a plausible patch.

Skill Testing

Favorites 0GitHub 18.8k

wp-playground

by WordPress

The wp-playground skill helps you create disposable, reproducible WordPress Playground environments for plugin and theme testing, version switching, blueprints, snapshots, and isolated debugging. It supports browser or CLI workflows via @wp-playground/cli and is especially useful for backend development, QA, and controlled issue reproduction.

Backend Development

Favorites 0GitHub 1.4k

playwright-best-practices

by currents-dev

playwright-best-practices is a Playwright + TypeScript skill for writing stable tests, reducing flake, improving auth flows, choosing fixtures vs page objects, and handling CI, popups, mobile, iframes, websockets, and multi-user scenarios with practical repo-backed guidance.

Test Automation

Favorites 0GitHub 174

playwright-skill

by lackeyjb

playwright-skill is a browser automation skill for testing pages, filling forms, checking links, taking screenshots, validating responsive layouts, and working through login or checkout flows. It auto-detects dev servers, uses a universal executor, and helps you run reliable Playwright tasks with less setup and guesswork.

Browser Automation

Favorites 0GitHub 0

property-based-testing

by trailofbits

property-based-testing skill guide for writing, reviewing, and improving PBT across languages and smart contracts. Use this property-based-testing guide to spot roundtrip, idempotence, invariant, parser, validator, and normalization cases, choose generators, and decide when property-based-testing is stronger than example-based tests.

Skill Testing

Favorites 0GitHub 5k

terraform-test

by hashicorp

terraform-test is a practical guide for writing and running Terraform tests with .tftest.hcl files, run blocks, assertions, mocks, and CI-friendly workflows. Use it to validate module outputs, resource arguments, conditional logic, and plan or apply behavior before merge.

Code Generation

Favorites 0GitHub 583

browser-testing-with-devtools

by addyosmani

browser-testing-with-devtools helps agents test and debug real browser behavior through Chrome DevTools MCP. Use it to inspect the DOM, capture console errors, analyze network requests, profile performance, and verify fixes in a live browser.

Test Automation

Favorites 0GitHub 18.7k

ios-simulator-skill

by conorluddy

ios-simulator-skill is a task-focused iOS simulator skill for accessibility-aware app launch, navigation, text entry, gestures, screenshots, state capture, build/test loops, and simulator lifecycle control. It is designed to reduce guesswork for AI agents, QA engineers, and developers working on repeatable iOS test automation.

Test Automation

Favorites 0GitHub 0

autoresearch

by github

autoresearch is an autonomous experimentation loop for coding tasks with measurable outcomes. It helps developers define a goal, baseline, metric, and scope, then iterate through code changes, tests, and keep-or-revert decisions using git-backed checkpoints.

Workflow Automation

Favorites 0GitHub 0

atheris

by trailofbits

Atheris is a coverage-guided Python fuzzing skill built on libFuzzer. Use the atheris skill to fuzz pure Python code and Python C extensions, find crashes, hangs, and memory-safety bugs, and support Security Audit workflows with fast, practical harness guidance.

Security Audit

Favorites 0GitHub 5k

playwright-cli

by VoltAgent

playwright-cli is a browser automation skill for Playwright from the command line. It helps with opening pages, inspecting elements, clicking through flows, filling forms, capturing screenshots, mocking requests, and generating test code from real interactions. Use it for repeatable browser automation and UI testing.

Browser Automation

Favorites 0GitHub 8.5k

playwright

by openai

Use the playwright skill to automate a real browser from the terminal with a wrapper script and `playwright-cli`. It fits browser automation tasks like navigation, form filling, screenshots, snapshots, extraction, and UI-flow debugging. Check `npx`, install the skill, set `PWCLI`, then follow the CLI-first workflow.

Browser Automation

Favorites 0GitHub 0