agentic-engineering

by affaan-m

Learn the agentic-engineering skill for eval-first execution, task decomposition, model routing, and safer workflow automation with regression checks.

Stars156k

Favorites0

Comments0

AddedApr 15, 2026

CategoryWorkflow Automation

Install Command

npx skills add affaan-m/everything-claude-code --skill agentic-engineering

Curation Score

This skill scores 68/100, which means it is worth listing for users who want an agentic engineering workflow, but it is not yet a highly guided install. The repository gives enough substance to help an agent act with less guesswork than a generic prompt, especially around eval-first execution and model routing, but directory users should expect a fairly high-level playbook rather than a tightly operational tool.

68/100

Strengths

Clear use case and trigger: the description and opening guidance explicitly target engineering workflows where AI agents do most implementation work.
Practical operating model: it lays out eval-first execution, decomposition into 15-minute units, and model routing by task complexity.
Good decision support for agents: it emphasizes completion criteria, regression checks, session strategy, and review priorities like invariants and security assumptions.

Cautions

No install command, scripts, or support files, so adoption depends entirely on reading the markdown guidance.
Workflow remains fairly abstract: there are no examples, checklists, or repo-linked references to reduce ambiguity for first-time use.

Ai Agents Workflow Automation Claude Code Model Evaluation Debugging TypeScript JavaScript

Overview

Overview of agentic-engineering skill

agentic-engineering is a workflow skill for teams that want AI to do most of the implementation work without losing control of quality, scope, or cost. The agentic-engineering skill is best for engineers who already know how they want to ship, but need a repeatable system for decomposition, evals, and model selection instead of a generic one-shot prompt.

What users usually want from agentic-engineering is not inspiration; it is a practical operating model for AI-assisted delivery. The core job-to-be-done is to turn a vague engineering task into small verifiable units, choose the right model tier for each unit, and validate results with regression checks before moving on.

Why this skill is different

Unlike prompt-only approaches, agentic-engineering bakes in execution discipline: define completion criteria first, break work into agent-sized pieces, and verify against evals. That makes it a stronger fit for multi-step coding work, refactors, and workflow automation than for casual code drafting.

Best fit for this skill

Use agentic-engineering if you care about:

reducing rework on agent-written code
keeping AI tasks small enough to review
routing simple tasks to cheaper models and hard tasks to stronger ones
catching regressions early instead of after merge

Where it is a poor fit

It is less useful when the task is tiny, purely stylistic, or already fully constrained by tests and lint. If you just need a short code snippet or a single-line fix, the agentic-engineering guide may be more process than you need.

How to Use agentic-engineering skill

Install and open the source

For agentic-engineering install, add the skill and then read the skill file directly:
npx skills add affaan-m/everything-claude-code --skill agentic-engineering

Start with skills/agentic-engineering/SKILL.md. Because this repo does not include extra rule folders or helper scripts, the main value is in the skill body itself, not in a large support tree.

Turn a rough task into a good prompt

The skill works best when your input already states:

the goal
the expected done condition
the main risk
the surfaces that may break

A weak request is: “Improve the auth flow.”

A stronger request is: “Refactor the auth flow so login success, token refresh, and expired-session handling are separately testable. Keep the public API stable, add regression checks for token refresh failure, and optimize for low-risk incremental changes.”

That second version gives agentic-engineering the material it needs for decomposition and eval-first execution.

Follow the workflow in the skill

In practice, the agentic-engineering usage pattern is:

define completion criteria
split the task into 15-minute units
pick model tiers by complexity
run baseline checks before changing code
validate each unit with focused tests or evals
re-check regressions before combining work

This is especially useful for agentic-engineering for Workflow Automation, where the work often spans multiple files, fragile edge cases, and changes that look correct until a downstream check fails.

What to read first

Read in this order:

SKILL.md for the operating model
the sections on Operating Principles and Eval-First Loop
Task Decomposition for the 15-minute unit rule
Model Routing and Review Focus for AI-Generated Code
Cost Discipline if you are managing token or model spend

agentic-engineering skill FAQ

Is `agentic-engineering` only for large projects?

No. It is most valuable on work that has hidden coupling, but it can also help on medium tasks if the risk of regressions is high. If the change can be verified in one quick edit, the overhead may not be worth it.

How is this different from a normal prompt?

A normal prompt asks the model to produce code. The agentic-engineering skill asks the model to work in a controlled loop: define success, decompose, route the right model, and verify with evals. That usually produces better outcomes when the implementation path is uncertain.

Is `agentic-engineering` beginner friendly?

Yes, if the user can describe a task and recognize a good done condition. It is not a beginner tutorial for coding itself; it is a process skill for making AI coding safer and more predictable.

When should I not use it?

Skip it when your task is trivial, when speed matters more than rigor, or when there is no meaningful way to measure success. It is also a weaker choice if you want pure exploration rather than controlled engineering output.

How to Improve agentic-engineering skill

Give it sharper inputs

The biggest quality gain comes from better task framing. Include acceptance criteria, constraints, and known failure modes up front. For example, mention whether backward compatibility matters, whether tests already exist, and which edge cases are most likely to break.

Use evals that match the real risk

The skill is strongest when your checks reflect actual user impact, not just syntax. If the risk is auth, test refresh and failure paths. If the risk is automation, test retries, idempotency, and state transitions. That is the heart of agentic-engineering improvement.

Iterate after the first pass

Do not treat the first output as final. Ask for a narrower decomposition, a different model routing plan, or a stricter regression gate if the result feels too broad. Good agentic-engineering workflow usually comes from tightening the loop, not from expanding the prompt.

Ratings & Reviews

No ratings yet

Share your review

0/10000

Latest reviews

Saving...

more skill

playwright-interactive

by openai

playwright-interactive is a browser automation skill for persistent Playwright sessions in local web and Electron apps. Use it to inspect UI state, retry interactions, and run functional or visual QA without restarting the toolchain. Ideal when you need a practical playwright-interactive guide for iterative debugging.

Browser Automation

Favorites 0GitHub 0

huggingface-datasets

by huggingface

Use the huggingface-datasets skill for Hugging Face Dataset Viewer API workflows to validate datasets, resolve splits, preview and paginate rows, search text, apply filters, and fetch parquet links or statistics. It is a practical huggingface-datasets guide for read-only dataset exploration.

Web Scraping

Favorites 0GitHub 10.4k

iterative-retrieval

by affaan-m

iterative-retrieval is a workflow pattern for progressively refining context retrieval in agentic work. It helps subagents avoid too much or too little context, making it useful for iterative-retrieval usage, install decisions, and iterative-retrieval for Workflow Automation.

Workflow Automation

Favorites 0GitHub 156.2k

data-scraper-agent

by affaan-m

data-scraper-agent helps build a repeatable public-data pipeline for web scraping, enrichment, and storage. It is designed for monitoring jobs, prices, news, repos, sports, and listings on a schedule using GitHub Actions, with outputs to Notion, Sheets, or Supabase. Best for ongoing tracking, not one-off extractions.

Web Scraping

Favorites 0GitHub 156.1k

inbox-triage

by alirezarezvani

inbox-triage runs recurring or on-demand email triage from an inbox-setup knowledge base. It classifies recent mail, researches senders, recommends actions, drafts replies without sending, logs results, and updates `${WORKSPACE}/Email/` using helper scripts for KB validation, search windows, and draft safety.

Workflow Automation

Favorites 0GitHub 22.2k

changelog-generator

by alirezarezvani

changelog-generator turns Conventional Commit history into auditable Keep a Changelog release notes. Use this changelog-generator skill to lint commits, infer semver bumps, generate CHANGELOG.md entries, and support CI, monorepo, hotfix, and Technical Writing workflows.

Technical Writing

Favorites 0GitHub 22.2k

notion-meeting-intelligence

by openai

notion-meeting-intelligence helps turn Notion context into meeting-ready agendas and pre-reads, with Codex research for decisions, status, planning, retros, and 1:1 prep. Best for the notion-meeting-intelligence for Meeting Prep workflow when you need grounded materials, clear timeboxes, and attendee-specific outputs.

Meeting Prep

Favorites 0GitHub 18.6k

multi-agent-patterns

by muratcankoylan

The multi-agent-patterns skill helps you design and implement agent systems with Agent Orchestration, context isolation, parallel work, and structured handoffs. Use it when choosing between a single agent and a multi-agent setup, or when you need supervisor routing, peer handoffs, consensus, or fault handling. It is best for orchestration-heavy tasks where clear coordination matters more than adding agents.

Agent Orchestration

Favorites 0GitHub 15.6k

building-incident-response-playbook

by mukul975

building-incident-response-playbook helps security teams create reusable incident response playbooks with step-by-step phases, decision trees, escalation criteria, RACI ownership, and SOAR-ready structure. It is designed for incident response procedure documentation, incident triage workflows, and audit-friendly operational response plans.

Incident Triage

Favorites 0GitHub 6.1k

building-patch-tuesday-response-process

by mukul975

building-patch-tuesday-response-process helps teams build a repeatable Microsoft Patch Tuesday process to triage advisories, rank risk, test patches, approve rollout, and track compliance. Useful for security operations, vulnerability management, and building-patch-tuesday-response-process for Project Management.

Project Management

Favorites 0GitHub 6.1k

read

by tw93

The read skill fetches URLs and PDFs as clean Markdown for reading, quoting, citation, and downstream work. It is built for read usage on paywalled pages, JS-heavy sites, X/Twitter, GitHub files, Chinese platforms, and Workflow Automation flows that need reliable source text before analysis. Use the read guide when you want source capture, not commentary.

Workflow Automation

Favorites 0GitHub 5.1k

secure-workflow-guide

by trailofbits

secure-workflow-guide guides a 5-step Solidity security workflow: Slither triage, feature-specific checks, visual inspection, security-property notes, and manual review. It is built for smart contract teams, auditors, and builders who want a repeatable secure-workflow-guide guide before deployment or release.

Security Audit

Favorites 0GitHub 4.9k

twitter-cli

by public-clis

twitter-cli is a terminal-first Twitter/X skill for reading timelines, bookmarks, search results, profiles, and tweet details, with posting and other write actions when authenticated. Use it for Social Media research, account monitoring, and lightweight publishing from the command line.

Social Media

Favorites 0GitHub 2.3k

azure-ai-contentunderstanding-py

by microsoft

azure-ai-contentunderstanding-py is the Python skill for Azure AI Content Understanding. It extracts structured content from documents, images, audio, and video for RAG workflows and automation. Use it when you need reliable multimodal extraction, Azure authentication, and repeatable pipeline-ready output.

RAG Workflows

Favorites 0GitHub 2.2k

wp-performance

by WordPress

Use wp-performance to investigate and improve WordPress performance from the backend, without a browser UI. It supports measurement-first diagnosis for slow frontend requests, admin pages, REST routes, and WP-Cron, with guidance on WP-CLI profile/doctor, Query Monitor via REST headers, Server-Timing, database queries, autoloaded options, object caching, cron, and remote HTTP calls.

Performance Optimization

Favorites 0GitHub 1.4k

wp-wpcli-and-ops

by WordPress

The wp-wpcli-and-ops skill helps with WordPress operations in WP-CLI: safe search-replace, db export/import, plugin and theme actions, cron, cache flushing, multisite targeting, and repeatable automation for backend development.

Backend Development

Favorites 0GitHub 1.4k

agentic-engineering

Overview of agentic-engineering skill

Why this skill is different

Best fit for this skill

Where it is a poor fit

How to Use agentic-engineering skill

Install and open the source

Turn a rough task into a good prompt

Follow the workflow in the skill

What to read first

agentic-engineering skill FAQ

Is agentic-engineering only for large projects?

How is this different from a normal prompt?

Is agentic-engineering beginner friendly?

When should I not use it?

How to Improve agentic-engineering skill

Give it sharper inputs

Use evals that match the real risk

Iterate after the first pass

Ratings & Reviews

Is `agentic-engineering` only for large projects?

Is `agentic-engineering` beginner friendly?