skill-comply

by affaan-m

skill-comply is a compliance-testing skill that checks whether an agent follows a skill, rule, or agent definition in real runs. It generates specs from markdown, runs three prompt strictness levels, classifies tool-call timelines, and reports compliance rates with evidence. Useful for skill-comply for Compliance Review.

Stars156.3k

Favorites0

Comments0

AddedApr 15, 2026

CategoryCompliance Review

Install Command

npx skills add affaan-m/everything-claude-code --skill skill-comply

Curation Score

This skill scores 78/100, which means it is a solid listing candidate for directory users who want an agent to verify whether skills, rules, and agent definitions are actually being followed. The repository provides a concrete workflow, explicit activation cues, and supporting scripts/tests, so users can judge install value with reasonable confidence, though they should expect some operational setup effort.

78/100

Strengths

Explicitly describes a multi-step compliance workflow: spec generation, 3-level scenario generation, trace capture, classification, and reporting.
Strong triggerability and scope clarity: SKILL.md says when to activate it and which targets it supports (skills, rules, agent definitions).
Real implementation evidence: multiple scripts, prompts, fixtures, and tests back the documented workflow.

Cautions

No install command in SKILL.md, so users must wire it up manually and may need to inspect scripts to run it correctly.
The repo notes agent-definition workflow verification is not yet fully supported, which limits coverage compared with the broad title.

Claude Code Claude Workflow Testing Automation Ai Quality Compliance

Overview

Overview of skill-comply skill

skill-comply is a compliance-testing skill for checking whether an agent actually follows a skill, rule, or agent definition in real runs. It fits users who need evidence, not assumptions: maintainers validating a workflow rule, authors testing a new skill, or teams asking whether a coding agent obeys TDD, review, or process constraints under different prompt conditions.

What the skill-comply skill does

The skill-comply skill generates an expected behavior spec from a markdown source, creates three prompts with decreasing support, runs the agent, then compares observed tool-call timelines against the spec. That makes it useful for Compliance Review when you care about both presence and order of actions, not just final output.

When skill-comply is a good fit

Use skill-comply when you need to verify that a rule is followed under pressure: supportive prompts, neutral prompts, and competing prompts. It is especially relevant for skills that depend on sequence, such as “test before implementation” or “read the rule before editing.”

What makes it different

Unlike a generic prompt asking “did it follow the rules?”, skill-comply operationalizes the check: it extracts steps, classifies tool calls with an LLM, and evaluates ordering deterministically. The value is in the trace, timeline, and compliance rate, which help you decide whether the skill is reliable enough to keep using.

How to Use skill-comply skill

Install and activate skill-comply

Install the skill-comply skill with:

npx skills add affaan-m/everything-claude-code --skill skill-comply

Then run it against the markdown file you want to verify. The repository’s own usage pattern is centered on CLI execution, so the skill works best when you point it at a single target file and treat the output as a compliance report, not a prose summary.

Read these files first

For the skill-comply install and setup path, start with skills/skill-comply/SKILL.md, then inspect prompts/spec_generator.md, prompts/scenario_generator.md, and prompts/classifier.md. Those three prompts show the real workflow: spec extraction, scenario generation, and trace classification. If you want to understand implementation constraints, skim scripts/run.py, scripts/spec_generator.py, scripts/scenario_generator.py, and scripts/classifier.py.

How to shape a good input

A strong skill-comply usage prompt is a concrete compliance target, not a vague policy. Good inputs name the file and the behavior you want verified, for example: “Check whether rules/common/testing.md is followed during a coding task” or “Measure whether the agent writes tests before implementation in this skill.” Weak inputs like “is this good?” do not give the tool enough behavior to score.

Practical workflow for better results

Use this sequence: choose one rule or skill, generate the spec, review the extracted steps, then run the three scenario levels. The best way to use skill-comply for Compliance Review is to compare the supportive, neutral, and competing runs side by side, because that shows whether the behavior is robust or only appears when the prompt helps.

skill-comply skill FAQ

Is skill-comply only for coding skills?

No. It is best for coding-agent workflows, but the repository explicitly supports skills, rules, and agent definitions. If your target is a markdown policy with observable actions, skill-comply is a strong fit.

How is this different from a normal prompt test?

A normal prompt test checks whether an answer looks right. skill-comply checks whether the agent’s actions match an expected sequence, including tool-use timing. That matters when compliance is about process, not just output.

Is skill-comply beginner-friendly?

Yes, if you can identify the file being tested and describe the behavior you expect. The harder part is choosing a target with clear observable steps. It is less useful when the policy is vague or mostly human judgment.

When should I not use it?

Do not use skill-comply when the target has no actionable sequence, no meaningful tool calls, or only subjective quality criteria. It is also a poor fit if you need full production observability beyond a single claude -p run and trace comparison.

How to Improve skill-comply skill

Give it sharper source material

skill-comply works best when the source markdown states concrete actions, ordering, and exceptions. If your rule says “prefer tests” instead of “write a test before implementation,” the extracted spec will be harder to score and less useful for Compliance Review.

Watch for the main failure modes

The biggest risk is over-trusting an extracted spec that is too broad or too narrow. Another common issue is confusing prompt support with real compliance: a skill may look good in the supportive scenario and fail once the prompt becomes neutral or competing. Use skill-comply usage results to check robustness, not just one green run.

Strengthen the first run inputs

Provide a target path, a realistic task, and any setup commands needed to reproduce the behavior under test. If the skill depends on files, commands, or environment assumptions, include those explicitly so the generated scenarios reflect actual use rather than a toy example.

Iterate from trace to spec

After the first run, inspect the generated spec and the tool-call timeline before you change the prompt or skill text. If a step was missed, decide whether the issue is the skill wording, the scenario design, or the detector description. That loop is where skill-comply adds the most value: it turns “did it comply?” into specific edits you can make to the source rule.

Ratings & Reviews

No ratings yet

Share your review

0/10000

Latest reviews

Saving...

more skill

chief-ai-officer-advisor

by alirezarezvani

chief-ai-officer-advisor helps founders and CAIO-style leaders make strategic AI decisions: API vs fine-tune vs in-house build, EU/US AI risk classification, API-to-self-hosted cost economics, and AI hiring sequence. Includes reference guides and Python calculators for structured planning.

Strategic Planning

Favorites 0GitHub 22.1k

azure-ai-contentsafety-ts

by microsoft

azure-ai-contentsafety-ts helps analyze text and images for harmful content with Azure AI Content Safety in TypeScript. Use this skill for moderation workflows, blocklists, and security audit checks for hate, violence, sexual content, and self-harm. It also covers Azure endpoint and auth setup.

Security Audit

Favorites 0GitHub 2.3k

security-review

by affaan-m

Use the security-review skill to review auth, user input, secrets, APIs, payments, uploads, and other sensitive flows. It provides a practical security-review guide with clear pass/fail checks, risky-pattern examples, and a focused process for catching common issues before release.

Security Audit

Favorites 0GitHub 156.3k

django-security

by affaan-m

django-security is a practical guide for hardening Django apps with authentication, authorization, CSRF, XSS, SQL injection prevention, secure cookies, and production settings. It helps developers and reviewers run a focused Security Audit, quickly spot risky config, and apply concrete fixes before deployment.

Security Audit

Favorites 0GitHub 156.1k

general-counsel-advisor

by alirezarezvani

general-counsel-advisor is a Claude skill for startup legal triage: contract risk review, IP ownership checks, term sheet decoding, and regulatory exposure mapping. Includes references and Python scanners to prepare counsel-ready issue lists, not legal advice.

Legal

Favorites 0GitHub 22.1k

codeql

by trailofbits

The codeql skill helps you run CodeQL with fewer blind spots during a security audit. It focuses on database quality, suite selection, data extensions, and SARIF review so you can use codeql usage more reliably across supported languages. Use it for repeatable codeql guide steps when analyzing real repositories.

Security Audit

Favorites 0GitHub 5k

insecure-defaults

by trailofbits

The insecure-defaults skill helps spot fail-open configuration patterns that let software run with unsafe settings instead of stopping. Use it for a Security Audit of production code, deployment configs, and secret-handling logic to catch weak auth, hardcoded secrets, and permissive defaults.

Security Audit

Favorites 0GitHub 5k

algorand-vulnerability-scanner

by trailofbits

algorand-vulnerability-scanner is a security-audit skill for Algorand TEAL and PyTeal. It helps find 11 common issues, including rekeying attacks, fee validation gaps, field checks, and access control flaws. Use the algorand-vulnerability-scanner skill for a practical first-pass review before a manual audit.

Security Audit

Favorites 0GitHub 4.9k

security-ownership-map

by openai

Use security-ownership-map to analyze git history for security ownership risk, bus factor, and sensitive-code ownership. It maps people to files, surfaces orphaned or under-owned areas, and exports CSV/JSON for graph analysis. Best for security audit questions, CODEOWNERS reality checks, and ownership clusters grounded in commit history.

Security Audit

Favorites 0GitHub 0

token-integration-analyzer

by trailofbits

token-integration-analyzer is a security-review skill for token implementations and token integrations. It checks ERC20/ERC721 conformity, weird token patterns, owner privileges, scarcity, and non-standard token handling for Security Audit workflows. Use the token-integration-analyzer guide to reduce guesswork and assess compatibility risk.

Security Audit

Favorites 0GitHub 4.9k

qms-audit-expert

by alirezarezvani

qms-audit-expert is an ISO 13485 internal audit skill for medical device QMS teams. It supports risk-based audit planning, clause checklists, nonconformity classification, CAPA verification, external audit prep, and schedule optimization using included references and a Python tool.

Quality Management

Favorites 0GitHub 22.2k

mdr-745-specialist

by alirezarezvani

mdr-745-specialist is an EU MDR 2017/745 compliance skill for device classification, Annex II/III technical documentation, clinical evidence, PMS/PMCF/PSUR planning, and notified body readiness. It includes MDR reference guides and a Python gap analyzer for structured compliance review.

Compliance Review

Favorites 0GitHub 22.2k

gdpr-dsgvo-expert

by alirezarezvani

gdpr-dsgvo-expert helps agents run GDPR/DSGVO Compliance Review with code scans, DPIA drafting, audit guidance, BDSG references, and DSAR deadline tracking. Use it to surface privacy risks and prepare evidence for DPO or legal review.

Compliance Review

Favorites 0GitHub 22.2k

senior-secops

by alirezarezvani

senior-secops is a Claude skill for application security reviews, vulnerability management, CVE triage, dependency risk, OWASP checks, and compliance guidance. It includes references for security standards and SOC 2, PCI-DSS, HIPAA, GDPR, plus scripts for code scanning, dependency assessment, and compliance checks.

Vulnerability Management

Favorites 0GitHub 22.1k

compliance-os

by alirezarezvani

compliance-os is a multi-framework compliance orchestration skill for Compliance Review. Use it to assess framework applicability, map control overlap, prioritize reusable evidence, and simulate mock internal audits with JSON templates and Python helper scripts.

Compliance Review

Favorites 0GitHub 22.1k

auditing-kubernetes-cluster-rbac

by mukul975

auditing-kubernetes-cluster-rbac helps audit Kubernetes RBAC for overbroad roles, risky bindings, secret access, and privilege escalation paths. It is built for security audit workflows across EKS, GKE, AKS, and self-managed clusters, with practical guidance for kubectl, rbac-tool, KubiScan, and Kubeaudit.

Security Audit

Favorites 0GitHub 0