pymc

by K-Dense-AI

PyMC is a Bayesian modeling skill for building, fitting, checking, and comparing probabilistic models in Python. Use pymc for hierarchical regression, multilevel analysis, time series, missing data, measurement error, and model comparison with LOO or WAIC.

Stars0

Favorites0

Comments0

AddedMay 14, 2026

CategoryData Analysis

Install Command

npx skills add K-Dense-AI/claude-scientific-skills --skill pymc

Curation Score

This skill scores 84/100, which means it is a solid listing candidate for directory users: it is clearly triggerable for Bayesian modeling tasks and provides enough workflow detail to justify installation, though it would benefit from supporting files and more adoption-oriented scaffolding.

84/100

Strengths

Explicitly scoped for Bayesian modeling with PyMC 5.x+, including hierarchical models, NUTS sampling, variational inference, and model comparison.
Strong operational guidance: the body lays out a standard Bayesian workflow with data prep, sampling, validation, diagnostics, and model comparison.
Good agent leverage and clarity: concrete use cases and code examples reduce guesswork compared with a generic prompt.

Cautions

No install command and no supporting scripts/references/resources, so users must rely on the SKILL.md content alone.
The repository appears focused on one long skill file, so some advanced or edge-case adoption paths may still require manual adaptation.

Python Pymc Bayesian Modeling Probabilistic Programming Mcmc Variational Inference Statistics Arviz

Overview

Overview of pymc skill

pymc is a Bayesian modeling skill for building, fitting, checking, and comparing probabilistic models in Python. It is best for readers who need real uncertainty estimates, not just point predictions: hierarchical regression, multilevel analysis, time series, missing data, measurement error, and model comparison with LOO or WAIC.

What pymc is for

Use the pymc skill when the job is to turn messy data into a defensible Bayesian model with posterior inference, not to write a generic Python analysis script. It helps you move from “I want to estimate this effect with uncertainty” to a working PyMC model, sampling plan, and validation workflow.

Who should use it

This pymc skill fits data analysts, scientists, and ML practitioners who already know their outcome and predictors but need help expressing the model correctly. It is especially useful for Bayesian workflow decisions: choosing priors, debugging sampler issues, and interpreting posterior diagnostics.

Main differentiators

Compared with a plain prompt, pymc is valuable because it centers the full workflow: data prep, model specification, sampling, checks, and comparison. The practical advantage is less guesswork around NUTS, prior predictive checks, and convergence diagnostics, which are common blockers in PyMC projects.

How to Use pymc skill

Install pymc skill

Install the pymc skill in your skills directory with the repository command shown in the skill file or your platform’s skill installer. Then confirm the scientific-skills/pymc path is available and open SKILL.md first, because that file defines the intended Bayesian workflow and scope.

Turn a rough goal into a useful prompt

A weak request like “analyze this dataset with pymc” leaves too much unspecified. A stronger prompt says what kind of model you need, the response variable, likely predictors, data size, grouping structure, and what you want out of the analysis, for example: “Build a hierarchical logistic regression in pymc for conversion by user and campaign, include weakly informative priors, explain sampling diagnostics, and show how to compare it to a pooled model.”

What to read first in the repo

Start with SKILL.md, then focus on the sections that describe when to use the skill and the standard Bayesian workflow. If your task is implementation-heavy, read the examples around data preparation, model building, sampling, and posterior checking before you prompt the model to write code.

Workflow details that improve output

For pymc, the input data shape matters a lot. Provide variable types, grouping IDs, missingness, and any scaling or categorical encoding already done. Ask explicitly for priors, sampler settings, and diagnostic output if you need a model that is more than a first draft. For pymc for Data Analysis, also specify whether you want interpretation, forecasting, causal comparison, or decision support, because those lead to different model structures.

pymc skill FAQ

Is pymc only for advanced users?

No. Beginners can use the pymc skill if they can describe their data clearly and are willing to review model diagnostics. The harder part is usually modeling judgment, not syntax, so the skill is most useful when you want guidance on structure and validation.

When should I not use pymc?

Do not use pymc if you only need a quick descriptive chart, a simple frequentist test, or a black-box prediction with no need for uncertainty. It is also a poor fit when you cannot describe the data-generating process at all, because PyMC works best when the model assumptions are explicit.

How is pymc different from a generic prompt?

A generic prompt may produce code, but pymc is oriented around the Bayesian workflow and the common failure points that affect model quality. That usually means better priors, better sampling advice, and more attention to diagnostics than an ad hoc prompt would provide.

Does pymc fit the wider Python ecosystem?

Yes. pymc is designed to work with the Python analysis stack, especially NumPy, pandas, ArviZ, and related plotting and data-prep tools. If your workflow already uses Python for analysis, pymc is a natural fit for probabilistic modeling.

How to Improve pymc skill

Give stronger model context

The best way to improve pymc output is to state the model class up front: linear, logistic, hierarchical, time series, missing-data, or measurement-error. Also include the target variable, predictors, grouping levels, and any business or scientific constraint that should shape the model.

Ask for diagnostics, not just code

Many pymc failures come from weak priors, bad scaling, or sampler pathologies. Ask for prior predictive checks, posterior predictive checks, effective sample size, R-hat, divergences, and a plan for what to change if sampling struggles. That makes the pymc skill more useful for Data Analysis work where validation matters.

Provide data shape and comparison goals

If you want a useful first result, tell the model how many rows, which variables are numeric or categorical, and whether there are repeated measures or clusters. If you need model comparison, specify the baseline model and what “better” means so the pymc skill can frame LOO or WAIC appropriately.

Iterate with the first fit

After the first pass, feed back the actual trace issues, posterior plots, or divergence counts instead of asking for a fresh model from scratch. The fastest way to improve pymc is to refine one assumption at a time: scale inputs, tighten or loosen priors, simplify the hierarchy, then refit and compare.

Ratings & Reviews

No ratings yet

Share your review

0/10000

Latest reviews

Saving...

more skill

clickhouse-best-practices

by ClickHouse

clickhouse-best-practices is a ClickHouse best practices skill for Database Engineering. It guides schema design, query tuning, insert strategy, and agent connectivity with rule-based recommendations, making clickhouse-best-practices usage easier to trigger, review, and cite in ClickHouse workflows.

Database Engineering

Favorites 0GitHub 412

chdb-datastore

by ClickHouse

chdb-datastore is a pandas-compatible skill for fast data analysis with a ClickHouse-backed DataStore API. It supports file, database, and cloud connectors, cross-source joins, and minimal code changes for pandas-style workflows. Use this chdb-datastore guide when you want a drop-in analysis layer for larger datasets.

Data Analysis

Favorites 0GitHub 0

sympy

by K-Dense-AI

Use the sympy skill for exact symbolic math in Python, including algebra, calculus, matrices, physics formulas, number theory, geometry, and code generation. It helps you keep expressions exact, choose the right SymPy modules, and avoid float-heavy mistakes. Best for users who need a practical sympy guide for symbolic workflows and sympy for Data Analysis.

Data Analysis

Favorites 0GitHub 21.4k

interpreting-culture-index

by trailofbits

interpreting-culture-index helps interpret Culture Index surveys, profile exports, and related hiring or coaching notes. Use this interpreting-culture-index skill for role fit, team dynamics, burnout risk, candidate debriefs, onboarding plans, and conflict mediation. It emphasizes arrow-relative reading, anti-pattern checks, and practical outputs for data analysis and decision support.

Data Analysis

Favorites 0GitHub 5k

azure-search-documents-py

by microsoft

azure-search-documents-py is the Python Azure AI Search skill for backend development, covering install, auth, index design, vector search, hybrid search, semantic ranking, and agentic retrieval. Use the azure-search-documents-py skill when you need practical guidance from setup to working query patterns.

Backend Development

Favorites 0GitHub 2.3k

gget

by K-Dense-AI

gget is a bioinformatics skill for fast, unified access to 20+ genomic databases and analysis tools from CLI or Python. Use it for gene info, BLAST-related lookups, AlphaFold structures, expression data, disease associations, and enrichment-style analysis. It suits quick exploration and gget for Data Analysis workflows.

Data Analysis

Favorites 0GitHub 0

torch-geometric

by K-Dense-AI

torch-geometric skill guide for PyTorch Geometric graph neural networks. Use it for torch-geometric install help, torch-geometric usage, graph classification, node classification, link prediction, heterogeneous graphs, custom MessagePassing layers, and scaling GNNs for Machine Learning workflows.

Machine Learning

Favorites 0GitHub 21.4k

rdkit

by K-Dense-AI

The rdkit skill helps with precise cheminformatics workflows: parsing SMILES, SDF, MOL, PDB, and InChI; calculating descriptors; generating fingerprints; running substructure search; handling reactions; and building 2D/3D coordinates. Use this rdkit guide for advanced control, custom sanitization, and rdkit for Data Analysis workflows.

Data Analysis

Favorites 0GitHub 21.4k

huggingface-vision-trainer

by huggingface

huggingface-vision-trainer helps you install and use a Hugging Face skill for vision training jobs: object detection, image classification, and SAM/SAM2 segmentation. It covers dataset prep, cloud GPU setup, evaluation, Trackio logging, and pushing results to the Hub. Ideal for backend automation and repeatable training workflows.

Backend Development

Favorites 0GitHub 10.4k

seo-dataforseo

by AgriciDaniel

seo-dataforseo connects Claude to live SEO data through the DataForSEO MCP server for SERP checks, keyword research, backlinks, on-page analysis, competitor research, business listings, and AI visibility tracking. It is best for data-backed workflows when you need real search evidence, clear install guidance, and practical seo-dataforseo usage.

Keyword Research

Favorites 0GitHub 6.2k

pymatgen

by K-Dense-AI

pymatgen is a Python materials science toolkit for crystal structures, phase diagrams, electronic structure, and file conversion. This pymatgen skill helps with scientific workflows using CIF, POSCAR, VASP, and Materials Project data.

Scientific

Favorites 0GitHub 0

geopandas

by K-Dense-AI

geopandas skill for Python geospatial vector data analysis, including shapefiles, GeoJSON, and GeoPackage files. Use it to read, clean, join, buffer, clip, reproject, and export spatial data with less guesswork.

Data Analysis

Favorites 0GitHub 0

analyzing-threat-intelligence-feeds

by mukul975

Analyzing-threat-intelligence-feeds helps you ingest CTI feeds, normalize indicators, assess feed quality, and enrich IOCs for STIX 2.1 workflows. This analyzing-threat-intelligence-feeds skill is built for threat intel operations and Data Analysis, with practical guidance for TAXII, MISP, and commercial feeds.

Data Analysis

Favorites 0GitHub 0

azure-ai-textanalytics-py

by microsoft

azure-ai-textanalytics-py is a skill for Azure AI Text Analytics in Python. It helps with sentiment analysis, entity recognition, key phrase extraction, language detection, PII detection, and healthcare NLP. Use it when you need a fast path to Azure client setup, authentication, and practical text analytics usage for apps, notebooks, or data analysis workflows.

Data Analysis

Favorites 0GitHub 0

chdb-sql

by ClickHouse

chdb-sql is a GitHub skill for running ClickHouse SQL in Python without a server. It covers chdb.query(), Session, DB-API connections, table functions like file() and s3(), parametrized queries, and backend development workflows for local files and external data sources.

Backend Development

Favorites 0GitHub 0

scvelo

by K-Dense-AI

scvelo is a Python skill for RNA velocity analysis in single-cell RNA-seq data. Use it to estimate cell state transitions from unspliced and spliced mRNA, infer trajectory direction, compute latent time, and identify driver genes. It is especially useful for scvelo for Data Analysis when you need directionality beyond standard clustering or pseudotime.

Data Analysis

Favorites 0GitHub 0