pydeseq2

by K-Dense-AI

pydeseq2 is a Python DESeq2 skill for bulk RNA-seq differential gene expression analysis. Use it to compare conditions, fit single- or multi-factor designs, apply Wald tests and FDR correction, and generate volcano or MA plots in pandas and AnnData workflows.

Stars0

Favorites0

Comments0

AddedMay 14, 2026

CategoryData Analysis

Install Command

npx skills add K-Dense-AI/claude-scientific-skills --skill pydeseq2

Curation Score

This skill scores 80/100 and is worth listing. The repository gives directory users enough evidence that an agent can recognize when to use it, follow a real PyDESeq2 differential-expression workflow, and apply it with less guesswork than a generic prompt. It is not perfectly packaged, but it has substantial operational content and clear install-value for bulk RNA-seq analysis users.

80/100

Strengths

Strong triggerability: the frontmatter and "When to Use This Skill" explicitly target DESeq2, differential expression, bulk RNA-seq counts, and PyDESeq2 mentions.
Operational workflow content: the skill body includes a concrete quick-start with pandas, DeseqDataSet, DeseqStats, filtering, Wald tests, and FDR correction.
Good agent leverage: it covers single-factor and multi-factor designs, batch/covariate handling, apeGLM shrinkage, and pandas/AnnData integration.

Cautions

No install command or support files are provided, so users may need to infer environment/setup details themselves.
The repository is marked with an experimental/test signal and appears to be a single SKILL.md without references or auxiliary assets.

Python Pandas Bioinformatics Genomics Dataset

Overview

Overview of pydeseq2 skill

What pydeseq2 is for

pydeseq2 is a Python skill for differential gene expression analysis on bulk RNA-seq count data. It helps you go from raw counts and sample metadata to DE results, fold changes, adjusted p-values, and standard exploratory outputs like volcano and MA plots.

Who should use it

Use the pydeseq2 skill if you want DESeq2-style analysis in Python, need multi-factor designs, or want to fit differential expression into a pandas/AnnData-based workflow. It is a good fit for analysts who already have count matrices and clinical or experimental metadata, not for users looking for a full RNA-seq preprocessing pipeline.

What makes it useful

The main value of pydeseq2 is that it reduces translation friction for Python users who would otherwise jump to R for DESeq2. It supports Wald testing, multiple-testing correction, optional apeGLM shrinkage, and a workflow that is practical for reproducible notebook or pipeline use.

How to Use pydeseq2 skill

Install pydeseq2

Install the skill in your Claude skill set, then open the skill files before prompting:
npx skills add K-Dense-AI/claude-scientific-skills --skill pydeseq2

For pydeseq2 install and setup decisions, verify that your environment already has the RNA-seq count table, sample metadata, and the Python packages required by your workflow. The skill is most useful when you can provide sample-by-gene counts and a design formula or grouping variable.

Start from the right inputs

Strong pydeseq2 usage starts with clean input structure:

a count matrix with samples as rows and genes as columns
metadata indexed by sample ID
a clear condition column, and any batch or covariate columns you want in the model
an explicit comparison target, such as treated vs control

A weak prompt says: “Run differential expression on my RNA-seq data.”
A stronger prompt says: “Use pydeseq2 on a bulk RNA-seq count matrix with 24 samples, compare treated vs control, include batch as a covariate, filter very low-count genes, and return significant genes plus volcano/MA plot code.”

Read these files first

Start with SKILL.md for the workflow and expected analysis steps. Then inspect README.md, AGENTS.md, metadata.json, and any rules/, resources/, references/, or scripts/ folders if present. For this repository, the main practical signal is in SKILL.md, so do not assume extra helper files exist.

Use pydeseq2 well

Treat pydeseq2 as an analysis method, not just a code generator. Tell the model:

what organism and assay you have
how samples are grouped
whether you need single-factor or multi-factor design
whether you want shrinkage, ranking, or visualization
what output format you need, such as a dataframe, notebook cells, or a reusable script

This improves pydeseq2 usage because the model can choose the right design, filtering, and interpretation steps instead of guessing.

pydeseq2 skill FAQ

Is pydeseq2 only for DESeq2 users?

No. It is for anyone who wants DESeq2-like differential expression analysis in Python. It is especially useful if you already work in pandas, scanpy, or AnnData and want to keep the analysis in one stack.

Do I need a perfect prompt to use it?

No, but vague prompts lead to generic analysis code. The pydeseq2 skill works best when you provide the count table shape, comparison of interest, and any known confounders.

Is pydeseq2 beginner-friendly?

It is beginner-friendly if you already understand the basics of RNA-seq counts and experimental design. It is less suitable if you need help with alignment, quantification, or upstream QC before differential expression.

When should I not use pydeseq2?

Do not use it for single-cell differential expression, normalized expression without raw counts, or workflows that need a full end-to-end transcriptomics pipeline. It is also not the right choice if your real need is statistical interpretation without gene-level count data.

How to Improve pydeseq2 skill

Give better biological context

The best pydeseq2 results come from prompts that explain the study design, not just the file names. Include the response variable, control condition, batch effects, replicate count, and whether you want gene ranking, plot code, or interpretation.

Specify the analysis decisions you care about

Tell the skill how to handle low-count genes, whether to use a multi-factor model, and whether you need shrinkage for effect sizes. These choices materially affect pydeseq2 outputs and help avoid generic defaults that may not match your study.

Ask for output you can reuse

Instead of asking only for “results,” request a saved dataframe schema, a plotting snippet, or a notebook-ready workflow. For example: “Return pydeseq2 code that fits the model, extracts adjusted p-values, and writes a CSV of significant genes with log2 fold change and padj.”

Iterate from diagnostics, not just final hits

If the first run looks off, ask for QC-oriented checks: sample clustering, count filtering rationale, the number of genes retained, or whether the design formula is confounded. This is the fastest way to improve pydeseq2 for Data Analysis when results are weak or unexpectedly sparse.

Ratings & Reviews

No ratings yet

Share your review

0/10000

Latest reviews

Saving...

more skill

clickhouse-best-practices

by ClickHouse

clickhouse-best-practices is a ClickHouse best practices skill for Database Engineering. It guides schema design, query tuning, insert strategy, and agent connectivity with rule-based recommendations, making clickhouse-best-practices usage easier to trigger, review, and cite in ClickHouse workflows.

Database Engineering

Favorites 0GitHub 412

chdb-datastore

by ClickHouse

chdb-datastore is a pandas-compatible skill for fast data analysis with a ClickHouse-backed DataStore API. It supports file, database, and cloud connectors, cross-source joins, and minimal code changes for pandas-style workflows. Use this chdb-datastore guide when you want a drop-in analysis layer for larger datasets.

Data Analysis

Favorites 0GitHub 0

sympy

by K-Dense-AI

Use the sympy skill for exact symbolic math in Python, including algebra, calculus, matrices, physics formulas, number theory, geometry, and code generation. It helps you keep expressions exact, choose the right SymPy modules, and avoid float-heavy mistakes. Best for users who need a practical sympy guide for symbolic workflows and sympy for Data Analysis.

Data Analysis

Favorites 0GitHub 21.4k

interpreting-culture-index

by trailofbits

interpreting-culture-index helps interpret Culture Index surveys, profile exports, and related hiring or coaching notes. Use this interpreting-culture-index skill for role fit, team dynamics, burnout risk, candidate debriefs, onboarding plans, and conflict mediation. It emphasizes arrow-relative reading, anti-pattern checks, and practical outputs for data analysis and decision support.

Data Analysis

Favorites 0GitHub 5k

azure-search-documents-py

by microsoft

azure-search-documents-py is the Python Azure AI Search skill for backend development, covering install, auth, index design, vector search, hybrid search, semantic ranking, and agentic retrieval. Use the azure-search-documents-py skill when you need practical guidance from setup to working query patterns.

Backend Development

Favorites 0GitHub 2.3k

gget

by K-Dense-AI

gget is a bioinformatics skill for fast, unified access to 20+ genomic databases and analysis tools from CLI or Python. Use it for gene info, BLAST-related lookups, AlphaFold structures, expression data, disease associations, and enrichment-style analysis. It suits quick exploration and gget for Data Analysis workflows.

Data Analysis

Favorites 0GitHub 0

channel-economics

by alirezarezvani

channel-economics helps RevOps and commercial leaders compare direct, partner, marketplace, reseller, or OEM channels with fully loaded cost-to-serve, ROI lenses, and constrained channel-mix recommendations. Includes Python scripts, data templates, and guidance for channel-economics usage.

Revenue Operations

Favorites 0GitHub 22.1k

torch-geometric

by K-Dense-AI

torch-geometric skill guide for PyTorch Geometric graph neural networks. Use it for torch-geometric install help, torch-geometric usage, graph classification, node classification, link prediction, heterogeneous graphs, custom MessagePassing layers, and scaling GNNs for Machine Learning workflows.

Machine Learning

Favorites 0GitHub 21.4k

rdkit

by K-Dense-AI

The rdkit skill helps with precise cheminformatics workflows: parsing SMILES, SDF, MOL, PDB, and InChI; calculating descriptors; generating fingerprints; running substructure search; handling reactions; and building 2D/3D coordinates. Use this rdkit guide for advanced control, custom sanitization, and rdkit for Data Analysis workflows.

Data Analysis

Favorites 0GitHub 21.4k

huggingface-vision-trainer

by huggingface

huggingface-vision-trainer helps you install and use a Hugging Face skill for vision training jobs: object detection, image classification, and SAM/SAM2 segmentation. It covers dataset prep, cloud GPU setup, evaluation, Trackio logging, and pushing results to the Hub. Ideal for backend automation and repeatable training workflows.

Backend Development

Favorites 0GitHub 10.4k

seo-dataforseo

by AgriciDaniel

seo-dataforseo connects Claude to live SEO data through the DataForSEO MCP server for SERP checks, keyword research, backlinks, on-page analysis, competitor research, business listings, and AI visibility tracking. It is best for data-backed workflows when you need real search evidence, clear install guidance, and practical seo-dataforseo usage.

Keyword Research

Favorites 0GitHub 6.2k

pymc

by K-Dense-AI

PyMC is a Bayesian modeling skill for building, fitting, checking, and comparing probabilistic models in Python. Use pymc for hierarchical regression, multilevel analysis, time series, missing data, measurement error, and model comparison with LOO or WAIC.

Data Analysis

Favorites 0GitHub 0

pymatgen

by K-Dense-AI

pymatgen is a Python materials science toolkit for crystal structures, phase diagrams, electronic structure, and file conversion. This pymatgen skill helps with scientific workflows using CIF, POSCAR, VASP, and Materials Project data.

Scientific

Favorites 0GitHub 0

geopandas

by K-Dense-AI

geopandas skill for Python geospatial vector data analysis, including shapefiles, GeoJSON, and GeoPackage files. Use it to read, clean, join, buffer, clip, reproject, and export spatial data with less guesswork.

Data Analysis

Favorites 0GitHub 0

analyzing-threat-intelligence-feeds

by mukul975

Analyzing-threat-intelligence-feeds helps you ingest CTI feeds, normalize indicators, assess feed quality, and enrich IOCs for STIX 2.1 workflows. This analyzing-threat-intelligence-feeds skill is built for threat intel operations and Data Analysis, with practical guidance for TAXII, MISP, and commercial feeds.

Data Analysis

Favorites 0GitHub 0

azure-ai-textanalytics-py

by microsoft

azure-ai-textanalytics-py is a skill for Azure AI Text Analytics in Python. It helps with sentiment analysis, entity recognition, key phrase extraction, language detection, PII detection, and healthcare NLP. Use it when you need a fast path to Azure client setup, authentication, and practical text analytics usage for apps, notebooks, or data analysis workflows.

Data Analysis

Favorites 0GitHub 0