cellxgene-census

by K-Dense-AI

cellxgene-census skill for querying the CELLxGENE Census programmatically. Use it to explore expression data, metadata, embeddings, and cross-dataset patterns across tissues, diseases, and cell types. Best for population-scale single-cell analysis and reference atlas comparisons; for your own data, use scanpy or scvi-tools.

Stars0

Favorites0

Comments0

AddedMay 14, 2026

CategoryData Analysis

Install Command

npx skills add K-Dense-AI/claude-scientific-skills --skill cellxgene-census

Curation Score

This skill scores 78/100, which means it is a solid listing candidate for directory users who want a focused way to query the CELLxGENE Census. The repository gives enough operational detail to help an agent trigger it correctly and understand its main use cases, though users should still expect some workflow gaps because the evidence shows no supporting scripts or reference files.

78/100

Strengths

Strong triggerability: the description and overview clearly say it is for programmatic queries of the CELLxGENE Census and when to use it.
Good operational scope: it covers population-scale single-cell querying, metadata exploration, and cross-dataset analysis across 61M+ cells.
Useful install guidance: it includes a direct install command (`uv pip install cellxgene-census`) and mentions integration with scanpy and PyTorch workflows.

Cautions

No support files are present (no scripts, references, resources, or rules), so agents may need to infer some usage details from the prose alone.
The excerpt suggests the document is focused on overview and setup rather than a fully opinionated workflow playbook, which may limit turn-key execution for complex tasks.

Python Dataset Bioinformatics Scientific Pytorch Scanpy Machine Learning

Overview

Overview of cellxgene-census skill

The cellxgene-census skill helps you query the CELLxGENE Census programmatically, so you can work with a large, versioned single-cell atlas instead of downloading ad hoc datasets one by one. It is best for researchers and data analysts who need expression data, cell metadata, embeddings, or cross-dataset comparisons at scale. The main job-to-be-done is turning a biological question like “Which cell types express this gene across disease states?” into a reproducible query and analysis workflow.

What this skill is for

Use cellxgene-census for population-scale single-cell analysis: tissue, disease, donor, cell type, and gene-level queries across many curated datasets. It is useful when your output needs to be consistent, filterable, and traceable to a specific Census version.

Where it fits best

This cellxgene-census skill fits data exploration, reference atlas comparison, and model-building workflows. It is a strong choice when you want standardized metadata and programmatic access, not a one-off notebook copied from a tutorial.

When it is not the right tool

Do not use cellxgene-census as a substitute for analyzing your own private dataset end-to-end. If you need local QC, normalization, clustering, or differential expression on your own data, tools like scanpy or scvi-tools are usually the better starting point.

How to Use cellxgene-census skill

Install the skill and confirm the scope

Use the directory install flow, then open the skill entry point first. A practical cellxgene-census install check is to confirm you are working from the skill’s SKILL.md and that your environment can install the Census package before you draft a query-heavy prompt.

Read the right files first

Start with SKILL.md, then inspect README.md, AGENTS.md, metadata.json, and any supporting folders such as rules/, resources/, or scripts/ if they exist. For this repo, SKILL.md is the main source of truth, so your prompt should be derived from its workflow sections rather than from a generic single-cell template.

Turn a vague goal into a usable prompt

A strong cellxgene-census usage prompt names the biological target, the filter dimensions, and the desired output. For example: “Find immune cells in human lung tissue from disease-associated samples, then return a compact table of cell counts, marker genes, and the Census version used.” Better inputs reduce ambiguity about species, tissue, measurement type, and whether you want summary stats or extracted observations.

Practical workflow for better output

Use the skill to answer one question per run: identify the target cohort, define the gene or metadata filters, choose the output shape, then validate the query against the Census version. If you are asking for downstream analysis, specify whether you want Python code, a notebook-style workflow, or a plain-language interpretation of results.

cellxgene-census skill FAQ

Is cellxgene-census good for beginners?

Yes, if you already know basic Python and single-cell concepts. The skill is easier to adopt when you can specify cell type, tissue, and gene targets clearly; it is less beginner-friendly if you want the model to invent an analysis plan from scratch.

How is this different from a generic prompt?

A generic prompt may give you a plausible answer, but cellxgene-census is meant to anchor the work in a versioned atlas, structured metadata, and reproducible queries. That matters when you need consistent cellxgene-census usage across projects or when results must be auditable.

Should I use it for my own data?

Usually not as the primary tool. Use cellxgene-census for reference atlas queries, benchmarking, or comparison against public data; use local analysis tooling for custom preprocessing, clustering, and model training on your own dataset.

How to Improve cellxgene-census skill

Give the skill fewer assumptions to guess

The best cellxgene-census for Data Analysis prompts include species, tissue, disease state, cell class, gene symbols, and the format you want back. “Summarize macrophage-related expression in human lung disease samples” is stronger than “analyze macrophages.”

State the output you actually need

If you want counts, summary statistics, filtered observations, or code, say so explicitly. The quality of cellxgene-census usage improves when you specify whether the deliverable is a query, a notebook snippet, a ranked table, or a short interpretation.

Watch for common failure modes

The most common problem is over-broad querying: too many tissues, no species, or ambiguous gene names. Another failure mode is mixing public atlas queries with private-data analysis in the same request, which makes the result less precise and harder to execute.

Iterate from query to analysis

A good cellxgene-census guide workflow is: first confirm the right cohort and filters, then refine the query, then add analysis steps such as comparison, aggregation, or plotting. If the first result is too broad, narrow by cell class, tissue, or disease before asking for deeper interpretation.

Ratings & Reviews

No ratings yet

Share your review

0/10000

Latest reviews

Saving...

more skill

clickhouse-best-practices

by ClickHouse

clickhouse-best-practices is a ClickHouse best practices skill for Database Engineering. It guides schema design, query tuning, insert strategy, and agent connectivity with rule-based recommendations, making clickhouse-best-practices usage easier to trigger, review, and cite in ClickHouse workflows.

Database Engineering

Favorites 0GitHub 412

chdb-datastore

by ClickHouse

chdb-datastore is a pandas-compatible skill for fast data analysis with a ClickHouse-backed DataStore API. It supports file, database, and cloud connectors, cross-source joins, and minimal code changes for pandas-style workflows. Use this chdb-datastore guide when you want a drop-in analysis layer for larger datasets.

Data Analysis

Favorites 0GitHub 0

sympy

by K-Dense-AI

Use the sympy skill for exact symbolic math in Python, including algebra, calculus, matrices, physics formulas, number theory, geometry, and code generation. It helps you keep expressions exact, choose the right SymPy modules, and avoid float-heavy mistakes. Best for users who need a practical sympy guide for symbolic workflows and sympy for Data Analysis.

Data Analysis

Favorites 0GitHub 21.4k

interpreting-culture-index

by trailofbits

interpreting-culture-index helps interpret Culture Index surveys, profile exports, and related hiring or coaching notes. Use this interpreting-culture-index skill for role fit, team dynamics, burnout risk, candidate debriefs, onboarding plans, and conflict mediation. It emphasizes arrow-relative reading, anti-pattern checks, and practical outputs for data analysis and decision support.

Data Analysis

Favorites 0GitHub 5k

azure-search-documents-py

by microsoft

azure-search-documents-py is the Python Azure AI Search skill for backend development, covering install, auth, index design, vector search, hybrid search, semantic ranking, and agentic retrieval. Use the azure-search-documents-py skill when you need practical guidance from setup to working query patterns.

Backend Development

Favorites 0GitHub 2.3k

gget

by K-Dense-AI

gget is a bioinformatics skill for fast, unified access to 20+ genomic databases and analysis tools from CLI or Python. Use it for gene info, BLAST-related lookups, AlphaFold structures, expression data, disease associations, and enrichment-style analysis. It suits quick exploration and gget for Data Analysis workflows.

Data Analysis

Favorites 0GitHub 0

channel-economics

by alirezarezvani

channel-economics helps RevOps and commercial leaders compare direct, partner, marketplace, reseller, or OEM channels with fully loaded cost-to-serve, ROI lenses, and constrained channel-mix recommendations. Includes Python scripts, data templates, and guidance for channel-economics usage.

Revenue Operations

Favorites 0GitHub 22.1k

torch-geometric

by K-Dense-AI

torch-geometric skill guide for PyTorch Geometric graph neural networks. Use it for torch-geometric install help, torch-geometric usage, graph classification, node classification, link prediction, heterogeneous graphs, custom MessagePassing layers, and scaling GNNs for Machine Learning workflows.

Machine Learning

Favorites 0GitHub 21.4k

rdkit

by K-Dense-AI

The rdkit skill helps with precise cheminformatics workflows: parsing SMILES, SDF, MOL, PDB, and InChI; calculating descriptors; generating fingerprints; running substructure search; handling reactions; and building 2D/3D coordinates. Use this rdkit guide for advanced control, custom sanitization, and rdkit for Data Analysis workflows.

Data Analysis

Favorites 0GitHub 21.4k

huggingface-vision-trainer

by huggingface

huggingface-vision-trainer helps you install and use a Hugging Face skill for vision training jobs: object detection, image classification, and SAM/SAM2 segmentation. It covers dataset prep, cloud GPU setup, evaluation, Trackio logging, and pushing results to the Hub. Ideal for backend automation and repeatable training workflows.

Backend Development

Favorites 0GitHub 10.4k

seo-dataforseo

by AgriciDaniel

seo-dataforseo connects Claude to live SEO data through the DataForSEO MCP server for SERP checks, keyword research, backlinks, on-page analysis, competitor research, business listings, and AI visibility tracking. It is best for data-backed workflows when you need real search evidence, clear install guidance, and practical seo-dataforseo usage.

Keyword Research

Favorites 0GitHub 6.2k

pymc

by K-Dense-AI

PyMC is a Bayesian modeling skill for building, fitting, checking, and comparing probabilistic models in Python. Use pymc for hierarchical regression, multilevel analysis, time series, missing data, measurement error, and model comparison with LOO or WAIC.

Data Analysis

Favorites 0GitHub 0

pymatgen

by K-Dense-AI

pymatgen is a Python materials science toolkit for crystal structures, phase diagrams, electronic structure, and file conversion. This pymatgen skill helps with scientific workflows using CIF, POSCAR, VASP, and Materials Project data.

Scientific

Favorites 0GitHub 0

geopandas

by K-Dense-AI

geopandas skill for Python geospatial vector data analysis, including shapefiles, GeoJSON, and GeoPackage files. Use it to read, clean, join, buffer, clip, reproject, and export spatial data with less guesswork.

Data Analysis

Favorites 0GitHub 0

analyzing-threat-intelligence-feeds

by mukul975

Analyzing-threat-intelligence-feeds helps you ingest CTI feeds, normalize indicators, assess feed quality, and enrich IOCs for STIX 2.1 workflows. This analyzing-threat-intelligence-feeds skill is built for threat intel operations and Data Analysis, with practical guidance for TAXII, MISP, and commercial feeds.

Data Analysis

Favorites 0GitHub 0

azure-ai-textanalytics-py

by microsoft

azure-ai-textanalytics-py is a skill for Azure AI Text Analytics in Python. It helps with sentiment analysis, entity recognition, key phrase extraction, language detection, PII detection, and healthcare NLP. Use it when you need a fast path to Azure client setup, authentication, and practical text analytics usage for apps, notebooks, or data analysis workflows.

Data Analysis

Favorites 0GitHub 0