imaging-data-commons

by K-Dense-AI

imaging-data-commons helps you query and download public cancer imaging data from NCI Imaging Data Commons with idc-index. Use it for imaging-data-commons usage across CT, MR, PET, and pathology datasets, including metadata search, browser preview, licensing checks, and AI training or data analysis workflows. No authentication required.

Stars0

Favorites0

Comments0

AddedMay 14, 2026

CategoryData Analysis

Install Command

npx skills add K-Dense-AI/claude-scientific-skills --skill imaging-data-commons

Curation Score

This skill scores 82/100, which means it is a solid directory listing for users who need IDC cancer imaging access. The repository gives enough operational detail for an agent to trigger the skill correctly, understand when to use idc-index versus BigQuery/DICOMweb/cloud storage, and execute common workflows with less guesswork than a generic prompt.

82/100

Strengths

Strong triggerability: the frontmatter clearly says it is for querying and downloading public cancer imaging data from NCI IDC, with no authentication required.
Good workflow depth: the SKILL.md is large and supported by 10 reference guides covering CLI, clinical data, DICOMweb, BigQuery, cloud storage, pathology, index tables, and SQL patterns.
High practical leverage: includes version pinning and explicit guidance for when to use each access path, reducing agent ambiguity for real tasks.

Cautions

No install command in SKILL.md, so users may need to infer setup steps from the references and code snippets.
The repository is heavily reference-driven rather than script-backed, so some advanced workflows may still require the agent to synthesize steps from multiple docs.

Python Jupyter Scientific Medical Image Datasets Bioinformatics Bigquery

Overview

Overview of imaging-data-commons skill

What imaging-data-commons does

The imaging-data-commons skill helps you query and download public cancer imaging data from the NCI Imaging Data Commons using idc-index. It is best for researchers, ML engineers, and analysts who need radiology or pathology cohorts without first building a custom data ingestion stack.

Who should install it

Use the imaging-data-commons skill if you need to find studies by metadata, inspect available collections, check licensing, preview data in a browser, or pull data for AI training and analysis. It is a strong fit when you want public IDC data with no authentication required.

Why it is different

This skill is not just a generic prompt for “find medical images.” It is anchored to IDC’s data model, versioning, and access patterns, so it can guide you toward the right path for CT, MR, PET, and digital pathology. The main value is reducing guesswork around where to query, what to download, and when to use index tables versus broader access methods.

How to Use imaging-data-commons skill

Install imaging-data-commons

Install the imaging-data-commons skill from the directory package first, then open the skill file and follow its linked references:
npx skills add K-Dense-AI/claude-scientific-skills --skill imaging-data-commons

Start with the right inputs

The imaging-data-commons usage workflow works best when you provide a concrete target, not a vague “help me explore IDC.” Good inputs include the modality, cancer type, collection name, desired output format, and whether you need metadata only or actual file downloads.

Example of a strong prompt:
“Use the imaging-data-commons skill to find public CT lung cancer collections with clinical labels, then show the best collection IDs and the download path for a small pilot cohort.”

Read these files first

For practical execution, read SKILL.md first, then inspect references/use_cases.md, references/cli_guide.md, references/index_tables_guide.md, and the domain guide that matches your task, such as references/digital_pathology_guide.md or references/cloud_storage_guide.md. Those files tell you whether to use the CLI, SQL patterns, index tables, BigQuery, DICOMweb, or direct cloud storage.

Use a decision-first workflow

A good imaging-data-commons guide workflow is: identify the data type, choose the least complex access method that fits, confirm collection-level licensing, then query or download only the subset you need. For data extraction tasks, ask the skill to return the exact collection or series filters, the expected file counts, and the recommended access route before you move to download.

imaging-data-commons skill FAQ

Is imaging-data-commons only for radiology?

No. The imaging-data-commons skill covers radiology and pathology workflows, including slide microscopy, segmentations, and related metadata access. If your task is pathology-heavy, use the matching reference guide rather than assuming the same query pattern fits every dataset.

Do I need cloud credentials or special access?

Usually no. The core imaging-data-commons install and usage flow is designed around public data access, and many common queries do not require authentication. You may need extra setup only for specific paths such as BigQuery or cloud-native workflows.

When should I not use this skill?

Do not use it if you need private hospital data, fully harmonized clinical data across unrelated sources, or a one-line generic image search. It is also a poor fit if you have not decided whether you need metadata discovery, browser visualization, or actual download automation.

Is it beginner friendly?

Yes, if you begin with a concrete objective and let the skill choose the access method. Beginners usually struggle when they ask for “everything in IDC”; they get better results when they specify a disease area, modality, and the intended downstream task.

How to Improve imaging-data-commons skill

Give the skill a tighter target

The fastest way to improve imaging-data-commons usage is to state the cohort boundary and output need upfront. Compare “find IDC data” with “find 50 public PET-CT series for NSCLC, favor collections with clinical labels, and give me a download-ready shortlist.”

Include constraints that change the path

Tell the skill about licensing limits, commercial use restrictions, storage limits, and whether you prefer CLI, Python, SQL, or browser-based inspection. These constraints matter because they determine whether idc-index, BigQuery, DICOMweb, or direct cloud storage is the right route.

Ask for a two-step output

For better imaging-data-commons for Data Analysis results, ask first for discovery and then for execution details: the relevant collections, the recommended filters, and the exact command or query skeleton. That reduces false starts and makes it easier to validate the first answer before downloading large datasets.

Iterate with evidence, not guesswork

If the first result is too broad, narrow it by modality, anatomy, license, or collection name, then ask for a smaller cohort or an alternative access path. The best improvement signal is usually not “more detail,” but a better-defined retrieval target and a clearer handoff from discovery to download.

Ratings & Reviews

No ratings yet

Share your review

0/10000

Latest reviews

Saving...

more skill

clickhouse-best-practices

by ClickHouse

clickhouse-best-practices is a ClickHouse best practices skill for Database Engineering. It guides schema design, query tuning, insert strategy, and agent connectivity with rule-based recommendations, making clickhouse-best-practices usage easier to trigger, review, and cite in ClickHouse workflows.

Database Engineering

Favorites 0GitHub 412

chdb-datastore

by ClickHouse

chdb-datastore is a pandas-compatible skill for fast data analysis with a ClickHouse-backed DataStore API. It supports file, database, and cloud connectors, cross-source joins, and minimal code changes for pandas-style workflows. Use this chdb-datastore guide when you want a drop-in analysis layer for larger datasets.

Data Analysis

Favorites 0GitHub 0

sympy

by K-Dense-AI

Use the sympy skill for exact symbolic math in Python, including algebra, calculus, matrices, physics formulas, number theory, geometry, and code generation. It helps you keep expressions exact, choose the right SymPy modules, and avoid float-heavy mistakes. Best for users who need a practical sympy guide for symbolic workflows and sympy for Data Analysis.

Data Analysis

Favorites 0GitHub 21.4k

interpreting-culture-index

by trailofbits

interpreting-culture-index helps interpret Culture Index surveys, profile exports, and related hiring or coaching notes. Use this interpreting-culture-index skill for role fit, team dynamics, burnout risk, candidate debriefs, onboarding plans, and conflict mediation. It emphasizes arrow-relative reading, anti-pattern checks, and practical outputs for data analysis and decision support.

Data Analysis

Favorites 0GitHub 5k

azure-search-documents-py

by microsoft

azure-search-documents-py is the Python Azure AI Search skill for backend development, covering install, auth, index design, vector search, hybrid search, semantic ranking, and agentic retrieval. Use the azure-search-documents-py skill when you need practical guidance from setup to working query patterns.

Backend Development

Favorites 0GitHub 2.3k

gget

by K-Dense-AI

gget is a bioinformatics skill for fast, unified access to 20+ genomic databases and analysis tools from CLI or Python. Use it for gene info, BLAST-related lookups, AlphaFold structures, expression data, disease associations, and enrichment-style analysis. It suits quick exploration and gget for Data Analysis workflows.

Data Analysis

Favorites 0GitHub 0

channel-economics

by alirezarezvani

channel-economics helps RevOps and commercial leaders compare direct, partner, marketplace, reseller, or OEM channels with fully loaded cost-to-serve, ROI lenses, and constrained channel-mix recommendations. Includes Python scripts, data templates, and guidance for channel-economics usage.

Revenue Operations

Favorites 0GitHub 22.1k

torch-geometric

by K-Dense-AI

torch-geometric skill guide for PyTorch Geometric graph neural networks. Use it for torch-geometric install help, torch-geometric usage, graph classification, node classification, link prediction, heterogeneous graphs, custom MessagePassing layers, and scaling GNNs for Machine Learning workflows.

Machine Learning

Favorites 0GitHub 21.4k

rdkit

by K-Dense-AI

The rdkit skill helps with precise cheminformatics workflows: parsing SMILES, SDF, MOL, PDB, and InChI; calculating descriptors; generating fingerprints; running substructure search; handling reactions; and building 2D/3D coordinates. Use this rdkit guide for advanced control, custom sanitization, and rdkit for Data Analysis workflows.

Data Analysis

Favorites 0GitHub 21.4k

huggingface-vision-trainer

by huggingface

huggingface-vision-trainer helps you install and use a Hugging Face skill for vision training jobs: object detection, image classification, and SAM/SAM2 segmentation. It covers dataset prep, cloud GPU setup, evaluation, Trackio logging, and pushing results to the Hub. Ideal for backend automation and repeatable training workflows.

Backend Development

Favorites 0GitHub 10.4k

seo-dataforseo

by AgriciDaniel

seo-dataforseo connects Claude to live SEO data through the DataForSEO MCP server for SERP checks, keyword research, backlinks, on-page analysis, competitor research, business listings, and AI visibility tracking. It is best for data-backed workflows when you need real search evidence, clear install guidance, and practical seo-dataforseo usage.

Keyword Research

Favorites 0GitHub 6.2k

pymc

by K-Dense-AI

PyMC is a Bayesian modeling skill for building, fitting, checking, and comparing probabilistic models in Python. Use pymc for hierarchical regression, multilevel analysis, time series, missing data, measurement error, and model comparison with LOO or WAIC.

Data Analysis

Favorites 0GitHub 0

pymatgen

by K-Dense-AI

pymatgen is a Python materials science toolkit for crystal structures, phase diagrams, electronic structure, and file conversion. This pymatgen skill helps with scientific workflows using CIF, POSCAR, VASP, and Materials Project data.

Scientific

Favorites 0GitHub 0

geopandas

by K-Dense-AI

geopandas skill for Python geospatial vector data analysis, including shapefiles, GeoJSON, and GeoPackage files. Use it to read, clean, join, buffer, clip, reproject, and export spatial data with less guesswork.

Data Analysis

Favorites 0GitHub 0

analyzing-threat-intelligence-feeds

by mukul975

Analyzing-threat-intelligence-feeds helps you ingest CTI feeds, normalize indicators, assess feed quality, and enrich IOCs for STIX 2.1 workflows. This analyzing-threat-intelligence-feeds skill is built for threat intel operations and Data Analysis, with practical guidance for TAXII, MISP, and commercial feeds.

Data Analysis

Favorites 0GitHub 0

azure-ai-textanalytics-py

by microsoft

azure-ai-textanalytics-py is a skill for Azure AI Text Analytics in Python. It helps with sentiment analysis, entity recognition, key phrase extraction, language detection, PII detection, and healthcare NLP. Use it when you need a fast path to Azure client setup, authentication, and practical text analytics usage for apps, notebooks, or data analysis workflows.

Data Analysis

Favorites 0GitHub 0