Data Cleaning

Browse Data Cleaning agent skills in Data Processing and compare related workflows, tools, and use cases.

6 skills
S
data-analyst

by Shubhamsaboo

data-analyst is a minimal GitHub skill that guides agents toward SQL, pandas, and basic statistical analysis for data exploration. Best for users who want code-backed queries, transformations, and interpretations from a single SKILL.md prompt layer.

Data Analysis
Favorites 0GitHub 104.2k
W
data-quality-frameworks

by wshobson

The data-quality-frameworks skill helps teams plan production data validation with dbt tests, Great Expectations, and data contracts. Use it to choose the right checks, map them to a testing pyramid, and guide CI/CD-ready data quality workflows for Data Cleaning and pipeline reliability.

Data Cleaning
Favorites 0GitHub 32.6k
P
dummy-dataset

by phuryn

dummy-dataset generates realistic test data in CSV, JSON, SQL, or Python script form. It helps with mock datasets, demos, database seeding, QA, and data cleaning by letting you define columns, row counts, and constraints for believable sample records.

Data Cleaning
Favorites 0GitHub 11.1k
D
read-file

by duckdb

read-file helps an agent read and inspect CSV, JSON, Parquet, Avro, Excel, SQLite, spatial files, or remote URLs with DuckDB. Use it to preview rows, check schema, profile data, and answer what’s in this file. It’s best for read-file usage on real data artifacts, not source code.

Office Documents
Favorites 0GitHub 443
K
lamindb

by K-Dense-AI

The lamindb skill helps you work with LaminDB, an open-source biology data framework for making data queryable, traceable, reproducible, and FAIR. Use it for lamindb for Data Analysis, metadata curation, ontology-based annotation, schema validation, and lineage-aware workflows across notebooks and pipelines.

Data Analysis
Favorites 0GitHub 0
K
exploratory-data-analysis

by K-Dense-AI

The exploratory-data-analysis skill turns scientific files into format-aware EDA reports. It detects file type, summarizes structure and quality, extracts key metadata, and suggests downstream analysis. Use it for exploratory-data-analysis for Data Analysis across chemistry, bioinformatics, microscopy, spectroscopy, proteomics, metabolomics, and other scientific file formats.

Data Analysis
Favorites 0GitHub 0
Data Cleaning agent skills