azure-storage-file-datalake-py

by microsoft

azure-storage-file-datalake-py is the Python skill for Azure Data Lake Storage Gen2. It helps backend developers and agents install, authenticate, and use the Azure SDK for hierarchical file system tasks like listing, uploading, downloading, and managing directories and files.

Stars2.3k

Favorites0

Comments0

AddedMay 8, 2026

CategoryBackend Development

Install Command

npx skills add microsoft/skills --skill azure-storage-file-datalake-py

Curation Score

This skill scores 78/100, which is solid enough for directory listing. For users, that means it looks install-worthy for real Azure Data Lake Storage Gen2 work: the trigger terms are explicit, the installation/auth setup is concrete, and the doc appears to cover a usable client hierarchy rather than a placeholder. It is still best suited for users already working with Azure storage rather than those seeking a broadly guided, end-to-end workflow skill.

78/100

Strengths

Explicit triggerability for ADLS Gen2 terms like DataLakeServiceClient, FileSystemClient, and hierarchical namespace
Concrete installation and auth guidance, including pip install and Azure environment variables
Real SDK-focused content with substantial body length and no placeholder/demo markers

Cautions

Repository evidence shows only one workflow signal and no supporting scripts/references, so advanced usage may require outside documentation
Description is very short, so install decision pages may need to infer scope from the body rather than the metadata

Azure Python Sdk Storage Files Cloud Data Processing

Overview

Overview of azure-storage-file-datalake-py skill

azure-storage-file-datalake-py is the Python skill for working with Azure Data Lake Storage Gen2 through the azure-storage-file-datalake SDK. It helps you do real storage work: connect to a DFS endpoint, authenticate safely, and manage file systems, directories, and files in a hierarchical namespace.

This skill is best for backend developers, data platform engineers, and agents that need the azure-storage-file-datalake-py skill for upload/download flows, directory traversal, and storage automation. It is more useful than a generic prompt when you need the correct Azure client hierarchy and authentication pattern, especially for production environments where credential choice matters.

What the skill is for

Use azure-storage-file-datalake-py when the task depends on ADLS Gen2 concepts such as DataLakeServiceClient, FileSystemClient, or DataLakeDirectoryClient. The practical job-to-be-done is not “write Python code,” but “wire the right Azure client to the right storage operation without guessing the API shape.”

What makes it different

The key advantage of azure-storage-file-datalake-py is that it centers hierarchical file-system operations rather than flat blob storage patterns. That matters when your workflow includes directories, path semantics, recursive listing, or analytics pipelines that expect ADLS Gen2 behavior.

Best-fit and misfit cases

Choose this skill if you are building backend services, ingestion jobs, or admin tooling around Azure Data Lake Storage Gen2. Do not choose it for generic Azure storage advice, non-Python stacks, or plain Blob Storage workflows that do not require a hierarchical namespace.

How to Use azure-storage-file-datalake-py skill

Install the skill in your workflow

For a directory or agent environment, install with:

npx skills add microsoft/skills --skill azure-storage-file-datalake-py

If you are not using the directory installer, the important part is that the azure-storage-file-datalake-py install context includes the skill file plus its supporting repo metadata. The skill has no extra helper scripts, so the main behavior comes from SKILL.md itself.

Read the right files first

Start with SKILL.md, because that is where the usage pattern, auth assumptions, and client hierarchy live. In this repo, there are no rules/, references/, or resources/ folders to rescue missing context, so you should treat SKILL.md as the source of truth.

Give the skill a complete task brief

For strong azure-storage-file-datalake-py usage, do not ask for “help with Data Lake.” Provide:

the account type and endpoint form, such as https://<account>.dfs.core.windows.net
whether the task is local dev, CI, managed identity, or production service-to-service auth
the file operation you need: list, create, upload, rename, delete, or recursive copy
the object scope: file system, directory, or file path
any constraints such as idempotency, overwrite rules, or large-file handling

A weak prompt is: “Write ADLS code.”
A stronger prompt is: “Using azure-storage-file-datalake-py, generate Python code to list all files under /landing/raw/ in my datalake-prod file system with DefaultAzureCredential, and make it safe to rerun.”

Use the client hierarchy correctly

A good azure-storage-file-datalake-py guide should lead you from service client to file system client to directory or file client. If your output skips that hierarchy, it often becomes brittle or incomplete. Ask for code that shows where each client is created and why, especially when the operation crosses directories or needs path-specific behavior.

azure-storage-file-datalake-py skill FAQ

Is azure-storage-file-datalake-py only for Azure experts?

No. It is usable by beginners who already know they need Azure Data Lake Storage Gen2, but it assumes you can describe your target account, auth method, and operation. If those inputs are vague, the output will also be vague.

How is this different from a normal Python prompt?

A normal prompt may produce generic Azure code that confuses Blob Storage and Data Lake Storage. The azure-storage-file-datalake-py skill is narrower: it pushes the correct SDK package, authentication flow, and hierarchical file-system model.

When should I not use this skill?

Do not use azure-storage-file-datalake-py if you need non-Python implementation, simple blob-object storage, or a tutorial-style explanation unrelated to real backend work. It is also a poor fit if you cannot specify the account URL or auth approach.

Does it help with production-ready auth?

Yes, if you say which auth path you need. The skill’s most valuable decision point is choosing between local development credentials and production credentials such as managed identity or a credential selected via AZURE_TOKEN_CREDENTIALS.

How to Improve azure-storage-file-datalake-py skill

Specify the exact storage shape

The biggest quality gain comes from naming the file system and path structure up front. Tell the model whether you are working at the container, directory, or file level, because azure-storage-file-datalake-py behaves differently depending on where the operation starts and ends.

Tell it which auth path to optimize for

The most common failure mode is mixing local and production authentication in one answer. If you want the azure-storage-file-datalake-py skill to produce useful code, say whether you expect DefaultAzureCredential, managed identity, or another credential class, and note if environment variables must be present.

Ask for output that matches your runtime

If your app is a backend service, ask for reusable functions, explicit client creation, and minimal side effects. If your need is a one-off admin task, ask for a short script instead. The same azure-storage-file-datalake-py usage can produce very different results depending on the target runtime.

Iterate on path-specific failures

If the first result is close but not usable, refine the prompt with the exact symptom: authorization failure, missing directory, wrong endpoint, or path encoding issue. That turns the azure-storage-file-datalake-py guide from generic scaffolding into a targeted fix and usually improves the next answer faster than asking for a full rewrite.

Ratings & Reviews

No ratings yet

Share your review

0/10000

Latest reviews

Saving...

more skill

azure-identity-py

by microsoft

azure-identity-py helps set up Azure authentication in Python with Microsoft Entra ID. Use it to choose DefaultAzureCredential, managed identity, or service principal auth, configure environment variables, and troubleshoot access control and credential chain issues. Install guidance, usage patterns, and practical setup notes are based on the repo skill file.

Access Control

Favorites 0GitHub 2.2k

wrangler

by cloudflare

The wrangler skill helps you find correct CLI commands, config shapes, and deployment steps for Cloudflare Workers. Use it for wrangler usage, wrangler install checks, and a practical wrangler guide when building or shipping Workers for Backend Development.

Backend Development

Favorites 0GitHub 1.3k

clickhouse-best-practices

by ClickHouse

clickhouse-best-practices is a ClickHouse best practices skill for Database Engineering. It guides schema design, query tuning, insert strategy, and agent connectivity with rule-based recommendations, making clickhouse-best-practices usage easier to trigger, review, and cite in ClickHouse workflows.

Database Engineering

Favorites 0GitHub 412

clickhouse-architecture-advisor

by ClickHouse

clickhouse-architecture-advisor helps design ClickHouse workloads with workload-aware decisions for ingestion, partitioning, joins, dictionaries, upserts, and pre-aggregation. It is especially useful for Backend Development, observability, SIEM, product analytics, IoT telemetry, and financial pipelines. The skill labels guidance as official, derived, or field.

Backend Development

Favorites 0GitHub 412

chdb-datastore

by ClickHouse

chdb-datastore is a pandas-compatible skill for fast data analysis with a ClickHouse-backed DataStore API. It supports file, database, and cloud connectors, cross-source joins, and minimal code changes for pandas-style workflows. Use this chdb-datastore guide when you want a drop-in analysis layer for larger datasets.

Data Analysis

Favorites 0GitHub 0

aspnet-core

by openai

The aspnet-core skill helps you build, review, refactor, and upgrade ASP.NET Core apps using current framework guidance. It is built for backend development, APIs, server-rendered apps, Blazor, SignalR, gRPC, and hosted services, with decision-first guidance for app model choice, Program.cs setup, DI, configuration, security, testing, and deployment.

Backend Development

Favorites 0GitHub 18.6k

azure-identity-ts

by microsoft

azure-identity-ts helps TypeScript apps authenticate to Azure services with @azure/identity. Use this skill to choose the right credential for local development, production, CI/CD, managed identity, service principals, workload identity, or browser login. It is especially useful for Backend Development and clear azure-identity-ts guide workflows.

Backend Development

Favorites 0GitHub 2.3k

azure-search-documents-py

by microsoft

azure-search-documents-py is the Python Azure AI Search skill for backend development, covering install, auth, index design, vector search, hybrid search, semantic ranking, and agentic retrieval. Use the azure-search-documents-py skill when you need practical guidance from setup to working query patterns.

Backend Development

Favorites 0GitHub 2.3k

azure-servicebus-dotnet

by microsoft

azure-servicebus-dotnet helps .NET backend teams use Azure Service Bus with queues, topics, subscriptions, sessions, and dead-letter handling. It covers install, authentication, connection setup, and practical usage of Azure.Messaging.ServiceBus for reliable messaging in backend development.

Backend Development

Favorites 0GitHub 2.2k

azure-cosmos-db-py

by microsoft

azure-cosmos-db-py helps you build Azure Cosmos DB NoSQL persistence in Python/FastAPI with production-ready patterns for client setup, dual auth, partition-aware CRUD, parameterized queries, and testable service layers. Use the azure-cosmos-db-py skill when you need a practical guide for backend development, local emulator support, and reusable Cosmos DB implementation patterns.

Backend Development

Favorites 0GitHub 2.2k

mcp-server-patterns

by affaan-m

mcp-server-patterns is a practical guide for MCP Server Development with the Node/TypeScript SDK. Learn when to use tools, resources, prompts, Zod validation, and stdio vs Streamable HTTP, with current API notes for safer implementation and debugging.

MCP Server Development

Favorites 0GitHub 156.2k

laravel-tdd

by affaan-m

laravel-tdd is a Laravel test-driven-development guide for PHPUnit and Pest. It helps with unit, feature, and integration test choices, database strategy, fakes, coverage targets, and a practical workflow for test automation.

Test Automation

Favorites 0GitHub 156.2k

django-security

by affaan-m

django-security is a practical guide for hardening Django apps with authentication, authorization, CSRF, XSS, SQL injection prevention, secure cookies, and production settings. It helps developers and reviewers run a focused Security Audit, quickly spot risky config, and apply concrete fixes before deployment.

Security Audit

Favorites 0GitHub 156.1k

uv-package-manager

by wshobson

Use the uv-package-manager skill to plan installs, migrate from pip or Poetry, and apply practical uv workflows for Python project setup, lockfiles, CI, Docker, and workspaces.

Project Setup

Favorites 0GitHub 32.6k

performance-optimization

by addyosmani

The performance-optimization skill helps you measure first, find the real bottleneck, fix it, and verify results. Use it when performance requirements exist, you suspect a regression, or Core Web Vitals, load times, or interaction latency need improvement.

Performance Optimization

Favorites 0GitHub 18.7k

chatgpt-apps

by openai

chatgpt-apps is the skill for building or fixing ChatGPT Apps SDK projects that pair an MCP server with a widget UI. Use it for docs-aligned setup, tool design, bridge wiring, resource registration, metadata, CSP, and repo validation. It also supports chatgpt-apps for Backend Development when backend and UI must be designed together.

Backend Development

Favorites 0GitHub 18.6k