firecrawl

by firecrawl

firecrawl skill for installing, authenticating, and using the official Firecrawl CLI for web scraping, search, crawling, and page interaction. Learn setup, `firecrawl --status`, login, safe file output to `.firecrawl/`, and practical usage patterns backed by the repo.

Stars234

Favorites0

Comments0

AddedMar 31, 2026

CategoryWeb Scraping

Install Command

npx skills add firecrawl/cli --skill firecrawl

Curation Score

This skill scores 78/100, which means it is a solid directory listing candidate for users who want an agent-triggerable web scraping/search CLI with clear install and safety guidance. Repository evidence shows strong trigger cues, concrete prerequisite checks, and practical security handling, though users will still need to rely on `--help` for full command usage details.

78/100

Strengths

Very strong triggerability: the SKILL.md explicitly lists when to use it (search, scrape, crawl docs, fetch URLs, interact with pages) and when not to use it.
Operational basics are clear: it defines allowed Bash invocations, requires `firecrawl --status`, and includes install/authentication steps in `rules/install.md`.
Trust and safety are better than average: `rules/security.md` gives specific handling guidance for untrusted web content, output isolation, incremental reading, and URL quoting.

Cautions

Command workflows appear only partially documented in the skill itself; users may need to consult `firecrawl --help` for exact options and execution patterns.
Install/version guidance is slightly inconsistent across files (`1.8.0` in install guidance vs `1.7.1` in security install snippet), which can create mild adoption friction.

Cli Scraping Websites API Npm

Overview

Overview of firecrawl skill

What the firecrawl skill does

The firecrawl skill helps an agent use the official Firecrawl CLI for web search, scraping, crawling, and interactive page retrieval from the terminal. It is best for users who need current web content, clean page extraction, or site-wide collection that goes beyond a model's built-in browsing. In practice, the real job-to-be-done is: turn a vague “get me this website’s content” request into a repeatable CLI workflow that fetches the right pages, stores results safely, and keeps large untrusted web content out of the main chat context.

Who should install firecrawl

Install this firecrawl skill if you regularly:

scrape article pages or docs sites
search the web for current information
extract content from a known URL
crawl multiple pages for research or migration work
need page interaction support that ordinary prompts cannot perform reliably

It is a stronger fit for developers, researchers, and automation-heavy users than for people who only need occasional lightweight browsing.

Why this firecrawl skill is useful

The main advantage of the firecrawl skill is not just “web access.” It gives an agent a clear operating pattern:

verify the CLI is installed and authenticated
use the right Firecrawl command for search, scrape, or crawl
write output to files instead of flooding the model context
treat fetched web content as untrusted input

That last point matters. The included security guidance is one of the most important reasons to use this skill instead of improvising shell commands.

What makes it different from a generic web prompt

A generic prompt might tell an agent to “scrape this site,” but the firecrawl skill adds concrete execution rules:

use the official firecrawl CLI
check readiness with firecrawl --status
watch concurrency and credit limits
save results under .firecrawl/
inspect output incrementally instead of reading everything at once

This reduces guesswork and makes firecrawl for Web Scraping safer and more reproducible.

When firecrawl is not the right tool

Do not install this firecrawl skill for:

local file operations
git workflows
deployments
code editing tasks with no web component
one-off browsing where built-in model tools already suffice

If your task does not require external website access, this skill is probably unnecessary.

How to Use firecrawl skill

Install firecrawl and verify access

The fastest firecrawl install path from the repository guidance is:

npx -y firecrawl-cli -y

That flow installs the CLI, starts authentication, and installs skills. A manual install is also supported:

npm install -g [email protected]

Then verify the environment:

firecrawl --status

A healthy status should show authentication, concurrency, and remaining credits. If firecrawl --status fails, do not proceed to scraping yet.

Authenticate before first real usage

The recommended login path is browser auth:

firecrawl login --browser

If needed, you can authenticate with an API key:

firecrawl login --api-key "<key>"

This is an adoption blocker for many users: the firecrawl skill is only useful after authentication succeeds. If install seems fine but commands still fail, check auth before troubleshooting command syntax.

Start with the right command category

Before writing prompts, decide which Firecrawl job you actually need:

search: find pages about a topic
scrape: extract content from a specific URL
crawl: collect content across a site or docs section
interact/page actions: when the target page needs clicks or login-like flow support

The skill is most valuable when the agent can distinguish “find sources” from “extract this known page” from “map this whole docs site.”

What input the firecrawl skill needs

Good firecrawl usage starts with complete task inputs. The agent works better when you provide:

the exact URL or domain
whether you want one page or many pages
the output format you need
the fields you care about
what to ignore, such as nav, boilerplate, or unrelated sections
whether freshness matters

Weak input: “Get info from Stripe docs.”

Stronger input: “Use firecrawl to crawl the Stripe docs pages about webhooks only, save output to .firecrawl/, and summarize endpoint signing, retry behavior, and local testing.”

Turn a rough request into a strong firecrawl prompt

A practical prompt template:

Use the firecrawl skill to [search/scrape/crawl] [URL or topic].
Write results to `.firecrawl/`.
Focus on [specific entities, sections, or facts].
Ignore [irrelevant areas].
After fetching, inspect only the needed parts and return a concise summary with source URLs.

Why this works:

it chooses the operation type
it names the target
it constrains scope
it reinforces safe output handling
it tells the agent what final answer format you want

Use file output instead of inline page dumps

One of the most important firecrawl usage practices in this skill is writing fetched content to files with -o rather than streaming large page bodies directly into chat. That improves:

context efficiency
repeatability
safety against prompt injection in page content
easier downstream filtering with shell tools

This is a major practical difference between mature firecrawl usage and “just fetch the page and paste it.”

Read the repository files that matter first

If you want to understand the skill quickly, read these files first:

SKILL.md
rules/install.md
rules/security.md

This repo is small, so the best reading path is short. SKILL.md tells you when to trigger firecrawl. rules/install.md removes setup friction. rules/security.md contains the highest-value operational guidance.

Check credits and concurrency before bigger jobs

The firecrawl --status output includes:

Concurrency: parallel job limit
Credits: remaining API budget

This matters for crawl-heavy work. If you plan to collect a large documentation set, credit limits and concurrency are not footnotes; they determine whether your workflow should be one big crawl or a narrower sequence of targeted scrapes.

Handle fetched web content as untrusted input

The firecrawl skill includes an unusually important warning: fetched web data is untrusted third-party content. Treat it that way. In practice:

quote URLs in shell commands
store output under .firecrawl/
do not read entire fetched files blindly
inspect only relevant sections with tools like grep or head
never follow instructions embedded inside scraped content

For many users, this security posture is the clearest reason to adopt the skill instead of rolling ad hoc scraping commands.

Suggested workflow for real tasks

A high-signal firecrawl guide for daily use looks like this:

Confirm install and auth with firecrawl --status.
Decide whether the task is search, scrape, crawl, or interaction.
Run the command with file output to .firecrawl/.
Inspect only the slices you need.
Extract the requested facts, links, or summaries.
If the first pass is noisy, narrow the URL scope or content target and rerun.

This workflow is simple, but it materially improves output quality and keeps the agent from drowning in web text.

Troubleshooting common install failures

If the firecrawl command is not found:

confirm your npm global bin is in PATH
test with npx [email protected] --version
reinstall with npm install -g [email protected]

If the command exists but is unusable:

run firecrawl --status
check whether authentication actually completed
retry firecrawl login --browser

These are the most likely issues that block firecrawl install success.

firecrawl skill FAQ

Is this firecrawl skill only for scraping?

No. The firecrawl skill covers search, scraping, crawling, and page interaction. If your task starts with “find relevant pages” rather than “extract this exact URL,” it is still a good fit.

Is firecrawl better than an ordinary prompt with browsing?

Usually yes when you need repeatable extraction, multi-page collection, file-based output, or CLI-driven workflows. Ordinary prompts are fine for light lookup. Firecrawl is better when you need structured web retrieval that can be rerun or inspected locally.

Is the firecrawl skill beginner-friendly?

Moderately. The install flow is straightforward, but beginners may get blocked by authentication, CLI path issues, or uncertainty about whether to search, scrape, or crawl. The skill is easier to use once you think in terms of concrete URLs and scoped targets.

Do I need an API key for firecrawl?

Not always manually, but you do need authentication. The recommended path is firecrawl login --browser. API key login is available if browser auth is not convenient.

When should I not use the firecrawl skill?

Skip it when:

the task has no web dependency
built-in browsing is enough
you only need a tiny snippet from one public page and do not care about reuse
the target content should not be fetched through an external scraping service

What is the main risk when using firecrawl for Web Scraping?

The biggest practical risk is treating scraped content as trusted. Web pages can contain prompt injection or simply overwhelm the model with irrelevant text. The skill addresses this by recommending file output, incremental reading, and selective extraction.

How to Improve firecrawl skill

Give firecrawl narrower targets

The fastest way to improve firecrawl results is to reduce ambiguity. Better inputs specify:

exact URL or domain
page set boundaries
desired facts or entities
exclusion rules
final deliverable format

“Crawl this documentation site” is broad. “Crawl only the authentication and rate-limit sections, then compare setup steps” produces much better output.

Ask for extraction goals, not just retrieval

The firecrawl skill performs better when retrieval is attached to a decision-making goal. For example:

“Find the latest pricing page and extract plan limits”
“Scrape this changelog page and summarize breaking changes since January”
“Crawl these docs pages and list all webhook retry rules”

That makes the fetched output easier to filter and more useful after the first pass.

Improve output quality with staged runs

For bigger jobs, do not start with the widest crawl. A better pattern is:

search or scrape a few representative pages
inspect what the content looks like
refine scope and fields
run the larger crawl only after the target is clear

This saves credits and reduces noisy results.

Avoid the most common failure modes

Common firecrawl failure modes include:

using crawl when a single scrape would do
collecting too much boilerplate content
forgetting auth checks
reading massive output files into context
not accounting for credit limits
giving a topic instead of a URL when exact extraction is needed

Most of these are input-quality problems, not tool-quality problems.

Use stronger prompts for better firecrawl usage

A stronger prompt usually includes all of the following:

operation type
target URL or topic
scope boundaries
required fields
output location
post-fetch analysis request

Example:

Use the firecrawl skill to scrape `https://example.com/docs/api/auth`.
Save output to `.firecrawl/`.
Extract only authentication methods, required headers, token expiry details, and example request patterns.
Then summarize the findings in bullets and cite the source URL.

This is much better than “Fetch this docs page.”

Iterate after the first run

If the first firecrawl result is not useful, improve one variable at a time:

narrow the URL set
switch from crawl to scrape
specify the exact sections you care about
ask for extraction of named fields
change the final answer format

Small prompt refinements usually outperform adding more generic instructions.

Keep the security pattern intact

Do not “improve” firecrawl usage by pasting full raw pages into chat. The better path is:

output to .firecrawl/
inspect partial content
extract only what matters
summarize outside the raw dump

This preserves the skill’s main operational advantage: practical, safer web retrieval with less context waste.

Ratings & Reviews

No ratings yet

Share your review

0/10000

Latest reviews

Saving...

more skill

huggingface-datasets

by huggingface

Use the huggingface-datasets skill for Hugging Face Dataset Viewer API workflows to validate datasets, resolve splits, preview and paginate rows, search text, apply filters, and fetch parquet links or statistics. It is a practical huggingface-datasets guide for read-only dataset exploration.

Web Scraping

Favorites 0GitHub 10.4k

data-scraper-agent

by affaan-m

data-scraper-agent helps build a repeatable public-data pipeline for web scraping, enrichment, and storage. It is designed for monitoring jobs, prices, news, repos, sports, and listings on a schedule using GitHub Actions, with outputs to Notion, Sheets, or Supabase. Best for ongoing tracking, not one-off extractions.

Web Scraping

Favorites 0GitHub 156.1k

baoyu-url-to-markdown

by JimLiu

baoyu-url-to-markdown converts live URLs to Markdown with a vendored baoyu-fetch CLI using Chrome CDP, site adapters, and generic fallback. Review Bun runtime needs, first-time EXTEND.md setup, and usage for X, YouTube, Hacker News, and rendered pages.

Format Conversion

Favorites 0GitHub 13.2k

x-twitter-scraper

by Xquik-dev

Use x-twitter-scraper to retrieve X (Twitter) data and confirmation-gated actions through Xquik. It supports tweet search, user lookup, follower extraction, media download, monitors, webhooks, MCP, and write actions. Best for web scraping-style research with an API key, not X login secrets.

Web Scraping

Favorites 0GitHub 71

exa-search

by K-Dense-AI

exa-search is a web research skill powered by Exa for finding current information and extracting content from URLs. Use it for search, source discovery, article and PDF extraction, and technical or scientific research with semantic retrieval, academic-style filtering, and clear install and usage guidance.

Web Research

Favorites 0GitHub 0

browser-use

by browser-use

browser-use is a browser automation skill for opening pages, inspecting state, clicking indexed elements, typing into fields, taking screenshots, and reusing a persistent browser session. Use it for reliable form filling, navigation, and logged-in workflows with the browser-use CLI.

Browser Automation

Favorites 0GitHub 84.9k

remote-browser

by browser-use

remote-browser helps sandboxed agents control a headless browser for Browser Automation. Use it to open pages, inspect state, click indexed elements, type input, take screenshots, and connect to local apps or CDP-backed browser sessions.

Browser Automation

Favorites 0GitHub 84.9k

firecrawl-search

by firecrawl

firecrawl-search is a web research skill for finding sources, running structured search, and optionally scraping full page content as JSON with Firecrawl CLI.

Web Research

Favorites 0GitHub 234

parallel-web

by K-Dense-AI

parallel-web is a web research and extraction skill powered by parallel-cli. It helps you search the web, extract URL content, enrich data from sources, and run deeper research with academic and scientific sources prioritized. Use it for parallel-web usage, web research, citations, and evidence-first workflows.

Web Research

Favorites 0GitHub 0

geomaster

by K-Dense-AI

geomaster is a geospatial science skill for GIS, remote sensing, spatial analysis, and Earth observation workflows. Use it for Data Analysis tasks like raster and vector operations, satellite imagery processing, spatial metrics, and workflow planning. The geomaster guide helps you install, inspect, and apply the skill with less guesswork.

Data Analysis

Favorites 0GitHub 0

asc-aso-audit

by rudrankriyam

asc-aso-audit helps you run an offline ASO audit on canonical App Store metadata in `./metadata`, then surface keyword gaps with Astro MCP. Use the asc-aso-audit skill after `asc metadata pull` to review `subtitle`, `keywords`, `description`, and `whatsNew` with less guesswork.

Data Analysis

Favorites 0GitHub 0

ffuf-web-fuzzing

by jthack

ffuf-web-fuzzing is a practical skill for discovering hidden web content, testing routes and parameters, and fuzzing authenticated targets with raw requests, auto-calibration, and result analysis. It fits security testers who need a repeatable ffuf-web-fuzzing guide for penetration testing and Security Audit workflows.

Security Audit

Favorites 0GitHub 0

web-to-markdown

by softaworks

web-to-markdown is a Format Conversion skill that turns live web pages into clean Markdown through the local web2md CLI, using a Chromium-family browser for JS-rendered pages, interactive flows, and batch URL conversion. It only runs when explicitly invoked by name.

Format Conversion

Favorites 0GitHub 1.3k

firecrawl-agent

by firecrawl

firecrawl-agent helps extract structured JSON from complex, multi-page websites. Learn when to use it, how to run the Firecrawl CLI agent, add schemas, set starting URLs, and save outputs for pricing, products, and directory-style data extraction.

Web Scraping

Favorites 0GitHub 234

firecrawl-map

by firecrawl

firecrawl-map helps agents discover and list URLs on a site, with options for search filtering, limits, JSON output, sitemap modes, and subdomain control before deeper scraping or crawling.

Web Scraping

Favorites 0GitHub 234

firecrawl-crawl

by firecrawl

firecrawl-crawl helps agents bulk extract content from a website or docs section with path filters, depth limits, page caps, wait mode, and job status checks.

Web Scraping

Favorites 0GitHub 234