Scraping

Browse agent skills tagged with Scraping and compare related workflows across the directory.

15 skills
B
browser-use

by browser-use

browser-use is a browser automation skill for opening pages, inspecting state, clicking indexed elements, typing into fields, taking screenshots, and reusing a persistent browser session. Use it for reliable form filling, navigation, and logged-in workflows with the browser-use CLI.

Browser Automation
Favorites 0GitHub 84.9k
J
baoyu-url-to-markdown

by JimLiu

baoyu-url-to-markdown converts live URLs to Markdown with a vendored baoyu-fetch CLI using Chrome CDP, site adapters, and generic fallback. Review Bun runtime needs, first-time EXTEND.md setup, and usage for X, YouTube, Hacker News, and rendered pages.

Format Conversion
Favorites 0GitHub 13.2k
O
multi-search-engine

by openclaw

multi-search-engine is a web research skill with 17 search engines, advanced operators, time filters, privacy-focused options, and WolframAlpha queries. It helps agents build and run better search URLs without API keys.

Web Research
Favorites 0GitHub 3.8k
S
web-to-markdown

by softaworks

web-to-markdown is a Format Conversion skill that turns live web pages into clean Markdown through the local web2md CLI, using a Chromium-family browser for JS-rendered pages, interactive flows, and batch URL conversion. It only runs when explicitly invoked by name.

Format Conversion
Favorites 0GitHub 1.3k
F
firecrawl-agent

by firecrawl

firecrawl-agent helps extract structured JSON from complex, multi-page websites. Learn when to use it, how to run the Firecrawl CLI agent, add schemas, set starting URLs, and save outputs for pricing, products, and directory-style data extraction.

Web Scraping
Favorites 0GitHub 234
F
firecrawl-browser

by firecrawl

firecrawl-browser is a Firecrawl skill for interactive web automation. It is deprecated as a standalone browser command and now guides users to use firecrawl scrape plus firecrawl interact for clicks, forms, login flows, pagination, and JavaScript-heavy pages.

Browser Automation
Favorites 0GitHub 234
F
firecrawl

by firecrawl

firecrawl skill for installing, authenticating, and using the official Firecrawl CLI for web scraping, search, crawling, and page interaction. Learn setup, `firecrawl --status`, login, safe file output to `.firecrawl/`, and practical usage patterns backed by the repo.

Web Scraping
Favorites 0GitHub 234
F
firecrawl-crawl

by firecrawl

firecrawl-crawl helps agents bulk extract content from a website or docs section with path filters, depth limits, page caps, wait mode, and job status checks.

Web Scraping
Favorites 0GitHub 234
F
firecrawl-download

by firecrawl

firecrawl-download helps you download a site or docs section into organized local files under .firecrawl/. It combines site mapping and scraping, supports markdown, links, and screenshots, and is useful for offline docs copies, bulk page capture, and practical Web Scraping workflows.

Web Scraping
Favorites 0GitHub 234
F
firecrawl-search

by firecrawl

firecrawl-search is a web research skill for finding sources, running structured search, and optionally scraping full page content as JSON with Firecrawl CLI.

Web Research
Favorites 0GitHub 234
F
firecrawl-map

by firecrawl

firecrawl-map helps agents discover and list URLs on a site, with options for search filtering, limits, JSON output, sitemap modes, and subdomain control before deeper scraping or crawling.

Web Scraping
Favorites 0GitHub 234
F
firecrawl-scrape

by firecrawl

firecrawl-scrape helps extract clean, LLM-friendly content from known URLs, including JS-rendered pages. Use it to scrape markdown, links, or page-specific answers with Firecrawl CLI or npx firecrawl.

Web Scraping
Favorites 0GitHub 234
X
x-twitter-scraper

by Xquik-dev

Use x-twitter-scraper to retrieve X (Twitter) data and confirmation-gated actions through Xquik. It supports tweet search, user lookup, follower extraction, media download, monitors, webhooks, MCP, and write actions. Best for web scraping-style research with an API key, not X login secrets.

Web Scraping
Favorites 0GitHub 71
X
tweetclaw

by Xquik-dev

tweetclaw is the installable OpenClaw plugin for structured X/Twitter workflows. This tweetclaw skill covers install, setup, credential boundaries, explicit approval for writes and paid actions, private-data handling, monitor controls, and practical tweetclaw usage for safer Social Media operations.

Social Media
Favorites 0GitHub 37
R
reddit

by ReScienceLab

The reddit skill retrieves Reddit posts, comment threads, subreddit metadata, and user profiles through the public JSON API. It’s built for Reddit research, subreddit scanning, and source-backed web research when you need real posts instead of a generic summary. No API key is required.

Web Research
Favorites 0GitHub 0
Scraping tagged agent skills