Scraping

Browse agent skills tagged with Scraping and compare related workflows across the directory.

15 skills

browser-use

by browser-use

browser-use is a browser automation skill for opening pages, inspecting state, clicking indexed elements, typing into fields, taking screenshots, and reusing a persistent browser session. Use it for reliable form filling, navigation, and logged-in workflows with the browser-use CLI.

Browser Automation

Favorites 0GitHub 84.9k

baoyu-url-to-markdown

by JimLiu

baoyu-url-to-markdown converts live URLs to Markdown with a vendored baoyu-fetch CLI using Chrome CDP, site adapters, and generic fallback. Review Bun runtime needs, first-time EXTEND.md setup, and usage for X, YouTube, Hacker News, and rendered pages.

Format Conversion

Favorites 0GitHub 13.2k

multi-search-engine

by openclaw

multi-search-engine is a web research skill with 17 search engines, advanced operators, time filters, privacy-focused options, and WolframAlpha queries. It helps agents build and run better search URLs without API keys.

Web Research

Favorites 0GitHub 3.8k

web-to-markdown

by softaworks

web-to-markdown is a Format Conversion skill that turns live web pages into clean Markdown through the local web2md CLI, using a Chromium-family browser for JS-rendered pages, interactive flows, and batch URL conversion. It only runs when explicitly invoked by name.

Format Conversion

Favorites 0GitHub 1.3k

firecrawl-agent

by firecrawl

firecrawl-agent helps extract structured JSON from complex, multi-page websites. Learn when to use it, how to run the Firecrawl CLI agent, add schemas, set starting URLs, and save outputs for pricing, products, and directory-style data extraction.

Web Scraping

Favorites 0GitHub 234

firecrawl-browser

by firecrawl

firecrawl-browser is a Firecrawl skill for interactive web automation. It is deprecated as a standalone browser command and now guides users to use firecrawl scrape plus firecrawl interact for clicks, forms, login flows, pagination, and JavaScript-heavy pages.

Browser Automation

Favorites 0GitHub 234

firecrawl

by firecrawl

firecrawl skill for installing, authenticating, and using the official Firecrawl CLI for web scraping, search, crawling, and page interaction. Learn setup, `firecrawl --status`, login, safe file output to `.firecrawl/`, and practical usage patterns backed by the repo.

Web Scraping

Favorites 0GitHub 234

firecrawl-crawl

by firecrawl

firecrawl-crawl helps agents bulk extract content from a website or docs section with path filters, depth limits, page caps, wait mode, and job status checks.

Web Scraping

Favorites 0GitHub 234

firecrawl-download

by firecrawl

firecrawl-download helps you download a site or docs section into organized local files under .firecrawl/. It combines site mapping and scraping, supports markdown, links, and screenshots, and is useful for offline docs copies, bulk page capture, and practical Web Scraping workflows.

Web Scraping

Favorites 0GitHub 234

firecrawl-search

by firecrawl

firecrawl-search is a web research skill for finding sources, running structured search, and optionally scraping full page content as JSON with Firecrawl CLI.

Web Research

Favorites 0GitHub 234

firecrawl-map

by firecrawl

firecrawl-map helps agents discover and list URLs on a site, with options for search filtering, limits, JSON output, sitemap modes, and subdomain control before deeper scraping or crawling.

Web Scraping

Favorites 0GitHub 234

firecrawl-scrape

by firecrawl

firecrawl-scrape helps extract clean, LLM-friendly content from known URLs, including JS-rendered pages. Use it to scrape markdown, links, or page-specific answers with Firecrawl CLI or npx firecrawl.

Web Scraping

Favorites 0GitHub 234

x-twitter-scraper

by Xquik-dev

Use x-twitter-scraper to retrieve X (Twitter) data and confirmation-gated actions through Xquik. It supports tweet search, user lookup, follower extraction, media download, monitors, webhooks, MCP, and write actions. Best for web scraping-style research with an API key, not X login secrets.

Web Scraping

Favorites 0GitHub 71

tweetclaw

by Xquik-dev

tweetclaw is the installable OpenClaw plugin for structured X/Twitter workflows. This tweetclaw skill covers install, setup, credential boundaries, explicit approval for writes and paid actions, private-data handling, monitor controls, and practical tweetclaw usage for safer Social Media operations.

Social Media

Favorites 0GitHub 37

by ReScienceLab

The reddit skill retrieves Reddit posts, comment threads, subreddit metadata, and user profiles through the public JSON API. It’s built for Reddit research, subreddit scanning, and source-backed web research when you need real posts instead of a generic summary. No API key is required.

Web Research

Favorites 0GitHub 0