Web Scraper MCP

Created By

navin4078a year ago

Scrape websites and let them talk to your LLM

Overview

What is MCP Web Scraper?

MCP Web Scraper is a lightweight and efficient web scraping server that allows users to scrape websites and interact with their data using the Model Context Protocol (MCP).

How to use MCP Web Scraper?

To use MCP Web Scraper, you can either automate the setup by cloning the repository and running the setup script, or manually set it up by creating a virtual environment and installing the required dependencies.

Key features of MCP Web Scraper?

Text, link, image, and table data extraction with CSS selectors.
Comprehensive metadata extraction including Open Graph and Twitter Cards.
Integration with Claude Desktop for seamless operation.
Configurable result limits and error handling.

Use cases of MCP Web Scraper?

Extracting text content from various websites.
Gathering headlines and metadata for news articles.
Scraping images and tables for data analysis.

FAQ from MCP Web Scraper?

Can MCP Web Scraper handle all types of websites?

Yes, it can scrape a wide variety of websites as long as they allow it in their robots.txt file.

Is there a limit to the number of results I can scrape?

Yes, you can configure the maximum number of results to prevent overload.

What dependencies does MCP Web Scraper require?

It requires libraries like requests, beautifulsoup4, and lxml for web scraping.

Try in Playground

Server Config

{
  "mcpServers": {
    "web-scraper": {
      "command": "/full/path/to/your/venv/bin/python",
      "args": [
        "/full/path/to/your/app_mcp.py"
      ]
    }
  }
}

Project Info

Created At

a year ago

Updated At

a year ago

Author Name

navin4078

Star

Language

License

Recommend Servers

8 days ago

Automatically create a remote browser to complete your specified tasks, developed based on Browser Use + Sandbox. 自动创建一个远程浏览器，完成你指定的任务，基于Browser Use + Sandbox开发。

a year ago

Greece Payments Mcp

@junter1989k-ai

8 days ago

PostgreSQL

@modelcontextprotocol

Read-only database access with schema inspection

a year ago

Concordance

@matharrismma

Language models generate; Concordance verifies. The verify tool checks a claim deterministically (no model in the loop) and returns HOLDS / BROKEN / INCOMPLETE with the worked reasoning and a sealed receipt (content_hash + cite_url) that re-fetches byte-identical or not at all. Also: ranked search over an ~11k-record library, seal_fetch to re-verify any receipt, redact to strip PII before text travels, and a sealed connection graph. Runs sovereign/offline too (stdlib-first Python). A public false-positive benchmark covers every domain: the engine has never sealed a falsehood. 38 tools live; remote endpoint at https://narrowhighway.com/mcp.

9 days ago

Frontrun

@jongall45

VC follow intelligence for AI agents. Track what top investors follow on X — detect new follows, convergence signals, and trending companies before they're announced.

3 days ago

Algeria Payments Mcp

@junter1989k-ai

8 days ago

Brave Search

@modelcontextprotocol

Web and local search using Brave's Search API

a year ago

Redis

@modelcontextprotocol

A Model Context Protocol server that provides access to Redis databases. This server enables LLMs to interact with Redis key-value stores through a set of standardized tools.

a year ago

Filesystem

Secure file operations with configurable access controls

a year ago

8 days ago

8 days ago

@modelcontextprotocol

Browser automation and web scraping

a year ago

8 days ago

Zhipu Web Search MCP Server is a search engine specifically designed for large models. It integrates four search engines, allowing users to flexibly compare and switch between them. Building upon the web crawling and ranking capabilities of traditional search engines, it enhances intent recognition capabilities, returning results more suitable for large model processing (such as webpage titles, URLs, summaries, site names, site icons, etc.). This helps AI applications achieve "dynamic knowledge acquisition" and "precise scenario adaptation" capabilities.

a year ago

8 days ago

9 days ago

8 days ago

South Africa Payments Mcp

@junter1989k-ai

9 days ago

Costa Rica Payments Mcp

@junter1989k-ai

8 days ago

Songcheck Ai Music Detector

@afghanfansmedia-ai

Is this song AI or human? SongCheck detects AI-generated music (Suno, Udio, and more) and media from any AI agent. Point it at an audio file, an image/video, or a whole music folder and it returns a verdict (LIKELY AI-GENERATED / UNCERTAIN / LIKELY HUMAN), an AI-probability score, confidence, and provenance signals (Content Credentials / SynthID watermark, generator hints). Tools: detect_ai_music, detect_ai_media, scan_catalog (audit an entire catalog), and songcheck_health. Free tier is 5 checks/day; a paid key unlocks unlimited catalog scans. Powered by SongCheck (Khaled Media), a self-hosted v9 ensemble detector.

3 days ago

Bulgaria Payments Mcp

@junter1989k-ai

8 days ago

ShareMyPage

@Henning Witzel-Acikgöz

Host the HTML or Markdown pages your AI generates and share each as a link with comments, versioning, and access control. Create, update, and organize pages and read reviewer comments over MCP.

8 days ago

Aiimagemultistyle

@codecraftm

A Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.

a year ago

8 days ago

Create a remote sandbox that can execute code/run commands/upload and download files. 创建远程沙盒，可以执行代码/运行命令/上传下载文件

a year ago