Doc Scraper MCP Server

A Model Context Protocol (MCP) server that provides documentation scraping functionality. This server converts web-based documentation into markdown format using jina.ai's conversion service.

Features

Scrapes documentation from any web URL
Converts HTML documentation to markdown format
Saves the converted documentation to a specified output path
Integrates with the Model Context Protocol (MCP)

Installation

Installing via Smithery

To install Doc Scraper for Claude Desktop automatically via Smithery:

bash

npx -y @smithery/cli install @askjohngeorge/mcp-doc-scraper --client claude

1. Clone the repository:

bash

git clone https://github.com/askjohngeorge/mcp-doc-scraper.git
cd mcp-doc-scraper

2. Create and activate a virtual environment:

bash

python -m venv venv
source venv/bin/activate  # On Windows, use: venv\Scripts\activate

3. Install the dependencies:

bash

pip install -e .

Usage

The server can be run using Python:

bash

python -m mcp_doc_scraper

Tool Description

The server provides a single tool:

Name: scrape_docs
Description: Scrape documentation from a URL and save as markdown
Input Parameters:
url: The URL of the documentation to scrape
output_path: The path where the markdown file should be saved

Project Structure

code

doc_scraper/
├── __init__.py
├── __main__.py
└── server.py

Dependencies

aiohttp
mcp
pydantic

Development

To set up the development environment:

1. Install development dependencies:

bash

pip install -r requirements.txt

2. The server uses the Model Context Protocol. Make sure to familiarize yourself with MCP documentation.

License

MIT License

Mcp Doc Scraper

Documentation

Doc Scraper MCP Server

Features

Installation

Installing via Smithery

Usage

Tool Description

Project Structure

Dependencies

Development

License

Similar MCP

Esp Rainmaker Mcp

Personalizationmcp

Fal Mcp Server

Opengenes Mcp

Trending MCP

Playwright Mcp

Serena

Mcp Playwright

Mcp Server Cloudflare

Similar MCP

Esp Rainmaker Mcp

Personalizationmcp

Fal Mcp Server

Opengenes Mcp

Trending MCP

Playwright Mcp

Serena

Mcp Playwright

Mcp Server Cloudflare