7 stars
Python
Updated May 30, 2025
Documentation
Doc Scraper MCP Server
A Model Context Protocol (MCP) server that provides documentation scraping functionality. This server converts web-based documentation into markdown format using jina.ai's conversion service.
Features
- Scrapes documentation from any web URL
- Converts HTML documentation to markdown format
- Saves the converted documentation to a specified output path
- Integrates with the Model Context Protocol (MCP)
Installation
Installing via Smithery
To install Doc Scraper for Claude Desktop automatically via Smithery:
bash
npx -y @smithery/cli install @askjohngeorge/mcp-doc-scraper --client claude1. Clone the repository:
bash
git clone https://github.com/askjohngeorge/mcp-doc-scraper.git
cd mcp-doc-scraper2. Create and activate a virtual environment:
bash
python -m venv venv
source venv/bin/activate # On Windows, use: venv\Scripts\activate3. Install the dependencies:
bash
pip install -e .Usage
The server can be run using Python:
bash
python -m mcp_doc_scraperTool Description
The server provides a single tool:
- Name:
scrape_docs - Description: Scrape documentation from a URL and save as markdown
- Input Parameters:
url: The URL of the documentation to scrapeoutput_path: The path where the markdown file should be saved
Project Structure
code
doc_scraper/
├── __init__.py
├── __main__.py
└── server.pyDependencies
- aiohttp
- mcp
- pydantic
Development
To set up the development environment:
1. Install development dependencies:
bash
pip install -r requirements.txt2. The server uses the Model Context Protocol. Make sure to familiarize yourself with MCP documentation.
License
MIT License
Similar MCP
Based on tags & features
Trending MCP
Most active this week
