🔍 McpDeepResearch

An MCP (Model-Context-Protocol) Server for Deep Academic Research

一个用于深度学术研究的 MCP 服务器

---

English

McpDeepResearch is a lightweight but powerful MCP (Model-Context-Protocol) server that helps you quickly discover, retrieve, and read academic papers from the web using the familiar Google Scholar interface.

✨ Features

search_scholar_papers – Google Scholar search with optional year-filter & date-sort
fetch_md – Convert *any* public web page to clean Markdown
fetch_paper – Auto-detect the paper content (title, abstract, body, references) and strip the rest

🛠️ Prerequisites

Python ≥ 3.10
Google Chrome/Chromium (for headless fetching via Chrome DevTools Protocol)
Environment variables

bash

export CDP_ENDPOINT="http://localhost:9222"   # Chrome debugging port
  export GOOGLE_PROXY="http://proxy:port"        # (optional) HTTP(S) proxy

⚙️ Quick Start

1. Install via pip

bash

pip install

2. (Optional) Manually launch Chrome if it isn’t already running

The server will automatically launch a Chrome instance; if not successfully launched, you can launch it manually with:

bash

google-chrome --remote-debugging-port=9222 --user-data-dir=/tmp/chrome-profile &
   ``

3. **Run the MCP server**

# Launch with SSE

mcp-deep-research --transport sse --port 8000

# Launch with STDIO

mcp-deep-research --transport stdio

code

The server exposes 3 read-only tools to any MCP-capable client.

### 🧑‍🎓 Example Workflow in a Chat-UI
1. **Search**  
   *“Find recent papers on diffusion models after 2022.”*  
   → `search_scholar_papers("diffusion models", year=2022, sort_bd=True)`

2. **Fetch**  
   Pick an interesting PDF link from the results and call  
   `fetch_paper("https://arxiv.org/abs/2304.12345")`

3. **Read**  
   The cleaned Markdown (title, abstract, full text) appears directly in the chat.

### 🔒 Security
- 100 % read-only; no writes, no uploads, no local file access.  
- All traffic respects the original site’s robots.txt.  
- Proxies can be configured to stay within institutional or regional firewalls.

### 🤝 Contributing
PRs are welcome!  

---

## 简体中文

McpDeepResearch 是一个轻量级、但功能完备的 [MCP](https://modelcontextprotocol.io/)（Model-Context-Protocol）服务器，帮助你在 **Google Scholar** 上快速发现、抓取并阅读学术文献。

### ✨ 功能一览
* **search_scholar_papers** – 使用关键词在 Google Scholar 中搜索，可过滤年份 / 按日期排序  
* **fetch_md** – 将任意公开网页渲染为整洁的 Markdown  
* **fetch_paper** – 智能提取网页中的论文主体，去除广告、导航条等噪声  

### 🛠️ 前置条件
- Python ≥ 3.10
- Google Chrome / Chromium（通过 [CDP](https://chromedevtools.github.io/devtools-protocol/) 进行无头抓取）
- 环境变量

export CDP_ENDPOINT="http://localhost:9222" # Chrome 调试端口

export GOOGLE_PROXY="http://proxy:port" # 可选：HTTP(S) 代理

code

### ⚙️ 快速开始
1. **通过 pip 安装**

pip install

code

2. **（可选）如果 Chrome 尚未启动可手动启动**  
   服务器启动时会自动运行 Chrome 实例。若未成功启动，可手动启动：

google-chrome --remote-debugging-port=9222 --user-data-dir=/tmp/chrome-profile &

code

3. **启动 MCP 服务器**

# 以 SSE 启动

mcp-deep-research --transport sse --port 8000

# 以 STDIO 启动

mcp-deep-research --transport stdio

code

服务器会对外暴露 3 个只读工具。

### 🧑‍🎓 对话界面中的典型工作流
1. **搜索**  
   *“找 2022 年之后关于扩散模型的论文。”*  
   → `search_scholar_papers("diffusion models", year=2022, sort_bd=True)`

2. **抓取**  
   从结果中挑选一篇 PDF 链接，调用  
   `fetch_paper("https://arxiv.org/abs/2304.12345")`

3. **阅读**  
   清洗后的 Markdown（含标题、摘要、全文）直接展示在聊天窗口。

### 🔒 安全性
- 完全只读，不修改、不上传、不写入本地文件。  
- 所有请求均尊重目标站点的 robots.txt。  
- 可配置代理以符合校园网或公司网络的安全策略。

### 🤝 如何贡献
欢迎提 PR！  

---

Mcp Deep Research

Documentation

🔍 McpDeepResearch

An MCP (Model-Context-Protocol) Server for Deep Academic Research

English

✨ Features

🛠️ Prerequisites

⚙️ Quick Start

Similar MCP

Chuk Mcp Linkedin

Pursuit Mcp

Hello Mcp

Gradle Mcp

Trending MCP

Playwright Mcp

Serena

Mcp Playwright

Mcp Server Cloudflare

Similar MCP

Chuk Mcp Linkedin

Pursuit Mcp

Hello Mcp

Gradle Mcp

Trending MCP

Playwright Mcp

Serena

Mcp Playwright

Mcp Server Cloudflare