
ArXiv MCP Server
Search, download, and read arXiv research papers from your AI assistant via MCP.
Add to your client
Copy the config for your MCP client and paste it into its config file.
uv tool install arxiv-mcp-serverPaste into ~/Library/Application Support/Claude/claude_desktop_config.json
{
"mcpServers": {
"arxiv-mcp-server": {
"command": "uv",
"args": [
"tool",
"run",
"arxiv-mcp-server",
"--storage-path",
"/path/to/paper/storage"
]
}
}
}Step-by-step guides: Add to Claude Desktop · Add to Cursor · Add to Windsurf
Before you start
- Python 3.11+
- uv (Astral) for installation and launching
- Optional: '[pdf]' extra (pymupdf4llm) for older PDF-only papers
- Optional: '[pro]' extra for experimental semantic search and advanced prompts
About ArXiv MCP Server
An MCP server that gives AI assistants programmatic access to arXiv: search papers with filters, download and locally cache them, and read full text in markdown with pagination for large documents. Includes research prompts for structured paper analysis plus experimental semantic search, citation-graph, and research-alert tools.
Tools & capabilities (8)
search_papersSearch arXiv with optional category, date range, and boolean query filters; supports sort by relevance or date. Enforces arXiv's 3-second rate limit automatically.
download_paperDownload a paper by arXiv ID (HTML first, PDF fallback) and store it locally. Returns content_length, returned_chars, next_start, and is_truncated for safe pagination of large papers.
list_papersList all papers downloaded locally, returning their arXiv IDs.
read_paperRead the full text of a locally downloaded paper in markdown; requires download_paper first. Supports start/max_chars pagination via the returned next_start value.
semantic_searchExperimental ([pro]): semantic similarity search over your locally downloaded papers, by query text or a reference paper_id.
citation_graphExperimental: fetch references and citing papers for any arXiv ID via Semantic Scholar (no local download required).
watch_topicExperimental: register an idempotent topic watch (same query syntax as search_papers) to track newly published papers.
check_alertsExperimental: poll saved topic watches and return only papers published since the last check; can target a single watch.
What this server can do
ArXiv MCP Server provides tools for these capabilities — tap one to see every MCP server that does the same:
When to use it
- Search arXiv for recent papers on a topic and have an AI assistant summarize the findings
- Download and read full paper text to support deep technical analysis or literature review
- Run the deep-paper-analysis prompt to generate a structured review from just a paper ID
- Build a local cache of papers and run semantic search to find related work
- Track a research topic over time with watch_topic and check_alerts
Security notes
Paper content retrieved from arXiv is untrusted external input and may contain prompt-injection attempts (OWASP LLM01) designed to hijack the AI's behavior. Use read-only MCP configurations where possible, review any AI summary that asks you to run commands or visit URLs, and be especially cautious in agentic pipelines that combine this server with filesystem, shell, or browser tools. In HTTP mode the server binds to 127.0.0.1 by default with DNS-rebinding protection; keep it on localhost unless you add authentication and set ALLOWED_HOSTS / ALLOWED_ORIGINS behind a reverse proxy.
ArXiv MCP Server FAQ
Do I need an API key or account?
No. The server queries the public arXiv API and requires no authentication.
Why does install use 'uv tool install' instead of npm?
The supported server is a Python package on PyPI. The README warns that the npm package of the same name is an unrelated third party, and that 'uv pip install' won't put the executable on your PATH — use 'uv tool install'.
How do I read older papers that only have a PDF?
Most arXiv papers have an HTML version handled automatically. For PDF-only papers, install the '[pdf]' extra: uv tool install 'arxiv-mcp-server[pdf]'.
Can it run over HTTP instead of stdio?
Yes. Set TRANSPORT=http with HOST/PORT to run Streamable HTTP (default bind 127.0.0.1), then point an HTTP-capable MCP client at http://host:port/mcp.
Is the paper content safe to feed to my agent?
Treat it as untrusted input. arXiv papers are user-generated and may contain prompt-injection attempts; apply sandboxing and review before acting on AI summaries, especially in multi-tool setups.
Alternatives to ArXiv MCP Server
Compare all alternatives →Official MCP reference server that fetches a URL and returns its content as clean Markdown, with chunking.
Official Firecrawl MCP server — scrape, crawl, map, search, and structured extraction for any LLM client.
Exa's neural web search and crawling MCP server — runs locally via npx or as a hosted remote endpoint.
Compare ArXiv MCP Server with: