ArXiv MCP Server

Search, download, and read arXiv research papers from your AI assistant via MCP.

Unverified

stdio (local)

No auth

Python

View repo 2.9k

Add to your client

Copy the config for your MCP client and paste it into its config file.

Install / run

uv tool install arxiv-mcp-server

Paste into ~/Library/Application Support/Claude/claude_desktop_config.json

{
  "mcpServers": {
    "arxiv-mcp-server": {
      "command": "uv",
      "args": [
        "tool",
        "run",
        "arxiv-mcp-server",
        "--storage-path",
        "/path/to/paper/storage"
      ]
    }
  }
}

Step-by-step guides: Add to Claude Desktop · Add to Cursor · Add to Windsurf

Before you start

Python 3.11+
uv (Astral) for installation and launching
Optional: '[pdf]' extra (pymupdf4llm) for older PDF-only papers
Optional: '[pro]' extra for experimental semantic search and advanced prompts

About ArXiv MCP Server

An MCP server that gives AI assistants programmatic access to arXiv: search papers with filters, download and locally cache them, and read full text in markdown with pagination for large documents. Includes research prompts for structured paper analysis plus experimental semantic search, citation-graph, and research-alert tools.

Tools & capabilities (8)

search_papers

Search arXiv with optional category, date range, and boolean query filters; supports sort by relevance or date. Enforces arXiv's 3-second rate limit automatically.

download_paper

Download a paper by arXiv ID (HTML first, PDF fallback) and store it locally. Returns content_length, returned_chars, next_start, and is_truncated for safe pagination of large papers.

list_papers

List all papers downloaded locally, returning their arXiv IDs.

read_paper

Read the full text of a locally downloaded paper in markdown; requires download_paper first. Supports start/max_chars pagination via the returned next_start value.

semantic_search

Experimental ([pro]): semantic similarity search over your locally downloaded papers, by query text or a reference paper_id.

citation_graph

Experimental: fetch references and citing papers for any arXiv ID via Semantic Scholar (no local download required).

watch_topic

Experimental: register an idempotent topic watch (same query syntax as search_papers) to track newly published papers.

check_alerts

Experimental: poll saved topic watches and return only papers published since the last check; can target a single watch.

What this server can do

ArXiv MCP Server provides tools for these capabilities — tap one to see every MCP server that does the same:

Search the web

When to use it

Search arXiv for recent papers on a topic and have an AI assistant summarize the findings
Download and read full paper text to support deep technical analysis or literature review
Run the deep-paper-analysis prompt to generate a structured review from just a paper ID
Build a local cache of papers and run semantic search to find related work
Track a research topic over time with watch_topic and check_alerts

Security notes

Paper content retrieved from arXiv is untrusted external input and may contain prompt-injection attempts (OWASP LLM01) designed to hijack the AI's behavior. Use read-only MCP configurations where possible, review any AI summary that asks you to run commands or visit URLs, and be especially cautious in agentic pipelines that combine this server with filesystem, shell, or browser tools. In HTTP mode the server binds to 127.0.0.1 by default with DNS-rebinding protection; keep it on localhost unless you add authentication and set ALLOWED_HOSTS / ALLOWED_ORIGINS behind a reverse proxy.

ArXiv MCP Server FAQ

Do I need an API key or account?

No. The server queries the public arXiv API and requires no authentication.

Why does install use 'uv tool install' instead of npm?

The supported server is a Python package on PyPI. The README warns that the npm package of the same name is an unrelated third party, and that 'uv pip install' won't put the executable on your PATH — use 'uv tool install'.

How do I read older papers that only have a PDF?

Most arXiv papers have an HTML version handled automatically. For PDF-only papers, install the '[pdf]' extra: uv tool install 'arxiv-mcp-server[pdf]'.

Can it run over HTTP instead of stdio?

Yes. Set TRANSPORT=http with HOST/PORT to run Streamable HTTP (default bind 127.0.0.1), then point an HTTP-capable MCP client at http://host:port/mcp.

Is the paper content safe to feed to my agent?

Treat it as untrusted input. arXiv papers are user-generated and may contain prompt-injection attempts; apply sandboxing and review before acting on AI summaries, especially in multi-tool setups.

#arxiv #research #papers #academic #search #pdf #semantic-search #citations

Alternatives to ArXiv MCP Server

Compare all alternatives →

Fetch (Reference)

Search & Web

87k

Official MCP reference server that fetches a URL and returns its content as clean Markdown, with chunking.

Verified

stdio (local)

No auth

Python

2 tools

Updated 20 days agoRepo

Firecrawl

Search & Web

6.6k

Official Firecrawl MCP server — scrape, crawl, map, search, and structured extraction for any LLM client.

Verified

stdio (local)

API key

TypeScript

8 tools

Updated 18 days agoRepo

Exa Search

Search & Web

4.6k

Exa's neural web search and crawling MCP server — runs locally via npx or as a hosted remote endpoint.

Verified

stdio (local)

API key

TypeScript

6 tools

Updated 19 days agoRepo

Compare ArXiv MCP Server with:

vs Fetch (Reference)vs Firecrawl vs Exa Search vs Perplexity