MCP servers for vector & semantic search

Store and query embeddings for RAG and long-term agent memory.

13 servers · Last updated August 1, 2026

TL;DR: These servers give your agent a vector store — upserting embeddings and running similarity search for RAG and persistent memory. They differ on whether they're a dedicated vector DB, a general database with vector support, or an in-process memory store. This is the retrieval half of any RAG stack.

Bottom line: if you only try one, Memory (Knowledge Graph) is the most popular, verified option for this (74,000★). 12 more compared below.

Build a multi-server config →Check your config →

Compare 13 servers

Server	Transport	Auth	Verified	Stars	Tools for this
Memory (Knowledge Graph)	Local (stdio)	No auth		74,000	create_entities, delete_entities, read_graph +2
OpenMemory MCP	Remote (SSE)	API key	—	59,874	search_memory
Graphiti MCP	Remote (HTTP)	API key	—	28,240	add_memory, search_nodes, search_memory_facts +2
MCP Server Chart	Local (stdio)	No auth	—	4,169	generate_network_graph
Qdrant MCP Server	Local (stdio)	API key		1,100	qdrant-store, qdrant-find
Mem0 MCP Server	Local (stdio)	API key	—	655	add_memory, get_memory, update_memory +3
Chroma MCP Server	Local (stdio)	API key		600	chroma_list_collections, chroma_create_collection, chroma_peek_collection +4
dbt MCP Server	Local (stdio)	API key	—	584	get_entities, get_semantic_model_details
Pinecone Developer MCP Server	Local (stdio)	API key		500	upsert-records, search-records
Graphlit MCP Server	Local (stdio)	API key	—	375	Query Collections, Ingest Memory (Short-Term), Collection Operations
MCP ECharts	Local (stdio)	No auth	—	242	generate_graph_chart
ChatSpatial	Local (stdio)	No auth	—	40	compute_embeddings
ENCODE Toolkit	Local (stdio)	No auth	—	35	encode_summarize_collection

The servers

Memory (Knowledge Graph)

Official MCP server providing persistent, file-backed knowledge-graph memory across sessions.

create_entitiesdelete_entitiesread_graphsearch_nodesopen_nodes

Config & setup →Source ↗

OpenMemory MCP

Mem0's local-first memory layer: a Dockerized MCP server plus dashboard that keeps agent memories on your machine.

search_memory

Config & setup →Source ↗

Graphiti MCP

Temporal knowledge-graph memory for agents: add episodes and search facts over FalkorDB or Neo4j, from Zep.

add_memorysearch_nodessearch_memory_factsget_episode_entitiesclear_graph

Config & setup →Source ↗

MCP Server Chart

Generate 26+ chart types and data visualizations using AntV, for chart generation and data analysis.

generate_network_graph

Config & setup →Source ↗

Qdrant MCP Server

Official Qdrant server using a vector collection as semantic memory: store and find embeddings.

qdrant-storeqdrant-find

Config & setup →Source ↗

Mem0 MCP Server

Archived official Mem0 server for long-term agent memory: add, search, update, and delete memories via the Mem0 API.

add_memoryget_memoryupdate_memorydelete_memorydelete_entitieslist_entities

Config & setup →Source ↗

Chroma MCP Server

Official Chroma server: create collections and run vector, full-text, and metadata search.

chroma_list_collectionschroma_create_collectionchroma_peek_collectionchroma_get_collection_infochroma_get_collection_countchroma_modify_collection

Config & setup →Source ↗

dbt MCP Server

Give AI agents context of your dbt project — run dbt CLI, query the Semantic Layer, explore lineage, and manage jobs.

get_entitiesget_semantic_model_details

Config & setup →Source ↗

Pinecone Developer MCP Server

Official Pinecone server: manage indexes, upsert/search records, rerank, and search Pinecone docs.

upsert-recordssearch-records

Config & setup →Source ↗

Graphlit MCP Server

Ingest, search, and RAG over data from Slack, Discord, Google Drive, GitHub, Jira, Linear and more via Graphlit.

Query CollectionsIngest Memory (Short-Term)Collection Operations

Config & setup →Source ↗

MCP ECharts

Generate Apache ECharts charts locally from AI for chart generation and data analysis.

generate_graph_chart

Config & setup →Source ↗

ChatSpatial

MCP server for spatial transcriptomics analysis via natural language

compute_embeddings

Config & setup →Source ↗

ENCODE Toolkit

Search ENCODE, cross-reference 14 genomics databases, run analysis pipelines, and generate publication-ready methods from natural language.

encode_summarize_collection

Config & setup →Source ↗

Use these in a stack

RAG agent

FAQ

Dedicated vector DB vs memory server?

Use a dedicated vector store (Chroma, Qdrant, Pinecone) for scale and persistence; a lightweight memory server is fine for small, single-agent context.

What pairs with a vector server for RAG?

A scraping/search server to gather content and an embeddings step — see the RAG agent stack for a ready-made combination.

Other capabilities

Execute SQL Inspect database schema Automate a browser Search the web Scrape web pages Generate images Send email Send team messages