EF Core Vector Search: Semantic Search Without a Separate Database

Q: What is vector search and when do I need it?

Vector search finds semantically similar content using embeddings (numeric representations of meaning). Use it for semantic search ("find articles about deployment" matches "CI/CD pipeline"), recommendations, and RAG (retrieval-augmented generation) for AI applications.

Q: Do I need a separate vector database?

Not necessarily. If you have fewer than 50,000 vectors and already use SQL Server, built-in vector support simplifies your architecture. Evaluate based on scale, latency requirements, and operational complexity.

Q: What about the EFCore.SqlServer.VectorSearch extension?

EF Core 10's native support replaces this extension. Remove it from your project when upgrading to EF Core 10.

Q: Can I use approximate search with EF Core?

The VECTOR_SEARCH() function for approximate search is not yet supported in EF Core 10. Use raw SQL or wait for future EF Core updates.

Q: How do I handle documents without embeddings?

Filter them out in queries: .Where(d => d.Embedding != null) . Consider making the embedding property nullable and indexing documents asynchronously.

Q: What embedding model should I use?

OpenAI's text-embedding-ada-002 or text-embedding-3-small are common choices. Azure OpenAI provides the same models with enterprise features. For on-premises, consider Sentence Transformers.

Q: How do I update embeddings when content changes?

Regenerate the embedding when the source content changes. Consider a background job that processes updated documents.

Abstract

TL;DR EF Core 10 adds native vector search with SqlVector and VECTOR_DISTANCE. Store embeddings alongside relational data and query by semantic similarity.