Vector Database
Learn how to create, populate and query embedding indexes in Pixeltable
Learn more about embedding/vector indexes with this in-depth guide.
What are Embedding/Vector Indexes?
Embedding indexes let you search your data based on meaning, not just keywords. They work with all kinds of content - text, images, audio, video, and documents - making it easy to build powerful search systems.
Multimodal Search Examples
Pixeltable makes it easy to build semantic search for different media types:
Audio
Build semantic search for audio files and podcasts
Image
Create visual search engines with embedding models
Document
Search through PDFs and other document formats
Video
Find relevant content within video libraries
Website
Search across web content with semantic understanding
Text
Use metadata to search for long term memory for ai agents
How Pixeltable Makes Embeddings Easy
- No infrastructure headaches - embeddings are managed automatically
- Works with any media type - text, images, audio, video, or documents
- Updates automatically - when data changes, embeddings update too
- Compatible with your favorite models - use Hugging Face, OpenAI, or your custom models
Phase 1: Setup Embeddings Model and Index
The setup phase defines your schema and creates embedding indexes.
Supported Index Options
Similarity Metrics
Index Configuration
Phase 2: Insert
The insert phase populates your indexes with data. Pixeltable automatically computes embeddings and maintains index consistency.
Large batch insertions are more efficient than multiple single insertions as they reduce the number of embedding computations.
Phase 3: Query
The query phase allows you to search your indexed content using the similarity()
function.
Management Operations
Drop Index
Update Index
Best Practices
- Cache embedding models in production UDFs
- Use batching for better performance
- Consider index size vs. search speed tradeoffs
- Monitor embedding computation time
Additional Resources
Was this page helpful?