Before Pixeltable
AI teams are building on images, video, audio, and text, but the infrastructure is broken:Fragmented Data
Data lives across object stores, vector DBs, SQL, and ad-hoc pipelines. No single source of truth.
Costly Iteration
Every model change requires reprocessing. Pipelines are brittle and hard to reproduce.
With Pixeltable
Persistent Storage
All data and computed results are automatically stored and versioned.
Incremental Updates
Data transformations run automatically on new data. No orchestration code needed.
Multimodal-Native
Images, video, audio, and documents integrate seamlessly with structured data.
AI Integration
Built-in support for OpenAI, Anthropic, Gemini, Hugging Face, and dozens more.
Get started
Quick Start
Install Pixeltable and run your first pipeline in 5 minutes.
10-Minute Tour
See Pixeltable in action with a hands-on image workflow.
Core Concepts
Learn about tables, computed columns, views, and the type system.
SDK Reference
Complete API reference for the Pixeltable Python SDK.
Core Primitives
Pixeltable provides a small set of primitives that compose into any multimodal AI workflow:Store
Store
Create tables with native multimodal types
Tables & Data
Create, insert, update, delete
Type System
All supported types
Orchestrate
Orchestrate
Declarative computed columns: API calls, LLM inference, local models, vision
Computed Columns
Incremental transforms
AI Integrations
OpenAI, Anthropic, Gemini, HuggingFace…
Iterate
Iterate
Explode rows: video→frames, doc→chunks, audio→segments
Views
Virtual tables
Iterators
Frame, Document, Audio splitters
Index
Index
Add embedding indexes for semantic search
Embedding Indexes
Vector search with automatic maintenance
Extend
Extend
Write custom functions with
@pxt.udf and @pxt.queryUDFs & Queries
Custom Python functions
Agents & Tools
Agents & Tools
Tool calling for AI agents and MCP integration
Tool Calling
Build agents with tools
Agents & MCP
MCP servers, memory, Pixelbot
Query & Experiment
Query & Experiment
SQL-like queries + test transformations before committing
Queries & Expressions
Select, filter, aggregate
Iterative Development
Test before commit
Version
Version
Time travel and automatic versioning
Version Control
History, snapshots, lineage
Import/Export
Import/Export
Load from any source, export to ML formats
Data Import
CSV, JSON, Parquet, S3, HF
Data Export
PyTorch, Parquet, COCO, LanceDB
Share
Share
Use Cases
Pixeltable’s primitives are use-case agnostic. They compose into any multimodal AI workflow:Data Wrangling for ML
Curate, augment, export training datasets. Pre-annotate with models, integrate Label Studio, export PyTorch.
Backend for AI Apps
Build RAG systems, semantic search, and multimodal APIs. Pixeltable handles storage, retrieval, and orchestration.
Agents & MCP
Tool-calling agents with persistent memory, MCP server integration, and automatic conversation history.
Choose How You Run Pixeltable
Pixeltable OSS
Open-source Python library. Install with
pip install pixeltable and run locally. Same APIs scale to production.Pixeltable Cloud
Data sharing available now. Managed endpoints and live tables coming soon.
Book a Demo
Schedule a call to discuss your use case and see how Pixeltable can help.
Next steps
Join the Community
Get help, share projects, and connect with other developers
GitHub
Star the repo, report issues, and contribute