Same foundation, different intent: This workflow uses the same Pixeltable capabilities as Data Wrangling for ML — tables, multimodal types, computed columns, iterators. The difference is the output: training datasets vs. live application intelligence.
Data Lifecycle
- 1. Store
- 2. Build
- 3. Index
- 4. Query
- 5. Serve
Create Tables
Define schema with native multimodal types — Pixeltable handles storage and references
create_table(), pxt.Image, pxt.Video, pxt.Audio, pxt.Document, pxt.JsonTables Guide
Create tables and manage data
Type System
Image, Video, Audio, Document, JSON & more
Ingest Data
Load from any source — local files, URLs, cloud storage, or databases
insert(), import_csv(), S3/GCS/AzureImport from S3
Load from cloud storage
Cloud Storage Setup
S3, GCS, Azure, R2 configuration
Deployment Patterns
- Orchestration Layer
- Full Backend
When: Keep existing RDBMS + blob storagePixeltable processes media, runs models, then exports results to your existing systems.
Orchestration Pattern Guide
Process → Export to your existing infrastructure
End-to-End Examples
Pixelbot AI Agent
Multimodal AI agent with memory, file search, and image generation
Similarity Search App
Next.js + FastAPI app for text & image search
RAG Pipeline
Retrieval-augmented generation workflow