This guide will get you from zero to a working AI application in under 5 minutes. Learn more from the Pixeltable Basics
and Pixeltable Fundamentals tutorials.
Create Your First Multimodal AI Application
Let’s build an image analysis application that combines object detection and OpenAI Vision.Installation
Please refer to our installation section here.1
Create Table Structure
2
Add Object Detection
3
Add OpenAI Vision Analysis
4
Use Your Application
What happened behind the scenes?
What happened behind the scenes?
Pixeltable automatically:
- Created a persistent table
- Downloaded and cached the ResNet model
- Orchestrated the OpenAI API call
- Created an efficient processing workflow
- Stored all results for future use
Key Features
Persistent Storage
All data and computed results are automatically stored and versioned. Your app state persists between sessions.
Computed Columns
Define transformations once, they run automatically on new data. Perfect for AI orchestration.
Multimodal Support
Handle images, video, audio, and text seamlessly in one unified interface.
AI Integration
Built-in support for popular AI services like OpenAI, YOLOX, Hugging Face, Label Studio, Replicate, Anthropic…
Custom Functions (UDFs)
Extend Pixeltable with your own functions using the@pxt.udf decorator: