> ## Documentation Index > Fetch the complete documentation index at: https://docs.pixeltable.com/llms.txt > Use this file to discover all available pages before exploring further. # Backend for AI Apps > Build multimodal AI applications on Pixeltable with declarative pipelines that combine images, video, audio, documents, and language data. **Who:** AI/App Developers **Output:** AI-powered application Add multimodal intelligence to applications with two deployment patterns. **Same foundation, different intent:** This workflow uses the same Pixeltable capabilities as [Data Wrangling for ML](/use-cases/ml-data-wrangling) — tables, multimodal types, computed columns, iterators. The difference is the output: training datasets vs. live application intelligence. *** ## Data Lifecycle Define schema with native multimodal types — Pixeltable handles storage and references [`create_table()`](/tutorials/tables-and-data-operations), [`pxt.Image`](/platform/type-system), [`pxt.Video`](/platform/type-system), [`pxt.Audio`](/platform/type-system), [`pxt.Document`](/platform/type-system), [`pxt.Json`](/platform/type-system) ```python theme={"theme":{"light":"light-plus","dark":"dark-plus"}} import pixeltable as pxt # Native multimodal types t = pxt.create_table('app.docs', { 'pdf': pxt.Document, 'metadata': pxt.Json }) ``` Create tables and manage data Image, Video, Audio, Document, JSON & more Load from any source — local files, URLs, cloud storage, or databases [`insert()`](/tutorials/tables-and-data-operations), [`import_csv()`](/sdk/latest/io), [S3/GCS/Azure](/integrations/cloud-storage) ```python theme={"theme":{"light":"light-plus","dark":"dark-plus"}} # Insert with URLs, local paths, or direct upload t.insert([ {'pdf': 'https://example.com/report.pdf'}, {'pdf': '/local/path/to/doc.pdf'}, {'pdf': 's3://bucket/documents/spec.pdf'} ]) ``` Load from cloud storage S3, GCS, Azure, R2 configuration Create UDFs and computed columns — they auto-update when data changes [`@pxt.udf`](/platform/udfs-in-pixeltable), [`@pxt.query`](/platform/udfs-in-pixeltable), [`add_computed_column()`](/tutorials/computed-columns) Write custom functions in Python Auto-update derived data Extract frames, transcribe audio, chunk documents [`frame_iterator()`](/platform/iterators), [`document_splitter()`](/platform/iterators), [`AudioSplitter`](/platform/iterators) Process video into searchable frames Audio to text with Whisper Add embedding indexes with **incremental sync** — only new/changed rows are embedded [`add_embedding_index()`](/platform/embedding-indexes) ```python theme={"theme":{"light":"light-plus","dark":"dark-plus"}} # Add index once — auto-updates on insert docs.add_embedding_index('content', string_embed=e5_embed) ``` Configure and query indexes Use OpenAI embedding models Define `@pxt.query` functions that return data from your tables [`@pxt.query`](/platform/udfs-in-pixeltable) ```python theme={"theme":{"light":"light-plus","dark":"dark-plus"}} @pxt.query def get_image(image_id: str) -> PIL.Image.Image: return ( images.where(images.uuid == image_id) .select(images.image) .limit(1) ) # Use in computed columns or API endpoints t.add_computed_column(thumbnail=get_image(t.image_id)) ``` Reusable parameterized queries Find relevant content by meaning, not keywords [`.similarity()`](/platform/embedding-indexes), `.order_by()`, `.where()`, `.collect()` ```python theme={"theme":{"light":"light-plus","dark":"dark-plus"}} sim = images.image.similarity(query) results = images.order_by(sim, asc=False).select( uuid=images.uuid, url=images.image.fileurl ).limit(10).collect() ``` Search documents by meaning Find visually similar images Expose Pixeltable functions as LLM tools for agents [`pxt.tools()`](/howto/cookbooks/agents/llm-tool-calling), [`invoke_tools()`](/howto/cookbooks/agents/llm-tool-calling) LLM agents with function calling Persistent conversation context Expose tables and queries as HTTP endpoints with a TOML config or a single CLI command [`pxt serve`](/howto/deployment/serving), [`FastAPIRouter`](/howto/deployment/serving#quickstart-python) ```toml theme={"theme":{"light":"light-plus","dark":"dark-plus"}} # service.toml [[service]] name = "image-service" port = 8000 [[service.routes]] type = "insert" table = "app/images" path = "/upload" uploadfile_inputs = ["image"] outputs = ["image", "caption"] [[service.routes]] type = "query" path = "/search" query = "app.queries.search_images" ``` ```bash theme={"theme":{"light":"light-plus","dark":"dark-plus"}} pxt serve image-service --config service.toml ``` TOML config, CLI, Python API, background jobs Full backend vs. orchestration layer For custom logic, middleware, or authentication, use Flask, FastAPI, or any Python web framework `pxt.get_table()`, `.insert()`, `.select()`, `.collect()` ```python theme={"theme":{"light":"light-plus","dark":"dark-plus"}} from flask import Flask, request import pixeltable as pxt app = Flask(__name__) images = pxt.get_table("app.images") @app.route("/api/search", methods=["POST"]) def search(): query = request.form.get("q") sim = images.image.similarity(query) return images.order_by(sim, asc=False).limit(10).collect() ``` Concurrency, error handling, sync endpoints Full Flask app with file upload & search Get pre-signed URLs for media files stored in cloud storage `.fileurl`, pre-signed URLs for S3/GCS/Tigris ```python theme={"theme":{"light":"light-plus","dark":"dark-plus"}} url = row["image"].fileurl presigned = s3.generate_presigned_url( "get_object", Params={"Bucket": bucket, "Key": key}, ExpiresIn=3600, ) ``` S3, GCS, Azure, R2, Tigris configuration *** ## Deployment Patterns **When:** Keep existing RDBMS + blob storage Pixeltable processes media, runs models, then exports results to your existing systems. ```python theme={"theme":{"light":"light-plus","dark":"dark-plus"}} from pixeltable.io.sql import export_sql # Process in Pixeltable with media stored directly to S3/GCS/Azure videos.add_computed_column( thumbnail=videos.frame.resize((256, 256)), destination='s3://my-bucket/thumbnails/' ) # Export structured results to serving DB export_sql( videos.select(videos.video, videos.transcript), 'video_metadata', db_connect_str='postgresql+psycopg://...', if_exists='replace', ) ``` Process with computed columns, export with `export_sql` **When:** Need versioning, lineage, and retrieval (RAG) from same system Pixeltable persists everything—use it as your primary data backend with automatic versioning. ```python theme={"theme":{"light":"light-plus","dark":"dark-plus"}} # Everything in one place: storage + compute + retrieval docs.add_computed_column(chunks=document_splitter(docs.pdf)) docs.add_embedding_index('chunks', string_embed=e5_embed) # Query with full lineage results = docs.chunks.similarity(query).limit(10).collect() ``` Versioning, lineage, and retrieval in one system *** ## End-to-End Examples Multimodal AI agent with memory, file search, and image generation Next.js + FastAPI app for text & image search Retrieval-augmented generation workflow **More sample apps:** Check out the [sample-apps directory](https://github.com/pixeltable/pixeltable/tree/main/docs/sample-apps) for chat applications, multimodal search, and more.