Backup Strategies
| Deployment Approach | Backup Strategy | Recovery Method |
|---|---|---|
| Orchestration Layer | External RDBMS + Blob Storage backups | Re-run transformation pipelines |
| Full Backend | pg_dump of ~/.pixeltable/pgdata + S3/GCS versioning | Restore pgdata + media files |
Full Backend Backup
For deployments using Pixeltable as the full backend:Orchestration Layer Backup
For orchestration-only deployments:- Primary data lives in your external RDBMS and blob storage
- Pixeltable state can be rebuilt by re-running transformation pipelines
- Back up your
setup_pixeltable.pyand UDF code in version control
Recovery Procedures
Full Backend Recovery
- Stop the Pixeltable application
- Restore PostgreSQL data:
psql -f backup.sql - Restore media files to
~/.pixeltable/media/ - Restart the application
Orchestration Layer Recovery
- Deploy fresh Pixeltable instance
- Run
setup_pixeltable.pyto recreate schema - Re-process data through computed columns (incremental)
Security Best Practices
| Security Layer | Recommendation | Implementation |
|---|---|---|
| Network | Deploy within private VPC | Do not expose PostgreSQL port (5432) to internet |
| Authentication | Application layer (FastAPI/Django) | Pixeltable does not manage end-user accounts |
| Cloud Credentials | IAM Roles / Workload Identity | Avoid long-lived keys in config.toml |
Network Security
Secrets Management
Never hardcode secrets. Use environment variables or secrets managers:- AWS: Secrets Manager, Parameter Store
- GCP: Secret Manager
- Kubernetes: Secrets, External Secrets Operator
Cloud Storage Credentials
For S3/GCS/Azure media storage:Audit and Compliance
Data Lineage
Pixeltable automatically tracks:- Table versions and schema changes
- Computed column definitions and dependencies
- Insert/update/delete operations
Access Logging
Implement application-level access logging:Disaster Recovery
Recovery Time Objectives
| Deployment | RTO | Strategy |
|---|---|---|
| Orchestration Layer | Minutes | Spin up new instance, re-run pipelines |
| Full Backend | Hours | Restore from backup, validate data integrity |
Recommendations
- Regular backups: Daily for production workloads
- Test recovery: Quarterly disaster recovery drills
- Multi-region: Store backups in different region than primary
- Immutable backups: Use S3 Object Lock or GCS retention policies