pixeltable.functions.video - Pixeltable Documentation

Pixeltable UDFs for VideoType. View source on GitHub

UDFs

`clip()` _udf

Extract a clip from a video, specified by start_time and either end_time or duration (in seconds). If start_time is beyond the end of the video, returns None. Can only specify one of end_time and duration. If both end_time and duration are None, the clip goes to the end of the video. Requirements:

ffmpeg needs to be installed and in PATH

Signature:

clip(
    video: Video,
    start_time: Float,
    end_time: Optional[Float],
    duration: Optional[Float]
)-> Optional[Video]

Parameters:

video (Video): Input video file
start_time (Float): Start time in seconds
end_time (Optional[Float]): End time in seconds
duration (Optional[Float]): Duration of the clip in seconds

Returns:

Optional[Video]: New video containing only the specified time range or None if start_time is beyond the end of the video.

`concat_videos()` _udf

Merge multiple videos into a single video. Requirements:

ffmpeg needs to be installed and in PATH

Signature:

concat_videos(videos: Json)-> Video

Parameters:

videos (Json): List of videos to merge.

Returns:

Video: A new video containing the merged videos.

`extract_audio()` _udf

Extract an audio stream from a video. Signature:

extract_audio(
    video_path: Video,
    stream_idx: Int,
    format: String,
    codec: Optional[String]
)-> Audio

Parameters:

stream_idx (Int): Index of the audio stream to extract.
format (String): The target audio format. ('wav', 'mp3', 'flac').
codec (Optional[String]): The codec to use for the audio stream. If not provided, a default codec will be used.

Returns:

Audio: The extracted audio.

Example: Add a computed column to a table tbl that extracts audio from an existing column video_col:

tbl.add_computed_column(extracted_audio=tbl.video_col.extract_audio(format='flac'))

`extract_frame()` _udf

Extract a single frame from a video at a specific timestamp. Signature:

extract_frame(
    video: Video,
    timestamp: Float
)-> Optional[Image]

Parameters:

video (Video): The video from which to extract the frame.
timestamp (Float): Extract frame at this timestamp (in seconds).

Returns:

Optional[Image]: The extracted frame as a PIL Image, or None if the timestamp is beyond the video duration.

Example: Extract the first frame from each video in the video column of the table tbl:

tbl.select(tbl.video.extract_frame(0.0)).collect()

Extract a frame close to the end of each video in the video column of the table tbl:

tbl.select(tbl.video.extract_frame(tbl.video.get_metadata().streams[0].duration_seconds - 0.1)).collect()

`get_duration()` _udf

Get video duration in seconds. Signature:

get_duration(video: Video)-> Optional[Float]

Parameters:

video (Video): The video for which to get the duration.

Returns:

Optional[Float]: The duration in seconds, or None if the duration cannot be determined.

`get_metadata()` _udf

Gets various metadata associated with a video file and returns it as a dictionary. Signature:

get_metadata(video: Video)-> Json

Parameters:

video (Video): The video for which to get metadata.

Returns:

Json: A dict such as the following:

{
    'bit_exact': False,
    'bit_rate': 967260,
    'size': 2234371,
    'metadata': {
        'encoder': 'Lavf60.16.100',
        'major_brand': 'isom',
        'minor_version': '512',
        'compatible_brands': 'isomiso2avc1mp41',
    },
    'streams': [
        {
            'type': 'video',
            'width': 640,
            'height': 360,
            'frames': 462,
            'time_base': 1.0 / 12800,
            'duration': 236544,
            'duration_seconds': 236544.0 / 12800,
            'average_rate': 25.0,
            'base_rate': 25.0,
            'guessed_rate': 25.0,
            'metadata': {
                'language': 'und',
                'handler_name': 'L-SMASH Video Handler',
                'vendor_id': '[0][0][0][0]',
                'encoder': 'Lavc60.31.102 libx264',
            },
            'codec_context': {'name': 'h264', 'codec_tag': 'avc1', 'profile': 'High', 'pix_fmt': 'yuv420p'},
        }
    ],
}

Example: Extract metadata for files in the video_col column of the table tbl:

tbl.select(tbl.video_col.get_metadata()).collect()

`overlay_text()` _udf

Overlay text on a video with customizable positioning and styling. Requirements:

ffmpeg needs to be installed and in PATH

Signature:

overlay_text(
    video: Video,
    text: String,
    font: Optional[String],
    font_size: Int,
    color: String,
    opacity: Float,
    horizontal_align: String,
    horizontal_margin: Int,
    vertical_align: String,
    vertical_margin: Int,
    box: Bool,
    box_color: String,
    box_opacity: Float,
    box_border: Optional[Json]
)-> Video

Parameters:

video (Video): Input video to overlay text on.
text (String): The text string to overlay on the video.
font (Optional[String]): Font family or path to font file. If None, uses the system default.
font_size (Int): Size of the text in points.
color (String): Text color (e.g., 'white', 'red', '#FF0000').
opacity (Float): Text opacity from 0.0 (transparent) to 1.0 (opaque).
horizontal_align (String): Horizontal text alignment ('left', 'center', 'right').
horizontal_margin (Int): Horizontal margin in pixels from the alignment edge.
vertical_align (String): Vertical text alignment ('top', 'center', 'bottom').
vertical_margin (Int): Vertical margin in pixels from the alignment edge.
box (Bool): Whether to draw a background box behind the text.
box_color (String): Background box color as a string.
box_opacity (Float): Background box opacity from 0.0 to 1.0.
box_border (Optional[Json]): Padding around text in the box in pixels.
- [10]: 10 pixels on all sides
- [10, 20]: 10 pixels on top/bottom, 20 on left/right
- [10, 20, 30]: 10 pixels on top, 20 on left/right, 30 on bottom
- [10, 20, 30, 40]: 10 pixels on top, 20 on right, 30 on bottom, 40 on left

Returns:

Video: A new video with the text overlay applied.

Example: Add a simple text overlay to videos in a table:

tbl.select(tbl.video.overlay_text('Sample Text')).collect()

Add a YouTube-style caption:

tbl.select(
    tbl.video.overlay_text(
        'Caption text',
        font_size=32,
        color='white',
        opacity=1.0,
        box=True,
        box_color='black',
        box_opacity=0.8,
        box_border=[6, 14],
        horizontal_margin=10,
        vertical_align='bottom',
        vertical_margin=70,
    )
).collect()

Add text with a semi-transparent background box:

tbl.select(
    tbl.video.overlay_text(
        'Important Message',
        font_size=32,
        color='yellow',
        box=True,
        box_color='black',
        box_opacity=0.6,
        box_border=[20, 10],
    )
).collect()

`segment_video()` _udf

Split a video into fixed-size segments. Requirements:

ffmpeg needs to be installed and in PATH

Signature:

segment_video(
    video: Video,
    duration: Float
)-> Json

Parameters:

video (Video): Input video file to segment
duration (Float): Approximate duration of each segment (in seconds).

Returns:

Json: List of file paths for the generated video segments.

Example: Split a video at 1 minute intervals

tbl.select(segment_paths=tbl.video.segment_video(duration=60)).collect()

Split video into two parts at the midpoint:

duration = tbl.video.get_duration()
tbl.select(segment_paths=tbl.video.segment_video(duration=duration / 2 + 1)).collect()

`with_audio()` _udf

Creates a new video that combines the video stream from video and the audio stream from audio. The start_time and duration parameters can be used to select a specific time range from each input. If the audio input (or selected time range) is longer than the video, the audio will be truncated. Requirements:

ffmpeg needs to be installed and in PATH

Signature:

with_audio(
    video: Video,
    audio: Audio,
    video_start_time: Float,
    video_duration: Optional[Float],
    audio_start_time: Float,
    audio_duration: Optional[Float]
)-> Video

Parameters:

video (Video): Input video.
audio (Audio): Input audio.
video_start_time (Float): Start time in the video input (in seconds).
video_duration (Optional[Float]): Duration of video segment (in seconds). If None, uses the remainder of the video after video_start_time. video_duration determines the duration of the output video.
audio_start_time (Float): Start time in the audio input (in seconds).
audio_duration (Optional[Float]): Duration of audio segment (in seconds). If None, uses the remainder of the audio after audio_start_time. If the audio is longer than the output video, it will be truncated.

Returns:

Video: A new video file with the audio track added.

Example: Add background music to a video:

tbl.select(tbl.video.with_audio(tbl.music_track)).collect()

Add audio starting 5 seconds into both files:

tbl.select(tbl.video.with_audio(tbl.music_track, video_start_time=5.0, audio_start_time=5.0)).collect()

Use a 10-second clip from the middle of both files:

tbl.select(
    tbl.video.with_audio(
        tbl.music_track, video_start_time=30.0, video_duration=10.0, audio_start_time=15.0, audio_duration=10.0
    )
).collect()

SDK Reference

​UDFs

​clip() udf

​concat_videos() udf

​extract_audio() udf

​extract_frame() udf

​get_duration() udf

​get_metadata() udf

​overlay_text() udf

​segment_video() udf

​with_audio() udf

UDFs

`clip()` _udf

`concat_videos()` _udf

`extract_audio()` _udf

`extract_frame()` _udf

`get_duration()` _udf

`get_metadata()` _udf

`overlay_text()` _udf

`segment_video()` _udf

`with_audio()` _udf