Question 1

How is the window chosen?

Accepted Answer

You supply it. The highlights pipeline (or any other upstream picker) emits start_sec and end_sec; this trim consumes them deterministically. There is no LLM in the trim itself — the editorial decision lives in the picker.

Question 2

Why does it require a transcript?

Accepted Answer

The transcript is what makes the cut snap to a full sentence instead of landing mid-word. The pipeline keeps every segment overlapping your window, takes the extent of the kept set, and trims the video to match — the same SRT is then sliced + rebased to t=0 and returned for the captions step to reuse.

Question 3

Can it run without a transcript?

Accepted Answer

Not anymore. The previous keyframe-vision fallback was removed when highlights took over the editorial pick — it consistently drifted from the chosen window. Transcribe the source first (the transcription pipeline caches by source hash, so re-runs are free).

Question 4

Does this cost credits?

Accepted Answer

0.1 credits per trim, flat.

Question 5

What if start_sec/end_sec land outside the transcript?

Accepted Answer

The pipeline errors out rather than silently producing a degenerate cut. Pass a window inside the source duration.

Video Trim

Best for

When to use

Tips

Recipes using this pipeline

Long video → multiple captioned clips, in one command

Video Trim — Deterministic Transcript-Aware Cut

How It Works

Why No LLM Here

Frequently Asked Questions

Explore more pipelines