Skip to content

Pipelines

Pipe2.ai pipelines chain multiple AI models into a single automated workflow. Each pipeline takes simple inputs (photos, text, audio) and produces finished content — images, videos, audio, or a combination.

How pipelines work

  1. Upload your input — a photo, video, audio file, or text
  2. Choose a pipeline — each pipeline defines what AI models run and in what order
  3. Get your output — images, videos, or audio depending on the pipeline

Each pipeline has a fixed credit cost. Credits are reserved when you start a run and confirmed on success (or refunded on failure).

Available pipelines

PipelineDescriptionCategoryModelsCredits
Video GeneratorGenerate AI videos from a text prompt, image, or reference clipvideoVeo 3.1, Seedance 2 Pro, Seedance 2 Fast12–96
Image GeneratorGenerate AI images from text or reference photosimageImagen 4 Fast, Gemini 3.1 Flash, GPT Image 20.5–7
Audio GeneratorTurn text into natural speech with dozens of voices, 70+ languages, and custom style directions for tone, accent, and pacingaudioGemini 2.5 TTS0.6–1.2
Music GeneratorGenerate original background music that matches any mood, genre, and length you describeaudioEleven Music v1, Lyria 3 Pro49
Video EditorEdit existing videos with AIvideoGrok Imagine8
Image EditorEdit any image with AIimageGrok Imagine0.5–1.5
Text CardCreate professional text overlays and title cards with typography, colors, and animation chosen automaticallyimageGemini 3.1 Flash1
Image MotionTurn any still image into a cinematic video clip with smooth camera movement, zoom, and transitionsvideo0.5
Script WriterTurn any topic into a complete video production blueprint — narration, visual plan per segment, audio direction, and a step-by-step shot listtext1
Video ReelCombine multiple video clips into one seamless video with AI-picked transitions, narration, and background musicvideo1
Video TrimDeterministic transcript-aware video trim: slice an SRT to your explicit [start, end] window and ffmpeg-cut the source to matchvideo0.1
Image SearchSearch museum archives and stock photo libraries with AI-targeted queries that curate the best results from five sourcesimage0.5
Footage SearchSearch stock video libraries for b-roll, establishing shots, and background footagevideo0.5
YouTube CoverGenerate scroll-stopping YouTube thumbnails from your video title — bold composition, expressive focal point, and room for overlay textimageGemini 3.1 Flash1.5
Product ShotsTurn one product photo into a full marketplace-ready shoot — white background, lifestyle, gradient studio, in-use, and outdoor variantsimageGemini 3.1 Flash1.5/item
TranscriptionTranscribe any video or audio file to textaudioElevenLabs Scribefrom 0.8 + 0.08/min
WatermarkBrand any video output with your logovideo0.5
CaptionsBurn styled captions into any video from a transcript or SRTvideofrom 0.2 + 0.02/min
HighlightsRead a transcript, return N editorial picks — the most quotable moments for clipping, chapter generation, or social poststextClaude Sonnet 4.610
Video ReframeAuto-crop horizontal video to vertical with AI active-speaker framingvideofrom 2

Inputs and outputs

Each pipeline defines its own input schema and output format. Common patterns:

Inputs: photos, text scripts, audio files, style/provider configuration

Outputs: generated images, animated videos, audio files, or multiple assets per run