Transcode, clip, concatenate, overlay, burn subtitles, or add text labels to video or audio.
by Faro
Transcodes, clips, concatenates, overlays images or video, burns in subtitles, and draws text labels on video or audio files in a single call. Outputs MP4, WebM, GIF, MKV, or AVI for video, and MP3, AAC, WAV, FLAC, or OGG for audio.
A single call handles the full chain of common edits cut, join, overlay, caption, transcode so agents do not need to orchestrate multiple ffmpeg-style steps or manage intermediate files. Wide codec support covers both modern web delivery (MP4, WebM) and legacy or editorial workflows (MKV, AVI, FLAC).
Subtitles can be burned in from a provided track, and text labels support an optional background box for readability. Returns a downloadable output ready to drop into a CDN or social-upload step.
Burn a single text label into the video, optionally with a background box and time window.
Additional clip URLs to concatenate after input_url. Output duration = sum of all clips. Up to 20 extra clips.
Overlay another video or image on top of the primary input.
Clip end time in seconds. Single-input mode only — ignored if concat is set.
Pre-signed GET URL of the input video or audio file. Obtain via POST /uploads/presign on faro-api.
Video quality (CRF). 0=lossless, 18=visually lossless, 23=default H.264, 28=lower quality. Only applies to H.264 (mp4/mkv/avi) output.
Download URL TTL in seconds (default 1h, max 24h).
Output resolution, e.g. '1280x720'. Use '-2' for the dimension you want auto-scaled, e.g. '1280:-2' keeps the aspect ratio.
Clip start time in seconds.
Audio bitrate, e.g. '128k', '192k', '320k'.
Target container/format. Use mp3/aac/wav/flac/ogg to extract audio only.
Pre-signed GET URL of an SRT or VTT subtitle file to burn into the video.
Filename for the output file. Defaults to `output.<ext>`.