Glossary
- pts
Presentation Time Stamp. The time at which a frame should be played. In TorchCodec, pts are expressed in seconds.
- best stream
The notion of “best” stream is determined by FFmpeg. Quoting the FFmpeg docs:
The best stream is determined according to various heuristics as the most likely to be what the user expects.
- scan
A scan corresponds to an entire pass over a video file, with the purpose of retrieving metadata about the different streams and frames. It does not involve decoding, so it is a lot cheaper than decoding the file. The
VideoDecoder
performs a scan when usingseek_mode="exact"
, and doesn’t scan when usingseek_mode="approximate"
.- clips
A clip is a sequence of frames, usually in pts order. The frames may not necessarily be consecutive. A clip is represented as a 4D
FrameBatch
. A group of clips, which is what the samplers return, is represented as 5DFrameBatch
.