A lighter network (like FlowNet) that estimates the motion between frames.

Following a specific target through various lighting and occlusion changes.

The resulting multi-dimensional data used to identify objects or segment pixels within the mp4 file. 🧪 Common Use Cases

A heavy network (like ResNet) extracts "deep features" only from select frames.