A lighter network (like FlowNet) that estimates the motion between frames.
Following a specific target through various lighting and occlusion changes.
The resulting multi-dimensional data used to identify objects or segment pixels within the mp4 file. 🧪 Common Use Cases
A heavy network (like ResNet) extracts "deep features" only from select frames.