Movement clustering · 6-modality ablation
When the hand is gone for 70-90% of frames, what else can we cluster on? Side-by-side comparison of V-JEPA, OWLv2 tool-presence, optical-flow rhythm, ego-motion, body-pose, and a late-fusion baseline across all 5 featured demo clips.
6 modalities
5 clips
RepNet stand-in