The dataset is diverse in its geographic coverage, scenarios,
participants and captured modalities. We consulted a survey
U.S. Bureau of Labor Statistics
that captures how people spend the bulk of their time.
Data was captured using seven different off-the-shelf
head-mounted cameras: GoPro, Vuzix Blade, Pupil Labs, ZShades,
OR- DRO EP6, iVue Rincon 1080, and Weeview.
In addition to video, portions of Ego4D offer other data
modalities: 3D scans, audio, gaze, stereo, multiple
synchronized wearable cameras, and textual narrations.
Check our ArXiv paper for details .