Metadata
Top level metadata for the set can be found primarily through ego4d.json available via the EGO4D CLI with the --metdata option (or enabled by default with any other primary dataset).
Metadata - ego4d.json schema
date(string): Date of generation.version(string): Dataset specific version.description(string)videos(array)- Items (object)
video_uid(string): The unique, primary video id.duration_sec(number)scenarios(array)- Items (string)
video_metadata(object)fps(number)num_frames(integer): The number of frames in the video stream.video_codec(string)display_resolution_width(['integer', 'null'])display_resolution_height(['integer', 'null'])sample_resolution_width(['integer', 'null'])sample_resolution_height(['integer', 'null'])mp4_duration_sec(number)video_start_sec(number): The start time of the vido stream (>= 0 for sync offset).video_duration_sec(number): The duration of the video stream (<= container duration).audio_start_sec(['null', 'number']): The start time of the audio stream (>= 0 for sync offset).audio_duration_sec(['null', 'number']): The duration of the audio stream (<= container duration).video_start_pts(integer)video_duration_pts(integer)video_base_numerator(integer)video_base_denominator(integer)audio_start_pts(['integer', 'null'])audio_duration_pts(['integer', 'null'])audio_base_numerator(['integer', 'null'])audio_base_denominator(['integer', 'null'])
split_em(['null', 'string']): Split (train/test/val) for Episodic Memory benchmark tasks (per video).split_av(['null', 'string']): FHO splits are clip dependent - specified for video only where consistent (or multi).split_fho(['null', 'string']): Split (train/test/val) for AV benchmark tasks (per video).s3_path(string): Path on AWS share - for reference, download via the CLI.origin_video_id(string): A university assigned id (no standardization across universities).video_source(string): The origin university that collected the data.device(['null', 'string'])physical_setting_name(['null', 'string']): The physical setting if a 3d scan exists.fb_participant_id(['integer', 'null']): A sequentially assigned participant id - entirely unrelated to FB.is_stereo(boolean): Is the video stereoscopic.has_imu(boolean)has_gaze(boolean)imu_s3_path(['null', 'string'])imu_manifold_path(['null', 'string'])gaze_s3_path(['null', 'string'])gaze_manifold_path(['null', 'string'])video_components(array)- Items (object)
video_component_uid(string)video_uid(string)component_idx(integer)redacted(boolean)canonical_video_start_sec(number)canonical_video_end_sec(number)canonical_video_start_frame(integer)canonical_video_end_frame(integer)video_metadata(object)fps(number)num_frames(integer)video_codec(string)display_resolution_width(integer)display_resolution_height(integer)sample_resolution_width(integer)sample_resolution_height(integer)mp4_duration_sec(number)video_start_sec(['null', 'number'])video_duration_sec(['null', 'number'])audio_start_sec(['null', 'number'])audio_duration_sec(['null', 'number'])video_start_pts(integer)video_duration_pts(['integer', 'null'])video_base_numerator(integer)video_base_denominator(integer)audio_start_pts(['integer', 'null'])audio_duration_pts(['integer', 'null'])audio_base_numerator(['integer', 'null'])audio_base_denominator(['integer', 'null'])
- Items (object)
concurrent_setshas_redacted_regions(boolean)redacted_intervals(array)- Items (object)
start_sec(number)end_sec(number)start_frame(integer)end_frame(integer)
- Items (object)
gaps(null)
- Items (object)
concurrent_video_sets(array)- Items (object)
concurrent_video_set_id(integer)valid(boolean)videos(array)- Items (object)
concurrent_video_set_id(integer)video_uid(string)video_start_offset_sec(number)
- Items (object)
- Items (object)
physical_settings(array)- Items (object)
name(string)fb_physical_setting_id(integer)source(string)s3_path(string)
- Items (object)
clips(array)- Items (object)
clip_uid(string)video_uid(string)video_start_sec(number)video_end_sec(number)video_start_frame(integer)video_end_frame(integer)clip_metadata(object)fps(number)num_frames(integer)video_codec(string)display_resolution_width(integer)display_resolution_height(integer)sample_resolution_width(integer)sample_resolution_height(integer)mp4_duration_sec(number)video_start_sec(null)video_duration_sec(number)audio_start_sec(null)audio_duration_sec(['null', 'number'])video_start_pts(integer)video_duration_pts(integer)video_base_numerator(integer)video_base_denominator(integer)audio_start_pts(['integer', 'null'])audio_duration_pts(['integer', 'null'])audio_base_numerator(['integer', 'null'])audio_base_denominator(['integer', 'null'])
s3_path(string)manifold_path(string)
- Items (object)