Skip to main content

Visualization Tool

The dataset visualization tool is publicly available at: https://visualize.ego4d-data.org. You will need an approved license for access.

Filtering

Filterbox Example

One of the first things you'll likely want to do is filter by benchmark.
Filter queries use a simple syntax of property operator value expressions chained together with ANDs, ORs, and ()s. Anything with a space or () must be surrounded by double quotes.
e.g.

  1. benchmarks include moments
  2. video_uid == a37f501d-5cc1-4cc2-8ac2-1ec4e66a86d2
  3. benchmarks include fho_hands AND modalities include imu
  4. duration > 5000
  5. moments.activities include "cut_open_a_package_(e.g._with_scissors)"

Autocomplete helps you type these up. Once your query is entered, click anywhere outside the autocomplete dropdown to end it (instead of hitting enter).

Interactive Annotations

Interactive AnnotationsInteractive AnnotationsInteractive Annotations

Once you click into a video, you'll see it with all its annotations. Many annotations are interactive.
These include:

  1. Video Frames
  2. Video Times
  3. Response Tracks
  4. Visual Crops

Any annotation that shows an underline on hover is clickable and will usually take you to that moment in the video.

You'll also notice many annotations have custom labels based on their context, e.g. scod object state changes show their pre/pnr/post times before expansion.

Visualization Modules

Each benchmark has modules to visualize its data types.
These include:

  1. Time Segments

    Interactive AnnotationsInteractive Annotations
    These show interactive start/end segments. Click a block to jump to the start, shift+click it to jump to the end. The black line indicates the streaming video's timestamp. A greedy algorithm assigns segments to tracks to show all segments without overlaps on a track and a minimal number of tracks. Colors are the same per label, but can be reused across multiple labels.

  2. Labeled Timestamps

    Interactive Annotations
    Some annotations like narrations are labeled times across the video. This module shows any timestamp labels near the current video time.

  3. Bounding Boxes

    Interactive Annotations
    Benchmarks with bounding boxes, like visual queries, are visualized directly on the video. Bounding boxes that track an object across frames (e.g. response tracks) are interpolated across them. The FHO hands benchmark uses points, not areas, to represent hands, so these are shown with fixed-size circles instead of bounding boxes.