Ego4D and EgoExo4D Challenge 2025

Overview

At EgoVis workshop during CVPR 2025, we will host 15 challenges representing Ego4D and EgoExo4D benchmarks. This year we will have 9 challenges from Ego4D and 6 challenges from EgoExo4D dataset. Please find details below on the challenges:

Ego4D challenges

Episodic memory:

Visual queries with 2D localization (VQ2D): Given an egocentric video clip and an image crop depicting the query object, return the most recent occurrence of the object in the input video, in terms of contiguous bounding boxes (2D + temporal localization).
- Quickstart:
Natural language queries (NLQ): Given a video clip and a query expressed in natural language, localize the temporal window within all the video history where the answer to the question is evident.
- Quickstart:
Moments queries (MQ): Given an egocentric video and an activity name (e.g., a “moment”), localize all instances of that activity in the past video
Goal Step: Given an untrimmed egocentric video, identify the temporal action segment corresponding to a natural language description of the step. Specifically, predict the (start_time, end_time) for a given keystep description.
Ego Schema: Given a very long-form video, evaluate the capabilities of modern vision and language systems.

Looking at me: Given an egocentric video clip, identify whether someone in the scene is looking at the camera wearer.
Talking to me: Given an egocentric video clip, identify whether someone in the scene is talking to the camera wearer.

Forecasting:

Short Term object interaction anticipation: Given a video clip, predict the next active objects, and, for each of them, predict the next action, and the time to contact.
- Quickstart:
Long-term activity prediction: Given a video clip, the goal is to predict what sequence of activities will happen in the future. For example, after kneading dough, list the actions that the baker will do next.

Other Ego4D challenges which are not part of CVPR 2025 workshop remain open on EvalAI website for submissions but are not eligible for prizes.

EgoExo4D challenges

Ego-Exo4D is a diverse, large-scale multi-modal multi view video dataset and benchmark challenge. Ego-Exo4D centers around simultaneously-captured ego-centric and exocentric video of skilled human activities (e.g., sports, music, dance, bike repair).

Here are the specific challenge tracks we will host at EgoVis workshop during CVPR 2025.

EgoPose Benchmark

Ego-Pose Body: Given an egocentric video, estimate the 3D body pose of the camera-wearer. Specifically, predict the 3D position of the 17 annotated body joints for each frame. [github] [tutorials]
Ego-Pose Hands: Estimate the 3D locations of the defined hand joints for visible hand(s). Specifically, estimate the (x,y,z) coordinates of each joint in the egocentric coordinate frame. [github] [tutorials]

Relations Benchmark

Correspondence: Given a pair of timesynchronized egocentric and exocentric videos, as well as a query object track in one of the views, the goal is to output the corresponding mask for the same object instance in the other view for all frames where the object is visible in both views. [github]

Keystep Benchmark

Fine-grained Keystep Recognition: The objective of this task is to predict the keystep label for a trimmed egocentric video clip. [github]
Procedure Understanding: The objective of this task is to infer a procedure's underlying structure from observing natural videos of subjects performing the procedure. [github]

Proficiency Benchmark

Demonstrator Proficiency:Given synchronized egocentric and exocentric video of a demonstrator performing a task, classify the proficiency skill level of the demonstrator. github

Other EgoExo4D challenges which are not part of CVPR 2025 workshop remain open on EvalAI website for submissions but are not eligible for prizes.

Dataset

Ego4D challenge participants will use Ego4D’s annotated data set of more than 3,670 hours of video data, capturing the daily-life scenarios of more than 900 unique individuals from nine different countries around the world. Unique train, validation and unannotated test sets are available to download per challenge at https://ego4d-data.org/docs/. This year's challenge we will continue to use Ego4D v2.0 which contains ~2X train and val annotations for Forecasting, Hands & Objects and NLQ, a number of corrections and usability enhancements, and two new related dataset enhancements (Ego Schema and Goal Step). The test set remains the same as previous versions of the challenge. More details can be found here.

EgoExo4D challenge participants will be using EgoExo4D dataset for these challenges. Please find the documentation here about the dataset.

Participation Guidelines

Participate in the contest by registering on the EvalAI challenge page and creating a team. All participants must register as a part of a “participating team” on EvalAI to ensure the submission limits are honored. Participants will upload their predictions in the format specified for the specific challenge, and will be evaluated on AWS instances by comparing to ground truth predictions. Instructions for training, local evaluation, and online submission are provided at EvalAI. Please refer to the individual EvalAI pages for each challenge for submission guidelines, task specifications, and evaluation criteria.

Dates

Ego4D challenges will launch on March 5, 2025 with the leaderboard closing on May 19, 2025.
EgoExo4D challenges will launch on March 5, 2025 with the leaderboard closing on May 19, 2025.
Winners for both will be announced at the Second Joint Egocentric Vision Workshop at CVPR 2025.

Competition Rules and Prize Information

Competition rules can be found here. Additionally, we are thrilled that FAIR is able to offer the following prize thresholds for challenges:

First place: $500
Second place: $300
Third place: $200

Challenge Reports

In addition to the submission on EvalAI, participants must submit a report describing their method to the workshop CMT link. In addition to your method and results, please remember to include examples of positive and negative results (limitations) of your model. These validation reports will be evaluated by challenge hosts from the Ego4D consortium before winner determination can be made. Similarly, challenge validation reports, research code from winning entries, and names of participants from the winning teams for all successful submissions must be shared publicly with the research community. More details can be found on the EgoVis workshop page.

Acknowledgements

The Ego4D and EgoExo4D challenges would not have been possible without the infrastructure and support of the EvalAI team. Thank you!

Organizers

Xizi Wang
Suyog Jain
Andrew Westbury
Chen Zhao
Merey Ramazanova
Francesco Ragusa
Seminara Luigi
Tushar Nagarajan
Karttikeya Mangalam
Raiymbek Akshulakov
Sherry Xue
Jinxu Zhang
Shan Shu
Gabriel Pérez Santamaria
Juanita Puentes
Maria Camila Escobar Palomeque
Arjun Somayazulu
Sanjay Haresh
Yale Song
Antonino Furnari
Manolis Savva
Giovanni Maria Farinella
Pablo Arbelaez
Jianbo Shi
Kristen Grauman

Past Challenges / Winners

CVPR Workshop 2024 (June 17, 2024)

CVPR Workshop 2023 (June 19, 2023)

ECCV Workshop 2022 (Oct 24, 2022)

CVPR Workshop 2022 (June 19, 2022)

Ego4D and EgoExo4D Challenge 2025

Overview​

Ego4D challenges​

Episodic memory:​

Social Understanding:​

Forecasting:​

EgoExo4D challenges​

EgoPose Benchmark​

Relations Benchmark​

Keystep Benchmark​

Proficiency Benchmark​

Dataset​

Participation Guidelines​

Dates​

Competition Rules and Prize Information​

Challenge Reports​

Acknowledgements​

Organizers​

Past Challenges / Winners​