Skip to content

Latest commit

 

History

History
872 lines (471 loc) · 87.5 KB

README.md

File metadata and controls

872 lines (471 loc) · 87.5 KB

Awesome Egocentric

Surveys

Papers

Episodic Memory

Moments Queries

Referring Image Segmentation

Referring Video Object Segmentation

Video Captioning

Embodied Agent Learning

VLN (Vision-and-Language Navigation)

RL (Reinforcement Learning)

MARL (Multiagent Reinforcement Learning)

Egocentric Video Summarization

VQA (Visual Question Answering)

VLP (Vision Language Pretraining)

Action/Activity Recognition

Hand-Object Interactions

Usupervised Domain Adaptation

Domain Generalization

Action Anticipation

Short-Term Action Anticipation

Long-Term Action Anticipation

Future Gaze Prediction

Trajectory prediction

Region prediction

Multi-Modalities

Audio-Visual

Depth

Thermal

Event

Temporal Segmentation (Action Detection)

Retrieval

Few-Shot Action Recognition

Gaze

From Third-Person to First-Person

User Data from an Egocentric Point of View

Localization

Privacy protection

Social Interactions

Multiple Egocentric Tasks

  • Ego4D: Around the World in 3,000 Hours of Egocentric Video - Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Christian Fuegen, Abrham Gebreselasie, Cristina Gonzalez, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jachym Kolar, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Yunyi Zhu, Pablo Arbelaez, David Crandall, Dima Damen, Giovanni Maria Farinella, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik, arXiv. [Github] [project page] [video]

Activity-context

Video summarization

Applications

Human to Robot

Asssitive Egocentric Vision

Popular Architectures

2D

3D

RNN

Transformer

Other EGO-Context

Challenges

  • Ego4D - Episodic Memory, Hand-Object Interactions, AV Diarization, Social, Forecasting.

  • Epic Kithchen Challenge - Action Recognition, Action Detection, Action Anticipation, Unsupervised Domain Adaptation for Action Recognition, Multi-Instance Retrieval