Name: Keji
Type: User
Company: Institute of Automation, Chinese Academy of Sciences
Bio: Ph.D. student at CASIA NLPR CRIPAC. Research interests involve Machine Learning, Multimodality, and Embodied AI.
Location: BEIJING, CHINA
Keji's Projects
Reading list for research topics in embodied vision
Reading list for research topics in multimodal machine learning
A curated list of Multimodal Related Research.
Extended LaTeX template for CVPR/ICCV papers
Official Implementation of Frequency-enhanced Data Augmentation for Vision-and-Language Navigation (NeurIPS2023)
A human-annotated, fine-grained dataset for Vision-and-Language Navigation
Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation
Room-across-Room (RxR) is a large-scale, multilingual dataset for Vision-and-Language Navigation (VLN) in Matterport3D environments. It contains 126k navigation instructions in English, Hindi and Telugu, and 126k navigation following demonstrations. Both annotation types include dense spatiotemporal alignments between the text and the visual perceptions of the annotators
[ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"