👨🏻‍💻 About

I am Xuchen Li (李旭宸), a first-year Ph.D. student at Institute of Automation, Chinese Academy of Sciences (CASIA ), supervised by Prof. Kaiqi Huang (IAPR Fellow). Additionally, I am a member of Visual Intelligence Interest Group (VIIG ).

Before that, I received my B.E. degree in Computer Science and Technology at School of Computer Science (SCS ) from Beijing University of Posts and Telecommunications (BUPT ) in Jun. 2024.

I am grateful to work with Dr. Shiyu Hu, which has a significant impact on me. I am also grateful to be growing up and studying with my twin brother Xuzhao Li, which is a truly unique and special experience for me.

My research focuses on Visual Language Tracking, Multi-modal Learning, Data-centric AI, Large Language Model and AI4Science. If you are interested in my work or would like to collaborate, please feel free to contact me.

🔥 News

2024.06: 📝 One paper has been accepted by the 7th Chinese Conference on Pattern Recognition and Computer Vision (PRCV, CCF-C Conference).
2024.06: 👨‍🎓 Obtain my B.E. degree from Beijing University of Posts and Telecommunications (BUPT). I will always remember the wonderful 4 years I spent here. Thanks to all!
2024.05: 🏆 Obtain Beijing Outstanding Graduates (北京市优秀毕业生) (Top 5%, only 38 students obtain this honor of SCS, BUPT)!
2024.05: 📣 Present our work during the 14th Vision and Learning Seminar (VALSE), see our poster for more information!
2024.04: 📝 One paper has been accepted as Oral Presentation and awarded Best Paper Honorable Mention Award by the 3rd CVPR Workshop on Vision Datasets Understanding (CVPRW, Workshop in CCF-A Conference, Oral, Best Paper Honorable Mention Award)!
2023.12: 🏆 Obtain College Scholarship of University of Chinese Academy of Sciences (中国科学院大学大学生奖学金) (only 17 students win this scholarship of CASIA)!
2023.12: 🏆 Obtain China National Scholarship (国家奖学金) with a rank of 1/455 (0.22%) (Top 1%, the highest honor for undergraduates in China)!
2023.11: 🏆 Obtain Beijing Merit Student (北京市三好学生) (Top 1%, only 36 students obtain this honor of BUPT)!
2023.09: 📝 One paper has been accepted by the 37th Conference on Neural Information Processing Systems (NeurIPS, CCF-A Conference, Poster)!
2022.12: 🏆 Obtain Huawei AI Education Base Scholarship (华为智能基座奖学金) (only 20 students win this scholarship of BUPT)!
2022.12: 🏆 Obtain China National Scholarship (国家奖学金) with a rank of 2/430 (0.47%) (Top 1%, the highest honor for undergraduates in China)!

📖 Educations

2024.09 - 2029.06 (Expected), Ph.D. student
Pattern Recognition and Intelligent System
Institute of Automation, Chinese Academy of Sciences, Beijing

2020.09 - 2024.06, B.E. degree
Computer Science and Technology, Overall Ranking 1/449 (0.22%)
School of Computer Science
Beijing University of Posts and Telecommunications, Beijing

💻 Experiences

2024.06 - now: Research intern on multi-modal large language model at Ant Group (ANT ), advised by Dr. Jian Wang and Dr. Ming Yang.
2023.05 - 2024.04: Member of Artificial Intelligence Elites Class at Institute of Automation, Chinese Academy of Sciences (CASIA ), supervised by Prof. Kaiqi Huang (IAPR Fellow).
2023.01 - 2023.05: Research intern on 3D vision at Tsinghua University (THU ), advised by Prof. Haoqian Wang.

📝 Publications

✅ Acceptance

CVPRW 2024

DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM

Xuchen Li, Xiaokun Feng, Shiyu Hu, Meiqi Wu, Dailing Zhang, Jing Zhang, Kaiqi Huang

CVPRW 2024 (Workshop in CCF-A Conference, Oral, Best Paper Honorable Mention Award): the 3rd CVPR Workshop on Vision Datasets Understanding
[Paper] [PDF] [Code] [Website] [Award] [Poster] [Slides] [BibTeX]
📌 Visual Language Tracking 📌 LLM 📌 Evaluation Technique

NeurIPS 2023

A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and Causal Relationship

Shiyu Hu, Dailing Zhang, Meiqi Wu, Xiaokun Feng, Xuchen Li, Xin Zhao, Kaiqi Huang

NeurIPS 2023 (CCF-A Conference, Poster): the 37th Conference on Neural Information Processing Systems
[Paper] [PDF] [Code] [Website] [Poster] [Slides] [BibTeX]
📌 Visual Language Tracking 📌 Video Understanding 📌 Hierarchical Annotation

PRCV 2024

VS-LLM: Visual-Semantic Depression Assessment based on LLM for Drawing Projection Test

Meiqi Wu, Yaxuan Kang, Xuchen Li, Shiyu Hu, Xiaotang Chen, Yunfeng Kang, Weiqiang Wang, Kaiqi Huang

PRCV 2024 (CCF-C Conference): the 7th Chinese Conference on Pattern Recognition and Computer Vision
[PDF] [Code]
📌 Psychological Assessment 📌 LLM 📌 AI4Science

☑️ Ongoing

CCF-A

DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM

Xuchen Li, Shiyu Hu, Xiaokun Feng, Dailing Zhang, Meiqi Wu, Jing Zhang, Kaiqi Huang

Submitted to a CCF-A conference, Under Review
📌 Visual Language Tracking 📌 LLM 📌 Benchmark Construction

CCF-A

MemVLT: Visual-Language Tracking with Adaptive Memory-based Prompts

Xiaokun Feng, Xuchen Li, Shiyu Hu, Dailing Zhang, Meiqi Wu, Jing Zhang, Xiaotang Chen, Kaiqi Huang

Submitted to a CCF-A conference, Under Review
📌 Visual Language Tracking 📌 Human-like Modeling 📌 Adaptive Prompts

CCF-A

Unconstrained Multimodal Air-Writing Benchmark: Writing by Moving Your Fingers in 3D

Meiqi Wu, Xuchen Li, Shiyu Hu, Yuanqiang Cai, Kaiqi Huang, Weiqiang Wang

Submitted to a CCF-A conference, Under Review
📌 Air Writing 📌 Benchmark Construction 📌 Human-machine Interaction

CCF-A

Beyond Accuracy: Tracking more like Human through Visual Search

Dailing Zhang, Shiyu Hu, Xiaokun Feng, Xuchen Li, Meiqi Wu, Jing Zhang, Kaiqi Huang

Submitted to a CCF-A conference, Under Review
📌 Visual Object Tracking 📌 Visual Search Mechanism 📌 Visual Turing Test

🏆 Honors

Best Paper Honorable Mention Award (最佳论文荣誉提名奖), at the 3rd CVPR Workshop on Vision Datasets Understanding, 2024
China National Scholarship (国家奖学金), My Rank: 1/455 (0.22%), Top 1%, at BUPT, by Ministry of Education of China, 2023
China National Scholarship (国家奖学金), My Rank: 2/430 (0.47%), Top 1%, at BUPT, by Ministry of Education of China, 2022
Huawei AI Education Base Scholarship (华为智能基座奖学金), at BUPT, by Ministry of Education of China and Huawei AI Education Base Joint Working Group, 2022
Beijing Merit Student (北京市三好学生), Top 1%, at BUPT, by Beijing Municipal Education Commission, 2023
Beijing Outstanding Graduates (北京市优秀毕业生), Top 5%, at BUPT, by Beijing Municipal Education Commission, 2024
College Scholarship of University of Chinese Academy of Sciences (中国科学院大学大学生奖学金), at CASIA, by University of Chinese Academy of Sciences, 2023

🔗 Services

Reviewer: ICPR 2024

🌟 Projects

GOT-10k Platform

GOT-10k: A Large High-diversity Benchmark and Evaluation Platform for Single Object Tracking

Visual Object Tracking / Evaluation Technology / Large High-diversity Benchmark
As of June 2024, the platform has received 3.66M+ page views, 7.2k+ downloads, 20.5k+ trackers from 160+ countries and regions worldwide.
GOT-10k is the supporting platform for research accepted by IEEE TPAMI 2021.

VideoCube / MGIT Platform

VideoCube / MGIT: A Large-scale Multi-dimensional Multi-modal Global Instance Tracking Intelligent Evaluation Platform

Visual Object Tracking / Visual Language Tracking / Long Video Understanding and Reasoning / Intelligent Evaluation Technology
As of June 2024, the platform has received 389k+ page views, 1.1k+ downloads, 410+ trackers from 130+ countries and regions worldwide.
VideoCube / MGIT is the supporting platform for research accepted by IEEE TPAMI 2023 and NeurIPS 2023.

SOTVerse Platform

SOTVerse: A User-defined Single Object Tracking Task Space

Visual Object Tracking / Dynamic Open Environment Construction / Visual Evaluation Technique
As of June 2024, the platform has received 120k+ page views from 100+ countries and regions worldwide.
SOTVerse is the supporting platform for research accepted by IJCV 2023.

🤝 Collaborators

Shiyu Hu, Ph.D. at the Institute of Automation, Chinese Academy of Sciences (CASIA ) and University of Chinese Academy of Sciences (UCAS ), focusing on visual object tracking, visual language tracking, benchmark construction, intelligent evaluation technique, and AI4Science.
Xiaokun Feng, Ph.D. student at the Institute of Automation, Chinese Academy of Sciences (CASIA ), focusing on visual object tracking, visual language tracking, and AI4Science.
Dailing Zhang, Ph.D. student at the Institute of Automation, Chinese Academy of Sciences (CASIA ), focusing on visual object tracking, visual language tracking, and AI4Science.
Meiqi Wu, Ph.D. student at the University of Chinese Academy of Sciences (UCAS ), focusing on computer vision, intelligent evaluation technique, and human-computer interaction.
Xuzhao Li, M.S. student at Beijing Institute of Technology (BIT ), focusing on multi-agent path planning and trajectory prediction.
Jing Zhang, research assistant at the Institute of Automation, Chinese Academy of Sciences (CASIA ), focusing on computer vision and AI4Science.
Yaxuan Kang, design researcher, research assistant and interaction designer at the Institute of Automation, Chinese Academy of Sciences (CASIA ), focusing on human-computer interaction.

My homepage visitors have been recorded since February 2024. Thanks for your attention.