👨🏻💻 About
I am Xuchen Li (李旭宸), an incoming Ph.D. student at Institute of Automation, Chinese Academy of Sciences (CASIA ), supervised by Prof. Kaiqi Huang (IAPR Fellow). Additionally, I am a member of Visual Intelligence Interest Group (VIIG
).
Before that, I received my B.E. degree in Computer Science and Technology at School of Computer Science (SCS ) from Beijing University of Posts and Telecommunications (BUPT
) in Jun. 2024.
I am grateful to work with Dr. Shiyu Hu, which has a significant impact on me. I am also grateful to be growing up and studying with my twin brother Xuzhao Li, which is a truly unique and special experience for me.
My research focuses on Visual Language Tracking, Multi-modal Learning and Large Language Model. If you are interested in my work or would like to collaborate, please feel free to contact me.
🔥 News
- 2024.06: 📝 One paper has been accepted by the 7th Chinese Conference on Pattern Recognition and Computer Vision (PRCV, CCF-C Conference).
- 2024.06: 👨🎓 Obtain my B.E. degree from Beijing University of Posts and Telecommunications (BUPT). I will always remember the wonderful 4 years I spent here. Thanks to all!
- 2024.05: 🏆 Obtain Beijing Outstanding Graduates (北京市优秀毕业生) (Top 5%, only 38 students obtain this honor of SCS, BUPT)!
- 2024.05: 📣 Present our work during the 14th Vision and Learning Seminar (VALSE), see our poster for more information!
- 2024.04: 📝 One paper has been accepted as Oral Presentation and awarded Best Paper Honorable Mention Award by the 3rd CVPR Workshop on Vision Datasets Understanding (CVPRW, Workshop in CCF-A Conference, Oral, Best Paper Honorable Mention Award)!
- 2023.12: 🏆 Obtain College Scholarship of University of Chinese Academy of Sciences (中国科学院大学大学生奖学金) (only 17 students win this scholarship of CASIA)!
- 2023.12: 🏆 Obtain China National Scholarship (国家奖学金) with a rank of 1/455 (0.22%) (Top 1%, the highest honor for undergraduates in China)!
- 2023.11: 🏆 Obtain Beijing Merit Student (北京市三好学生) (Top 1%, only 36 students obtain this honor of BUPT)!
- 2023.09: 📝 One paper has been accepted by the 37th Conference on Neural Information Processing Systems (NeurIPS, CCF-A Conference, Poster)!
- 2022.12: 🏆 Obtain Huawei AI Education Base Scholarship (华为智能基座奖学金) (only 20 students win this scholarship of BUPT)!
- 2022.12: 🏆 Obtain China National Scholarship (国家奖学金) with a rank of 2/430 (0.47%) (Top 1%, the highest honor for undergraduates in China)!
📖 Educations
2024.09 - 2029.06 (Expected), Ph.D. student
Pattern Recognition and Intelligent System
Institute of Automation, Chinese Academy of Sciences, Beijing
2020.09 - 2024.06, B.E. degree
Computer Science and Technology, Ranking 1/449 (0.22%)
School of Computer Science
Beijing University of Posts and Telecommunications, Beijing
💻 Experiences
- 2024.06 - now: Research intern on multi-modal large language model at Ant Group (ANT
), advised by Jian Wang and Ming Yang.
- 2023.05 - 2024.04: Member of Artificial Intelligence Elites Class at Institute of Automation, Chinese Academy of Sciences (CASIA
), supervised by Prof. Kaiqi Huang (IAPR Fellow).
- 2023.01 - 2023.05: Research intern on 3D vision at Tsinghua University (THU
), advised by Prof. Haoqian Wang.
📝 Publications
✅ Acceptance
DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM
Xuchen Li, Xiaokun Feng, Shiyu Hu, Meiqi Wu, Dailing Zhang, Jing Zhang, Kaiqi Huang
CVPRW 2024 (Workshop in CCF-A Conference, Oral, Best Paper Honorable Mention Award): the 3rd CVPR Workshop on Vision Datasets Understanding
[Paper]
[PDF]
[Code]
[Website]
[Award]
[Poster]
[Slides]
[BibTeX]
A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and Causal Relationship
Shiyu Hu, Dailing Zhang, Meiqi Wu, Xiaokun Feng, Xuchen Li, Xin Zhao, Kaiqi Huang
NeurIPS 2023 (CCF-A Conference, Poster): the 37th Conference on Neural Information Processing Systems
[Paper]
[PDF]
[Code]
[Website]
[Poster]
[Slides]
[BibTeX]
![sym](../publications/VSLLM.png)
VS-LLM: Visual-Semantic Depression Assessment based on LLM for Drawing Projection Test
Meiqi Wu, Yaxuan Kang, Xuchen Li, Shiyu Hu, Xiaotang Chen, Yunfeng Kang, Weiqiang Wang, Kaiqi Huang
PRCV 2024 (CCF-C Conference): the 7th Chinese Conference on Pattern Recognition and Computer Vision
☑️ Ongoing
![sym](../publications/DTVLT.png)
DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Xuchen Li, Shiyu Hu, Xiaokun Feng, Dailing Zhang, Meiqi Wu, Jing Zhang, Kaiqi Huang
Submitted to a CCF-A conference, Under Review
![sym](../publications/MemVLT.png)
MemVLT: Visual-Language Tracking with Adaptive Memory-based Prompts
Xiaokun Feng, Xuchen Li, Shiyu Hu, Dailing Zhang, Meiqi Wu, Jing Zhang, Xiaotang Chen, Kaiqi Huang
Submitted to a CCF-A conference, Under Review
![sym](../publications/MMAW.png)
Unconstrained Multimodal Air-Writing Benchmark: Writing by Moving Your Fingers in 3D
Meiqi Wu, Xuchen Li, Shiyu Hu, Yuanqiang Cai, Kaiqi Huang, Weiqiang Wang
Submitted to a CCF-A conference, Under Review
![sym](../publications/CPDTrack.png)
Beyond Accuracy: Tracking more like Human through Visual Search
Dailing Zhang, Shiyu Hu, Xiaokun Feng, Xuchen Li, Meiqi Wu, Jing Zhang, Kaiqi Huang
Submitted to a CCF-A conference, Under Review
🎖 Honors
- Best Paper Honorable Mention Award (最佳论文荣誉提名奖), at the 3rd CVPR Workshop on Vision Datasets Understanding, 2024
- China National Scholarship (国家奖学金), My Rank: 1/455 (0.22%), Top 1%, at BUPT, by Ministry of Education of China, 2023
- China National Scholarship (国家奖学金), My Rank: 2/430 (0.47%), Top 1%, at BUPT, by Ministry of Education of China, 2022
- Huawei AI Education Base Scholarship (华为智能基座奖学金), at BUPT, by Ministry of Education of China and Huawei AI Education Base Joint Working Group, 2022
- Beijing Merit Student (北京市三好学生), Top 1%, at BUPT, by Beijing Municipal Education Commission, 2023
- Beijing Outstanding Graduates (北京市优秀毕业生), Top 5%, at BUPT, by Beijing Municipal Education Commission, 2024
- College Scholarship of University of Chinese Academy of Sciences (中国科学院大学大学生奖学金), at CASIA, by University of Chinese Academy of Sciences, 2023
🌟 Projects
GOT-10k: A Large High-diversity Benchmark and Evaluation Platform for Single Object Tracking
- Visual Object Tracking / Evaluation Technology / Large High-diversity Benchmark
- As of June 2024, the platform has received 3.66M+ page views, 7.2k+ downloads, 20.5k+ trackers from 160+ countries and regions worldwide.
- GOT-10k is the supporting platform for research accepted by IEEE TPAMI 2021.
- Visual Object Tracking / Visual Language Tracking / Long Video Understanding and Reasoning / Intelligent Evaluation Technology
- As of June 2024, the platform has received 389k+ page views, 1.1k+ downloads, 410+ trackers from 130+ countries and regions worldwide.
- VideoCube / MGIT is the supporting platform for research accepted by IEEE TPAMI 2023 and NeurIPS 2023.
SOTVerse: A User-defined Single Object Tracking Task Space
- Visual Object Tracking / Dynamic Open Environment Construction / Visual Evaluation Technique
- As of June 2024, the platform has received 120k+ page views from 100+ countries and regions worldwide.
- SOTVerse is the supporting platform for research accepted by IJCV 2023.
🤝 Collaborators
- Shiyu Hu, Ph.D. at the Institute of Automation, Chinese Academy of Sciences (CASIA
) and University of Chinese Academy of Sciences (UCAS
), focusing on visual object tracking, visual language tracking, benchmark construction, intelligent evaluation technique, and AI4Science.
- Xiaokun Feng, Ph.D. student at the Institute of Automation, Chinese Academy of Sciences (CASIA
), focusing on visual object tracking, visual language tracking, and AI4Science.
- Dailing Zhang, Ph.D. student at the Institute of Automation, Chinese Academy of Sciences (CASIA
), focusing on visual object tracking, visual language tracking, and AI4Science.
- Meiqi Wu, Ph.D. student at the University of Chinese Academy of Sciences (UCAS
), focusing on computer vision, intelligent evaluation technique, and human-computer interaction.
- Xuzhao Li, B.E. and incoming M.S. student at Beijing Institute of Technology (BIT
), focusing on multi-agent path planning and trajectory prediction.
- Jing Zhang, research assistant at the Institute of Automation, Chinese Academy of Sciences (CASIA
), focusing on computer vision and AI4Science.
- Yaxuan Kang, design researcher, research assistant and interaction designer at the Institute of Automation, Chinese Academy of Sciences (CASIA
), focusing on human-computer interaction.
My homepage visitors have been recorded since February 2024. Thanks for your attention.
© Xuchen Li | Last updated: Jun. 2024