👨🏻‍💻 About

I am Xuchen Li (李旭宸), an incoming Ph.D. student at Institute of Automation, Chinese Academy of Sciences (CASIA ), supervised by Prof. Kaiqi Huang (IAPR Fellow). Additionally, I am a member of Visual Intelligence Interest Group (VIIG ).

Before that, I received my B.E. degree in Computer Science and Technology at School of Computer Science (SCS ) from Beijing University of Posts and Telecommunications (BUPT ) in Jun. 2024.

I am grateful to work with Dr. Shiyu Hu, which has a significant impact on me. I am also grateful to be growing up and studying with my twin brother Xuzhao Li, which is a truly unique and special experience for me.

My research focuses on Visual Language Tracking, Multi-modal Learning and Large Language Model. If you are interested in my work or would like to collaborate, please feel free to contact me.

🔥 News

  • 2024.06: 📝 One paper has been accepted by the 7th Chinese Conference on Pattern Recognition and Computer Vision (PRCV, CCF-C Conference).
  • 2024.06: 👨‍🎓 Obtain my B.E. degree from Beijing University of Posts and Telecommunications (BUPT). I will always remember the wonderful 4 years I spent here. Thanks to all!
  • 2024.05: 🏆 Obtain Beijing Outstanding Graduates (北京市优秀毕业生) (Top 5%, only 38 students obtain this honor of SCS, BUPT)!
  • 2024.05: 📣 Present our work during the 14th Vision and Learning Seminar (VALSE), see our poster for more information!
  • 2024.04: 📝 One paper has been accepted as Oral Presentation and awarded Best Paper Honorable Mention Award by the 3rd CVPR Workshop on Vision Datasets Understanding (CVPRW, Workshop in CCF-A Conference, Oral, Best Paper Honorable Mention Award)!
  • 2023.12: 🏆 Obtain College Scholarship of University of Chinese Academy of Sciences (中国科学院大学大学生奖学金) (only 17 students win this scholarship of CASIA)!
  • 2023.12: 🏆 Obtain China National Scholarship (国家奖学金) with a rank of 1/455 (0.22%) (Top 1%, the highest honor for undergraduates in China)!
  • 2023.11: 🏆 Obtain Beijing Merit Student (北京市三好学生) (Top 1%, only 36 students obtain this honor of BUPT)!
  • 2023.09: 📝 One paper has been accepted by the 37th Conference on Neural Information Processing Systems (NeurIPS, CCF-A Conference, Poster)!
  • 2022.12: 🏆 Obtain Huawei AI Education Base Scholarship (华为智能基座奖学金) (only 20 students win this scholarship of BUPT)!
  • 2022.12: 🏆 Obtain China National Scholarship (国家奖学金) with a rank of 2/430 (0.47%) (Top 1%, the highest honor for undergraduates in China)!

📖 Educations

sym

2024.09 - 2029.06 (Expected), Ph.D. student
Pattern Recognition and Intelligent System
Institute of Automation, Chinese Academy of Sciences, Beijing

sym

2020.09 - 2024.06, B.E. degree
Computer Science and Technology, Ranking 1/449 (0.22%)
School of Computer Science
Beijing University of Posts and Telecommunications, Beijing

💻 Experiences

📝 Publications

✅ Acceptance

CVPRW 2024
sym

DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM

Xuchen Li, Xiaokun Feng, Shiyu Hu, Meiqi Wu, Dailing Zhang, Jing Zhang, Kaiqi Huang

CVPRW 2024 (Workshop in CCF-A Conference, Oral, Best Paper Honorable Mention Award): the 3rd CVPR Workshop on Vision Datasets Understanding
[Paper] [PDF] [Code] [Website] [Award] [Poster] [Slides] [BibTeX]

NeurIPS 2023
sym

A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and Causal Relationship

Shiyu Hu, Dailing Zhang, Meiqi Wu, Xiaokun Feng, Xuchen Li, Xin Zhao, Kaiqi Huang

NeurIPS 2023 (CCF-A Conference, Poster): the 37th Conference on Neural Information Processing Systems
[Paper] [PDF] [Code] [Website] [Poster] [Slides] [BibTeX]

PRCV 2024
sym

VS-LLM: Visual-Semantic Depression Assessment based on LLM for Drawing Projection Test

Meiqi Wu, Yaxuan Kang, Xuchen Li, Shiyu Hu, Xiaotang Chen, Yunfeng Kang, Weiqiang Wang, Kaiqi Huang

PRCV 2024 (CCF-C Conference): the 7th Chinese Conference on Pattern Recognition and Computer Vision

☑️ Ongoing

CCF-A
sym

DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM

Xuchen Li, Shiyu Hu, Xiaokun Feng, Dailing Zhang, Meiqi Wu, Jing Zhang, Kaiqi Huang

Submitted to a CCF-A conference, Under Review

CCF-A
sym

MemVLT: Visual-Language Tracking with Adaptive Memory-based Prompts

Xiaokun Feng, Xuchen Li, Shiyu Hu, Dailing Zhang, Meiqi Wu, Jing Zhang, Xiaotang Chen, Kaiqi Huang

Submitted to a CCF-A conference, Under Review

CCF-A
sym

Unconstrained Multimodal Air-Writing Benchmark: Writing by Moving Your Fingers in 3D

Meiqi Wu, Xuchen Li, Shiyu Hu, Yuanqiang Cai, Kaiqi Huang, Weiqiang Wang

Submitted to a CCF-A conference, Under Review

CCF-A
sym

Beyond Accuracy: Tracking more like Human through Visual Search

Dailing Zhang, Shiyu Hu, Xiaokun Feng, Xuchen Li, Meiqi Wu, Jing Zhang, Kaiqi Huang

Submitted to a CCF-A conference, Under Review

🎖 Honors

  • Best Paper Honorable Mention Award (最佳论文荣誉提名奖), at the 3rd CVPR Workshop on Vision Datasets Understanding, 2024
  • China National Scholarship (国家奖学金), My Rank: 1/455 (0.22%), Top 1%, at BUPT, by Ministry of Education of China, 2023
  • China National Scholarship (国家奖学金), My Rank: 2/430 (0.47%), Top 1%, at BUPT, by Ministry of Education of China, 2022
  • Huawei AI Education Base Scholarship (华为智能基座奖学金), at BUPT, by Ministry of Education of China and Huawei AI Education Base Joint Working Group, 2022
  • Beijing Merit Student (北京市三好学生), Top 1%, at BUPT, by Beijing Municipal Education Commission, 2023
  • Beijing Outstanding Graduates (北京市优秀毕业生), Top 5%, at BUPT, by Beijing Municipal Education Commission, 2024
  • College Scholarship of University of Chinese Academy of Sciences (中国科学院大学大学生奖学金), at CASIA, by University of Chinese Academy of Sciences, 2023

🌟 Projects

GOT-10k Platform
sym

GOT-10k: A Large High-diversity Benchmark and Evaluation Platform for Single Object Tracking

  • Visual Object Tracking / Evaluation Technology / Large High-diversity Benchmark
  • As of June 2024, the platform has received 3.66M+ page views, 7.2k+ downloads, 20.5k+ trackers from 160+ countries and regions worldwide.
  • GOT-10k is the supporting platform for research accepted by IEEE TPAMI 2021.
VideoCube / MGIT Platform
sym

VideoCube / MGIT: A Large-scale Multi-dimensional Multi-modal Global Instance Tracking Intelligent Evaluation Platform

  • Visual Object Tracking / Visual Language Tracking / Long Video Understanding and Reasoning / Intelligent Evaluation Technology
  • As of June 2024, the platform has received 389k+ page views, 1.1k+ downloads, 410+ trackers from 130+ countries and regions worldwide.
  • VideoCube / MGIT is the supporting platform for research accepted by IEEE TPAMI 2023 and NeurIPS 2023.
SOTVerse Platform
sym

SOTVerse: A User-defined Single Object Tracking Task Space

  • Visual Object Tracking / Dynamic Open Environment Construction / Visual Evaluation Technique
  • As of June 2024, the platform has received 120k+ page views from 100+ countries and regions worldwide.
  • SOTVerse is the supporting platform for research accepted by IJCV 2023.

🤝 Collaborators

My homepage visitors have been recorded since February 2024. Thanks for your attention.


© Xuchen Li | Last updated: Jun. 2024