
Xin Lai*, Junyi Li*, Wei Li, Tao Liu, Tianjian Li, Hengshuang Zhao# (* equal contribution, # corresponding author)
Under review. 2025
A full training recipe to reproduce OpenAI o3-style thinking-with-images capability.
Xin Lai*, Junyi Li*, Wei Li, Tao Liu, Tianjian Li, Hengshuang Zhao# (* equal contribution, # corresponding author)
Under review. 2025
A full training recipe to reproduce OpenAI o3-style thinking-with-images capability.

Senqiao Yang*, Junyi Li*, Xin Lai*, Bei Yu, Hengshuang Zhao, Jiaya Jia (* equal contribution)
Advances in Neural Information Processing Systems (NeurIPS) 2025 Poster
A new paradigm of efficient VLMs with token compression.
Senqiao Yang*, Junyi Li*, Xin Lai*, Bei Yu, Hengshuang Zhao, Jiaya Jia (* equal contribution)
Advances in Neural Information Processing Systems (NeurIPS) 2025 Poster
A new paradigm of efficient VLMs with token compression.

Junfeng Wu*, Dongliang Luo*, Weizhi Zhao, Zhihao Xie, Yuanhao Wang, Junyi Li, Xudong Xie, Yuliang Liu, Xiang Bai# (* equal contribution, # corresponding author)
Under review. 2025
A simple Benchmark for evaluating your visual tokenizer.
Junfeng Wu*, Dongliang Luo*, Weizhi Zhao, Zhihao Xie, Yuanhao Wang, Junyi Li, Xudong Xie, Yuliang Liu, Xiang Bai# (* equal contribution, # corresponding author)
Under review. 2025
A simple Benchmark for evaluating your visual tokenizer.

Junyi Li*, Junfeng Wu*, Weizhi Zhao, Song Bai, Xiang Bai# (* equal contribution, # corresponding author)
Eurepean Conference on Computer Vision (ECCV) 2024 Poster
The first part-level foundation model for locating and identifying both objects and parts in images through a hierarchical framework.
Junyi Li*, Junfeng Wu*, Weizhi Zhao, Song Bai, Xiang Bai# (* equal contribution, # corresponding author)
Eurepean Conference on Computer Vision (ECCV) 2024 Poster
The first part-level foundation model for locating and identifying both objects and parts in images through a hierarchical framework.