I am a researcher at Shanghai AI Laboratory and a member of InternLM (书生·浦语) team. My research currently focuses on Large Language Models, with special interests in efficient post-training and self-alignmnet. And my research interests also include human-centric computer vision. I have been a core member of OpenMMLab since 2021, leading the development of MMPose and MMAction2.

Before joining Shanghai AI Laboratory, I was a research scientist at SenseTime, working on environment perception for autonomous driving.

I received my Ph.D. degree from MMLab, The Chinese University of Hong Kong in 2019, superviesd by Prof. Chen Change Loy and Prof. Xiaoou Tang. Earlier I received my B.S. degree from Tsinghua University in 2014.

Recruiting!
We have opening positions for full-time reasearchers, interns and joint PhD students. Please drop me an email if you are interested.

News

  • [2024.07] InternLM2.5-7B is released, ranking 1st on the Open LLM Leaderboard (under 10B scale). Check out the model zoo.

  • [2024.07] We released RTMW, the newest addition to RTMPose series, which specializes in predicting whole-body 2D and 3D keypoints simultaneously in realtime.

  • [2024.07] Open-Vocabulary SAM is accepted to ECCV 2024.

  • [2024.05] InternLM2 technical report has been released.

  • [2024.05] MMPose has reached a milestone of 5k stars on Github.

  • [2024.02] 3 papers accepted to CVPR 2024 (RTMO, OMG-Seg and [ROVI(#pub-rovi)]).

  • [2023.12] Introduce AgentLego, an open-source library of tool APIs to extend and enhance LLM agents with multi-modal perception capability.

Publications

Technical Report
sym

InternLM2 technical report

Full author list

Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang, Penglong Jiao, Zhenjiang Jin, Zhikai Lei, Jiaxing Li, Jingwen Li, Linyang Li, Shuaibin Li, Wei Li, Yining Li, Hongwei Liu, Jiangning Liu, Jiawei Hong, Kaiwen Liu, Kuikun Liu, Xiaoran Liu, Chengqi Lv, Haijun Lv, Kai Lv, Li Ma, Runyuan Ma, Zerun Ma, Wenchang Ning, Linke Ouyang, Jiantao Qiu, Yuan Qu, Fukai Shang, Yunfan Shao, Demin Song, Zifan Song, Zhihao Sui, Peng Sun, Yu Sun, Huanze Tang, Bin Wang, Guoteng Wang, Jiaqi Wang, Jiayu Wang, Rui Wang, Yudong Wang, Ziyi Wang, Xingjian Wei, Qizhen Weng, Fan Wu, Yingtong Xiong, Chao Xu, Ruiliang Xu, Hang Yan, Yirong Yan, Xiaogui Yang, Haochen Ye, Huaiyuan Ying, Jia Yu, Jing Yu, Yuhang Zang, Chuyu Zhang, Li Zhang, Pan Zhang, Peng Zhang, Ruijie Zhang, Shuo Zhang, Songyang Zhang, Wenjian Zhang, Wenwei Zhang, Xingcheng Zhang, Xinyue Zhang, Hui Zhao, Qian Zhao, Xiaomeng Zhao, Fengzhe Zhou, Zaida Zhou, Jingming Zhuo, Yicheng Zou, Xipeng Qiu, Yu Qiao, Dahua Lin

Arxiv, 2024
Project \(|\) HuggingFace \(|\) ArXiv

Technical Report
sym

Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model

Full author list

Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Xilin Wei, Songyang Zhang, Haodong Duan, Maosong Cao, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang

Arxiv, 2024
Project \(|\) HuggingFace \(|\) ArXiv

Arxiv
sym

GTA: A Benchmark for General Tool Agents

Jize Wang, Zerun Ma, Yining Li, Songyang Zhang, Cailian Chen, Kai Chen, Xinyi Le

Arxiv, 2024
Project \(|\) Code \(|\) ArXiv \(|\) Dataset

CVPR
sym

RTMW: Real-Time Multi-Person 2D and 3D Whole-body Pose Estimation

Tao Jiang\(^*\), Xinchen Xie\(^*\), Yining Li

CVPR, 2024
Code \(|\) ArXiv

ECCV
sym

Open-vocabulary SAM: Segment and recognize twenty-thousand classes interactively

Haobo Yuan, Xiangtai Li, Chong Zhou, Yining Li, Kai Chen, Chen Change Loy

ECCV, 2024
Project \(|\) Code \(|\) ArXiv \(|\) Demo

Arxiv
sym

RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation

Peng Lu, Tao Jiang, Yining Li, Xiangtai Li, Kai Chen, Wenming Yang

CVPR, 2024
Code \(|\) ArXiv \(|\) Demo

CVPR
sym

OMG-Seg: Is one model good enough for all segmentation?

Xiangtai Li, Haobo Yuan, Wei Li, Henghui Ding, Size Wu, Wenwei Zhang, Yining Li, Kai Chen, Chen Change Loy

CVPR, 2024
Project \(|\) Code \(|\) ArXiv

CVPR
sym

Towards language-driven video inpainting via multimodal large language models

Jianzong Wu, Xiangtai Li, Chenyang Si, Shangchen Zhou, Jingkang Yang, Jiangning Zhang, Yining Li, Kai Chen, Yunhai Tong, Ziwei Liu, Chen Change Loy

CVPR, 2024
Project \(|\) Code \(|\) ArXiv

Arxiv
sym

Rtmpose: Real-time multi-person pose estimation based on mmpose

Tao Jiang, Peng Lu, Li Zhang, Ningsheng Ma, Rui Han, Chengqi Lyu, Yining Li, Kai Chen

ArXiv, 2023
Code \(|\) ArXiv \(|\) Demo

T-PAMI
sym

Deep imbalanced learning for face recognition and attribute prediction

Chen Huang, Yining Li, Chen Change Loy, Xiaoou Tang

T-PAMI, 2019
ArXiv

CVPR
sym

Dense intrinsic appearance flow for human pose transfer

Yining Li, Chen Huang, Chen Change Loy

CVPR, 2019
Project \(|\) Code \(|\) ArXiv

ICCV
sym

Learning to disambiguate by asking discriminative questions

Yining Li, Chen Huang, Xiaoou Tang, Chen-Change Loy

ICCV, 2017
Project \(|\) Dataset \(|\) ArXiv

CVPR
sym

Learning Deep Representation for Imbalanced Classification

Chen Huang, Yining Li, Chen Change Loy, Xiaoou Tang

CVPR, 2016 (Spotlight)
Project \(|\) Slides \(|\) Code

ECCV
sym

Human attribute recognition by deep hierarchical contexts

Yining Li, Chen Huang, Chen Change Loy, Xiaoou Tang

ECCV, 2016
Project \(|\) Dataset

Projects

sym

InternLM

A series of multi-lingual foundation models and chat models.

sym

MMPose

OpenMMLab Pose Estimation Toolbox and Benchmark.

Provide high-performance human pose estimation models including RTMPose, RTMO and RTMW-2D/3D, and full support for deployment and inference.

sym

AgentLego

An open-source library of tool APIs to extend and enhance large language model (LLM) based agents.