Research
My current research focuses on Multimodal Large Language Models and Vision-Language Models.

Specifically, I work on improving the general understanding and reasoning capability of multimodal foundation models as well as extending them to specific domains.
Browse →