Le Thien Phuc Nguyen
Logo Affiliated with University of Wisconsin - Madison
Undergraduate Researcher

Hi, my name is Le Thien Phuc Nguyen.

I am from Vietnam, and I am a senior undergraduate student at the University of Wisconsin-Madison, advised by Professor Yong Jae Lee . Fortunately, I have a chance to work with my mentor Zhuoran Yu, a PhD student from Professor Lee's lab.

My research interests focus on multimodal models, with a particular emphasis on video, audio, image, and large language models (LLMs).


Education
  • University of Wisconsin - Madison
    University of Wisconsin - Madison
    B.S. in Computer Science, Data Science, Math, and Statistics
    Sep. 2022 - present
Honors & Awards
  • Gold medal in the ICPC North Central North America (NCNA)
    2023
  • Silver medal in the ICPC North Central North America (NCNA)
    2022
  • Third prize in the Vietnam National Olympiad in Informatics
    2022
  • Second prize in the ICPC Vietnam National Round
    2021
  • Second prize in the Vietnam National University Olympiad in Informatics
    2021
News
2025
My paper LASER is accepted to WACV 2026!
Sep 05
2024
I became a mentee of Zhuoran Yu, a PhD student in Professor Lee's lab
Sep 05
I got accepted into Professor Yong Jae Lee's lab
Jun 01
2022
I have just landed in United States to start my education journey at University of Wisconsin - Madison
Aug 19
Selected Publications (view all )
Describe Anything Model for Visual Question Answering on Text-rich Images
Describe Anything Model for Visual Question Answering on Text-rich Images

Yen-Linh Vu, Dinh-Thang Duong, Truong-Binh Duong, Anh-Khoi Nguyen, Thanh-Huy Nguyen, Le Thien Phuc Nguyen, Jianhua Xing, Xingjian Li, Tianyang Wang, Ulas Bagci, Min Xu

VisionDocs Workshop @ the International Conference on Computer Vision (ICCV) 2025

Describe Anything Model for Visual Question Answering on Text-rich Images

Yen-Linh Vu, Dinh-Thang Duong, Truong-Binh Duong, Anh-Khoi Nguyen, Thanh-Huy Nguyen, Le Thien Phuc Nguyen, Jianhua Xing, Xingjian Li, Tianyang Wang, Ulas Bagci, Min Xu

VisionDocs Workshop @ the International Conference on Computer Vision (ICCV) 2025

UniTalk: Towards Universal Active Speaker Detection in Real World Scenarios
UniTalk: Towards Universal Active Speaker Detection in Real World Scenarios

Le Thien Phuc Nguyen*, Zhuoran Yu*, Khoa Quang Nhat Cao, Yuwei Guo, Tu Ho Manh Pham, Tuan Tai Nguyen, Toan Ngo Duc Vo, Lucas Poon, Soochahn Lee, Yong Jae Lee (* equal contribution)

arXiv 2025

UniTalk: Towards Universal Active Speaker Detection in Real World Scenarios

Le Thien Phuc Nguyen*, Zhuoran Yu*, Khoa Quang Nhat Cao, Yuwei Guo, Tu Ho Manh Pham, Tuan Tai Nguyen, Toan Ngo Duc Vo, Lucas Poon, Soochahn Lee, Yong Jae Lee (* equal contribution)

arXiv 2025

LASER: Lip Landmark Assisted Speaker Detection for Robustness
LASER: Lip Landmark Assisted Speaker Detection for Robustness

Le Thien Phuc Nguyen*, Zhuoran Yu*, Yong Jae Lee (* equal contribution)

The IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2026

LASER: Lip Landmark Assisted Speaker Detection for Robustness

Le Thien Phuc Nguyen*, Zhuoran Yu*, Yong Jae Lee (* equal contribution)

The IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2026

All publications