I am Ph.D. student in Computer Science at Yonsei University under the advisement of Prof. Seong Jae Hwang at MICV.
My research interests revolve around deriving impactful insights from Transformers to deliver tangible improvements across multiple applications.
Specifically, I focus on Vision and Language Models (VLMs) and Large Vision-Language Models (LVLMs). For example, I’ve conducted research on enhancing semantic alignment and interpretability in LVLM.
Additionally, I’m deeply interested in exploring and advancing transformer-based generative methods in computer vision, particularly Diffusion Transformers, to further improve their effectiveness and versatility!