Spatio-Temporal Audio Language Modeling for Dynamic Sound Sources
Preprint
Ph.D. student at POSTECH
Email / CV / Google Scholar / GitHub / LinkedIn
Contact: hyunbinoh-AT-postech-DOT-ac-DOT-kr
Research internships/collaborations welcome. Please feel free to reach out.
I am a Ph.D. student at POSTECH, advised by Prof. Tae-Hyun Oh. I am also a research scientist intern at Sony AI.
My research lies at the intersection of computer vision and machine learning, with a focus on multimodal signals that evolve over time, such as video and audio. I am especially interested in audio-visual understanding and generation.
* denotes equal contribution. † denotes co-corresponding authors.
Preprint
TMLR 2026
INTERSPEECH 2026
CVPR 2026 Oral presentation
CVPR 2025 Highlight
ICLR 2025
ECCV 2024
INTERSPEECH 2024
INTERSPEECH 2024
RA-L 2024 IROS Oral
Sony AI
Jul 2025 - Present