I am a Ph.D. student at POSTECH, advised by Tae-Hyun Oh. I received my bachelor’s degrees in Physics and Electrical Engineering (Double major) from Chung-Ang University. I work on research projects at the intersection of computer vision and machine learning, with a focus on modalities that contain temporal information, such as video and audio.

My research interests are on multi-modal learning, especially within audio-visual understanding and generation, but not limited to.

🔥 News

2025.07: I’ll start a research scientist internship at Sony AI, Tokyo.
2025.02: One paper has been accepted to CVPR 2025 Highlight (Top 3.7%).
2025.01: One paper has been accepted to ICLR 2025.
2024.07: One paper has been accepted to ECCV 2024.
2024.06: Two papers have been accepted to INTERSPEECH 2024.
2024.04: One paper has been accepted to RA-L 2024. This will be presented in IROS 2024 (Oral presentation).

📝 Pre-print

Preprint

[P1] Revisiting Learning-based Video Motion Magnification for Real-time Processing (under-review)

Hyunwoo Ha*, Oh Hyun-Bin*, Kim Jun-Seong, Kwon Byung-Ki, Kim Sung-Bin, Linh-Tam Tran, Ji-Yun Kim, Sung-Ho Bae, Tae-Hyun Oh (*: equal contribution)

Project

We magnify invisible small motions, but in real-time.
Short version at CVPRW 2023 ‘Vision-based InduStrial InspectiON’ Workshop.

📝 Publications

CVPR 2025

[C5] Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics

Lee Chae-Yeon*, Oh Hyun-Bin*, Han EunGi, Kim Sung-Bin, Suekyeong Nam, Tae-Hyun Oh (*: equal contribution)

Project

We introduce novel definitions, a speech-mesh representation space, and evaluation metrics for perceptually accurate 3D talking face generation.
Selected as a highlight (top 3.7%).

ICLR 2025

[C4] AVHBench: A Cross-Modal Hallucination Evluation for Audio-Visual Large Language Models

Kim Sung-Bin*, Oh Hyun-Bin*, JungMok Lee, Arda Senocak, Joon Son Chung, Tae-Hyun Oh (*: equal contribution)

Project

We introduce a comprehensive benchmark specifically designed to evaluate the perception and comprehension capabilities of audio-visual LLMs.

ECCV 2024

[C3] Learning-based Axial Video Motion Magnification

Kwon Byung-Ki, Oh Hyun-Bin, Kim Jun-Seong, Hyunwoo Ha, Tae-Hyun Oh

Project

We magnify invisible small motions, but in user-specified directions.

INTERSPEECH 2024

[C2] Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert

Han EunGi*, Oh Hyun-Bin*, Kim Sung-Bin, Corentin Nivelet Etcheberry, Suekyeong Nam, JangHoon Ju, Tae-Hyun Oh (*: equal contribution)

Project

We generate speech-synchronized lip movements in 3D facial animation with audio-visual lip reading expert.

INTERSPEECH 2024

[C1] MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset

Kim Sung-Bin*, Lee Chae-Yeon*, Gihun Son*, Oh Hyun-Bin, JangHoon Ju, Suekyeong Nam, Tae-Hyun Oh

Project

We generate accurate 3D talking heads from multilingual speech.

RA-L 2024

[J1] Uni-DVPS: Unified Model for Depth-Aware Video Panoptic Segmentation

Kim Ji-Yeon, Oh Hyun-Bin, Kwon Byung-Ki, Dahun Kim, Yongjin Kwon, Tae-Hyun Oh

Project

We present Uni-DVPS, a unified multi-task model for Depth-aware Video Panoptic Segmentation (DVPS).
Oral presentation at IROS 2024
Short version at CVPRW 2023 ‘Vision-Centric Autonomous Driving (VCAD)’ Workshop.

🎖 Honors and Awards

2025: Best Paper Award, IPIU
2024: Best Paper Award, KRoC
2023: Best Paper Award, IPIU
2021: Summa Cum Laude, Chung-Ang University
2021: Department Honors Scholarship, Chung-Ang University
2018: Science Scholarship, Suwon Municipal Scholarship Foundation
2017: Department Honors Scholarship, Chung-Ang University

📖 Educations

2022.03 - Present: Integrated Ph.D. in Electrical Engineering, Pohang University of Science and Technology (POSTECH), Pohang, South Korea
2017.03 - 2021.08: B.S. in Physics & B.E. in Electrical Engineering, Chung-Ang University, Seoul, South Korea
- Summa Cum Laude

🫡 Academic Services

Conference Reviewer: ACCV
Journal Reviewer: IJCV

🧑‍🏫 Teaching Experiences

2023, NAVER Boostcamp AI Tech Computer Vision Track (5th, 6th)
2022, Introduction to Machine Learning (EECE454), POSTECH

Hyun-Bin Oh

🔥 News

📝 Pre-print

📝 Publications

🎖 Honors and Awards

📖 Educations

🫡 Academic Services

🧑‍🏫 Teaching Experiences