ML Anthology
Authors
Search
About
Seo, Paul Hongsuck
22 publications
ICCV
2025
DialNav: Multi-Turn Dialog Navigation with a Remote Guide
Leekyeung Han
,
Hyunji Min
,
Gyeom Hwangbo
,
Jonghyun Choi
,
Paul Hongsuck Seo
AAAI
2025
Multi-Granularity Video Object Segmentation
Sangbeom Lim
,
Seongchan Kim
,
Seungjun An
,
Seokju Cho
,
Paul Hongsuck Seo
,
Seungryong Kim
CVPR
2025
Random Conditioning for Diffusion Model Compression with Distillation
Dohyun Kim
,
Sehwan Park
,
Geonhee Han
,
Seung Wook Kim
,
Paul Hongsuck Seo
NeurIPS
2025
Seg4Diff: Unveiling Open-Vocabulary Semantic Segmentation in Text-to-Image Diffusion Transformers
Chaehyun Kim
,
Heeseong Shin
,
Eunbeen Hong
,
Heeji Yoon
,
Anurag Arnab
,
Paul Hongsuck Seo
,
Sunghwan Hong
,
Seungryong Kim
CVPR
2024
CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
Seokju Cho
,
Heeseong Shin
,
Sunghwan Hong
,
Anurag Arnab
,
Paul Hongsuck Seo
,
Seungryong Kim
CVPR
2024
Learning Correlation Structures for Vision Transformers
Manjin Kim
,
Paul Hongsuck Seo
,
Cordelia Schmid
,
Minsu Cho
ECCV
2024
Pseudo-RIS: Distinctive Pseudo-Supervision Generation for Referring Image Segmentation
Seonghoon Yu
,
Paul Hongsuck Seo
,
Jeany Son
NeurIPS
2024
Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels
Heeseong Shin
,
Chaehyun Kim
,
Sunghwan Hong
,
Seokju Cho
,
Anurag Arnab
,
Paul Hongsuck Seo
,
Seungryong Kim
NeurIPS
2024
TrackIME: Enhanced Video Point Tracking via Instance Motion Estimation
Seong Hyeon Park
,
Huiwon Jang
,
Byungwoo Jeon
,
Sukmin Yun
,
Paul Hongsuck Seo
,
Jinwoo Shin
CVPR
2023
AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR
Paul Hongsuck Seo
,
Arsha Nagrani
,
Cordelia Schmid
CVPR
2023
IFSeg: Image-Free Semantic Segmentation via Vision-Language Model
Sukmin Yun
,
Seong Hyeon Park
,
Paul Hongsuck Seo
,
Jinwoo Shin
CVPR
2023
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
Antoine Yang
,
Arsha Nagrani
,
Paul Hongsuck Seo
,
Antoine Miech
,
Jordi Pont-Tuset
,
Ivan Laptev
,
Josef Sivic
,
Cordelia Schmid
CVPR
2023
Zero-Shot Referring Image Segmentation with Global-Local Context Features
Seonghoon Yu
,
Paul Hongsuck Seo
,
Jeany Son
CVPR
2022
End-to-End Generative Pretraining for Multimodal Video Captioning
Paul Hongsuck Seo
,
Arsha Nagrani
,
Anurag Arnab
,
Cordelia Schmid
ECCV
2022
Learning Audio-Video Modalities from Image Captions
Arsha Nagrani
,
Paul Hongsuck Seo
,
Bryan Seybold
,
Anja Hauth
,
Santiago Manen
,
Chen Sun
,
Cordelia Schmid
CVPR
2021
Look Before You Speak: Visually Contextualized Utterances
Paul Hongsuck Seo
,
Arsha Nagrani
,
Cordelia Schmid
AAAI
2020
Reinforcing an Image Caption Generator Using Off-Line Human Feedback
Paul Hongsuck Seo
,
Piyush Sharma
,
Tomer Levinboim
,
Bohyung Han
,
Radu Soricut
NeurIPS
2019
Combinatorial Inference Against Label Noise
Paul Hongsuck Seo
,
Geeho Kim
,
Bohyung Han
ACML
2019
Regularizing Neural Networks via Stochastic Branch Layers
Wonpyo Park
,
Paul Hongsuck Seo
,
Bohyung Han
,
Minsu Cho
ICCV
2017
MarioQA: Answering Questions by Watching Gameplay Videos
Jonghwan Mun
,
Paul Hongsuck Seo
,
Ilchae Jung
,
Bohyung Han
NeurIPS
2017
Visual Reference Resolution Using Attention Memory for Visual Dialog
Paul Hongsuck Seo
,
Andreas Lehrmann
,
Bohyung Han
,
Leonid Sigal
CVPR
2016
Image Question Answering Using Convolutional Neural Network with Dynamic Parameter Prediction
Hyeonwoo Noh
,
Paul Hongsuck Seo
,
Bohyung Han