Learning to localize sound source in visual scenes A Senocak, TH Oh, J Kim, MH Yang, IS Kweon Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018 | 353 | 2018 |
Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications A Senocak, TH Oh, J Kim, MH Yang, IS Kweon IEEE Transactions on Pattern Analysis and Machine Intelligence 43 (5), 1605-1619, 2021 | 52 | 2021 |
Part-based Player Identification using Deep Convolutional Representation and Multi-scale Pooling A Senocak, TH Oh, J Kim, IS Kweon Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018 | 49 | 2018 |
Learning Sound Localization Better From Semantically Similar Samples A Senocak*, H Ryu*, J Kim*, IS Kweon ICASSP IEEE International Conference on Acoustics, Speech and Signal …, 2022 | 29 | 2022 |
Less can be more: Sound source localization with a classification model A Senocak*, H Ryu*, J Kim*, IS Kweon Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2022 | 25 | 2022 |
Sound to visual scene generation by audio-to-visual latent alignment K Sung-Bin, A Senocak, H Ha, A Owens, TH Oh Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 22 | 2023 |
Sound source localization is all about cross-modal alignment A Senocak, H Ryu, J Kim, TH Oh, H Pfister, JS Chung Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 11 | 2023 |
MarginNCE: Robust Sound Localization with a Negative Margin S Park*, A Senocak*, JS Chung ICASSP IEEE International Conference on Acoustics, Speech and Signal …, 2023 | 9 | 2023 |
Event-Specific Audio-Visual Fusion Layers: A Simple and New Perspective on Video Understanding A Senocak*, J Kim*, TH Oh, D Li, IS Kweon Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023 | 8* | 2023 |
Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples H Ryu*, A Senocak*, IS Kweon, JS Chung ICASSP IEEE International Conference on Acoustics, Speech and Signal …, 2023 | 5 | 2023 |
Can CLIP Help Sound Source Localization? S Park, A Senocak, JS Chung Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024 | 3 | 2024 |
FlexiAST: Flexibility is What AST Needs J Feng, MH Erol, JS Chung, A Senocak Interspeech, 2023 | 3 | 2023 |
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning MH Erol, A Senocak, J Feng, JS Chung arXiv preprint arXiv:2406.03344, 2024 | 1 | 2024 |
From Coarse to Fine: Efficient Training for Audio Spectrogram Transformers J Feng, MH Erol, JS Chung, A Senocak ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions J Feng, MH Erol, JS Chung, A Senocak arXiv preprint arXiv:2407.08691, 2024 | | 2024 |
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning M Hamza Erol, A Senocak, J Feng, J Son Chung arXiv e-prints, arXiv: 2406.03344, 2024 | | 2024 |
Speech Guided Masked Image Modeling for Visually Grounded Speech J Woo, H Ryu, A Senocak, JS Chung ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
Learning audio-visual relationships and correspondences in the visual scenes A Senocak 한국과학기술원, 2022 | | 2022 |
Nearly-Unsupervised Localization of Sound Sources in Videos A Senocak, TH Oh, J Kim, MH Yang, IS Kweon MMTC Communications–Review, 1-15, 2021 | | 2021 |
Deformable parts model based player identification using deep convolutional representations A Senocak, S Arda 한국과학기술원, 2015 | | 2015 |