Publication List

TitleAuthorsVenuesYearsMisc
Rethinking Spatial Dimensions of Vision TransformersByeongho Heo, Sangdoo Yun, Dongyoon Han, Sanghyuk Chun, Junsuk Choe, Seong Joon OharXiv2021Github
GPT3Mix: Leveraging Large-scale Language Models for Text AugmentationKang Min Yoo, Dongju Park, Jaewook Kang, Sang-Woo Lee, Woomyeong ParkarXiv2021
Consistency Training with Virtual Adversarial Discrete PerturbationJungsoo Park, Gyuwan Kim, Jaewoo Kang (Korea U.)arXiv2021
Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with SearchGyuwan Kim, Kyunghyun Cho (NYU)arXiv2021Github
Reward Optimization for Neural Machine Translation with Learned MetricsRaphael Shu, Kang Min Yoo, Jung-Woo HaarXiv2021Github
Integration of Pre-trained Networks with Continuous Token Interface for End-to-End Spoken Language UnderstandingSeunghyun Seo, Donghyun Kwak, Bowon Lee (Inha Univ.)arXiv2021
Bayesian Perspective on Visual Data Augmentation for Efficient Utilization of Sub-sampled DataJoonhyun Jeong, Sungmin Cha, Youngjoon Yoo, Sangdoo Yun, Jongwon ChoiICLRW 20212021
Rainbow Memory: Continual Learning with a Memory of Diverse SamplesJihwan Bang, Heesu Kim, Youngjoon Yoo, Jung-Woo Ha, Jonghyun Choi (GIST)CVPR 20212021Github
Rethinking Channel Dimensions for Efficient Model DesignDongyoon Han, Sangdoo Yun, Byeongho Heo, YoungJoon YooCVPR 20212021Github
Exploiting Spatial Dimensions of Latent in GAN for Real-time Image EditingHyunsu Kim, Yunjey Choi, Junho Kim, Sungjoo Yoo (Seoul National University), Youngjung Uh (Yonsei University)CVPR 20212021
Probabilistic Embeddings for Cross-Modal RetrievalSanghyuk Chun, Seong Joon Oh, Rafael Sampaio de Rezende, Yannis Kalantidis, Diane LarlusCVPR 20212021
Re-labeling ImageNet: from Single to Multi-Labels, from Global to Localized LabelsSangdoo Yun, Seong Joon Oh, Byeongho Heo, Dongyoon Han, Junsuk Choe, Sanghyuk ChunCVPR 20212021Github
Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech SeparationJiyoung Lee (Yonsei), Soo-Whan Chung, Sunok Kim(Yonsei, Korea Aerospace), Hong-Goo Kang(Yonsei), Kwanghoon Sohn(Yonsei)CVPR 20212021
Designing a Minimal Retrieve-and-Read System for Open-Domain Question AnsweringSohee Yang, Minjoon SeoNAACL 20212021
M2FN: Multi-step modality fusion for advertisement image assessmentKyung-Wha Park (SNU), Jung-Woo Ha, JungHoon Lee (Soongsil Univ.), Sunyoung Kwon (Pusan Univ.), Kyung-Min Kim, Byoung-Tak Zhang (SNU)Applied Soft Computing2021
Two-stage Textual Knowledge Distillation to Speech Encoder for Spoken Language UnderstandingSeongbin Kim*, Gyuwan Kim*, Seongjin Shin, Sangmin Lee (Inha Univ)ICASSP 20212021Arxiv
ST-BERT: Cross-modal Language Model Pre-training For End-to-end Spoken Language UnderstandingMinjeong Kim, Gyuwan Kim, Sang-Woo Lee, Jung-Woo HaICASSP 20212021Arxiv
TTS-by-TTS: TTS-driven Data Augmentation for Fast and High-Quality Speech SynthesisMin-Jae Hwang, Ryuichi Yamamoto, Eunwoo Song, and Jae-Min KimICASSP 20212021
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminatorsRyuichi Yamamoto, Eunwoo Song , Min-Jae Hwang, Jae-Min KimICASSP 20212021
NN-KOG2P: A Novel Grapheme-Phoneme model for Korean languageKim Hwayeon, Kim Jonghwan, Kim Jae MinICASSP 20212021
The ins and outs of speaker recognition: lessons from VoxSRC 2020Yoohwan Kwon, Hee-Soo Heo, Bong-Jin Lee, Joon Son ChungICASSP 20212021
Playing a Part: Speaker Verification at the MoviesAndrew Brown (U. of Oxford), Jaesung Huh (U. of Oxford), Arsha Nagrani (U. of Oxford), Joon Son Chung, Andrew Zisserman (U. of Oxford)ICASSP 20212021
Graph Attention Networks for Speaker VerificationJee-weon Jung, Hee-Soo Heo, Ha-Jin Yu(UOS), Joon Son ChungICASSP 20212021
Intermediate Loss Regularization for CTC-based Speech RecognitionJaesong Lee, Shinji Watanabe (CMU)ICASSP 20212021
Proxy Synthesis: Learning with Synthetic Classes for Deep Metric LearningGeonmo Gu, Byungsoo Ko, Han-Gyu KimAAAI 20212021
Discriminative Region Suppression for Weakly-Supervised Semantic SegmentationBeomyoung Kim(Naver, KAIST), Sangeun Han (KAIST), Junmo Kim (KAIST)AAAI 20212021
Few-shot Font Generation with Localized Style Representations and FactorizationSong Park, Sanghyuk Chun, Junbum Cha, Bado Lee, Hyunjung ShimAAAI 20212021Github
Show, Attend and Distill: Knowledge Distillation via Attention-based Feature MatchingMingi Ji, Byeongho Heo, Sungrae ParkAAAI 20212021
DialogBERT: Discourse-Aware Response Generation via Learning to Recover and Rank UtterancesXiaodong Gu, Kang Min Yoo, Jung-Woo HaAAAI 20212021
AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant WeightsByeongho Heo, Sanghyuk Chun, Seong Joon Oh, Dongyoon Han, Sangdoo Yun, Gyuwan Kim, Youngjung Uh, Jung-Woo HaICLR 20212021Github
Project page
Diagnosing Bias in the Gender Representation of HCI Research Participants: How it Happens and Where We Are?Anna Offenwanger (U of British Columbia), Alan John Milligan (University of British Columbia), Minsuk Chang (Naver AI Lab & KAIST), Julia Bullard (U of British Columbia), Dongwook Yoon (U of British Columbia)CHI 20212021
RubySlippers: Supporting Content-based Voice Navigation for How-to VideosMinsuk Chang (Naver AI Lab & KAIST), Mina Huh (KAIST), Juho Kim (KAIST)CHI 20212021
Personalizing Ambience and Illusionary Presence: How People Use “Study with Me” Videos to Create Effective Studying EnvironmentsYoonjoo Lee (KAIST), John Joon Young Chung(U of Michigan), Jean Y Song (KAIST), Minsuk Chang (Naver AI Lab & KAIST), Juho Kim (KAIST)CHI 20212021
A Simulation Model of Intermittently Controlled Point-and-Click BehaviorSeungwon Do (ETRI), Minsuk Chang (Naver AI Lab & KAIST), Byungjoo Lee (KAIST)CHI 20212021
Scale down Transformer by Grouping Features for a Lightweight Character-level Language ModelSungrae Park, Geewook Kim, Junyeop Lee, Junbum Cha, Ji-Hoon Kim, Hwalsuk LeeCOLING 20202020Github
Self-supervised Auxiliary Learning with Meta-paths for Heterogeneous GraphsDasol Hwang (Korea Univ), Jinyoung Park (Korea Univ), Sunyoung Kwon, Kyung-Min Kim, Jung-Woo Ha, Hyunwoo J. Kim (Korea Univ)NeurIPS 20202020
A Worrying Analysis of Probabilistic Time-series Models for Sales ForecastingSeungjae Jung, Kyung-Min Kim, Hanock Kwak, Young-Jin ParkPMLR (in press), ICBINB@NeurIPS 2020Article
Multi-Manifold Learning for Large-scale Targeted Advertising System







Kyuyong Shin, Young-Jin Park, Kyung-Min Kim, Sunyoung KwonAD@KDD 20202020
Hop Sampling: A Simple Regularized Graph Learning for Non-Stationary EnvironmentsYoung-Jin Park, Kyuyong Shin, Kyung-Min KimMLG@KDD 20202020
Large Product Key Memory for Pretrained Language ModelsGyuwan Kim, Tae-Hwan JungEMNLP 2020 (Findings)2020Github
Variational Hierarchical Dialog Autoencoder for Dialog State Tracking Data AugmentationKang Min Yoo, Hanbit Lee (Seoul National University), Franck Dernoncourt (Adobe), Trung Bui (Adobe), Walter Chang (Adobe), Sang-goo Lee (Seoul National University)EMNLP 20202020
Context-Aware Answer Extraction in Question AnsweringYeon Seonwoo (KAIST), Ji-Hoon Kim, Jung-Woo Ha, Alice Oh (KAIST)EMNLP 20202020
Exploring Lexicon-Free Modeling Units for End-to-End Korean and Korean-English Code-Switching Speech RecognitionJisung Wang, Jihwan Kim (VUNO), Sangki Kim (VUNO), Yeha Lee (VUNO)Interspeech 20202020
End-to-End Task-oriented Dialog System through Template Slot Value GenerationTeakgyu Hong, Oh-Woog Kwon(ETRI) Institute), Young-Kil Kim (ETRI)Interspeech 20202020
Speech to Text Adaptation: Towards an Efficient Cross-Modal DistillationWon Ik Cho(Seoul National University), Donghyun Kwak, Jiwon Yoon (Seoul National University), Nam Soo Kim (Seoul National University)Interspeech 20202020
Neural Text-to-Speech with a Modeling-by-Generation Excitation VocoderEunwoo Song, Min-Jae Hwang, Ryuichi Yamamoto, Jin-Seob Kim, Ohsung Kwon, Jae-Min KimInterspeech 20202020
Now you're speaking my language: Visual language identificationTriantafyllos Afouras (University of Oxford), Joon Son Chung, Andrew Zisserman(University of Oxford)Interspeech 20202020
Seeing voices and hearing voices: learning discriminative embeddings using cross-modal self-supervisionSoo-Whan Chung, Hong-Goo Kang (Yonsei University) , Joon Son ChungInterspeech 20202020
Spot the conversation: speaker diarisation in the wildJoon Son Chung, Jaesung Huh, Arsha Nagrani (University of Oxford), Triantafyllos Afouras (University of Oxford), Andrew Zisserman(University of Oxford)Interspeech 20202020
FaceFilter: Audio-visual speech separation using still imagesSoo-Whan Chung, Soyeon Choe, Joon Son Chung, Hong-Goo Kang (Yonsei University)Interspeech 20202020
Self-supervised Pre-training with Acoustic Configurations for Replay Spoofing DetectionHye-jin Shim(University of Seoul), Hee-Soo Heo, Jee-weon Jung(University of Seoul), Ha-Jin Yu(University of Seoul)Interspeech 20202020
ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact CentersJung-Woo Ha, Kihyun Nam, Jingu Kang, Sang-Woo Lee, Sohee Yang, Hyunhoon Jung, Eunmi Kim, Hyeji Kim, Soojin Kim, Hyun Ah Kim, Kyoungtae Doh, Chan Kyu Lee, Nako Sung, Sunghun KimInterspeech 20202020Github
In defence of metric learning for speaker recognitionJoon Son Chung, Jaesung Huh, Seongkyu Mun, Minjae Lee, Hee Soo Heo, Soyeon Choe, Chiheon Ham, Sunghwan Jung, Bong-Jin Lee, Icksang HanInterspeech 20202020Github
VideoMix: Rethinking Data Augmentation for Video ClassificationSangdoo Yun, Seong Joon Oh, Byeongho Heo, Dongyoon Han, Jinhyung KimArXiv2020
Evaluation for Weakly Supervised Object Localization: Protocol, Metrics, and DatasetsJunsuk Choe, Seong Joon Oh, Sanghyuk Chun, Zeynep Akata, Hyunjung ShimArXiv2020Github
CareCall: a Call-Based Active Monitoring Dialog Agent for Managing COVID-19 PandemicSang-Woo Lee, Hyunhoon Jung, SukHyun Ko, Sunyoung Kim, Hyewon Kim, Kyoungtae Doh, Hyunjung Park, Joseph Yeo, Sang-Houn Ok, Joonhaeng Lee, Sungsoon Lim, Minyoung Jeong, Seongjae Choi, SeungTae Hwang, Eun-Young Park (Seongnam city), Gwang-Ja Ma (Seongnam city), Seok-Joo Han (Seongnam city), Kwang-Seung Cha (Seongnam city), Nako Sung, Jung-Woo HaArXiv2020
Efficient Active Learning for Automatic Speech Recognition via Augmented Consistency RegularizationJihwan Bang, Heesu Kim, YoungJoon Yoo, Jung-Woo HaArXiv2020
Graphs, Entities, and Step MixtureKyuyong Shin, Wonyoung Shin, Jung-Woo Ha, Sunyoung KwonGRL+ WS@ICML 20202020
Understanding Differences between Heavy Users and Light Users in Difficulties with Voice User InterfacesHyunhoon Jung, Hyeji Kim, Jung-Woo HaCUI 20202020
Which Strategies Matter for Noisy Label Classification? Insight into Loss and UncertaintyWonyoung Shin, Jung-Woo Ha, Shengzhe Li, Yongwoo Cho, Hoyean Song, Sunyoung KwonArXiv2020
Rethinking the Truly Unsupervised Image-to-Image TranslationKyungjune Baek, Yunjey Choi, Youngjung Uh, Jaejun Yoo, Hyunjung ShimArXiv2020Github
StatAssist & GradBoost: A Study on Optimal INT8 Quantization-aware Training from ScratchTaehoon Kim, Youngjoon Yoo, Jihoon YangArXiv2020Github
BSL-1K: Scaling up co-articulated sign recognition using mouthing cuesSamuel Albanie, Gul Varol, Liliane Momeni, Triantafyllos Afouras, Joon Son Chung, Neil Fox, Andrew ZissermanECCV 20202020
Self-supervised learning of audio-visual objects from videoTriantafyllos Afouras, Andrew Owens, Joon Son Chung, Andrew ZissermanECCV 20202020
Few-shot Compositional Font Generation with Dual MemoryJunbum Cha, Sanghyuk Chun, Gayoung Lee, Bado Lee, Seonghyeon Kim, Hwalsuk LeeECCV 20202020Github
Character Region Attention For Text SpottingYoungmin Baek, Seung Shin, Jeonghun Baek, Sungrae Park, JunyeopLee, Daehyun Nam, Hwalsuk LeeECCV 20202020
ReAD: Reciprocal Attention Discriminator for Image-to-Video Re-IdentificationMinho Shim, Hsuan-I Ho, Jinhyung Kim, Dongyoon WeeECCV 20202020
Reliable Fidelity and Diversity Metrics for Generative ModelsMuhammad Ferjad Naeem, Seong Joon Oh, Youngjung Uh, Yunjey Choi, Jaejun YooICML 20202020Github
Learning De-biased Representations with Biased RepresentationsHyojin Bahng, Sanghyuk Chun, Sangdoo Yun, Jaegul Choo (Korea Univ.), Seong Joon OhICML 20202020Github
Efficient Dialogue State Tracking by Selectively Overwriting MemorySungdong Kim, Sohee Yang, Gyuwan Kim, Sang-Woo LeeACL 20202020Github
Contextualized Sparse Representations for
Real-Time Open-Domain Question Answering
Jinhyuk Lee, Minjoon Seo, Hanna Hajishirzi, Jaewoo KangACL 20202020
Embedding Expansion: Augmentation in Embedding Space for Deep Metric LearningByungsoo Ko*, Geonmo Gu*CVPR 20202020Github
Regularization on Spatio-Temporally Smoothed Feature for Action RecognitionJinhyung Kim, Dongyoon Wee, Soonmin Bae, Junmo KimCVPR 20202020
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Analysis and a New StrategyJaejun Yoo*, Namhyuk Ahn, Kyung-Ah SohnCVPR 20202020Github
Evaluating Weakly Supervised Object Localization Methods RightJunsuk Choe*, Seong Joon Oh*, Seungho Lee (Yonsei Univ.), Sanghyuk Chun, Zeynep Akata (Univ. of Tubingen), Hyunjung Shim (Yonsei Univ.)CVPR 20202020Github
StarGAN v2: Diverse Image Synthesis for Multiple DomainsYunjey Choi*, Youngjung Uh*, Jaejun Yoo*, Jung-Woo HaCVPR 20202020Github
U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image TranslationJunho Kim, Minjae Kim, Hyeonwoo Kang, Kwang Hee LeeICLR 20202020Github
Data-Driven Harmonic Filters for Audio Representation LearningMinz Won (Univ. of Pompeu Fabra), Sanghyuk Chun, Oriol Nieto (Pandora), Xavier Serra (Univ. of Pompeu Fabra)ICASSP 20202020
The Sound of My Voice: Speaker Representation Loss for Target Voice SeparationSeongkyu Mun, Soyeon Choe, Jaesung Huh, Joon Son ChungICASSP 20202020
Disentangled Speech Embeddings Using Cross-modal Self-supervisionArsha Nagrani* (Univ. of Oxford), Joon Son Chung*, Samuel Albanie (Univ. of Oxford), Andrew Zisserman (Univ. of Oxford)ICASSP 20202020
ASR is All You Need: Cross-modal Distillation for Lip ReadingTriantafyllos Afouras (Univ. of Oxford), Joon Son Chung, Andrew Zisserman (Univ. of Oxford)ICASSP 20202020
Learning From Dances : Pose-invariant Re-identification for Multi-person TrackingHsuan-I Ho, Minho Shim, Dongyoon WeeICASSP 20202020
Parallel WaveGAN: A Fast waveform generation model based on generative adversarial networks with multi-resolution spectrogramRyuichi Yamamoto, Eunwoo Song, Jae-Min KimICASSP 20202020
Improving LPCNet-based text-to-speech with linear prediction-structured mixture density networkMin-Jae Hwang, Eunwoo Song, Ryuichi Yamamoto, Frank Soong (MSRA), Hong-Goo Kang (Yonsei Univ)ICASSP 20202020
Compounding the Performance Improvements of Assembled Techniques in a Convolutional Neural NetworkJungkyu Lee, Taeryun Won, Kiho HongarXiv2020Github
Lightweight 3D Human Pose Estimation Network Training Using Teacher-Student LearningDong-Hyun Hwang, Suntae Kim, Nicolas Monet, Hideki Koike, Soonmin BaeWACV 20202020
SINet: Extreme Lightweight Portrait Segmentation Networks with Spatial Squeeze Modules and Information Blocking DecoderHyojin Park, Lars Lowe Sjösund, YoungJoon Yoo, Nicolas Monet (NAVER LABS Europe), Jihwan Bang, Nojun Kwak (Seoul National Univ.)WACV 20202020Github
Symmetrical synthesis for deep metric learningGeonmo Gu, Byung Soo KoAAAI 20202020Github
Background Suppression Networks for Weakly-supervised Temporal Action LocalizationPilhyeon Lee (Yonsei Univ.), Youngjung Uh, Heyran Byun (Yonsei Univ.)AAAI 20202020
div2vec: Diversity-Emphasized Node EmbeddingJisu Jeong, Jeong-Min Yun, Hongi Keam, Young-Jin Park, Zimin Park, Junki ChoWorkshop on the Impact of Recommender Systems, RecSys 20202020
An Effective Style Token Weight Control Technique for End-to-End Emotional Speech SynthesisOhsung Kwon, Inseon Jang (ETRI), ChungHyun Ahn (ETRI), Hong-Goo Kang (Yonsei Univ.)IEEE Signal Processing Letters (presented @ ICASSP 2020)2019
CORD: A Consolidated Receipt Dataset for Post-OCR ParsingSeunghyun Park, Seung Shin, Bado Lee, Junyeop Lee, Jaeheung Surh, Minjoon Seo, Hwalsuk LeeDocument Intelligence WS@NeurIPS 20192019Github
Unpaired Sketch-to-Line Translation via Synthesis of SketchesGayoung Lee, Dohyun Kim (NAVER Webtoon), Youngjoon Yoo, Dongyoon Han, Jung-Woo Ha, Jaehyuk Chang (NAVER Webtoon)SIGGRAPH-Asia2019
CodeKernel: A Graph Kernel Based Approach to the Selection of API Usage ExamplesXiaodong Gu, Hongyu Zhang (Univ. of Newcastle), Sunghun KimIEEE/ACM ASE 20192019
Subword Language Model for Query Auto-CompletionGyuwan KimEMNLP-IJCNLP 20192019Blog
NL2pSQL: Generating Pseudo-SQL Queries from Under-Specified Natural Language QuestionsFuxiang Chen, Seung-won Hwang (Yonsei Univ.), Jaegul Choo (Korea Univ.), Jung-Woo Ha, Sung KimEMNLP-IJCNLP 20192019Blog
Mixture Content Selection for Diverse Sequence GenerationJaemin Cho, Minjoon Seo, Hannaneh Hajishirzi (Univ. of Washington)EMNLP-IJCNLP 20192019
CutMix:
Regularization Strategy to Train Strong Classifiers with Localizable Features
Classification Robustness and Uncertainty
Sangdoo Yun, Dongyoon Han, Seong Joon Oh, Sanghyuk Chun, Junsuk Choe, Youngjoon YooICCV 2019 (Oral)2019Blog
What is
Wrong with Scene Text Recognition Model Comparisons? Dataset and Model
Analysis
Jeonghun Baek, Geewook Kim, Junyeop Lee, Sungrae Park, Dongyoon Han, Sangdoo Yun, Seong Joon Oh, Hwalsuk LeeICCV 2019 (Oral)2019Blog
Photorealistic
Style Transfer via Wavelet Transforms
Jaejun Yoo, Youngjung Uh, Sanghyuk Chun, Byungkyu Kang, Jung-Woo HaICCV 20192019Blog
A Comprehensive Overhaul of Feature DistillationByeongho Heo, Jeesoo Kim, Sangdoo Yun, Hyojin Park, Nojun Kwak, Jin Young ChoiICCV 20192019Blog
Automatic music tagging with Harmonic CNNMinz Won, Sanghyuk Chun, Xavier SerraISMIR 2019 (Late break demo)2019
Probability Density Distillation with Generative Adversarial Networks for High-Quality Parallel Waveform GenerationRyuichi Yamamoto, Eunwoo Song, Jae-Min KimINTERSPEECH 20192019
Parameter
Enhancement for MELP Speech Codec in Noisy Communication Environment
Min-Jae Hwang, Hong-Goo KangINTERSPEECH 20192019
Who Said that: Audio-Visual Speaker Diarisation of Real-World MeetingsJoon Son Chung, Bong-Jin Lee, Icksang HanINTERSPEECH 20192019
My Lips are Concealed: Audio-Visual Speech Enhancement through ObstructionsTriantafyllos Afouras, Joon Son Chung, Andrew ZissermanINTERSPEECH 20192019
BioBERT: a pre-trained biomedical language representation model for biomedical text miningJinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan Ho So, Jaewoo KangBioinformatics2019
ExcitNet
Vocoder: A Neural Excitation Model for Parametric Speech Synthesis Systems
Eunwoo Song, Kyungguen Byun, Hong-Goo KangEUSIPCO 20192019
Tripartite Heterogeneous Graph Propagation for Large-scale Social RecommendationKyung-Min Kim, Donghyun Kwak, Hanock Kwak, Young-Jin Park, Sangkwon Sim, Jae-Han Cho, Minkyu Kim, Jihun Kwon, Nako Sung, Jung-Woo HaRecsys 2019 (LBR)2019
Real-Time
Open-Domain Question Answering with Dense-Sparse Phrase Index
Minjoon Seo, Jinhyuk Lee, Tom Kwiatkowski, Ankur P. Parikh, Ali Farhadi, Hannaneh HajishirziACL 20192019
TedEval: A Fair Evaluation Metric for Scene Text DetectorsChae Young Lee, Youngmin Baek, Hwalsuk LeeWorkshop on Industrial Applications of Document Analysis and Recognition 20192019Blog
Excitation-by-SampleRNN
Model for Text-to-Speech
Kyungguen Byun, Eunwoo Song, Jinseob Kim, Jae-Min Kim, Hong-Goo KangITC-CSCC 20192019
Character
Region Awareness for Text Detection
Youngmin Baek, Bado Lee, Dongyoon Han, Sangdoo Yun, Hwalsuk LeeCVPR 20192019
EXTD:
Extremely Tiny Face Detector via Iterative Filter Reuse
YoungJoon Yoo, Dongyoon Han, Sangdoo YunarXiv2019
Visualizing
and Understanding Self-Attention Based Music Tagging
Minz Won, Sanghyuk Chun, Xavier SerraMachine Learning for Music Discovery Workshop (Contributed Talk)@ICML 20192019
An
Empirical Evaluation on Robustness and Uncertainty of Regularization methods
Robustness and Uncertainty
Sanghyuk Chun, Seong Joon Oh, Sangdoo Yun, Dongyoon Han, Junsuk Choe, Youngjoon YooUncertainty & Robustness in Deep Learning Workshop@ICML 20192019
Curiosity-Bottleneck: Exploration By Distilling Task-Specific NoveltyYoungjin Kim, Wontae Nam, Hyunwoo Kim, Ji-Hoon Kim, Gunhee KimICML 20192019
Toward
Interpretable Music Tagging with Self-Attention
Minz Won, Sanghyuk Chun, Xavier SerraarXiv2019
Effective
Parameter Estimation Methods for an ExcitNet Model in Generative
Text-to-Speech Systems
Ohsung Kwon, Eunwoo Song, Jae-Min Kim,Hong-Goo KangarXiv2019
Domain
Mismatch Robust Acoustic Scene Classification Using Channel Information
Conversion
Sung Kyu Moon, Suwon ShonICASSP 20192019
Perfect
Match: Improved Cross-Modal Embeddings for Audio-Visual Synchronisation
Soo-Whan Chung, Joon Son Chung, Hong-Goo KangICASSP 20192019
DialogWAE:
Multimodal Response Generation with Conditional Wasserstein Auto-Encoder
Xiaodong Gu, Kyunghyun Cho, Jung-Woo Ha, Sunghun KimICLR 20192019
Large-Scale
Answerer in Questioner's Mind for Visual Dialog Question Generation
Sang-Woo Lee, Tong Gao, Sohee Yang, Jaejun Yoo, Jung-Woo HaICLR 20192019
Modeling
Uncertainty with Hedged Instance Embeddings
Seong Joon Oh, Andrew C. Gallagher, Kevin P. Murphy, Florian Schroff, Jiyan Pan, Joseph RothICLR 20192019
Where To
Be Adversarial Perturbations Added? Investigating and Manipulating Pixel
Robustness Using Input Gradients
Jisung Hwang, Younghoon Kim, Sanghyuk Chun, Jaejun Yoo, Ji-Hoon Kim, Dongyoon HanDebugging Machine Learning Models Workshop@ICLR 20192019
A
Comprehensive Exploration on WikiSQL with Table-Aware Word Contextualization
Wonseok Hwang, Jinyeong Yim, Seunghyun Park, Minjoon SeoarXiv2019
Adversarial
Dropout for Recurrent Neural Networks
Sungrae Park, Jun-Keon Park, Su-Jin Shin, Il-Chul MoonAAAI 20192019
Hierarchical
Context Enabled Recurrent Neural Network for Recommendation
Kyungwoo Song, Mingi Ji, Sungrae Park, Il-Chul MoonAAAI 20192019
Knowledge Distillation with Adversarial Samples Supporting Decision BoundaryByeongho Heo, Minsik Lee, Sangdoo Yun, Jin Young ChoiAAAI 20192019
Knowledge Transfer via Distillation of Activation Boundaries Formed by Hidden NeuronsByeongho Heo, Minsik Lee, Sangdoo Yun, Jin Young ChoiAAAI 2019, Oral2019
Paraphrase
Diversification Using Counterfactual Debiasing
Sunghyun Park, Seung-Won Hwang, Fuxiang Chen, Jaegul Choo, Jung-Woo Ha, Sunghun KimAAAI 20192019
End-to-End
Question Answering Models for Goal-Oriented Dialog Learning
Jamin Shin, Andrea Madotto, Minjoon Seo, Pascale FungWorkshop on DSTC 2019 (at AAAI)2019
Dirichlet
Variational Autoencoder
Weonyoung Joo, Wonsung Lee, Sungrae Park, Il-Chul MoonarXiv2019
Multi-Domain
Processing via Hybrid Denoising Networks for Speech Enhancement
Jang-Hyun Kim, Jaejun Yoo, Sanghyuk Chun, Adrian Kim, Jung-Woo HaarXiv2018
Answerer in Questioner's Mind: Information Theoretic Approach to Goal-Oriented Visual DialogSang-Woo Lee, Yu-Jung Heo, Byoung-Tak ZhangNeurIPS 2018, Spotlight2018
Speaker-Adaptive
Neural Vocoders for Statistical Parametric Speech Synthesis Systems
Eunwoo Song, Jinseob Kim, Kyungguen Byun, Hong-Goo KangarXiv2018
Phrase-Indexed
Question Answering: A New Challenge towards Scalable Document Comprehension
Minjoon Seo, Tom Kwiatkowski, Ankur P. Parikh, Ali Farhadi, Hannaneh HajishirziEMNLP 20182018
Interpretable
Prediction of Vascular Diseases from Electronic Health Records via Deep Attention Networks
Seunghyun Park, You Jin Kim, Jeong Whun Kim, Jin Joo Park, Borim Ryu, Jung-Woo HaIEEE BIBE 20182018
CHOPT: Automated Hyperparameter Optimization Framework for Cloud-Based Machine Learning PlatformsJinwoong Kim, Minkyu Kim, Heungseok Park, Ernar Kusdavletov, Adrian Kim, Ji-Hoon Kim, Jung-Woo Ha, Nako SungarXiv2018
NSML: Meet the MLaaS Platform with a Real-World Case StudyHanjoo Kim, Minkyu Kim, Dongjoo Seo, Jinwoong Kim, Heungseok Park, Soeun Park, Hyunwoo Jo, KyungHyun Kim, Youngil Yang, Youngkwan Kim, Nako Sung, Jung-Woo HaarXiv2018
Representation
Learning of Music Using Artist Labels
Jiyoung Park, Jongpil Lee, Jangyeon Park, Jung-Woo Ha, Juhan NamISMIR 20182018
Multimodal Dual Attention Memory for Video Story Question AnsweringKyung-Min Kim, Seong-Ho Choi, Jin-Hwa Kim, Byoung-Tak ZhangECCV 20182018
Unsupervised Holistic Image Generation from Key Local PatchesDonghoon Lee, Sangdoo Yun, Sungjoon Choi, Hwiyeon Yoo, Ming-Hsuan Yang, Songhwai OhECCV 20182018
A Unified Framework for the Generation of Glottal Signals in Deep Learning-Based Parametric Speech Synthesis SystemsMin-Jae Hwang, Eunwoo Song, Jin-Seob Kim, Hong-Goo KangInterspeech 20182018
Acoustic Modeling Using Adversarially Trained Variational Recurrent Neural Network for
Speech Synthesis
Joun Yeop Lee, Sung Jun Cheon, Byoung Jin Choi, Nam Soo Kim, Eunwoo SongInterspeech 20182018
Deep Lip Reading: a Comparison of Models and an Online ApplicationTriantafyllos Afouras, Joon Son Chung, Andrew ZissermanInterspeech 20182018
The Conversation: Deep Audio Visual Speech EnhancementTriantafyllos Afouras, Joon Son Chung, Andrew ZissermanInterspeech 20182018
VoxCeleb2: Deep Speaker RecognitionJoon Son Chung, Arsha Nagrani, Andrew ZissermanInterspeech 20182018
StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image TranslationYunjey Choi, Minje Choi, Munyoung Kim, Jung-Woo Ha, Sunghun Kim, Jaegul ChooCVPR 2018 (Oral)2018
Deep Code SearchXiaodong Gu, Hongyu Zhang, Sunghun KimICSE 20182018
Neural Speed Reading via Skim-RNNMinjoon Seo, Sewon Min, Ali Farhadi, Hannaneh HajishirziICLR 20182018
Perceptual Quality and Modeling Accuracy of Excitation Parameters in DLSTM-Based Speech Synthesis SystemsEunwoo Song, Frank K. Soong, Hong-Goo KangASRU 20172017
Automatic Music Highlight Extraction Using Convolutional Recurrent Attention NetworksJung-Woo Ha, Adrian Kim, Chanju Kim, Jangyeon Park, Sung KimarXiv2017
NSML: A Machine Learning Platform That Enables You to Focus on Your ModelsNako Sung, Minkyu Kim, Hyunwoo Jo, Youngil Yang, Jinwoong Kim, Leonard Lausen, Youngkwan Kim, Gayoung Lee, Donghyun Kwak, Jung-Woo Ha, Sung KimNIPS WS ML Systems 20172017
Highrisk Prediction from Electronic Medical Records via Deep Attention NetworksYou Jin Kim, Yun-Geun Lee, Jeong Whun Kim, Jin Joo Park, Borim Ryu, Jung-Woo HaNIPS WS ML4H 20172017
Overcoming Catastrophic Forgetting by Incremental Moment MatchingSang-Woo Lee, Jin-Hwa Kim, Jaehyun Jun, Jung-Woo Ha, Byoung-Tak ZhangNIPS 20172017
Building a Better Bitext for Structurally Different Languages through Self-TrainingJungyeul Park, Loic Dugast, Jeen-Pyo Hong, Chang-Uk Shin, Jeong-Won ChaWorkshop on Curation and Applications of
Parallel and Comparable Corpora in IJCNLP 2017
2017
Deep Neural Networks for News RecommendationsKeunchan Park, Jisoo Lee, Jaeho ChoiCIKM 20172017
Dual attention networks for multimodal reasoning and matchingHyeonseob Nam, Jung-Woo Ha, Jeonghee KimCVPR 2017 (Spot)2017
Hadamard product for low-rank bilinear poolingJin-Hwa Kim, Kyoung-Woon On, Jeonghee Kim, Jung-Woo Ha, Byoung-Tak ZhangICLR 20172017