Title | Authors | Venues | Years | Misc |
---|---|---|---|---|
Scale down Transformer by Grouping Features for a Lightweight Character-level Language Model | Sungrae Park, Geewook Kim, Junyeop Lee, Junbum Cha, Ji-Hoon Kim, Hwalsuk Lee | COLING 2020 | 2020 | |
Few-shot Font Generation with Localized Style Representations and Factorization | Song Park, Sanghyuk Chun, Junbum Cha, Bado Lee, Hyunjung Shim | ArXiv | 2020 | Github |
Self-supervised Auxiliary Learning with Meta-paths for Heterogeneous Graphs | Dasol Hwang (Korea Univ), Jinyoung Park (Korean Univ), Sunyoung Kwon, Kyung-Min Kim, Jung-Woo Ha, Hyunwoo J. Kim (Korea Univ) | NeurIPS 2020 | 2020 | |
DialogBERT: Discourse-Aware Response Generation via Learning to Recover and Rank Utterances | Xiaodong Gu, Kang Min Yoo, Jung-Woo Ha | EMNLP 2020 (Findings) | 2020 | |
Large Product Key Memory for Pretrained Language Models | Gyuwan Kim, Tae-Hwan Jung | EMNLP 2020 (Findings) | 2020 | Github |
Variational Hierarchical Dialog Autoencoder for Dialog State Tracking Data Augmentation | Kang Min Yoo, Hanbit Lee (Seoul National University), Franck Dernoncourt (Adobe), Trung Bui (Adobe), Walter Chang (Adobe), Sang-goo Lee (Seoul National University) | EMNLP 2020 | 2020 | |
Context-Aware Answer Extraction in Question Answering | Yeon Seonwoo (KAIST), Ji-Hoon Kim, Jung-Woo Ha, Alice Oh (KAIST) | EMNLP 2020 | 2020 | |
Exploring Lexicon-Free Modeling Units for End-to-End Korean and Korean-English Code-Switching Speech Recognition | Jisung Wang, Jihwan Kim (VUNO), Sangki Kim (VUNO), Yeha Lee (VUNO) | Interspeech 2020 | 2020 | |
End-to-End Task-oriented Dialog System through Template Slot Value Generation | Teakgyu Hong, Oh-Woog Kwon(ETRI) Institute), Young-Kil Kim (ETRI) | Interspeech 2020 | 2020 | |
Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation | Won Ik Cho(Seoul National University), Donghyun Kwak, Jiwon Yoon (Seoul National University), Nam Soo Kim (Seoul National University) | Interspeech 2020 | 2020 | |
Neural Text-to-Speech with a Modeling-by-Generation Excitation Vocoder | Eunwoo Song, Min-Jae Hwang, Ryuichi Yamamoto, Jin-Seob Kim, Ohsung Kwon, Jae-Min Kim | Interspeech 2020 | 2020 | |
Now you're speaking my language: Visual language identification | Triantafyllos Afouras (University of Oxford), Joon Son Chung, Andrew Zisserman(University of Oxford) | Interspeech 2020 | 2020 | |
Seeing voices and hearing voices: learning discriminative embeddings using cross-modal self-supervision | Soo-Whan Chung, Hong-Goo Kang (Yonsei University) , Joon Son Chung | Interspeech 2020 | 2020 | |
Spot the conversation: speaker diarisation in the wild | Joon Son Chung, Jaesung Huh, Arsha Nagrani (University of Oxford), Triantafyllos Afouras (University of Oxford), Andrew Zisserman(University of Oxford) | Interspeech 2020 | 2020 | |
FaceFilter: Audio-visual speech separation using still images | Soo-Whan Chung, Soyeon Choe, Joon Son Chung, Hong-Goo Kang (Yonsei University) | Interspeech 2020 | 2020 | |
Self-supervised Pre-training with Acoustic Configurations for Replay Spoofing Detection | Hye-jin Shim(University of Seoul), Hee-Soo Heo, Jee-weon Jung(University of Seoul), Ha-Jin Yu(University of Seoul) | Interspeech 2020 | 2020 | |
ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact Centers | Jung-Woo Ha, Kihyun Nam, Jingu Kang, Sang-Woo Lee, Sohee Yang, Hyunhoon Jung, Eunmi Kim, Hyeji Kim, Soojin Kim, Hyun Ah Kim, Kyoungtae Doh, Chan Kyu Lee, Nako Sung, Sunghun Kim | Interspeech 2020 | 2020 | Github |
In defence of metric learning for speaker recognition | Joon Son Chung, Jaesung Huh, Seongkyu Mun, Minjae Lee, Hee Soo Heo, Soyeon Choe, Chiheon Ham, Sunghwan Jung, Bong-Jin Lee, Icksang Han | Interspeech 2020 | 2020 | Github |
CareCall: a Call-Based Active Monitoring Dialog Agent for Managing COVID-19 Pandemic | Sang-Woo Lee, Hyunhoon Jung, SukHyun Ko, Sunyoung Kim, Hyewon Kim, Kyoungtae Doh, Hyunjung Park, Joseph Yeo, Sang-Houn Ok, Joonhaeng Lee, Sungsoon Lim, Minyoung Jeong, Seongjae Choi, SeungTae Hwang, Eun-Young Park (Seongnam city), Gwang-Ja Ma (Seongnam city), Seok-Joo Han (Seongnam city), Kwang-Seung Cha (Seongnam city), Nako Sung, Jung-Woo Ha | ArXiv | 2020 | |
Efficient Active Learning for Automatic Speech Recognition via Augmented Consistency Regularization | Jihwan Bang, Heesu Kim, YoungJoon Yoo, Jung-Woo Ha | ArXiv | 2020 | |
Graphs, Entities, and Step Mixture | Kyuyong Shin, Wonyoung Shin, Jung-Woo Ha, Sunyoung Kwon | GRL+ WS@ICML 2020 | 2020 | |
Understanding Differences between Heavy Users and Light Users in Difficulties with Voice User Interfaces | Hyunhoon Jung, Hyeji Kim, Jung-Woo Ha | CUI 2020 | 2020 | |
Which Strategies Matter for Noisy Label Classification? Insight into Loss and Uncertainty | Wonyoung Shin, Jung-Woo Ha, Shengzhe Li, Yongwoo Cho, Hoyean Song, Sunyoung Kwon | ArXiv | 2020 | |
Rethinking the Truly Unsupervised Image-to-Image Translation | Kyungjune Baek, Yunjey Choi, Youngjung Uh, Jaejun Yoo, Hyunjung Shim | ArXiv | 2020 | Github |
StatAssist & GradBoost: A Study on Optimal INT8 Quantization-aware Training from Scratch | Taehoon Kim, Youngjoon Yoo, Jihoon Yang | ArXiv | 2020 | Github |
Slowing Down the Weight Norm Increase in Momentum-based Optimizers | Byeongho Heo, Sanghyuk Chun, Seong Joon Oh, Dongyoon Han, Sangdoo Yun, Youngjung Uh, Jung-Woo Ha | ArXiv | 2020 | Github Project page |
BSL-1K: Scaling up co-articulated sign recognition using mouthing cues | Samuel Albanie, Gul Varol, Liliane Momeni, Triantafyllos Afouras, Joon Son Chung, Neil Fox, Andrew Zisserman | ECCV 2020 | 2020 | |
Self-supervised learning of audio-visual objects from video | Triantafyllos Afouras, Andrew Owens, Joon Son Chung, Andrew Zisserman | ECCV 2020 | 2020 | |
Few-shot Compositional Font Generation with Dual Memory | Junbum Cha, Sanghyuk Chun, Gayoung Lee, Bado Lee, Seonghyeon Kim, Hwalsuk Lee | ECCV 2020 | 2020 | Github |
Character Region Attention For Text Spotting | Youngmin Baek, Seung Shin, Jeonghun Baek, Sungrae Park, JunyeopLee, Daehyun Nam, Hwalsuk Lee | ECCV 2020 | 2020 | |
ReAD: Reciprocal Attention Discriminator for Image-to-Video Re-Identification | Minho Shim, Hsuan-I Ho, Jinhyung Kim, Dongyoon Wee | ECCV 2020 | 2020 | |
Reliable Fidelity and Diversity Metrics for Generative Models | Muhammad Ferjad Naeem, Seong Joon Oh, Youngjung Uh, Yunjey Choi, Jaejun Yoo | ICML 2020 | 2020 | Github |
Learning De-biased Representations with Biased Representations | Hyojin Bahng, Sanghyuk Chun, Sangdoo Yun, Jaegul Choo (Korea Univ.), Seong Joon Oh | ICML 2020 | 2020 | Github |
Efficient Dialogue State Tracking by Selectively Overwriting Memory | Sungdong Kim, Sohee Yang, Gyuwan Kim, Sang-Woo Lee | ACL 2020 | 2020 | Github |
Contextualized Sparse Representations for Real-Time Open-Domain Question Answering | Jinhyuk Lee, Minjoon Seo, Hanna Hajishirzi, Jaewoo Kang | ACL 2020 | 2020 | |
Embedding Expansion: Augmentation in Embedding Space for Deep Metric Learning | Byungsoo Ko*, Geonmo Gu* | CVPR 2020 | 2020 | Github |
Regularization on Spatio-Temporally Smoothed Feature for Action Recognition | Jinhyung Kim, Dongyoon Wee, Soonmin Bae, Junmo Kim | CVPR 2020 | 2020 | |
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Analysis and a New Strategy | Jaejun Yoo*, Namhyuk Ahn, Kyung-Ah Sohn | CVPR 2020 | 2020 | Github |
Evaluating Weakly Supervised Object Localization Methods Right | Junsuk Choe*, Seong Joon Oh*, Seungho Lee (Yonsei Univ.), Sanghyuk Chun, Zeynep Akata (Univ. of Tubingen), Hyunjung Shim (Yonsei Univ.) | CVPR 2020 | 2020 | Github |
StarGAN v2: Diverse Image Synthesis for Multiple Domains | Yunjey Choi*, Youngjung Uh*, Jaejun Yoo*, Jung-Woo Ha | CVPR 2020 | 2020 | Github |
U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation | Junho Kim, Minjae Kim, Hyeonwoo Kang, Kwang Hee Lee | ICLR 2020 | 2020 | Github |
Data-Driven Harmonic Filters for Audio Representation Learning | Minz Won (Univ. of Pompeu Fabra), Sanghyuk Chun, Oriol Nieto (Pandora), Xavier Serra (Univ. of Pompeu Fabra) | ICASSP 2020 | 2020 | |
The Sound of My Voice: Speaker Representation Loss for Target Voice Separation | Seongkyu Mun, Soyeon Choe, Jaesung Huh, Joon Son Chung | ICASSP 2020 | 2020 | |
Disentangled Speech Embeddings Using Cross-modal Self-supervision | Arsha Nagrani* (Univ. of Oxford), Joon Son Chung*, Samuel Albanie (Univ. of Oxford), Andrew Zisserman (Univ. of Oxford) | ICASSP 2020 | 2020 | |
ASR is All You Need: Cross-modal Distillation for Lip Reading | Triantafyllos Afouras (Univ. of Oxford), Joon Son Chung, Andrew Zisserman (Univ. of Oxford) | ICASSP 2020 | 2020 | |
Learning From Dances : Pose-invariant Re-identification for Multi-person Tracking | Hsuan-I Ho, Minho Shim, Dongyoon Wee | ICASSP 2020 | 2020 | |
Parallel WaveGAN: A Fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram | Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim | ICASSP 2020 | 2020 | |
Improving LPCNet-based text-to-speech with linear prediction-structured mixture density network | Min-Jae Hwang, Eunwoo Song, Ryuichi Yamamoto, Frank Soong (MSRA), Hong-Goo Kang (Yonsei Univ) | ICASSP 2020 | 2020 | |
Compounding the Performance Improvements of Assembled Techniques in a Convolutional Neural Network | Jungkyu Lee, Taeryun Won, Kiho Hong | arXiv | 2020 | Github |
Lightweight 3D Human Pose Estimation Network Training Using Teacher-Student Learning | Dong-Hyun Hwang, Suntae Kim, Nicolas Monet, Hideki Koike, Soonmin Bae | WACV 2020 | 2020 | |
SINet: Extreme Lightweight Portrait Segmentation Networks with Spatial Squeeze Modules and Information Blocking Decoder | Hyojin Park, Lars Lowe Sjösund, YoungJoon Yoo, Nicolas Monet (NAVER LABS Europe), Jihwan Bang, Nojun Kwak (Seoul National Univ.) | WACV 2020 | 2020 | Github |
Symmetrical synthesis for deep metric learning | Geonmo Gu, Byung Soo Ko | AAAI 2020 | 2020 | Github |
Background Suppression Networks for Weakly-supervised Temporal Action Localization | Pilhyeon Lee (Yonsei Univ.), Youngjung Uh, Heyran Byun (Yonsei Univ.) | AAAI 2020 | 2020 | |
An Effective Style Token Weight Control Technique for End-to-End Emotional Speech Synthesis | Ohsung Kwon, Inseon Jang (ETRI), ChungHyun Ahn (ETRI), Hong-Goo Kang (Yonsei Univ.) | IEEE Signal Processing Letters (presented @ ICASSP 2020) | 2019 | |
CORD: A Consolidated Receipt Dataset for Post-OCR Parsing | Seunghyun Park, Seung Shin, Bado Lee, Junyeop Lee, Jaeheung Surh, Minjoon Seo, Hwalsuk Lee | Document Intelligence WS@NeurIPS 2019 | 2019 | Github |
Unpaired Sketch-to-Line Translation via Synthesis of Sketches | Gayoung Lee, Dohyun Kim (NAVER Webtoon), Youngjoon Yoo, Dongyoon Han, Jung-Woo Ha, Jaehyuk Chang (NAVER Webtoon) | SIGGRAPH-Asia | 2019 | |
CodeKernel: A Graph Kernel Based Approach to the Selection of API Usage Examples | Xiaodong Gu, Hongyu Zhang (Univ. of Newcastle), Sunghun Kim | IEEE/ACM ASE 2019 | 2019 | |
Subword Language Model for Query Auto-Completion | Gyuwan Kim | EMNLP-IJCNLP 2019 | 2019 | Blog |
NL2pSQL: Generating Pseudo-SQL Queries from Under-Specified Natural Language Questions | Fuxiang Chen, Seung-won Hwang (Yonsei Univ.), Jaegul Choo (Korea Univ.), Jung-Woo Ha, Sung Kim | EMNLP-IJCNLP 2019 | 2019 | Blog |
Mixture Content Selection for Diverse Sequence Generation | Jaemin Cho, Minjoon Seo, Hannaneh Hajishirzi (Univ. of Washington) | EMNLP-IJCNLP 2019 | 2019 | Blog |
CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features Classification Robustness and Uncertainty | Sangdoo Yun, Dongyoon Han, Seong Joon Oh, Sanghyuk Chun, Junsuk Choe, Youngjoon Yoo | ICCV 2019 (Oral) | 2019 | Blog |
What is Wrong with Scene Text Recognition Model Comparisons? Dataset and Model Analysis | Jeonghun Baek, Geewook Kim, Junyeop Lee, Sungrae Park, Dongyoon Han, Sangdoo Yun, Seong Joon Oh, Hwalsuk Lee | ICCV 2019 (Oral) | 2019 | Blog |
Photorealistic Style Transfer via Wavelet Transforms | Jaejun Yoo, Youngjung Uh, Sanghyuk Chun, Byungkyu Kang, Jung-Woo Ha | ICCV 2019 | 2019 | Blog |
A Comprehensive Overhaul of Feature Distillation | Byeongho Heo, Jeesoo Kim, Sangdoo Yun, Hyojin Park, Nojun Kwak, Jin Young Choi | ICCV 2019 | 2019 | Blog |
Probability Density Distillation with Generative Adversarial Networks for High-Quality Parallel Waveform Generation | Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim | INTERSPEECH 2019 | 2019 | |
Parameter Enhancement for MELP Speech Codec in Noisy Communication Environment | Min-Jae Hwang, Hong-Goo Kang | INTERSPEECH 2019 | 2019 | |
Who Said that: Audio-Visual Speaker Diarisation of Real-World Meetings | Joon Son Chung, Bong-Jin Lee, Icksang Han | INTERSPEECH 2019 | 2019 | |
My Lips are Concealed: Audio-Visual Speech Enhancement through Obstructions | Triantafyllos Afouras, Joon Son Chung, Andrew Zisserman | INTERSPEECH 2019 | 2019 | |
BioBERT: a pre-trained biomedical language representation model for biomedical text mining | Jinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan Ho So, Jaewoo Kang | Bioinformatics | 2019 | |
ExcitNet Vocoder: A Neural Excitation Model for Parametric Speech Synthesis Systems | Eunwoo Song, Kyungguen Byun, Hong-Goo Kang | EUSIPCO 2019 | 2019 | |
Tripartite Heterogeneous Graph Propagation for Large-scale Social Recommendation | Kyung-Min Kim, Donghyun Kwak, Hanock Kwak, Young-Jin Park, Sangkwon Sim, Jae-Han Cho, Minkyu Kim, Jihun Kwon, Nako Sung, Jung-Woo Ha | Recsys 2019 (LBR) | 2019 | |
Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index | Minjoon Seo, Jinhyuk Lee, Tom Kwiatkowski, Ankur P. Parikh, Ali Farhadi, Hannaneh Hajishirzi | ACL 2019 | 2019 | |
TedEval: A Fair Evaluation Metric for Scene Text Detectors | Chae Young Lee, Youngmin Baek, Hwalsuk Lee | Workshop on Industrial Applications of Document Analysis and Recognition 2019 | 2019 | Blog |
Excitation-by-SampleRNN Model for Text-to-Speech | Kyungguen Byun, Eunwoo Song, Jinseob Kim, Jae-Min Kim, Hong-Goo Kang | ITC-CSCC 2019 | 2019 | |
Character Region Awareness for Text Detection | Youngmin Baek, Bado Lee, Dongyoon Han, Sangdoo Yun, Hwalsuk Lee | CVPR 2019 | 2019 | |
EXTD: Extremely Tiny Face Detector via Iterative Filter Reuse | YoungJoon Yoo, Dongyoon Han, Sangdoo Yun | arXiv | 2019 | |
Visualizing and Understanding Self-Attention Based Music Tagging | Minz Won, Sanghyuk Chun, Xavier Serra | Machine Learning for Music Discovery Workshop (Contributed Talk)@ICML 2019 | 2019 | |
An Empirical Evaluation on Robustness and Uncertainty of Regularization methods Robustness and Uncertainty | Sanghyuk Chun, Seong Joon Oh, Sangdoo Yun, Dongyoon Han, Junsuk Choe, Youngjoon Yoo | Uncertainty & Robustness in Deep Learning Workshop@ICML 2019 | 2019 | |
Curiosity-Bottleneck: Exploration By Distilling Task-Specific Novelty | Youngjin Kim, Wontae Nam, Hyunwoo Kim, Ji-Hoon Kim, Gunhee Kim | ICML 2019 | 2019 | |
Toward Interpretable Music Tagging with Self-Attention | Minz Won, Sanghyuk Chun, Xavier Serra | arXiv | 2019 | |
Effective Parameter Estimation Methods for an ExcitNet Model in Generative Text-to-Speech Systems | Ohsung Kwon, Eunwoo Song, Jae-Min Kim,Hong-Goo Kang | arXiv | 2019 | |
Domain Mismatch Robust Acoustic Scene Classification Using Channel Information Conversion | Sung Kyu Moon, Suwon Shon | ICASSP 2019 | 2019 | |
Perfect Match: Improved Cross-Modal Embeddings for Audio-Visual Synchronisation | Soo-Whan Chung, Joon Son Chung, Hong-Goo Kang | ICASSP 2019 | 2019 | |
DialogWAE: Multimodal Response Generation with Conditional Wasserstein Auto-Encoder | Xiaodong Gu, Kyunghyun Cho, Jung-Woo Ha, Sunghun Kim | ICLR 2019 | 2019 | |
Large-Scale Answerer in Questioner's Mind for Visual Dialog Question Generation | Sang-Woo Lee, Tong Gao, Sohee Yang, Jaejun Yoo, Jung-Woo Ha | ICLR 2019 | 2019 | |
Modeling Uncertainty with Hedged Instance Embeddings | Seong Joon Oh, Andrew C. Gallagher, Kevin P. Murphy, Florian Schroff, Jiyan Pan, Joseph Roth | ICLR 2019 | 2019 | |
Where To Be Adversarial Perturbations Added? Investigating and Manipulating Pixel Robustness Using Input Gradients | Jisung Hwang, Younghoon Kim, Sanghyuk Chun, Jaejun Yoo, Ji-Hoon Kim, Dongyoon Han | Debugging Machine Learning Models Workshop@ICLR 2019 | 2019 | |
A Comprehensive Exploration on WikiSQL with Table-Aware Word Contextualization | Wonseok Hwang, Jinyeong Yim, Seunghyun Park, Minjoon Seo | arXiv | 2019 | |
Adversarial Dropout for Recurrent Neural Networks | Sungrae Park, Jun-Keon Park, Su-Jin Shin, Il-Chul Moon | AAAI 2019 | 2019 | |
Hierarchical Context Enabled Recurrent Neural Network for Recommendation | Kyungwoo Song, Mingi Ji, Sungrae Park, Il-Chul Moon | AAAI 2019 | 2019 | |
Knowledge Distillation with Adversarial Samples Supporting Decision Boundary | Byeongho Heo, Minsik Lee, Sangdoo Yun, Jin Young Choi | AAAI 2019 | 2019 | |
Knowledge Transfer via Distillation of Activation Boundaries Formed by Hidden Neurons | Byeongho Heo, Minsik Lee, Sangdoo Yun, Jin Young Choi | AAAI 2019, Oral | 2019 | |
Paraphrase Diversification Using Counterfactual Debiasing | Sunghyun Park, Seung-Won Hwang, Fuxiang Chen, Jaegul Choo, Jung-Woo Ha, Sunghun Kim | AAAI 2019 | 2019 | |
End-to-End Question Answering Models for Goal-Oriented Dialog Learning | Jamin Shin, Andrea Madotto, Minjoon Seo, Pascale Fung | Workshop on DSTC 2019 (at AAAI) | 2019 | |
Dirichlet Variational Autoencoder | Weonyoung Joo, Wonsung Lee, Sungrae Park, Il-Chul Moon | arXiv | 2019 | |
Multi-Domain Processing via Hybrid Denoising Networks for Speech Enhancement | Jang-Hyun Kim, Jaejun Yoo, Sanghyuk Chun, Adrian Kim, Jung-Woo Ha | arXiv | 2018 | |
Answerer in Questioner's Mind: Information Theoretic Approach to Goal-Oriented Visual Dialog | Sang-Woo Lee, Yu-Jung Heo, Byoung-Tak Zhang | NeurIPS 2018, Spotlight | 2018 | |
Speaker-Adaptive Neural Vocoders for Statistical Parametric Speech Synthesis Systems | Eunwoo Song, Jinseob Kim, Kyungguen Byun, Hong-Goo Kang | arXiv | 2018 | |
Phrase-Indexed Question Answering: A New Challenge towards Scalable Document Comprehension | Minjoon Seo, Tom Kwiatkowski, Ankur P. Parikh, Ali Farhadi, Hannaneh Hajishirzi | EMNLP 2018 | 2018 | |
Interpretable Prediction of Vascular Diseases from Electronic Health Records via Deep Attention Networks | Seunghyun Park, You Jin Kim, Jeong Whun Kim, Jin Joo Park, Borim Ryu, Jung-Woo Ha | IEEE BIBE 2018 | 2018 | |
CHOPT: Automated Hyperparameter Optimization Framework for Cloud-Based Machine Learning Platforms | Jinwoong Kim, Minkyu Kim, Heungseok Park, Ernar Kusdavletov, Adrian Kim, Ji-Hoon Kim, Jung-Woo Ha, Nako Sung | arXiv | 2018 | |
NSML: Meet the MLaaS Platform with a Real-World Case Study | Hanjoo Kim, Minkyu Kim, Dongjoo Seo, Jinwoong Kim, Heungseok Park, Soeun Park, Hyunwoo Jo, KyungHyun Kim, Youngil Yang, Youngkwan Kim, Nako Sung, Jung-Woo Ha | arXiv | 2018 | |
Representation Learning of Music Using Artist Labels | Jiyoung Park, Jongpil Lee, Jangyeon Park, Jung-Woo Ha, Juhan Nam | ISMIR 2018 | 2018 | |
Multimodal Dual Attention Memory for Video Story Question Answering | Kyung-Min Kim, Seong-Ho Choi, Jin-Hwa Kim, Byoung-Tak Zhang | ECCV 2018 | 2018 | |
Unsupervised Holistic Image Generation from Key Local Patches | Donghoon Lee, Sangdoo Yun, Sungjoon Choi, Hwiyeon Yoo, Ming-Hsuan Yang, Songhwai Oh | ECCV 2018 | 2018 | |
A Unified Framework for the Generation of Glottal Signals in Deep Learning-Based Parametric Speech Synthesis Systems | Min-Jae Hwang, Eunwoo Song, Jin-Seob Kim, Hong-Goo Kang | Interspeech 2018 | 2018 | |
Acoustic Modeling Using Adversarially Trained Variational Recurrent Neural Network for Speech Synthesis | Joun Yeop Lee, Sung Jun Cheon, Byoung Jin Choi, Nam Soo Kim, Eunwoo Song | Interspeech 2018 | 2018 | |
Deep Lip Reading: a Comparison of Models and an Online Application | Triantafyllos Afouras, Joon Son Chung, Andrew Zisserman | Interspeech 2018 | 2018 | |
The Conversation: Deep Audio Visual Speech Enhancement | Triantafyllos Afouras, Joon Son Chung, Andrew Zisserman | Interspeech 2018 | 2018 | |
VoxCeleb2: Deep Speaker Recognition | Joon Son Chung, Arsha Nagrani, Andrew Zisserman | Interspeech 2018 | 2018 | |
StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation | Yunjey Choi, Minje Choi, Munyoung Kim, Jung-Woo Ha, Sunghun Kim, Jaegul Choo | CVPR 2018 (Oral) | 2018 | |
Deep Code Search | Xiaodong Gu, Hongyu Zhang, Sunghun Kim | ICSE 2018 | 2018 | |
Neural Speed Reading via Skim-RNN | Minjoon Seo, Sewon Min, Ali Farhadi, Hannaneh Hajishirzi | ICLR 2018 | 2018 | |
Perceptual Quality and Modeling Accuracy of Excitation Parameters in DLSTM-Based Speech Synthesis Systems | Eunwoo Song, Frank K. Soong, Hong-Goo Kang | 2017 IEEE Automatic Speech Recognition and Understanding Workshop | 2017 | |
Automatic Music Highlight Extraction Using Convolutional Recurrent Attention Networks | Jung-Woo Ha, Adrian Kim, Chanju Kim, Jangyeon Park, Sung Kim | arXiv | 2017 | |
NSML: A Machine Learning Platform That Enables You to Focus on Your Models | Nako Sung, Minkyu Kim, Hyunwoo Jo, Youngil Yang, Jinwoong Kim, Leonard Lausen, Youngkwan Kim, Gayoung Lee, Donghyun Kwak, Jung-Woo Ha, Sung Kim | NIPS WS ML Systems 2017 | 2017 | |
Highrisk Prediction from Electronic Medical Records via Deep Attention Networks | You Jin Kim, Yun-Geun Lee, Jeong Whun Kim, Jin Joo Park, Borim Ryu, Jung-Woo Ha | NIPS WS ML4H 2017 | 2017 | |
Overcoming Catastrophic Forgetting by Incremental Moment Matching | Sang-Woo Lee, Jin-Hwa Kim, Jaehyun Jun, Jung-Woo Ha, Byoung-Tak Zhang | NIPS 2017 | 2017 | |
Building a Better Bitext for Structurally Different Languages through Self-Training | Jungyeul Park, Loic Dugast, Jeen-Pyo Hong, Chang-Uk Shin, Jeong-Won Cha | Workshop on Curation and Applications of Parallel and Comparable Corpora in IJCNLP 2017 | 2017 | |
Deep Neural Networks for News Recommendations | Keunchan Park, Jisoo Lee, Jaeho Choi | CIKM 2017 | 2017 |