I am a postdoc at University of California, Berkeley working with Marti A. Hearst, and a part of Berkeley AI Research (BAIR). I obtained my Ph.D. in the Language Technologies Institute of the School of Computer Science at Carnegie Mellon University, under Eduard Hovy. I interned at Facebook AI, Allen Institute for AI (AI2), and Microsoft Research. My Ph.D. study has been supported by Allen Institute for AI (AI2) Fellowship, CMU Presidential Fellowship, and ILJU Graduate Fellowship. In the middle of the study, I completed my alternative military service in South Korea at Naver Labs and KAIST Institute. Before joining CMU, I obtained my BS and MS in Computer Science Engineering at KAIST, Korea.
I am on academic job market this year.
[ CV | Research Statement | Teaching Statement | Diversity Statement ]
Augmenting Scientific Papers with Just-in-Time, Position-Sensitive Definitions of Terms and Symbols
Andrew Head, Kyle Lo, Dongyeop Kang, Raymond Fok, Sam Skjonsberg, Daniel S. Weld, Marti A. Hearst
CHI 2021 [arxiv | code | video | project page | bib]
GenAug: Data Augmentation for Finetuning Text Generators
Steven Y. Feng*, Varun Gangal*, Dongyeop Kang, Teruko Mitamura, Eduard Hovy (*equal contribution)
EMNLP 2020, Deep Learning Inside Out (DeeLIO) Workshop [pdf | arxiv | code | bib]
Document-Level Definition Detection in Scholarly Documents: Existing Models, Error Analyses, and Future Directions
Dongyeop Kang, Andrew Head, Risham Sidhu, Kyle Lo, Daniel Weld and Marti A. Hearst
EMNLP 2020, Workshop on Scholarly Document Processing (SDP) [pdf | arxiv | code | bib]
Plan ahead: Self-Supervised Text Planning for Paragraph Completion Task
Dongyeop Kang and Eduard Hovy
EMNLP 2020 [pdf | arxiv | bib]
INSPIRED: Toward Sociable Recommendation Dialog Systems
Shirley Anugrah Hayati, Dongyeop Kang, Qingxiaoyang Zhu, Weiyan Shi and Zhou Yu
EMNLP 2020 [pdf | arxiv | code | bib]
Linguistically Informed Language Generation: A Multifaceted Approach
Dongyeop Kang
Committee: Eduard Hovy, Jeffrey Bigham, Alan W Black, Jason Weston, Dan Jurafsky
Ph.D thesis [pdf | bib]
Posterior Calibrated Training on Sentence Classification Tasks
Taehee Jung, Dongyeop Kang, Hua Cheng, Lucas Mentch and Thomas Schaaf
ACL 2020 [pdf | arxiv | code | bib]
xSLUE: A Benchmark and Analysis Platform for Cross-Style Language Understanding and Evaluation
Dongyeop Kang and Eduard Hovy
[arxiv | data+leaderboard | code | bib]
Earlier Isn't Always Better: Sub-aspect Analysis on Corpus and System Biases in Summarization
Dongyeop Kang*, Taehee Jung*, Lucas Mentch and Eduard Hovy (*equal contribution)
EMNLP 2019
[pdf | arxiv | data+leaderboard | code
| bib]
Recommendation as a Communication Game: Self-Supervised Bot-Play for Goal-oriented Dialogue
Dongyeop Kang, Anusha Balakrishnan, Pararth Shah, Paul Crook, Y-Lan Boureau and Jason Weston
EMNLP 2019
[pdf | arxiv | data | bib]
(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Annotated Stylistic Language Dataset with Multiple Personas
Dongyeop Kang, Varun Gangal and Eduard Hovy
EMNLP 2019
[pdf | arxiv | data+code | slides | bib]
Linguistic Versus Latent Relations for Modeling Coherent Flow in Paragraphs
Dongyeop Kang, Hiroaki Hayashi, Alan W Black, and Eduard Hovy
EMNLP 2019
[pdf | arxiv | code | bib]
Bridging Knowledge Gaps in Neural Entailment via Symbolic Models
Dongyeop Kang, Tushar Khot, Ashish Sabharwal and Peter Clark
EMNLP 2018
[pdf | arxiv | code | bib]
AdvEntuRe: Adversarial Training for Textual Entailment with Knowledge-Guided Examples
Dongyeop Kang, Tushar Khot, Ashish Sabharwal and Eduard Hovy
ACL 2018
[pdf | arxiv | code | bib]
A Dataset of Peer Reviews (PeerReaD): Collection, Insights and NLP Applications
Dongyeop Kang, Waleed Ammar, Bhavana Dalvi, Madeleine van Zuylen, Sebastian Kohlmeier, Eduard Hovy, Roy Schwartz
NAACL 2018
[pdf | arxiv | data+code | bib]
Actionable email intent modeling with reparametrized RNN
Chu-Cheng Lin, Dongyeop Kang, Michael Gamon, Madian Khabsa, Ahmed Hassan Awadallah, Patrick Pantel
AAAI 2018
[pdf | arxiv | bib]
Detecting and Explaining Causes From Text For a Time Series Event
Dongyeop Kang, Varun Gangal, Ang Lu, Zheng Chen, Eduard Hovy
EMNLP 2017
[pdf | arxiv | dataset | bib]
News2Images: Automatically Summarizing News Articles into Image-Based Contents via Deep Learning
Jung-Woo Ha, Dongyeop Kang, Hyuna Pyo, Jeonghee Kim
RecSys Workshop 2015
[pdf | bib]
Eventera: Real-time Event Recommendation System from Massive Heterogeneous Online Media
Dongyeop Kang, DongGyun Han, Na Hea Park, Sangtae Kim, U Kang, Soobin Lee
ICDM 2014 (Demo)
[pdf | bib | project page | demo]
Data/Feature Distributed Stochastic Coordinate Descent for Logistic Regression
Dongyeop Kang, Woosang Lim, Kijung Shin, Sael Lee, U Kang
CIKM 2014
[pdf | bib | appendix | slides]
Hetero-Labeled LDA: A partially supervised topic model with heterogeneous label information
Dongyeop Kang, Youngja Park, Suresh Chari
ECML/PKDD 2014
[pdf | bib | slides]
Multidimensional Mining of Large-Scale Search Logs: A Topic-Concept Cube Approach
Dongyeop Kang, Daxin Jiang, Jian Pei, Zhen Liao, Xiaohui Sun, Ho-Jin Choi
WSDM 2011
[pdf | bib | journal version]
University of California, Berkeley, postdoctoral scholar (PI: Marti A. Hearst), 2020-Present, CA, USA
CMU, research assistant (advisor: Eduard Hovy), 2015-2020, PA, USA
Facebook AI, research intern (collaborators: Anusha Balakrishnan, Pararth Shah, Rajen Subba, Paul Crook, Y-Lan Boureau, Jason Weston), 2018, CA, USA
Allen Institute for Artificial Intelligence (AI2), research intern (collaborators: Tushar Khot, Ashish Sabharwal, Peter Clark), 2017, WA, USA
Microsoft Research, research intern (collaborators: Michael Gamon, Patrick Pantel), 2016, WA, USA
Eventrader, founder, 2015, Seoul, Korea
Naver Labs, researcher, 2014-2015, Seoul, Korea
Oracle Labs, research intern (mentor: Sungpack Hong), 2014, CA, USA.
IBM TJ Watson Research, research intern (mentor: Youngja Park), 2013, NY, USA.
KAIST Institute, researcher, 2012-2014, Seoul, Korea
CMU / PanOptus, research assistant (advisor: Eric Xing), 2011-2012, PA, USA
HKUST, research assistant (advisor: Sung Kim), 2011, Hong Kong.
Microsoft Research, research intern (mentor: Ed Nightingale), 2010, WA, USA.
Microsoft Research Asia, research intern (mentor: Daxing Jiang, Hang Li), 2009-2010, Beijing, China.
ETRI, intern, 2008, Daejeon, Korea.
DML/CDSN/KECI Labs at KAIST, research assistant, 2007-2009, Daejeon, Korea.
Program Committee / Reviewer of ICLR21
Program Committee / Reviewer of ICLR20, ACL20 (Generation track), NeurIPS20, EMNLP20 (Generation track)
Program Committee / Reviewer of ICLR19, ICML19, NAACL19 (Style track), ACL19 (Machine Learning / Generation / Question Answering / Sentence-level Semantics / Applications track), EMNLP19 (Lexical Semantics track), NeurIPS19, W-NUT19, Scientometrics19
Program Committee / Reviewer of NeurIPS18 (top-30% reviewer), EMNLP18 (Discourse track), ACL18 (Machine Learning track) (top-reviewer), MRQA18
Talk at KAIST AI Colloquium, 2020 Fall
Talk at GIST EECS Colloquium, 2020 Fall
Talk at Berkeley AI Research (BAIR) Workshop, 2020 Summer
Talk at CMU LTI Seminar, 2020 Summer
Talk at POSTECH AI Seminar, 2020 Summer
Lecture on "Relational Semantics on Word Embeddings" at Computational Semantics for NLP, 2019 Spring
CMU, teaching assistant, Computational Semantics for NLP (instructor: Eduard Hovy, Teruko Mitamura), 2018 Spring
Lecture on "Distributional Compositionality and Logic" at Computational Semantics for NLP, 2018 Spring
CMU, teaching assistant, Machine Translation and Sequence to Sequence Models (instructor: Graham Neubig), 2017 Spring
Last updated in Nov 2020