Full publications

Asterisks (“*”) denote equal contribution.

2022

Read, Revise, Repeat: A System Demonstration for Human-in-the-loop Iterative Text Revision
Wanyu Du*, Zae Myung Kim*, Vipul Raheja, Dhruv Kumar, Dongyeop Kang
In2Writing @ACL 2022 · Paper · Video · Bibtex

Understanding Iterative Revision from Human-Written Text
Wanyu Du, Vipul Raheja, Dhruv Kumar, Zae Myung Kim, Melissa Lopez, Dongyeop Kang
ACL 2022 · Paper · Bibtex

What Makes Better Augmentation Strategies? Augment Difficult but Not too Different
Jaehyung Kim, Dongyeop Kang, Sungsoo Ahn, Jinwoo Shin
ICLR 2022 · Paper · Bibtex


2021

Understanding Out-of-distribution: A Perspective of Data Dynamics
Dyah Adila, Dongyeop Kang
ICBINB at NeurIPS 2021 · Paper · Bibtex

Does BERT Learn as Humans Perceive? Understanding Linguistic Styles through Lexica
Shirley Anugrah Hayati, Dongyeop Kang, Lyle Ungar
EMNLP 2021 · Paper · Code · Bibtex

Modeling Mathematical Notation Semantics in Academic Papers
Hwiyeol Jo, Dongyeop Kang, Andrew Head, Marti A. Hearst
EMNLP 2021 Findings · Paper · Bibtex

Visualizing Cross-Lingual Discourse Relations in Multilingual TED Corpora
Zae Myung Kim, Vassilina Nikoulina, Dongyeop Kang, Didier Schwab, Laurent Besacier
CODI at EMNLP 2021 · Paper · Code · Bibtex

Zero-Shot Natural Language Video Localization
Jinwoo Nam, Daechul Ahn, Dongyeop Kang, Seong Jong Ha, Jonghyun Choi
ICCV 2021 Oral · Paper · Bibtex

Style is NOT a single variable: Case Studies for Cross-Stylistic Language Understanding
Dongyeop Kang, Eduard Hovy
ACL 2021 · Paper · Code · Project Page · Bibtex

Augmenting Scientific Papers with Just-in-Time, Position-Sensitive Definitions of Terms and Symbols
Andrew Head, Kyle Lo, Dongyeop Kang, Raymond Fok, Sam Skjonsberg, Daniel S. Weld, Marti A. Hearst
CHI 2021 · Paper · Code · Video · Project Page · Bibtex


2020

GenAug: Data Augmentation for Finetuning Text Generators
Steven Y. Feng, Varun Gangal, Dongyeop Kang, Teruko Mitamura, Eduard Hovy
DeeLIO Workshop @EMNLP 2020 · Paper · Code · Bibtex

Document-Level Definition Detection in Scholarly Documents: Existing Models, Error Analyses, and Future Directions
Dongyeop Kang, Andrew Head, Risham Sidhu, Kyle Lo, Daniel Weld, Marti A. Hearst
SDP Workshop @EMNLP 2020 · Paper · Code · Bibtex

Plan ahead: Self-Supervised Text Planning for Paragraph Completion Task
Dongyeop Kang, Eduard Hovy
EMNLP 2020 · Paper · Bibtex

INSPIRED: Toward Sociable Recommendation Dialog Systems
Shirley Anugrah Hayati, Dongyeop Kang, Qingxiaoyang Zhu, Weiyan Shi, Zhou Yu
SDP Workshop @EMNLP 2020 · Paper · Code · Bibtex

Linguistically Informed Language Generation: A Multifaceted Approach
Dongyeop Kang
PhD Dissertation 2020 · Bibtex

Posterior Calibrated Training on Sentence Classification Tasks
Taehee Jung, Dongyeop Kang, Hua Cheng, Lucas Mentch, Thomas Schaaf
ACL 2020 · Paper · Code · Bibtex


2019

Recommendation as a Communication Game: Self-Supervised Bot-Play for Goal-oriented Dialogue
Dongyeop Kang, Anusha Balakrishnan, Pararth Shah, Paul Crook, Y-Lan Boureau, Jason Weston
EMNLP 2019 · Paper · Dataset · Bibtex

(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Annotated Stylistic Language Dataset with Multiple Personas
Dongyeop Kang, Varun Gangal, Eduard Hovy
EMNLP 2019 · Paper · Data+Code · Bibtex

Linguistic Versus Latent Relations for Modeling Coherent Flow in Paragraphs
Dongyeop Kang, Hiroaki Hayashi, Alan W Black, Eduard Hovy
EMNLP 2019 · Paper · Code · Bibtex


2018

Bridging Knowledge Gaps in Neural Entailment via Symbolic Models
Dongyeop Kang, Tushar Khot, Ashish Sabharwal, Peter Clark
EMNLP 2018 · Paper · Code · Bibtex

AdvEntuRe: Adversarial Training for Textual Entailment with Knowledge-Guided Examples
Dongyeop Kang, Tushar Khot, Ashish Sabharwal, Eduard Hovy
ACL 2018 · Paper · Code · Bibtex

A Dataset of Peer Reviews (PeerReaD): Collection, Insights and NLP Applications
Dongyeop Kang, Waleed Ammar, Bhavana Dalvi, Madeleine van Zuylen, Sebastian Kohlmeier, Eduard Hovy, Roy Schwartz
NAACL 2018 · Paper · Data+Code · Bibtex

Actionable email intent modeling with reparametrized RNN
Chu-Cheng Lin, Dongyeop Kang, Michael Gamon, Madian Khabsa, Ahmed Hassan Awadallah, Patrick Pantel
AAAI 2018 · Paper · Bibtex


2017

Detecting and Explaining Causes From Text For a Time Series Event
Dongyeop Kang, Varun Gangal, Ang Lu, Zheng Chen, Eduard Hovy
EMNLP 2017 · Code · Bibtex


2014

Eventera: Real-time Event Recommendation System from Massive Heterogeneous Online Media
Dongyeop Kang, DongGyun Han, Na Hea Park, Sangtae Kim, U Kang, Soobin Lee
ICDM 2014 · Paper · Video · Project Page · Bibtex

Data/Feature Distributed Stochastic Coordinate Descent for Logistic Regression
Dongyeop Kang, Woosang Lim, Kijung Shin, Sael Lee, U Kang
CIKM 2014 · Paper · Appendix · Bibtex

Hetero-Labeled LDA: A partially supervised topic model with heterogeneous label information
Dongyeop Kang, Youngja Park, Suresh Chari
ECML 2014 · Paper · Bibtex


2011

Multidimensional Mining of Large-Scale Search Logs: A Topic-Concept Cube Approach
Dongyeop Kang, Daxin Jiang, Jian Pei, Zhen Liao, Xiaohui Sun, Ho-Jin Choi
WSDM 2011 · Journal Version · Bibtex