Generating Phrasal and Sentential Paraphrases: A Survey of Data-Driven Methods

TitleGenerating Phrasal and Sentential Paraphrases: A Survey of Data-Driven Methods
Publication TypeJournal Articles
Year of Publication2010
AuthorsMadnani N, Dorr BJ
JournalComputational Linguistics
Volume36
Issue3
Pagination341 - 387
Date Published2010///
ISBN Number0891-2017
Abstract

The task of paraphrasing is inherently familiar to speakers of all languages. Moreover, the task of automatically generating or extracting semantic equivalences for the various units of language—words, phrases, and sentences—is an important part of natural language processing (NLP) and is being increasingly employed to improve the performance of several NLP applications. In this article, we attempt to conduct a comprehensive and application-independent survey of data-driven phrasal and sentential paraphrase generation methods, while also conveying an appreciation for the importance and potential use of paraphrases in the field of NLP research. Recent work done in manual and automatic construction of paraphrase corpora is also examined. We also discuss the strategies used for evaluating paraphrase generation techniques and briefly explore some future trends in paraphrase generation.

URLhttp://dx.doi.org/10.1162/coli_a_00002
DOI10.1162/coli_a_00002