Topic-focused multi-document summarization using an approximate oracle score
Title | Topic-focused multi-document summarization using an approximate oracle score |
Publication Type | Conference Papers |
Year of Publication | 2006 |
Authors | Conroy JM, Schlesinger JD, O'Leary DP |
Conference Name | Proceedings of the COLING/ACL on Main conference poster sessions |
Date Published | 2006/// |
Publisher | Association for Computational Linguistics |
Conference Location | Stroudsburg, PA, USA |
Abstract | We consider the problem of producing a multi-document summary given a collection of documents. Since most successful methods of multi-document summarization are still largely extractive, in this paper, we explore just how well an extractive method can perform. We introduce an "oracle" score, based on the probability distribution of unigrams in human summaries. We then demonstrate that with the oracle score, we can generate extracts which score, on average, better than the human summaries, when evaluated with ROUGE. In addition, we introduce an approximation to the oracle score which produces a system with the best known performance for the 2005 Document Understanding Conference (DUC) evaluation. |
URL | http://dl.acm.org/citation.cfm?id=1273073.1273093 |