skip to main content
10.1145/2009916.2010015acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Summarizing the differences in multilingual news

Published:24 July 2011Publication History

ABSTRACT

There usually exist many news articles written in different languages about a hot news event. The news articles in different languages are written in different ways to reflect different standpoints. For example, the Chinese news agencies and the Western news agencies have published many articles to report the same news of "Liu Xiaobo's Nobel Prize" in Chinese and English languages, respectively. The Chinese news articles and the English news articles share something about the news fact in common, but they focus on different aspects in order to reflect different standpoints about the event. In this paper, we investigate the task of multilingual news summarization for the purpose of finding and summarizing the major differences between the news articles about the same event in the Chinese and English languages. We propose a novel constrained co-ranking (C-CoRank) method for addressing this special task. The C-CoRank method adds the constraints between the difference score and the common score of each sentence to the co-ranking process. Evaluation results on the manually labeled test set with 15 news topics show the effectiveness of our proposed method, and the constrained co-ranking method can outperform a few baselines and the typical co-ranking method.

References

  1. A. Aker, T. Cohn, and R. Gaizauskas. Multi-document summarization using A* search and discriminative training. In Proceedings of EMNLP2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. R. Amini, P. Gallinari. The Use of Unlabeled Data to Improve Supervised Learning for Text Summarization. In Proceedings of SIGIR2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. F. Boudin, M. El-Bèze, J.-M. Torres-Moreno. The LIA update summarization systems at TAC-2008. In Proceedings of TAC2008.Google ScholarGoogle Scholar
  4. J. Carbonell and J. Goldstein. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Proceedings of SIGIR1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. G. de Chalendar, R. Besançon, O. Ferret, G. Grefenstette, and O. Mesnard. Crosslingual summarization with thematic extraction, syntactic sentence simplification, and bilingual generation. In Workshop on Crossing Barriers in Text Summarization Research, 5th International Conference on Recent Advances in Natural Language Processing (RANLP2005).Google ScholarGoogle Scholar
  6. A. Celikyilmaz and D. Hakkani-Tur. A hybrid hierarchical model for multi-document summarization. In Proceedings of ACL2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. H. T. Dang and K. Owczarzak. Overview of the TAC 2008 update summarization task. In Proceedings of TAC2008.Google ScholarGoogle Scholar
  8. P. Du, J. Guo, J. Zhang, X. Cheng. Manifold ranking with sink points for update summarization. In Proceedings of CIKM2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. G. ErKan, D. R. Radev. LexPageRank. Prestige in Multi-Document Text Summarization. In Proceedings of EMNLP2004.Google ScholarGoogle Scholar
  10. S. Fisher and B. Roark. Query-focused supervised sentence ranking for update summaries. In Proceeding of TAC2008.Google ScholarGoogle Scholar
  11. S. Harabagiu and F. Lacatusu. Topic themes for multi-document summarization. In Proceedings of SIGIR2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. H. D. Kim and C. Zhai. Generating comparative summaries of contradictory opinions in text. In Proceedings of CIKM2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Kupiec, J. Pedersen, F. Chen. A.Trainable Document Summarizer. In Proceedings of SIGIR1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. A. Leuski, C.-Y. Lin, L. Zhou, U. Germann, F. J. Och, E. Hovy. Cross-lingual C*ST*RD: English access to Hindi information. ACM Transactions on Asian Language Information Processing, 2(3): 245--269, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. W. Li, F. Wei Q. Lu and Y. He. PNR2: Ranking sentences with positive and negative reinforcement for query-oriented update summarization. In Proceedings of COLING2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. C. Y. Lin, E. Hovy. The Automated Acquisition of Topic Signatures for Text Summarization. In Proceedings of COLING2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. C..-Y. Lin and E.. H. Hovy. From Single to Multi-document Summarization: A Prototype System and its Evaluation. In Proceedings of ACL2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. C.-Y. Lin and E.H. Hovy. Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics. In Proceedings of HLT-NAACL -2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. C.-Y. Lin, L. Zhou, and E. Hovy. Multilingual summarization evaluation 2005: automatic evaluation report. In Proceedings of MSE (ACL2005 Workshop).Google ScholarGoogle Scholar
  20. M. Litvak, M. Last, and M. Friedman. A new approach to improving multilingual summarization using a genetic algorithm. In Proceedings of ACL2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. H. P. Luhn. The Automatic Creation of literature Abstracts. IBM Journal of Research and Development, 2(2), 1969. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. I. Mani and E. Bloedorn. Summarizing similarities and differences among related documents. Information Retrieval, 1: 35--67, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. R. Mihalcea, P. Tarau. TextRank: Bringing Order into Texts. In Proceedings of EMNLP2004.Google ScholarGoogle Scholar
  24. R. Mihalcea and P. Tarau. A language independent algorithm for single and multiple document summarization. In Proceedings of IJCNLP-2005.Google ScholarGoogle Scholar
  25. V. Nastase, K. Filippova, S. P. Ponzetto. Generating update summaries with spreading activation. In Proceedings of TAC2008.Google ScholarGoogle Scholar
  26. A. Nenkova and A. Louis. Can you summarize this? Identifying correlates of input difficulty for generic multi-document summarization. In Proceedings of ACL-2008:HLT.Google ScholarGoogle Scholar
  27. M. J. Paul, C. Zhai, and R. Girju. Summarizing contrastive viewpoints in opinionated text. In Proceedings of EMNLP2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. D. R. Radev, H. Y. Jing, M. Stys and D. Tam. Centroid-based summarization of multiple documents. Information Processing and Management, 40: 919--938, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. A. Siddharthan and K. McKeown. Improving multilingual summarization: using redundancy in the input to correct MT errors. In Proceedings of HLT/EMNLP-2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. X. Wan. Towards a unified approach to simultaneous single-document and multi-document summarizations. In Proceedings of COLING2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. X. Wan, H. Li and J. Xiao. Cross-language document summarization based on machine translation quality prediction. In Proceedings of ACL2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. X. Wan and J. Yang. Multi-document summarization using cluster-based link analysis. In Proceedings of SIGIR-2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. X. Wan, J. Yang and J. Xiao. Towards an Iterative Reinforcement Approach for Simultaneous Document Summarization and Keyword Extraction. In Proceedings of ACL2007.Google ScholarGoogle Scholar
  34. X. Wan, J. Yang and J. Xiao. Manifold-ranking based topic-focused multi-document summarization. In Proceedings of IJCAI-2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. D. Wang, S. Zhu, T. Li, and Y. Gong. Comparative document summarization via discriminative sentence selection. In Proceedings of CIKM2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. D. Wang, T. Li. Document update summarization using incremental hierarchical clustering. In Proceedings of CIKM2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. K.-F. Wong, M. Wu and W. Li. Extractive summarization using supervised and semi-supervised learning. In Proceedings of COLING-2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. H. Y. Zha. Generic Summarization and Keyphrase Extraction Using Mutual Reinforcement Principle and Sentence Clustering. In Proceedings of SIGIR2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Y. Zhang, X. Ji, C.-H. Chu, and H. Zha. Correlating summarization of multi-source news with K-way graph bi-clustering. SIGKDD Explorations, 6(2), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Summarizing the differences in multilingual news

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SIGIR '11: Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
          July 2011
          1374 pages
          ISBN:9781450307574
          DOI:10.1145/2009916

          Copyright © 2011 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 24 July 2011

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate792of3,983submissions,20%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader