ABSTRACT
Transcripts of meetings are a document genre characterized by a complex narrative structure. The essence is not only what is said, but also by who and to whom. This paper investigates whether we can use semantic annotations like the speaker in order to capture this debate structure, as well as the related content of the debate. The structure is visualized in a graph, while the content is condensed into word clouds, that are created using a parsimonious language model. Evaluation shows that both tools adequately capture the structure and content of the debate at an aggregated level.
- T. Gielissen and M. Marx. Exemelification of parliamentary debates. In Proceedings of the 9th Dutch-Belgian Workshop on Information Retrieval (DIR 2009), 2009.Google Scholar
- D. Hiemstra, S. Robertson, and H. Zaragoza. Parsimonious language models for information retrieval. In Proceedings SIGIR 2004, pages 178--185. ACM Press, New York NY, 2004. Google ScholarDigital Library
- P. Rayson and R. Garside. Comparing corpora using frequency profiling. In Proceedings of the workshop on Comparing Corpora, 2000. Google ScholarDigital Library
Index Terms
- Who said what to whom?: capturing the structure of debates
Recommendations
Word Cloud Explorer: Text Analytics Based on Word Clouds
HICSS '14: Proceedings of the 2014 47th Hawaii International Conference on System SciencesWord clouds have emerged as a straightforward and visually appealing visualization method for text. They are used in various contexts as a means to provide an overview by distilling text down to those words that appear with highest frequency. Typically, ...
Visualization of Police Intelligence Data Based on Word Clouds
CIS '14: Proceedings of the 2014 Tenth International Conference on Computational Intelligence and SecurityThis paper studies the method of using word clouds to visualize 110 incidents data. It has been reported that word clouds have become a simple and visual appeal of the visualization method for text. Lots of data are stored in the police intelligence ...
Focused retrieval and result aggregation with political data
This paper presents a case-study in which we use a large semi-structured data set consisting of official transcripts of meetings of the Dutch parliament for focused retrieval and result aggregation. Transcripts of meetings are a document genre ...
Comments