Visual analysis of news streams with article threads

Miloš Krstajic, Enrico Bertini, Florian Mansmann, Daniel A. Keim

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    The analysis of large quantities of news is an emerging area in the field of data analysis and visualization. International agencies collect thousands of news every day from a large number of sources and making sense of them is becoming increasingly complex due to the rate of the incoming news, as well as the inherent complexity of analyzing large quantities of evolving text corpora. Current visual techniques that deal with temporal evolution of such complex datasets, together with research efforts in related domains like text mining and topic detection and tracking, represent early attempts to understand, gain insight and make sense of these data. Despite these initial propositions, there is still a lack of techniques dealing directly with the problem of visualizing news streams in a "on-line" fashion, that is, in a way that the evolution of news can be monitored in real-time by the operator. In this paper we propose a purely visual technique that permits to see the evolution of news in real-time. The technique permits to show the stream of news as they enter into the system as well as a series of important threads which are computed on the fly. By merging single articles into threads, the technique permits to offload the visualization and retain only the most relevant information. The proposed technique is applied to the visualization of news streams generated by a news aggregation system that monitors over 4000 sites from 1600 key news portals world-wide and retrieves over 80000 reports per day in 43 languages.

    Original languageEnglish (US)
    Title of host publicationProc. of the 1st International Workshop on Novel Data Stream Pattern Mining Techniques, StreamKDD'10, Held in Conjunction with the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
    Pages39-46
    Number of pages8
    DOIs
    StatePublished - 2010
    Event1st International Workshop on Novel Data Stream Pattern Mining Techniques, StreamKDD'10, Held in Conjunction with the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining - Washington, DC, United States
    Duration: Jul 25 2010Jul 25 2010

    Publication series

    NameProceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

    Other

    Other1st International Workshop on Novel Data Stream Pattern Mining Techniques, StreamKDD'10, Held in Conjunction with the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
    Country/TerritoryUnited States
    CityWashington, DC
    Period7/25/107/25/10

    Keywords

    • Data streaming
    • News analysis
    • Visual analytics

    ASJC Scopus subject areas

    • Software
    • Information Systems

    Fingerprint

    Dive into the research topics of 'Visual analysis of news streams with article threads'. Together they form a unique fingerprint.

    Cite this