Skip to main navigation Skip to search Skip to main content

Discriminative bi-term topic modelfor headline-based social news clustering

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

34 Scopus citations

Abstract

Social news are becoming increasingly popular. News organizations and popular journalists are starting to use social media more and more heavily for broadcasting news. The major challenge in social news clustering lies in the fact that textual content is only a headline, which is much shorter than the fulltext. Previous works showed that the bi-term topic model (BTM) is effective in modeling short text such as tweets. However, the drawback is that all non-stop terms are considered equally in forming the bi-terms. In this paper, a discriminative bi-term topic model (d-BTM) is presented, which tries to exclude less indicative bi-terms by discriminating topical terms from general and documentspecific ones. Experiments on TDT4 and Reuter-21578 show that using merely headlines, the d-BTM model is able to induce latent topics that are nearly as good as that are generated by LDA using news fulltext as evidence. The major contribution of this work lies in the empirical study on the reliability of topic modeling using merely news headlines.

Original languageEnglish
Title of host publicationProceedings of the 28th International Florida Artificial Intelligence Research Society Conference, FLAIRS 2015
EditorsWilliam Eberle, Ingrid Russell
PublisherAAAI press
Pages311-316
Number of pages6
ISBN (Electronic)9781577357308
StatePublished - 2015
Externally publishedYes
Event28th International Florida Artificial Intelligence Research Society Conference, FLAIRS 2015 - Hollywood, United States
Duration: 18 May 201520 May 2015

Publication series

NameProceedings of the 28th International Florida Artificial Intelligence Research Society Conference, FLAIRS 2015

Conference

Conference28th International Florida Artificial Intelligence Research Society Conference, FLAIRS 2015
Country/TerritoryUnited States
CityHollywood
Period18/05/1520/05/15

Bibliographical note

Publisher Copyright:
Copyright © 2015, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

ASJC Scopus subject areas

  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Discriminative bi-term topic modelfor headline-based social news clustering'. Together they form a unique fingerprint.

Cite this