Abstract
Social news are becoming increasingly popular. News organizations and popular journalists are starting to use social media more and more heavily for broadcasting news. The major challenge in social news clustering lies in the fact that textual content is only a headline, which is much shorter than the fulltext. Previous works showed that the bi-term topic model (BTM) is effective in modeling short text such as tweets. However, the drawback is that all non-stop terms are considered equally in forming the bi-terms. In this paper, a discriminative bi-term topic model (d-BTM) is presented, which tries to exclude less indicative bi-terms by discriminating topical terms from general and documentspecific ones. Experiments on TDT4 and Reuter-21578 show that using merely headlines, the d-BTM model is able to induce latent topics that are nearly as good as that are generated by LDA using news fulltext as evidence. The major contribution of this work lies in the empirical study on the reliability of topic modeling using merely news headlines.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 28th International Florida Artificial Intelligence Research Society Conference, FLAIRS 2015 |
| Editors | William Eberle, Ingrid Russell |
| Publisher | AAAI press |
| Pages | 311-316 |
| Number of pages | 6 |
| ISBN (Electronic) | 9781577357308 |
| State | Published - 2015 |
| Externally published | Yes |
| Event | 28th International Florida Artificial Intelligence Research Society Conference, FLAIRS 2015 - Hollywood, United States Duration: 18 May 2015 → 20 May 2015 |
Publication series
| Name | Proceedings of the 28th International Florida Artificial Intelligence Research Society Conference, FLAIRS 2015 |
|---|
Conference
| Conference | 28th International Florida Artificial Intelligence Research Society Conference, FLAIRS 2015 |
|---|---|
| Country/Territory | United States |
| City | Hollywood |
| Period | 18/05/15 → 20/05/15 |
Bibliographical note
Publisher Copyright:Copyright © 2015, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
ASJC Scopus subject areas
- Artificial Intelligence
Fingerprint
Dive into the research topics of 'Discriminative bi-term topic modelfor headline-based social news clustering'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver