Bi-LSTM-CRF sequence labeling for keyphrase extraction from scholarly documents

  • Rabah A. Al-Zaidy
  • , Cornelia Caragea
  • , C. Lee Giles

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

178 Scopus citations

Abstract

In this paper, we address the keyphrase extraction problem as sequence labeling and propose a model that jointly exploits the complementary strengths of Conditional Random Fields that capture label dependencies through a transition parameter matrix consisting of the transition probabilities from one label to the neighboring label, and Bidirectional Long Short Term Memory networks that capture hidden semantics in text through the long distance dependencies. Our results on three datasets of scholarly documents show that the proposed model substantially outperforms strong baselines and previous approaches for keyphrase extraction.

Original languageEnglish
Title of host publicationThe Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019
PublisherAssociation for Computing Machinery, Inc
Pages2551-2557
Number of pages7
ISBN (Electronic)9781450366748
DOIs
StatePublished - 13 May 2019
Externally publishedYes
Event2019 World Wide Web Conference, WWW 2019 - San Francisco, United States
Duration: 13 May 201917 May 2019

Publication series

NameThe Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019

Conference

Conference2019 World Wide Web Conference, WWW 2019
Country/TerritoryUnited States
CitySan Francisco
Period13/05/1917/05/19

Bibliographical note

Publisher Copyright:
© 2019 IW3C2 (International World Wide Web Conference Committee), published under Creative Commons CC-BY 4.0 License.

Keywords

  • Deep learning
  • Keyphrase extraction
  • Sequence labeling

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Software

Fingerprint

Dive into the research topics of 'Bi-LSTM-CRF sequence labeling for keyphrase extraction from scholarly documents'. Together they form a unique fingerprint.

Cite this