Reducing Wrong Labels using Conflict Score in Distant Supervision for Relation Extraction in Bangla Language

  • Tanzim Mahfuz
  • , Tasneem Farhana Suha
  • , Md Musfique Anwar

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

The research area of information extraction (IE) aims to extract structured information such as types of entities and relations between them, from unstructured textual data like newswires, blogs, governmental documents etc. Relation extraction (RE) deals with the automatic detection of relationships between concepts mentioned in free texts. Knowledge-based distant supervision (DS) uses structured data to heuristically label a training corpus. However, this heuristic can generate some noisy labeled data. In this paper, we propose a method using conflict score in DS to reduce the number of wrong labels for Bangla sentences.

Original languageEnglish
Title of host publication2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering, CSDE 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665419741
DOIs
StatePublished - 16 Dec 2020
Externally publishedYes
Event2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering, CSDE 2020 - Gold Coast, Australia
Duration: 16 Dec 202018 Dec 2020

Publication series

Name2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering, CSDE 2020

Conference

Conference2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering, CSDE 2020
Country/TerritoryAustralia
CityGold Coast
Period16/12/2018/12/20

Bibliographical note

Publisher Copyright:
© 2020 IEEE.

Keywords

  • Conflict score
  • Distant supervision
  • Knowledge base
  • Noisy pattern
  • Relation extraction

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Information Systems and Management
  • Health Informatics
  • Communication

Fingerprint

Dive into the research topics of 'Reducing Wrong Labels using Conflict Score in Distant Supervision for Relation Extraction in Bangla Language'. Together they form a unique fingerprint.

Cite this