Semantic annotation tool for annotating arabic web documents

Saeed Al-Bukhitan, Tarek Helmy, Mohammed Al-Mulhem

Research output: Contribution to journalConference articlepeer-review

9 Scopus citations

Abstract

The vision of semantic Web is to have a Web of data instead of Web of documents in a form that can be processed by machines. This vision could be achieved in the existing Web using semantic annotation. Due to exponential growth and huge size of the Web sources, there is a need to have a fast and automatic semantic annotation of Web documents. Arabic language has received less attention in semantic Web research compared to Latin languages especially in the field of semantic annotation. In this paper, we present an automatic annotation tool that supports the semantic annotation of Arabic language Web documents. The tool takes a URL of Web document and the corresponding ontology then produces an external annotation of the Web document using Resource Description Framework (RDF) language. The annotation tool's output could be used by semantic search engines to achieve higher recall and precision. To evaluate the performance of the tool, three domain ontologies of food, nutrition and health were used with manually annotated documents related to those domains. The initial results show a promising performance which will support the research in the semantic Web with respect to Arabic language.

Original languageEnglish
Pages (from-to)429-436
Number of pages8
JournalProcedia Computer Science
Volume32
DOIs
StatePublished - 2014

Bibliographical note

Funding Information:
This research is supported by King Abdulaziz City for Science and Technology (KACST) through King Fahd University of Petroleum & Minerals (KFUPM) Science & Technology Unit under the project No. 10-INF1381-04 as part of the National Science, Technology and Innovation Plan.

Keywords

  • Arabic Language
  • Ontology
  • Semantic Annotation
  • Semantic Web

ASJC Scopus subject areas

  • General Computer Science

Fingerprint

Dive into the research topics of 'Semantic annotation tool for annotating arabic web documents'. Together they form a unique fingerprint.

Cite this