A systematic metadata harvesting workflow for analysing scientific networks

Bilal H. Butt*, Muhammad Rafi, Muhammad Sabih

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

One of the disciplines behind the science of science is the study of scientific networks. This work focuses on scientific networks as a social network having different nodes and connections. Nodes can be represented by authors, articles or journals while connections by citation, co-citation or co-authorship. One of the challenges in creating scientific networks is the lack of publicly available comprehensive data set. It limits the variety of analyses on the same set of nodes of different scientific networks. To supplement such analyses we have worked on publicly available citation metadata from Crossref and OpenCitatons. Using this data a workflow is developed to create scientific networks. Analysis of these networks gives insights into academic research and scholarship. Different techniques of social network analysis have been applied in the literature to study these networks. It includes centrality analysis, community detection, and clustering coefficient. We have used metadata of Scientometrics journal, as a case study, to present our workflow. We did a sample run of the proposed workflow to identify prominent authors using centrality analysis. This work is not a bibliometric study of any field rather it presents replicable Python scripts to perform network analysis. With an increase in the popularity of open access and open metadata, we hypothesise that this workflow shall provide an avenue for understanding scientific scholarship in multiple dimensions.

Original languageEnglish
Pages (from-to)1-19
Number of pages19
JournalPeerJ Computer Science
Volume7
DOIs
StatePublished - Mar 2021
Externally publishedYes

Bibliographical note

Publisher Copyright:
Copyright 2021 Butt et al.

Keywords

  • Centrality measures
  • Citation network
  • Collaboration network
  • Crossref
  • Digital libraries
  • Ego network
  • Influence
  • Network analysis
  • OpenCitations
  • Python

ASJC Scopus subject areas

  • General Computer Science

Fingerprint

Dive into the research topics of 'A systematic metadata harvesting workflow for analysing scientific networks'. Together they form a unique fingerprint.

Cite this