Provenance Based Intelligent Search Framework for Cloud Storage

Project: Research

Project Details


Cloud computing is the on-demand delivery of IT resources such as network, servers and storage via the internet. Instead of buying, owning and maintaining physical datacenters and servers, it provides access to technology services such as computing power and storage. Cloud storage offers secure and scalable storage for immutable data such as images, text, videos or any other file format. However, most of the data does not naturally present itself in a meaningful way to the outside world e.g. to business applications. Therefore it becomes challenging to search for data in a meaningful way inside data centers. Each and every data item contains some essential information related to data called metadata. Metadata provides meaningful insights to the original data. In general, data centers are designed to simply store data and not communicate with outside applications. This is where intelligent tools are required to process such huge amount of data using the meaningful insights of metadata. However, metadata contains limited information regarding the data item, and it does not contain information regarding the ancestry of the data such as how, where and why the data item was produced. To answer such questions and gain full insights to the relationships of data items, one important development in research is the use of data provenance. Data provenance describes the history of data item, where it came from, and how it came to be in its present state. Adding the provenance functionality to the datacenters can link data together. Such linking of data is extremely helpful in answering questions related to the demands of the end users. Data driven decision is one of the key concepts in big data such as cloud storage for business and research domain. To enable such decision making, the digital provenance (presenting the data based on ancestry) has aroused as an important type of data in grid and cloud environments. Many database applications such as integrity, safety, security, access polices etc. are based on the utilization of provenance data. Provenance requires ranking data based on the contents and the relationships that exists between files, with authors, and with processes that produced such data. The purpose of this research is to provide a method where provenance is utilized for searching data inside cloud environment. Various queries based on provenance data are further utilized for decision making based on the ancestry of data In summary, Cloud storage is utilized by businesses, industry and research domain on daily basis. The economy of cloud based companies is more than some countries these days. However, the increasing amount of data is becoming a challenge for end users for effective and efficient searching and decision making. This research is planned to produce a model for provenance based search
Effective start/end date1/02/2231/12/22


Explore the research topics touched on by this project. These labels are generated based on the underlying awards/grants. Together they form a unique fingerprint.