Data deduplication approaches: concepts, strategies, and challenges

In the age of data science, the rapidly increasing amount of data is a major concern in numerous applications of computing operations and data storage. Duplicated data or redundant data is a main challenge in the field of data science research. Data Deduplication Approaches: Concepts, Strategies, an...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Weitere Verfasser: Thwel, Tin Thein (HerausgeberIn), Sinha, G. R. 1975- (HerausgeberIn)
Format: Elektronisch E-Book
Sprache:English
Veröffentlicht: Amsterdam Academic Press 2021
Schlagworte:
Online-Zugang:DE-706
Volltext
Zusammenfassung:In the age of data science, the rapidly increasing amount of data is a major concern in numerous applications of computing operations and data storage. Duplicated data or redundant data is a main challenge in the field of data science research. Data Deduplication Approaches: Concepts, Strategies, and Challenges shows readers the various methods that can be used to eliminate multiple copies of the same files as well as duplicated segments or chunks of data within the associated files. Due to ever-increasing data duplication, its deduplication has become an especially useful field of research for storage environments, in particular persistent data storage. Data Deduplication Approaches provides readers with an overview of the concepts and background of data deduplication approaches, then proceeds to demonstrate in technical detail the strategies and challenges of real-time implementations of handling big data, data science, data backup, and recovery. The book also includes future research directions, case studies, and real-world applications of data deduplication, focusing on reduced storage, backup, recovery, and reliability
Beschreibung:1. Introduction to data deduplication approaches<br>2. Data deduplication concepts<br>3. Concepts, strategies, and challenges of data deduplication<br>4. Existing mechanisms for data deduplication<br>5. Classification criteria for data deduplication methods<br>6. File chunking approaches<br>7. Study of data deduplication for file chunking approaches<br>8. Essentials of data deduplication using open-source toolkit<br>9. Efficient data deduplication scheme for scale-out distributed storage<br>10. Identification of duplicate bug reports in software bug repositories: a systematic review, challenges and future scope<br>11. A survey and critical analysis on energy generation from datacenter<br>12. Review of MODIS EVI and NDVI data for data mining applications<br>13. Performance modeling for secure migration processes of legacy systems to the cloud computing<br>14. DedupCloud: an optimized efficient virtual machine deduplication algorithm in cloud computing environment<br>15. Data deduplication for cloud storage<br>16. Data duplication using Amazon Web Services cloud storage<br>17. Game-theoretic analysis of encrypted cloud data deduplication<br>18. Data deduplication applications in cognitive science and computer vision research
Beschreibung:1 Online-Ressource Illustrationen
ISBN:9780128233955
0128233958
DOI:10.1016/C2020-0-00104-0