A popularity prediction and dynamic data replication study for the ATLAS distributed data management / by Thomas Beermann

Beermann, Thomas

Titelaufnahme

Titel
A popularity prediction and dynamic data replication study for the ATLAS distributed data management / by Thomas Beermann
Verfasser
Beermann, Thomas
Körperschaft
Bergische Universität Wuppertal
Erschienen
Wuppertal, July 5, 2017
Ausgabe
Elektronische Ressource
Umfang
1 Online-Ressource (xi, 139 Seiten)
Hochschulschrift
Bergische Universität Wuppertal, Dissertation, 2017
Sprache
Englisch
Dokumenttyp
Dissertation
URN
urn:nbn:de:hbz:468-20170707-130937-5

Zugriffsbeschränkung

Das Dokument ist frei verfügbar

Links

Dateien

A popularity prediction and dynamic data replication study for the ATLAS distributed data management [pdf 5.65 mb] RIS

Klassifikation

Klassifikation (DDC) → Naturwissenschaften und Mathematik → Physik → Physik

English

ATLAS (A Toroidal LHC Apparatus) is one of several experiments of at the Large Hadron Collider (LHC) at CERN in Geneva, Switzerland. The LHC is the largest and most powerful particle accelerator in the world, which is able to operate at unprecedented energy levels. Because of this, ATLAS is able to observe physical phenomena and massive particles that were not observable before. The detectors at the LHC itself create vast amount of data that need to be accessible to physicists for their analysis. For this reason a worldwide computing grid (WLCG) was created that connects hundreds of computing centres across the planet. The experiments constantly create new data but older data has to be kept as well. The available resources are limited, which requires a smart management of the storage space. This thesis presents a method to dynamically create new replicas and delete unused replicas based on a prediction of data popularity to improve user waiting times. The first part gives an general introduction of the LHC, ATLAS and the WLCG, a description of the computing model and systems used by the ATLAS experiment and finally the motivation for this work. The second part concentrates on the popularity prediction, introducing how the access data from the grid can be transformed to be used with different prediction methods. The evaluation describes typical usage patterns followed by a discussion of the advantages and disadvantages of the prediction algorithms, which then leads to the hybrid prediction, where two methods are combined to improve the results. The third part then first introduces the redistribution algorithms that then uses the popularity prediction to delete and add new replicas. After that a grid simulator is described that was developed to study the impact of the redistribution on different workloads. Finally, the evaluation shows the impact of the redistribution on waiting times for user analysis jobs on the grid. The last part summarises the results and gives an outlook for further developments.

Inhalt

Inhalt des Werkes

Statistik

Das PDF-Dokument wurde 3 mal heruntergeladen.

Lizenz-/Rechtehinweis

Urheberrechtsschutz

Detailsuche

Bibliotheken

Projekt

Impressum

Datenschutz

Impressum

Datenschutz

Titelaufnahme

English