de
en
Schliessen
Detailsuche
Bibliotheken
Projekt
Impressum
Datenschutz
de
en
Schliessen
Impressum
Datenschutz
zum Inhalt
Detailsuche
Schnellsuche:
OK
Ergebnisliste
Titel
Titel
Inhalt
Inhalt
Seite
Seite
Im Dokument suchen
New approaches in user centric job monitoring on the LHC computing grid : application of remote debugging and real time data selection techniques / by Tim dos Santos. 2011
Inhalt
I Introduction
Context: On High Energy Physics (HEP)
Current research in HEP
The Standard Model
Examples for open questions
CERN and the LHC
The Large Hadron Collider
The ATLAS Experiment
Data flow in ATLAS
Real-time data reduction: Triggers
Software in HEP
High-performance maths and core services: ROOT
Event generators and detector simulation tools
ATLAS' main physics analysis framework: Athena
Grid Computing
Overview
Definition of the term ``Grid Computing''
Virtual Organisations
Components and services of a Grid
Security in the Grid
The WLCG
Overview
The middleware: gLite
Computing model
Data storage and distribution
gLite Grid jobs
Input- and outputdata
Grid job life cycle
Job failures
WLCG software
Pilot jobs and the pilot factory
The user interfaces: pAthena and Ganga
Conclusion
II Job monitoring
Overview
Site monitoring
User-centric monitoring of Grid jobs
The Job Execution Monitor
Concept
Architecture
User interface component
Worker node component
Data transmission
Inter-process communication
Acquisition of monitoring data
System metrics monitor (``Watchdog'')
Script wrappers
User interface
Command-line usage
Built-in interface
Integration into Ganga
Deployment strategy
Shortcomings of this version of the software
Conclusion
III Tracing the execution of binaries
Concept and requirements
Event notification
Symbol resolving and identifier lookup
Application memory inspection
Publishing of the gathered data
User code prerequisites
Architecture and implementation
Event notification
Symbol and value resolving
A victim-thread for safe memory inspection
Concept and architecture
Usage by the CTracer
Resulting monitoring data
Usage
Stand-alone execution for custom binaries
Integration into JEM
Configuration and invocation
Insertion of CTracer-data into JEMs data stream
Augmentation of the JEM-Ganga-Integration
Application for HEP Grid jobs
Preparation of the user application
Activation and configuration in Ganga
Results and interpretation in an example run
Performance impact
Conclusion
IV A real time trigger mechanism
Concept and requirements
Extendible chunk format for monitoring data
Chunk backlog and tagging
Inter-process communication in JEM revised
Architecture and implementation
General JEM architecture changes
High-throughput shared ring buffer
Working principle
Ring buffer operations
Triggers and event handling
Trigger architecture
Trigger scripting APIs
Example trigger scripts
Memory management
Management of shared memory
Shared identifier cache
Application in JEM
Changes in JEM execution
Refactored Ganga-JEM integration
Refactored CTracer
Testing
Functional tests
Performance tests
Conclusion
V Summary
Use cases and testing
Testing framework
Unit tests
User tests
Use cases
User perspective: Hanging Grid job
Admin perspective: Excess dCache mover usage
Outlook
Open questions
Further development
Conclusion
VI Appendices
Module structure
Example trigger implementations
List of Figures
List of Tables
List of Listings
Acronyms and abbreviations
References