๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

[ACM Press the third international workshop - San Jose, California, USA (2011.06.08-2011.06.08)] Proceedings of the third international workshop on Large-scale system and application performance - LSAP '11 - Multi-scale analysis of large distributed computing systems

โœ Scribed by Schnorr, Lucas Mello; Legrand, Arnaud; Vincent, Jean-Marc


Book ID
121205034
Publisher
ACM Press
Year
2011
Weight
716 KB
Category
Article
ISBN
1450307035

No coin nor oath required. For personal study only.

โœฆ Synopsis


Large scale distributed systems are composed of many thousands of computing units. Today's examples of such systems are grid, volunteer and cloud computing platforms. Generally, their analyses are done through monitoring tools that gather resource information like processor or network utilization, providing high-level statistics and basic resource usage traces. Such approaches are recognized as rather scalable but are unfortunately often insufficient to detect or fully understand unexpected behavior. In this paper, we investigate the use of more detailed tracing techniques -commonly used in parallel computing-in distributed systems. Finely analyzing the behavior of such systems comprising thousands of resources over several months may seem infeasible. Yet, we show that the resulting trace can be analyzed using tools that enable to easily zoom in and out on selected area of space and time. We use the BOINC volunteer computing system as a basis of this study. Since detailed activity traces of the BOINC clients are not available yet, we rely instead on traces obtained through a BOINC simulator developed with the SimGrid toolkit and which uses as input real availability trace files from the Seti@Home BOINC project. We show that the analysis of such detailed resource utilization traces provides several non-trivial insights about the whole system and enables the discovery of unexpected behavior.


๐Ÿ“œ SIMILAR VOLUMES


[ACM Press the third international works
โœ Raicu, Ioan; Foster, Ian T.; Beckman, Pete ๐Ÿ“‚ Article ๐Ÿ“… 2011 ๐Ÿ› ACM Press โš– 815 KB

Exascale computers will enable the unraveling of significant scientific mysteries. Predictions are that 2019 will be the year of exascale, with millions of compute nodes and billions of threads of execution. The current architecture of high-end computing systems is decades-old and has persisted as w

[ACM Press the 2nd international worksho
โœ Bresnahan, John; Keahey, Kate; LaBissoniere, David; Freeman, Tim ๐Ÿ“‚ Article ๐Ÿ“… 2011 ๐Ÿ› ACM Press โš– 368 KB

Amazon's S3 protocol has emerged as the de facto interface for storage in the commercial data cloud. However, it is closed source and unavailable to the numerous science data centers all over the country. Just as Amazon's Simple Storage Service (S3) provides reliable data cloud access to commercial