𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Managing clusters of geographically distributed high-performance computers

✍ Scribed by Brune, Matthias; Gehring, Jörn; Keller, Axel; Reinefeld, Alexander


Publisher
John Wiley and Sons
Year
1999
Tongue
English
Weight
562 KB
Volume
11
Category
Article
ISSN
1040-3108

No coin nor oath required. For personal study only.

✦ Synopsis


We present a software system for the management of geographically distributed highperformance computers. It consists of three components: 1. The Computing Center Software (CCS) is a vendor-independent resource management software for local HPC systems. It controls the mapping and scheduling of interactive and batch jobs on massively parallel systems; 2. The Resource and Service Description (RSD) is used by CCS for specifying and mapping hardware and software components of (meta-)computing environments. It has a graphical user interface, a textual representation and an object-oriented API; 3. The Service Coordination Layer (SCL) co-ordinates the co-operative use of resources in autonomous computing sites. It negotiates between the applications' requirements and the available system services.


📜 SIMILAR VOLUMES


High-Performance Heterogeneous Computing
✍ Lastovetsky, Alexey L.; Dongarra, Jack J. 📂 Article 📅 2009 🏛 John Wiley & Sons, Inc. 🌐 English ⚖ 896 KB

an Analytical Overview Of The State Of The Art, Open Problems, And Future Trends In Heterogeneous Parallel And Distributed Computing this Book Provides An Overview Of The Ongoing Academic Research, Development, And Uses Of Heterogeneous Parallel And Distributed Computing In The Context Of Scientifi

Special Issue: Grid Computing, High Perf
✍ María S. Pérez; Pilar Herrero; Dennis Gannon; Daniel S. Katz 📂 Article 📅 2010 🏛 John Wiley and Sons 🌐 English ⚖ 65 KB 👁 2 views

In the recent decades we have witnessed a major revolution in the computer field. The major challenges posed by applications in fields of bioinformatics, earth sciences or weather forecasting, among others, have caused the proliferation of complex solutions, such as grid, cloud and highperformance c

Optimal Recovery Schemes for High-Availa
✍ Lars Lundberg; Charlie Svahnberg 📂 Article 📅 2001 🏛 Elsevier Science 🌐 English ⚖ 368 KB

Clusters and distributed systems offer two important advantages, viz. fault tolerance and high performance through load sharing. When all computers are up and running, we would like the load to be evenly distributed among the computers. When one or more computers break down the load on these compute

Telegraphos: A Substrate for High-Perfor
✍ Manolis G.H. Katevenis; Evangelos P. Markatos; George Kalokerinos; Apostolos Dol 📂 Article 📅 1997 🏛 Elsevier Science 🌐 English ⚖ 322 KB

Networks of workstations and high-performance microcomputers have been rarely used for running parallel applications, because, although they have significant aggregate computing power, they lack the support for efficient message-passing and sharedmemory communication. In this paper we present Telegr

Managing Multiple Communication Methods
✍ Ian Foster; Jonathan Geisler; Carl Kesselman; Steven Tuecke 📂 Article 📅 1997 🏛 Elsevier Science 🌐 English ⚖ 352 KB

Modern networked computing environments and applications often require-or can benefit from-the use of multiple communication substrates, transport mechanisms, and protocols, chosen according to where communication is directed, what is communicated, or when communication is performed. We propose tech