poster FDSI SPADES.pptx - Agence Nationale de la Recherche

Transcription

poster FDSI SPADES.pptx - Agence Nationale de la Recherche
FONDEMENT DES SYSTÈMES
INFORMATIQUES
SPADES:
Servicing Petascale Architectures
and Distributed System
SEGI 2008
Overview!
Today's emergence of Petascale
architecture and the evolution of both
research and production grids
increase a lot the number of potential
r e s o u r c e s . H o w e v e r, e x i s t i n g
infrastructures and access rules do
not allow to fully take advantage of
these resources. One key idea of the
SPADES project is to propose a nonintrusive but highly dynamic
environment able to take advantage of
available resources without disturbing
their native use. "
Goals:!
•  Support platforms at a "
very large scale (Petas-"
cale environments"
Means: !
•  Distributed Schedulers "
based on the Desktop "
Computing paradigm."
Co-scheduling0
The$scheduling$
en--es$($$$$$$$$$$$$$$)$$
Scheduler0
Scheduler0
have$to$cooperate$$
with$both$Service$
Discovery$Agents$($$$$$$$$$)$$
SDA0
and$Batch$Schedulers$($$$$$$$$)$
BS0
ObjecGves:0
• $Determining$the$
BS0
Best$mapping$of$the$
jobs$on$resources.$
• $Designing$reac-ve$$
Scheduling$heuris-cs.$$
Approaches:0
• $Adap-ng$strategies$coming$from$
Desktop$Compu-ng$$
• $Replica-on$strategies$$
• $Coopera-on$between$several$batch$
schedulers$$
• $OnEdemand$job$migra-on$
COORDINATEURS:
PARTENAIRES:
[email protected]
Tel:004.72.72.80.040
h9p//graal.ens-lyon.fr/SPADES0
P2P Service Discovery in a large
scale and distributed context."
Constraints:!
•  Platforms heterogeneity"
•  System dynamicity"
Objectives:!
•  Reliability of infrastructure"
•  Efficiency of" SDA0
Fault-tolerance mechanisms:!
•  Self-stabilisation"
•  Replication. "
Petascale Runtimes:
Runtimes hitherto proposed do
not fit any more with the 100.000
resources
Problems:
•  Deployment and launching
step duration
•  Scaling data structures
•  Failure detection broadcasting
•  Failure recovery
Issues:
•  Unifying resources
•  Efficient communication
Means:
•  P2P approach
•  Overlay networks
BS0
Expected results:
-  n u m e r o u s a n d
frequent failures
Scheduler0
SDAs0
BS0
BS0
Batch Scheduler Interactions!
To allow reactive Petascale architectures, fine interactions between
Batch Schedulers ( BS0 ) and upper
layers is a key issue."
Approachs: !
•  Using standards API (OGSA-BES,
DRMAA) hiding heterogeneity."
•  Customizing a BS and executing it in
parallel with other BS."
Applications
Target applications:
•  Cosmic ray simulator
•  Coupled climate model.
Test platform:
•  IBM Blue Gene
•  CRAY XT of PRACE
•  DEISA grids
Benefits:
•  Increase model resolution
•  Reduce model time step
•  Make replications
Expected results: Observe low scale
Phenomena as hurricanes.