Post on 17-Apr-2020
1
Grid'5000 and Grid eXplorer 1GdXGdXGrid’5000
Grid Grid eXplorereXplorer&
Franck CappelloINRIA
fci@lri.frAvec tous les membres participants
Plates-formes de Grilles expérimentalesmutualisées à l’échelle nationale
ACI GRID & ACI MD
Grid’5000
ACI Masse de DonnéesACI GRID
Grid'5000 and Grid eXplorer 2GdXGdXGrid’5000
Agenda
• Rational
• Grid’5000
• Grid eXplorer
2
Grid'5000 and Grid eXplorer 3GdXGdXGrid’5000
Grid raises a lot of research issues:Security, Performance, Fault tolerance, Scalability, Load Balancing, Coordination, Message passing, Data storage, Programming, Algorithms, Communication protocols and architecture, Deployment, etc.
Theoretical models and simulators cannot capture reallife or real size conditionsProduction platforms have strong difficulties to reproduce experimental conditions
How to test and compare?• Fault tolerance protocols• Security mechanisms• Networking protocols• etc.
Grid experimental platformsrational
Grid'5000 and Grid eXplorer 4GdXGdXGrid’5000
log(cost)
log(realism)
math simulation emulation live systems
Models:Sys, apps,Platforms,conditions
Real systemsReal applicationsReal platformsReal conditions
Tools for Distributed System Studies To investigate Distributed System issues, we need:1) Tools (model, simulators, emulators, experi. Platforms)2) Strong interaction between these research tools
Tools for Large Scale Distributed Systems
Real systemsReal applications“In-lab” platformsSynthetic conditions
Key system mecas.Algo, app. kernelsVirtual platformsSynthetic conditions
3
Grid'5000 and Grid eXplorer 5GdXGdXGrid’5000
Grids are complex dist. Systems.• Systems
– nodes, OS, – distributed systems mechanisms (resource discovery, storage,
scheduling, etc.), – middleware, runtimes,
– Fault (crash, transient)– Workload (multiple users/multiple applications)– Heterogeneity (resource diversity, performance)– Malicious users/behaviors
• Networks– routers, links, topology,– protocols, – Theoretical features: synchronous, pseudo synchronous or
asynchronous
– Disconnection– Packet loss– Congestion
Static
Static
Dyn.
Dyn.
Grid'5000 and Grid eXplorer 6GdXGdXGrid’5000
Existing Grid Research Tools• SimGRid and SimGrid2
• Discrete event simulation with trace injection• Originally dedicated to scheduling studies
• GridSim• Australian competitor of SimGrid• Dedicated to scheduling (with deadline)
• Titech Bricks• Discrete event simulation for scheduling
and replication studies
• MicroGrid•Emulator with MPI communications•Not dynamic
àNo emulator or real life experimental platformàThese tools do not scale (limited to ~100 grid nodes)àThey do not consider the network issues (almost)
4
Grid'5000 and Grid eXplorer 7GdXGdXGrid’5000
log(cost)
log(realism)
math simulation emulation live systems
SimGridMicroGridBricksNS, etc.Model
Protocol proof
Grid eXplorerWANinLabEmulab
Grid’5000TERAGridPlanetLabNaregi Testbed
We need Grid experimental toolsAccording to the current knowledge:There is no large scale testbed dedicated to Grid experiments
à Grid’5000 as a real life systemà Grid eXplorer as a large scale emulator
Grid'5000 and Grid eXplorer 8GdXGdXGrid’5000Grid’5000A New Generation of Research Tools
5
Grid'5000 and Grid eXplorer 9GdXGdXGrid’5000 NAREGI Middleware
Development Infrastructure
• Under Procurement – Installation planned in Dec. 2003– 3 SMPs, 128 procs total– 6 x 128-proc clusters, with different interconnects– 1 File Server– Multi-gigabit networking to simulate Grid Environment– NOT a production system (c.f. TeraGrid) – Mainly geared
towards R&D, but could be used partially for experimental production
– ~5 Teraflops– To form a Grid with the IMS NAREGI application testbed
infrastructure (~ 10 Teraflops, March 2004), and other national centers(voluntary basis) via SuperSINET
Grid'5000 and Grid eXplorer 10GdXGdXGrid’5000
Netbed (Emulab)Comme « Modelnet »
Description de la topologie => NS scriptUtilisation de DummynetOutils de Mapping routeur_logique => machine physiqueUtilisation du simulateur NSE (ns emulation)Utilisation de noeuds exterieurs client (40 DSL)
3 Sites
6
Grid'5000 and Grid eXplorer 11GdXGdXGrid’5000
GNET-1
Grid'5000 and Grid eXplorer 12GdXGdXGrid’5000
1) Remotely controllable Grid nodes installed in geographically distributed laboratories
2) A « Controllable » and « Monitorable » Network between theGrid nodes à (may be unrealistic in some cases)
3) A middleware infrastructure connecting allowing users to access the Grid sites
4) A playground to prepare experiments
5) A toolkit to deploy, run, monitor, control experiments and collect results
What do we need for Grid experiments ?
We need these components for a nation wide experimental platform
7
Grid'5000 and Grid eXplorer 13GdXGdXGrid’5000
Agenda
• Rational
• Grid’5000
• Grid eXplorer
Grid'5000 and Grid eXplorer 14GdXGdXGrid’5000
1) Building a nation wide experimental platform for Grid researches (like a particle accelerator for the computerscientists)• 8/10 geographically distributed sites• every site hosts a cluster (from 256 CPUs to 1K CPUs)• All sites are connected by RENATER (French Res. and Edu. Net.)• RENATER hosts probes to trace network load conditions• Design and develop a system/middleware environment
for safely test and repeat experiments
2) Use the platform for Grid experiments in real life conditions• Address critical issues of Grid system/middleware:
• Programming, Scalability, Fault Tolerance, Scheduling• Address critical issues of Grid Networking
• High performance transport protocols, Qos• Port and test applications• Investigate original mechanisms
• P2P resources discovery, Desktop Grids
The Grid’5000 Project
8
Grid'5000 and Grid eXplorer 15GdXGdXGrid’5000 Participants
-Franck Cappello (animateur)-Thierry Priol (Directeur directeur ACI Grid)-Brigitte Plateau (Directrice CS ACI Grid)-Dani Vandrome (Renater)-Frédéric Desprez (Lyon)-Michel Daydé (Toulouse)-Yvon Jégou (Rennes)-Stéphane Lantéri (Sophia)-Raymond Namyst (Bordeaux)-Pascale Primet (Lyon)-Olivier Richard (Grenoble)
Comité de Pilotage (11) :
Jean-Luc ANTHOINE Jean-Claude BarbetPierrette BarbarescoNicolas CapitEddy CaronChristophe CérinOlivier CoulaudGeorges Da-CostaYves DenneulinBenjamin DexheimerAurélien DumezGilles Gallot David GeldreichSébastien GeorgetOlivier Gluck Claude InglebertJulien LeducCyrille MartinJean-Francois MéhautJean-Christophe MignotThierry MonteilGuillaume MornetAlain NaudVincent NériGaetan PeaquinFranck SimonSebastien VarretteJean-Marc Vincent
Comité Techniqueétendu (28) :
Grid’5000
Forums:Deployment/exploitation: Franck Cappello (AS1, RTP8)Programming models:Raymond Namyst (AS2, RTP8)
Grid'5000 and Grid eXplorer 16GdXGdXGrid’5000
www.grid5000.org
9
Grid'5000 and Grid eXplorer 17GdXGdXGrid’5000
Grid'5000 and Grid eXplorer 18GdXGdXGrid’5000Grid’5000 in November’2004
Funding: ACI Grid, ACI MD, INRIA, CNRS, Reg.
Grid 5000 nodes
à Connexion des sites à 1 Gbits3
Grid eXplorerPau
~4M€ for hardware only (including 2004 fund)
10
Grid'5000 and Grid eXplorer 19GdXGdXGrid’5000
Grid’5000 Hardware
Call for proposals
Sept03
Selection of 7 sites
Nov03
ACI GRIDFunding
Jan04
Call for ExpressionOf Interest
March04
Vendorselection
Jun/July 04
Instal.First tests
Spt 04
Final review
Oct 04
FisrtDemo(SC04)
Nov 04
Grid’5000 System/middleware Forum
Security Prototypes
Control Prototypes
Grid’5000 Programming Forum
Grid’5000
Grid’5000 Experiments
Planning
Renater connection
Demo preparation
today
SwitchProto toGrid’5000
Grid’5000
Grid'5000 and Grid eXplorer 20GdXGdXGrid’5000
Summary of Grid5000 experimentsof Grid’5000 members
• Networking– End host communication layer (interference with local communications)– High performance long distance protocols (improved TCP)– High Speed Network Emulation
• Middleware / OS– Grid’5000 control/access– Grid’5000 experiment automation– Scheduling / data distribution in Grid– Fault tolerance in Grid– Resource management– Grid SSI OS and Grid I/O– Desktop Grid/P2P systems
• Programming– Component programming for the Grid (Java, Corba)– GRID-RPC– GRID-MPI– Code Coupling
• Applications– Multi-parametric applications (Climate modeling/Functional Genomic)– Large scale experimentation of distributed applications
(Electromagnetism, multi-material fluid mechanics, parallel optimization algorithms, CFD, astrophysics
– Medical images, Collaborating tools in virtual 3D environment
11
Grid'5000 and Grid eXplorer 21GdXGdXGrid’5000 Grid’5000 Design goal:
Experimenting all layers of the Gridsoftware stack
Algorithms
Runtime
Middleware
Operating System
Programming Environments
Networking
Application
Grid'5000 and Grid eXplorer 22GdXGdXGrid’5000 Visions utilisateur
de Grid’5000
• Comme un grand cluster de clusters– Pas de nécessité de gérer la sécurité(tous les ports sont ouverts entre ressources Grid’5000 à
rsh, etc. fonctionnelles, possibilité de tester NFS, etc.)àTester les exécutifs et les applications sans interférence
vis-à-vis de la sécurité (isolation des phénomènes)
• Comme un ensemble de clusters dans des domaines d’administration distincts.– Nécessité de gérer la sécurité (au minimum SSH sur
tous les nœuds), idéalement :possibilité de sélectionner le middleware de gestion de la sécurité (Globus 2, 3, 4?)
à Tester les exécutifs et les applications dans des conditions de sécurité réalistes.
12
Grid'5000 and Grid eXplorer 23GdXGdXGrid’5000
Modes d’utilisation
• Partagé (préparation des expériences, taille S)– Pas de ressources dédiées (les utilisateurs se loguent sur
les nœuds et utilisent le système par défaut, pas de middleware, etc.)
• Réservé (à la Planet-Lab, taille M)– Ressources réservées non coordonnées (les utilisateurs
peuvent changer l’OS sur les nœuds qu’ils ont réservés)
• Batch (automatique, taille L ou XL) – Les ressources de Grid’000 sont réservées par les
utilisateurs et les expériences sont en mode automatique)
• Avec planification possible
Grid'5000 and Grid eXplorer 24GdXGdXGrid’5000 Architecture for Grid’5000 prototype
(adressage privé)
Switch Grid
controleur
LABO
Routeur RENATER
MPLS
vlans
Firewall/nat
Switch/routeur labo
Front-endlocal
Routeur RENATER
Routeur RENATER
Routeur RENATER
Site grid5000
Controleur = serveur de boot, replicat LDAP?, DNS secondaire, ordonnancement…Front-end local = routes statiques vers grid5000 et default vers Firewall/natFirewall/nat = 2 interfaces, redirige accès port 22 vers front-end-local
Cluster banaliséAdresses privées
RENATER
13
Grid'5000 and Grid eXplorer 25GdXGdXGrid’5000
Control commands
Lab’sNetwork
LAB/FirewallRouter
Test Cluster
ControlMaster
Site 1
Site 2
Site 3
Users(ssh loggin
+ password)
Firewall/nat
ControlSlave
Test Cluster
Boot server + dhcp
-rsync (kernel,dist)-orders (boot, reset)
ControlSlave
user site
Grid’5000 prototype (Control)
System kernels and distributionsare downloaded from a boot server.They are uploaded by the users assystem images.
Grid'5000 and Grid eXplorer 26GdXGdXGrid’5000 Middleware1(XP)Grid5000
• Grid’5000 control- Computing Environment deployment (Ka-tools)- Experiment automation (security and control)- VGrid « mapping a virtual Grid on a real testbed »- Monitoring, benchmarking, performance characterization and analysis
• Scheduling / distribution- Scheduling : Data transfers, global communications, work stealing,...- Data re-distribution in Grid- Task distribution and load balancing in heterogeneous Grid- Mixed Parallelism (task and data parallelism) - Mixing data management and task scheduling- Hierarchical and Distributed Scheduling
• Fault tolerance- Fault tolerant Grid-RPC (RPC-V)- Hierarchical Fault tolerant MPI (MPICH-V)- Fault tolerant in data-flow approach (Athapascan)
XP: eXPeriments on
14
Grid'5000 and Grid eXplorer 27GdXGdXGrid’5000 Middleware2(XP)Grid5000
• Management - AROMA tool : resources management over a Grid of clusters with
different classes of services- Mobile agents for open Grid management- Management of Grids and hosted services (security, QoS, monitoring &
control, dynamic configuration, …)- Optimization for wide area distributed query processing- Virtualization of data storage on Grids- Automatic Deployment of GridRPC middle tier.- Multiclusters and lightweights Grid resource management (OAR/CIGRI)
• Global Computing/P2P Middleware- Executing Web Services on Desktop Grid Workers (XtremWeb)- Distributing the Coordination in Desktop Grids (XtremWeb)- Harnessing Clusters as parallel Workers- Probabilistic certification in peer-to-peer systems- Large Scale Data Sharing Service based on JXTA (JuxMem)- Experimenting management services for textual document in P2P
systems
• Grid SSI OS and Grid I/O- Grid file system (NFSG)- Grid-aware OS (Kerrighed)- Coupling Computational Grid with Reality Center
Grid'5000 and Grid eXplorer 28GdXGdXGrid’5000
• End Host Communication layer
- Intelligent Usage of NICs for local and wide area communications
- Direct file access over Myrinet : ORFA/NFS and ORFA/LUSTRE
• High performance long distance protocols
- Alternative Transport for very high speed networks (backpressure)
- Differentiated transport with delay control on WAN
- Reliable active and non active Multicast
- Network Bandwidth optimization in Grid (VTHD++, Paco++).
• High Speed Network Emulation
- Automatic Deployment of emulated high speed domains
- Experiment design for grid flow interactions studies
• Grid Networking Layer
- Network Resource and QoS on demand
- Grid Overlay and Programmable Routers
- Measurement Services for network aware middleware
Network(XP)Grid5000
15
Grid'5000 and Grid eXplorer 29GdXGdXGrid’5000
• Component programming on the grid - ProActive : a JAVA library (parallel, distributed, concurrent
computing with security and mobility)- Assessment of scalability, deployment, security and fault
tolerance issues- Hierarchical components architecture- PadicoTM/Paco++ combining parallel and distributed computing
• RPC Environment- Large scale experimentation of the DIET platform (Distributed
Interactive Engineering Toolbox)- Client/Agent/Server model following the GridRPC standard with
distributed scheduling agents
• MPI Environment- Time sharing Grid resources- Migration over Clusters with heterogeneous high speed networks
• Code Coupling- Application coupling with Athapascan- Communication / method invocation rescheduling into ORB (HOMA)- Fluid transfer simulation and geological code with PadicoTM/Paco++
Programming(XP)Grid5000
Grid'5000 and Grid eXplorer 30GdXGdXGrid’5000
Applications(XP)Grid5000• Multi-parametric applications
- ACI GRID-TLSE Project : expertise site for sparse linear algebra- Climate modeling and Global Change- DataGène Project : Functional genomic
• Large scale experimentation of distributed message passing applications– Computational Electromagnetism application (JEM3D)– MECAGRID (ACI GRID project, Smash project-team)
• Massively parallel computations in multi-material fluid mechanics• Study of numerical algorithms for heterogeneous computing platforms
– Grid computing for medical applications (Epidaure project-team)• Interoperable medical image registration grid service
– Optimal design of complex systems (Coprin project-team)• Evaluation of parallel optimization algorithms based on interval
analysis techniques• Study of load balancing strategies on heterogeneous resources
- CFD, astrophysics,… applications- Collaborating tools in virtual 3D environment.
- New algorithms (iterative asynchronous methods)
16
Grid'5000 and Grid eXplorer 31GdXGdXGrid’5000
• 5 démos déclanchées à la demande :– Infrastructure G5k : on/off/reboot, OAR, etc.– Expériences réseaux– Installations/modifications middleware (Globus)– Environnements (DIET, Padico, PM2, Pro-active, XW, etc.)– Applications
• 3 objectifs– Expérimenter/éprouver l’infrastructure Grid’5000– Démo pour les extérieurs/présentations/politiques– Démo pour les grandes conférences (SC)
• 2 contraintes– Démos très visuelles– Démontrant les capacités de Grid5000 en plus de celles des
logiciels
Demos
Grid'5000 and Grid eXplorer 32GdXGdXGrid’5000
Other experiments on Grid’5000
• Grid’5000 will be opened to other French Grid researchers (certainly through a selection procedure) in 2005
• Grid’5000 may be connected to the EU CoreGrid testbed and may be used as an experimental platform for CoreGridresearchers (still through a kind of selection procedure)
• Meeting before or after September 16-17, gathering ACI MD project members with Grid’5000 participants.
17
Grid'5000 and Grid eXplorer 33GdXGdXGrid’5000
Agenda
• Rational
• Grid’5000
• Grid eXplorer (GdX)
Grid'5000 and Grid eXplorer 34GdXGdXGrid’5000
log(cost)
log(realism)
math simulation emulation live systems
SimGridMicroGridBricksNS, etc.Model
Protocol proof
Grid eXplorer
Grid’5000
Grid’5000 + Grid eXplorerCombining two Grid research instruments
Relax RealLife Conditions
Relax ConditionsReproducibility
18
Grid'5000 and Grid eXplorer 35GdXGdXGrid’5000 Grid eXplorer
Analogy with physic instruments
Measurement units
ExperimentalConditions database/injector
1k Cpu Emulator
The Cosmotron. This was thefirst accelerator in the world to send particles to energies in the billion electron volt, or GeV, region, 1953.
Grid'5000 and Grid eXplorer 36GdXGdXGrid’5000
Grid experiments undersynthetic reproducible conditions
1) Build the instrument:- 1K CPU cluster (may be only 600 depending on the budget) - configurable network (Ethernet, Myrinet, others?) - configurable OS (kernel, distribution, etc.)- A set of emulation/simulation tools (existing + new ones)- Multi-users- Located/managed by IDRIS
2) Study impact of Scale in Grid/P2P systems - Address critical issues of Grid system/middleware:
• Programming, Scalability, Fault Tolerance, Scheduling- Address critical issues of Networking :
• High performance transport protocols, Monitoring, Qos- Port and test applications- Investigate original mechanisms
• P2P resources discovery, Desktop Grids
GridGrideXplorereXplorer
19
Grid'5000 and Grid eXplorer 37GdXGdXGrid’5000 Grid eXplorer
Big picture
An experimentalconditions data base
Emulator CoreHardware + Soft:Emulation &Simulation
A set of toolsfor analysis
A set of sensorsin Grid’5000(Renater)
Validation onGrid’5000
Emulab cluster
Grid'5000 and Grid eXplorer 38GdXGdXGrid’5000 Grid eXplorer
Funding plan
Secured:ACI Masse de données: 750 K€ TTC ACI Grid’5000 2004 : 155 K€ TTCINRIA Rocquencourt: 150 K€ TTCINRIA Futurs: 150 K€ TTC
Submitted:SESAME Ile de France: 900 K€ TTCASTRE 2005: 300 K€ TTC
à1.2 M€ securedàPossibly 2,4 M€ in total.
20
Grid'5000 and Grid eXplorer 39GdXGdXGrid’5000 Grid eXplorer
Architecture objectivesEmulation of 1) large scale distributed systems2) Grid made of clusters of clusters
1 K CPU 64 bits+ 64 to 100 Network emulators +Gigabit Ethernet between all nodes (125 Mbits/s)+128 nodes High speed network (Myrinet, Infiniband)+10 to 20 Gbits/s links between cluster partitions+128 nodes High speed network (Infiniband, Myrinet)
128 Noeuds
Switch 48 ports
1Gb/s
10 Gb/s
10 Gb/s
128 Noeuds
1Gb/s
Switch 48 ports
Infiniband
Ethernet
Myrinet
Grid'5000 and Grid eXplorer 40GdXGdXGrid’5000
IRISA, Institut de Recherche en Informatique et Systèmes Aléatoires
CEA, Direction des Technologies de l'Information (Saclay)
IBCP (UMR5086), Institut de Biologie et Chimie des Protéines
LABRI (UMR 5800), Laboratoire Bordelais de Recherche en Informatique
LIP6 (UMR 7606), Laboratoire d'Informatique de Paris 6
INRIA Sophia Antipolis, UNSA, I3S-CNRS
LIFL (ESA 8022), Laboratoire d’Informatique Fondamentale de Lille
LIP-ENS Lyon (URM 5668), Laboratoire de l'Informatique du Parallélisme
LORIA (UMR 7503), Laboratoire lorrain de recherche en informatique et sesapplications
LAAS-CNRS (UPR 8001), Laboratoire d'Analyse et d'Architecture des Systèmes
LRI (UMR 8623), Laboratoire de Recherche en Informatique, Université de Paris-sud
LaRIA (UPRES EA 2083), Laboratoire de Recherche en Informatique d’Amiens, Université de Picardie Jules Verne
IMAG, ID (UMR 5132), Laboratoire d’Informatique et Distribution, Université de Grenoble
ACI Masse de Données DGdX13 Labs
21
Grid'5000 and Grid eXplorer 41GdXGdXGrid’5000
Alain Lecluse (IBCP),Alexandre Genoud, (Projet OASIS, INRIA Sophia Antipolis)Antoine Vernois, (IBCP)Arnaud Contes, (Projet OASIS, INRIA Sophia Antipolis)Aurélien Bouteiller, (LRI),Bénedicte Legrand (LIP6)Brice Goglin (doctorant), (INRIA LIP RESO),Brigitte Rozoy (LRI)Cécile Germain (LRI) Christophe Blanchet, (IBCP)Christophe Cérin, (Amiens, Laria)Christophe Chassot, (LAAS-ENSICA),Colette Johnen (LRI)CongDuc Pham, (LIP)Cyril Randriamaro, (LaRIA)Denis Caromel, (Projet OASIS, INRIA Sophia Antipolis)Eddy Caron, (LIP/ENS Lyon), Emmanuel Jeannot, (Loria)Eric Totel (Supélec Rennes)Fabrice Huet, (Projet OASIS, INRIA Sophia Antipolis)Faycal Bouhaf (DEA)(INRIA LIP RESO), Franck Cappello, (LRI)Françoise Baude, (Projet OASIS, INRIA Sophia Antipolis)Frédéric Desprez, (LIP/INRIA Rhône-Alpes), Frédéric Magniette, (LRI)Gabriel Antoniu, (IRISA/INRIA Rennes), George Bosilca, (LRI)Georges Da Costa, (ID-IMAG),Géraud Krawezik (LRI)Gil Utard, (LaRIA)Gilles Fedak, (LRI)Grégory Mounié (ID-IMAG)Guillaume Auriol, (LAAS-ENSICA),Guillaume Mercier, (LaBRI),Guy Bergère, (LIFL, GrandLarge INRIA Futur)Haiwu He, (LIFL, GrandLarge INRIA Futur)Isaac Scherson, (LIFL, GrandLarge, INRIA Futur)Jens Gustedt (LORIA & INRIA Lorraine) Joffroy Beauquier (LRI)Johanne Cohen, (Loria)Kavé Salamatian (LIP6), Lamine Aouad (LIFL, GrandLarge, INRIA Futur)
Laurent Baduel, (Projet OASIS, INRIA Sophia Antipolis)Laurent Dairaine, (LAAS)Luc Bougé, (IRISA/ENS CachanAntenne de Bretagne), Luciana Arantes (LIP6), Ludovic Mé, (Supélec Rennes)Luis Angelo Estefanel, (ID-IMAG)Marin Bertier (LIP6), Mathieu Goutelle, (KIP)Mathieu Jan, (IRISA) Michel Diaz, (LAAS-ENSICA),Michel Koskas (Amiens, Laria)Nicolas Lacorne, (IBCP)Nicolas Larrieu (LAAS-ENSICA), Nicolas Viovy (CEA-DSM-LSCE)Oleg Lodygensky, (LRI)Olivier Richard (ID-IMAG), Olivier Soyez, (LaRIA)Pascal Berthou, (LAAS-ENSICA),Pascale Primet (LIP), Pascale Vicat-Blanc Primet, (INRIA LIP RESO),Patrick Sénac, (LAAS-ENSICA),Philippe d'Anfray (CEA-DTI/SISC), Philippe Gauron, (LRI)Philippe Owezarski (LAAS)Pierre Fraigniaud, (LRI)Pierre Lemarini, (LRI)Pierre Sens (LIP6 / INRIA),Pierre-André Wacrenier, (LaBRI),Raymond Namyst, (LaBRI),Samir Djilali, (LRI)Sébastien Tixeuil (LRI)Serge Petiton, (LIFL, GrandLarge INRIA Futur)Stéphane Vialle (Supélec) Tanguy Pérennou (LAAS)Thierry Gayraud, (LAAS-ENSICA),Thierry Priol, (IRISA)Thomas Hérault, (LRI)Timur Friedman (LIP6)Vincent Danjean, (LaBRI),Vincent Néri (LRI)
Grid'5000 and Grid eXplorer 42GdXGdXGrid’5000
4 Research Topics
The 4 research topics and their leaders:
-Infrastructure (Hardware + system), Olivier Richard (ID-IMAG)-Emulation, Pierre Sens (LIP6)-Network, Pascale Primet (LIP, Inria RESO)-Applications. Christophe Cérin (Laria)
22
Grid'5000 and Grid eXplorer 43GdXGdXGrid’5000
XII.16 Grid coupled sys.XXXII.15 NG. Internet
XXXII.14 SecurityXXXII.13 Reliability (adversary)XXII.12 P2P storageXII.11 BioinformatiqueXXII.10 Cellul. automaton
XXII.9 Uni and multicastXXII.8 Data sharing
XII.7 Comm. Optimizat.XXII.6 SchedulingXXII.5 Data base
XII.4 DHTXXII.3 Fault tolerance
XXII.2 Mobile objectsXXXII.1 Engineering tech.
XXXI.8 Internet Emul.XI.7 Communication
XI.6 Heterogeneity emulXXXI.5 Network emul.
XI.4 Emul driven SimulXXI.3 Virt. Techniques XXI.2 Virtual Grid
XXXXI.1 PlatformApplicationNetworkEmulationInfrastructureExperiences
Grid'5000 and Grid eXplorer 44GdXGdXGrid’5000
Budget: 754 000 € TTC (Procédure lancée Lundi 5 Avril).Tranche ferme : environ 400 processeurs (200 nœuds) Tranche conditionnelle 1 : réseau haute performance (Myrinet ouInfiniband)Tranche conditionnelle 2 : augmentation du nombre de processeurs
Financements complémentaires : Grid’5000 2004, INRIA ~450 K€ TTC
Détails tranche ferme :-Nœuds de calcul :
~400 processeurs (Opteron 2Ghz ou Xeon 64 bits à 3,6 Ghz)-Nœuds réseaux :
~32 nœuds monoprocesseurs (IA32 à 3Ghz) -Nœuds frontaux :
2 nœuds (biprocesseurs Opteron ou Xeon 64 bits, 1 To de disque)-Réseau :
Commutateur Gigabit/s (avec bande passante minimale de 125 Mbits par processeurs)
Matériel
23
Grid'5000 and Grid eXplorer 45GdXGdXGrid’5000
Question de faisabilité technique : exécuter 100 PC virtuels sur 1 PC réelà 10 K PCv sur 100 CPUs (LRI), 100K PCv sur 1K CPUs (GdX),~1M PCv sur (Grid5000) Non temps réel
Etude de Vserver : www.linux-vserver.org (virtual host bsd)
Principe pour n PCv: 1 seul noyau (drivers), n root file systems, z processus (z>n)La table des processus « inclut » un champ supplémentaire « PCv »
Emulation dans GdX
/vserver/vs1/…
/vserver/vs2/…
Root dir pour chaque Vserver
1 noyau
driver
Processus PCv1
Processus PCv2
Overhead de Vserver ?Le scheduleur à équité entre les PCv ?
Parties communesParties spécifiques
Grid'5000 and Grid eXplorer 46GdXGdXGrid’5000
Question de faisabilité technique : exécuter 100 PC virtuels sur 1 PC réel10 K PCv sur 100 CPUs (LRI), 100K PCv sur 1K CPUs (GdX),~1M PCv sur (Grid5000) Non temps réel
à Problème : pas assez d’adresses routables à adresses privéesNAT ou Vtun : vtun.sourceforge.net (tunnels virtuels par-dessus TCP/IP)Vtun implémente un « Vlan » entre Vservers de PC différents
Emulation dans GdX
noyau
driver
NIC129.175.7.124
192.168.100.2 192.168.100.99
Overhead de Vtun ou NAT ?Equité ??? à devra être vérifiée dans GdX
Eth0 :
devicetun0 :
192.168.100.1
Un Vservera une adresse IP
noyau
driver
NIC129.175.7.125
192.168.200.2 192.168.200.99
Eth0 :
devicetun0 :
192.168.200.1
24
Grid'5000 and Grid eXplorer 47GdXGdXGrid’5000
Question de faisabilité technique : exécuter 100 PC virtuels sur 1 PC réel10 K PCv sur 100 CPUs (LRI), 100K PCv sur 1K CPUs (GdX),~1M PCv sur (Grid5000) Non temps réel
à Problème : Emulation de la topologie (plusieurs tunnels) et des perfsréseaux (Dummynet et NistNet)
Emulation dans GdX
noyau
driver
NIC
DummyNetNistNetModifiés ??
Toujours les questions d’overhead et d’équité
noyau
driver
NIC129.175.7.124
192.168.100.2 192.168.100.99
Eth0 :
device tun0 :192.168.100.1
Un Vservera une adresse IP
Où placer DummyNet ou Nistnet ?-> sur un Vserver ???
Grid'5000 and Grid eXplorer 48GdXGdXGrid’5000
Question de faisabilité technique : exécuter 100 PC virtuels sur 1 PC réel10 K PCv sur 100 CPUs (LRI), 100K PCv sur 1K CPUs (GdX),~1M PCv sur (Grid5000)
Emulation dans GdX
Imaginons que ce soit techniquement faisable : Validation ? (méthode : quels paramètres mesurer, quels tests, etc.)
Facteur de ralentissement par rapport au temps réel ? (variation dynamique ? Compréhension de l’impact de chaque brique: Vserver, Vtun, Dummynet/NistNet ?, etc.)
25
Grid'5000 and Grid eXplorer 49GdXGdXGrid’5000 Grid eXplorer and Grid’5000
interactionsDesign
Test/CheckValidation under
Real Life ConditionsGrid’5000
DesignTest/Check Validation of
Scalability and underSynthetic Conditions
Grid eXplorer
Integration to standard middleware, Deployment,
Performance
Scalability, Fault tolerance
Grid'5000 and Grid eXplorer 50GdXGdXGrid’5000
Summary
• Researches in Grid and P2P need large scale platforms– To study protocols, systems, middleware, programming models
and applications in real life OR reproducible experimental conditions
• Grid’5000 and Grid eXplorer- Will be experimental platforms for Grid& Network researchers
(like particle accelerator for physicists)- A nation wide platform and a large scale emulator- Strong relations between this two projects (researchers are the
same persons for the two projects)- Hardware should be installed by November 2004- Prototypes (security and control) should work for November 2004- Might be opened for experiments in early 2005
26
Grid'5000 and Grid eXplorer 51GdXGdXGrid’5000
Q&A