Calcolo per LHCb Italia
Download
Report
Transcript Calcolo per LHCb Italia
The 2001 Tier-1
prototype for LHCb-Italy
Vincenzo Vagnoni
Genève, November 2000
Outline
2001 LHCb Italian Tier-1
prototype architecture
Linux diskless nodes howto
Network attached storage
Tests made in Bologna
Conclusions
The 2001 Tier-1 prototype for LHCb-Italy
Vincenzo Vagnoni
2001 LHCb Italian Tier-1
protoype
15 CPU 2001 (700 SI95) Tier-1 computer
farm (at the beginning of the year, with
further increase if we can show that more
CPU power is needed for the italian groups)
15 single-processor motherboards, 256 MB
RAM each or more, rack-mounted, diskless, with
redundant power supply and cooling.
1 TB IDE disk array in RAID-5 configuration,
hosted in a NAS (Network Attached Storage)
unit (also to be increased if needed).
100 Mbps Ethernet Switch.
The 2001 Tier-1 prototype for LHCb-Italy
Vincenzo Vagnoni
WAN
Router
Switch
The 2001 Tier-1 prototype for LHCb-Italy
Vincenzo Vagnoni
Motherboards
NAS
Network Attached Storage
2001 Tier-1 Computer Farm Prototype
Components (Motherboards,
Racks, NAS, Switches)
The 2001 Tier-1 prototype for LHCb-Italy
Vincenzo Vagnoni
Advantage of diskless nodes
Easy Operating System installation and maintainance
Adding a machine to the farm requires to run a simple script on
the disk server
The Operating Systems are centralized and accessible through
the file system of the disk server
Enhanced disk fault tolerance
The disk server can have large hard disk arrays with redundant
information by using a RAID 5
No need of UPS battery for the client nodes
Only the disk server needs UPS, no damage if a client is
powered off
System compactness
The nodes are basically simple motherboards with integrated
ethernet cards
They can be arranged in racks, ~25 nodes in only 1 m2.
The 2001 Tier-1 prototype for LHCb-Italy
Vincenzo Vagnoni
Working concept (using for
example a PXE BootROM)
At boot time the diskless client sends its ethernet card MAC
address by making a bootp (or dhcp) broadcast request over the
LAN
The diskless client ethernet controller must be equipped with a
BootROM (for example a PXE, Pre-eXecution Environment,
compliant one)
The bootp (or dhcp) server replies assigning the IP address to the
client and passing the file name of the boot code on the server
Using a PXE BootROM the boot code is a small loader
(pxelinux.bin), provided with the syslinux package, also able to
read a configuration file on the server with some additional
information (i.e. the arguments to pass to the linux kernel)
The client makes a TFTP connection to the server asking to
download the PXE loader and executes it
The loader tftp-downloads the linux kernel, and executes it
The 2001 Tier-1 prototype for LHCb-Italy
Vincenzo Vagnoni
Further details…
The linux kernel needs to be compiled with root-over-NFS
support to be able to mount the root filesystem through
NFS
The Ethernet Card driver must be compiled resident into
the kernel (in principle could be also provided by an initial
ramdisk)
The clients can share some directories of the linux
filesystem (/usr, /opt, …), but need some others to be
private (at least /etc, /dev, /var, /tmp)
Optionally some non-shareable directories (for example
/dev, /var, /tmp) can be mounted on ramdisks
The 2001 Tier-1 prototype for LHCb-Italy
Vincenzo Vagnoni
Network Attached Storage
OpenNAS RS15 RaidZone
File Server supports NFS (also Windows SMB, but
fortunately we don’t need this guy)
Built-in Web Server and DNS Server
R3 RAID5 (Redundancy with Rapid Recovery)
Java-based remote interface for all OpenNAS functions,
including disk array configuration
15 hot-swappable drives bays, with 15 Ultra ATA/100 drives
(80 GB each) + 1 hot-swappable spare
Dual PIII 800MHz CPUs and 256 MB ECC RAM
Dual 100BaseT network connections (Gigabit optional)
Dual redundant 300W hot-swappable power supplies
Possibility of two RAIDZONE disk array expansions (up to 3
TB)
Cost: $20850 (RS15) + optional $19595 (1 TB expansion)
The 2001 Tier-1 prototype for LHCb-Italy
Vincenzo Vagnoni
Tests made in Bologna
Hardware setup
4 PC HP Vectra, single processor PIII 500 MHz, 256 MB
RAM, one acting as root filesystem disk server and three
as diskless clients
1 PC HP Kayak, dual processor PIII 733 MHz, 256 MB
RAM, as an additional client with different architecture
Software setup
Linux RedHat 6.2, upgraded with kernel 2.2.16, or
optionally (to be choosen at boot time) Linux SuSE 6.4
Tests made to evaluate:
Complexity of installation and administration
Performance of the system (especially interesting due to
the absence of the system swap area, even if in principle
a network based one could be arranged)
The 2001 Tier-1 prototype for LHCb-Italy
Vincenzo Vagnoni
Installation tests results
Installation trivial
Once the server system is configured, adding a new
client takes 1 minute
All the installations are by default identical
No additional network activity observed (monitored directly
into the switch)
The OSes rarely need to access non cached information
on the server (server disk almost completely inactive)
Robust system
Simulated network failures for few seconds do not
interfere with the OS work
Powering off the client machines (without a shutdown!)
doesn’t create any problem at all (no need to check the
disk data integrity at reboot)
The 2001 Tier-1 prototype for LHCb-Italy
Vincenzo Vagnoni
Performance test results
Performance tested by running two HERA-B MonteCarlo
simulation (ARTE) jobs on a dual processor 733 MHz, 256
MB RAM, HP Kayak with diskless installation (running Linux
SuSE 6.4 in this case)
Performance compared with an identical machine with nondiskless installation and local disk swap area
Identical performances! The MC jobs were tuned to allocate
~100 MB RAM each
Even with 256 MB RAM, two heavy jobs can run together
without using a local disk swap area at all
Conclusion: no problem for a single-processor motherboard
equipped with 256 MB RAM running one single job
The 2001 Tier-1 prototype for LHCb-Italy
Vincenzo Vagnoni
Conclusions
The 2001 LHCb Tier-1 prototype
architecture is presented
Based on diskless nodes configuration with
a Network Attached Storage disk server
and a 100 Mbps switched ethernet
System proof of principle tested
This kind of system can be an interesting
contribution to the LHC computing
community
It saves time, room and money!
The 2001 Tier-1 prototype for LHCb-Italy
Vincenzo Vagnoni