Installing Worker Node WN - LCG system

James Werner

Installation tasks.

  1. Network configuration
  2. Adapt Scientific Linux to LCG
  3. Java installation from RPM
  4. Node synchronisation
  5. Configuration tool:YAIM
  6. Site configuration file
  7. Installing Apt-get
  8. Installing Torque Worker Nodes (WN)
  9. AFS configuration
  10. Configuration WNs


Network configuration

Add to /etc/hosts after the local definition the following hosts: vi hosts # Do not remove the following line, or various programs # that require network functionality will fail. 127.0.0.1 localhost.localdomain localhost 194.36.3.43 IS_host IS_ip IS_host IS_nick RB_ip RB_host RB_nick CE_ip CE_host CE_nick WN_ip WN_host WN_nick ... UI_ip UI_host UI_nick SE_ip SE_host SE_nick FileServer_ip FS_host FS_nick ...

CAP NFS auto mounting:

At cap.hep.man.ac.uk, as root user: vi /etc/exports exportfs -a exportfs -at each network computer: cd /etc vi auto.master # # $Id: auto.master,v 1.3 2003/09/29 08:22:35 raven Exp $ # # Sample auto.master file # This is an automounter map and it has the following format # key [ -mount-options-separated-by-comma ] location # For details of the format look at autofs(5). #/misc /etc/auto.misc --timeout=60 #/misc /etc/auto.misc #/net /etc/auto.net /exp_software /etc/auto.experiment -rw,intr,noquota,nosuid,noatime vi auto.experiment mkdir /exp_software ln -s /exp_software /nfs /etc/rc.d/init.d/autofs restart /sbin/service nfs restart Shutting down NFS mountd: [ OK ] Shutting down NFS daemon: [ OK ] Shutting down NFS quotas: [ OK ] Shutting down NFS services: [ OK ] Starting NFS services: [ OK ] Starting NFS quotas: [ OK ] Starting NFS daemon: [ OK ] Starting NFS mountd: [ OK ] ls /nfs/work 00_index.txt babar lost+found newstore SE0 store users This mount point will be used by kanaccess.conf to assembly data physical file name for babar experiment. The direct mounting of CAP can be done as: mkdir /nfs mkdir /nfs/work mount cap:/work /nfs/work ls /nfs/work 00_index.txt babar lost+found newstore SE0 store users

Adapt Scientific Linux to LCG

This is necessary to avoid conflict with message passing packages in LCG: rpm -e lam-6.5.9-1

Java installation from RPM

Verify if java is installed: ls /usr/java j2sdk1.4.2_10 ls /usr/java/j2sdk1.4.2_10 bin demo jre LICENSE README src.zip COPYRIGHT include lib man README.html THIRDPARTYLICENSEREADME.txt and get the address: /usr/java/j2sdk1.4.2_10 If java is not installed:
http://java.sun.com/j2se/1.4.2/download.html Download J2SE SDK 1.4.2_10 accept RPM in self extracting file: j2sdk-1_4_2_10-linux-i586-rpm.bin The file was saved in: cap.hep.man.ac.uk:/nfs/work/users/jamwer/GRID scp j2sdk-1_4_2_10-linux-i586-rpm.bin root@cap.hep.man.ac.uk:/nfs/work/users/jamwer/GRID j2sdk-1_4_2_10-linux-i586-rpm.bin 100% 34MB 4.2MB/s 00:08 cd /nfs/teste/users/jamwer/GRID chmod +x j2sdk-1_4_2_10-linux-i586-rpm.bin ls -l total 34508 -rwxr-xr-x 1 root root 35292766 Nov 1 14:48 j2sdk-1_4_2_10-linux-i586-rpm.bin cd /usr/local ls bin etc games include lib libexec sbin share src mkdir java cd java /nfs/teste/users/jamwer/GRID/j2sdk-1_4_2_10-linux-i586-rpm.bin Sun Microsystems, Inc. Binary Code License Agreement for the JAVATM 2 SOFTWARE DEVELOPMENT KIT (J2SDK), STANDARD EDITION, VERSION 1.4.2_X ... (LFI#129530/Form ID#011801) Do you agree to the above license terms? [yes or no] yes Unpacking... Checksumming... 0 0 Extracting... UnZipSFX 5.40 of 28 November 1998, by Info-ZIP (Zip-Bugs@lists.wku.edu). inflating: j2sdk-1_4_2_10-linux-i586.rpm Done. ls j2sdk-1_4_2_10-linux-i586.rpm cd /usr/local/java ls j2sdk-1_4_2_10-linux-i586.rpm mv j2sdk-1_4_2_10-linux-i586.rpm /nfs/work/users/jamwer/GRID/ cd .. rmdir java rpm -ivh /nfs/work/users/jamwer/GRID/j2sdk-1_4_2_10-linux-i586.rpm Preparing... ########################################### [100%] 1:j2sdk ########################################### [100%] ls /usr/java/j2sdk1.4.2_10 bin demo jre LICENSE README src.zip COPYRIGHT include lib man README.html THIRDPARTYLICENSEREADME.txt Set JVM address (verify the correct version): vi /etc/profile export JAVA_LOCATION=/usr/java/j2sdk1.4.2_10 export JAVA_LIBDIR=$JAVA_LOCATION/lib export JNI_LIBDIR=$JAVA_LIBDIR export JVM_LIBDIR=$JAVA_LIBDIR PATH=$JAVA_LOCATION/bin:$PATH . /etc/profile java Usage: java [-options] class [args...] (to execute a class) ...

Node synchronisation

ping ntp2a.mcc.ac.uk PING maverick.mcc.ac.uk (130.88.202.49) 56(84) bytes of data. 64 bytes from maverick.mcc.ac.uk (130.88.202.49): icmp_seq=0 ttl=63 time=0.419 ms mv ntp.conf ntp.conf.orig vi ntp.conf cat ntp.conf server ntp2a.mcc.ac.uk server ntp2b.mcc.ac.uk server ntp2c.mcc.ac.uk server ntp2d.mcc.ac.uk cat /etc/ntp/step-tickers ntp2a.mcc.ac.uk ntp2b.mcc.ac.uk ntp2c.mcc.ac.uk ntp2d.mcc.ac.uk Start ntp: ntpdate ntp2a.mcc.ac.uk 1 Nov 16:01:10 ntpdate[2426]: adjust time server 130.88.202.49 offset -0.370997 sec service ntpd start ntpd: Synchronizing with time server: [ OK ] Starting ntpd: [ OK ] chkconfig ntpd on

Configuration tool:YAIM

cd /nfs/work/users/jamwer/GRID rpm -ivh lcg-yaim-2.6.0-9.noarch.rpm Preparing... ########################################### [100%] 1:lcg-yaim ########################################### [100%] cd /opt/lcg ls bin yaim ls bin configure_node install_node cd /opt/lcg/yaim ls examples functions scripts

Site configuration file

vi wn-list.conf WN_host WN_host WN_host WN_host WN_host WN_host vi users.conf 24014:babar001:2739:babar:babar:: 24022:babar002:2739:babar:babar:: 24023:babar003:2739:babar:babar:: 24024:babar004:2739:babar:babar:: 24028:babar005:2739:babar:babar:: 24029:babar006:2739:babar:babar:: 24030:babar007:2739:babar:babar:: 24031:babar008:2739:babar:babar:: 24032:babar009:2739:babar:babar:: 24033:babar010:2739:babar:babar:: 24034:babar011:2739:babar:babar:: 24038:babar012:2739:babar:babar:: 24044:babar013:2739:babar:babar:: 24045:babar014:2739:babar:babar:: 24046:babar015:2739:babar:babar:: 24047:babar016:2739:babar:babar:: 24048:babar017:2739:babar:babar:: 24049:babar018:2739:babar:babar:: 24050:babar019:2739:babar:babar:: 24051:babar020:2739:babar:babar:: vi /nfs/work/users/jamwer/GRID/JSTestBed.def Verify the file
/nfs/work/users/jamwer/GRID/JSTestbed.def for complete implementation. The information necessary to configure the complete site is: Latitude and longitude: www.infoplease.com/ipa/A0001769.html MY_DOMAIN=hep.man.ac.uk CE_HOST=CE_nick.$MY_DOMAIN SE_HOST=SE_nick.$MY_DOMAIN RB_HOST=RB_nick.$MY_DOMAIN PX_HOST=IS_nick.$MY_DOMAIN #PX_HOST=lcgrbp01.gridpp.rl.ac.uk BDII_HOST=IS_nick.$MY_DOMAIN #BDII_HOST=lcgbdii02.gridpp.rl.ac.uk MON_HOST=IS_nick.$MY_DOMAIN #MON_HOST=bohr0002.tier2.hep.man.ac.uk REG_HOST=lcgic01.gridpp.rl.ac.uk # there is only 1 central registry for now WN_LIST=/opt/lcg/yaim/wn-list.conf USERS_CONF=/opt/lcg/yaim/users.conf FUNCTIONS_DIR=/opt/lcg/yaim/functions JAVA_LOCATION="/usr/java/j2sdk1.4.2_10" MYSQL_PASSWORD=carnaval SITE_NAME=JSTestBed SITE_LOC="Manchester, UK" SITE_LAT=53.30 SITE_LONG=2.15 SITE_WEB="http://www.hep.man.ac.uk" SITE_TIER="TIER 3" SITE_SUPPORT_SITE="tier2.hep.man.ac.uk" CE_CLOSE_SE="SE1" CE_CLOSE_SE1_HOST=$SE_HOST CE_CLOSE_SE1_ACCESS_POINT=/storage CE_CLOSE_SE2_HOST=another-se.$MY_DOMAIN CE_CLOSE_SE2_ACCESS_POINT=/somewhere BDII_HTTP_URL="http://IS_host/lcg-testbed.conf" #BDII_HTTP_URL="http://grid-deployment.web.cern.ch/grid-deployment/gis/lcg2-bdii/dteam/lcg2-all-sites.conf" VO_SW_DIR=/nfs/work/lcg VO_BABAR_SW_DIR=$VO_SW_DIR/babar VO_BABAR_DEFAULT_SE=$SE_HOST VO_BABAR_STORAGE_DIR=$CE_CLOSE_SE1_ACCESS_POINT/babar VO_BABAR_QUEUES="babar" VO_BABAR_USERS=ldap://babar-vo.gridpp.ac.uk/ou=babar,dc=gridpp,dc=ac,dc=uk

Installing Apt-get

Verify if tool already installed: apt-get apt 0.5.15cnc6 for linux i386 compiled on Jun 1 2004 22:58:58 Usage: apt-get [options] command apt-get [options] install|remove pkg1 [pkg2 ...] apt-get [options] source pkg1 [pkg2 ...] ... In this case, available in standard installation.

Installing Torque Worker Nodes (WN)

adduser globus passwd globus Installing sar monitor: http://linuxsoft.cern.ch/repository//i386/sysstat-5.0.5-5.rhel3.i386.html rpm -ivh /nfs/work/users/jamwer/GRID/sysstat-5.0.5-5.rhel3.i386.rpm warning: /nfs/work/users/jamwer/GRID/sysstat-5.0.5-5.rhel3.i386.rpm: V3 DSA signature: NOKEY, key ID a7048f8d Preparing... ########################################### [100%] 1:sysstat ########################################### [100%] cp /nfs/work/users/jamwer/GRID/monitor /bin cp /nfs/work/users/jamwer/GRID/anal1 /bin /opt/lcg/yaim/scripts/install_node /nfs/work/users/jamwer/GRID/JSTestBed.def lcg-WN_torque ... The AFS cache is configured for 100 MB. Edit the /usr/vice/etc/cacheinfo file to change this before running AFS for the first time. You should also set your home cell in /usr/vice/etc/ThisCell. Also, you may want to edit /etc/pam.d/login and possibly others there to get an AFS token on login. Put the line: auth sufficient /lib/security/pam_afs.so try_first_pass ignore_root before the one for pwdb. ... ------------------------------------------------------------------ Globus Toolkit 3.2.1 has been installed in /usr/local/globus-3.2.1 directory To set up Globus environment, first set your GLOBUS_LOCATION to /usr/local/globus-3.2.1 and second, depending on your shell, run {csh} source /usr/local/globus-3.2.1/etc/globus-user-env.csh {sh} . /usr/local/globus-3.2.1/etc/globus-user-env.sh For documentation on Globus Toolkit 3.2.1 please refer to http://www-unix.globus.org/toolkit/docs/3.2/index.html About SGE integration with GT 3.2 see http://www.lesc.ic.ac.uk/projects/epic-gt3-sge.html ------------------------------------------------------------------ ... Done.

AFS configuration

cat /usr/vice/etc/cacheinfo /afs:/usr/vice/cache:300000 cat /usr/vice/etc/ThisCell hep.man.ac.uk cat /usr/vice/etc/CellServDB >hep.man.ac.uk #Manchester HEP 194.36.2.3 #afs1.hep.man.ac.uk 194.36.2.4 #afs2.hep.man.ac.uk 194.36.2.6 #afs4.hep.man.ac.uk /etc/rc.d/init.d/afs restart ### AFS script config: ajusteAFS cp /nfs/work/users/jamwer/GRID/cacheinfo /usr/vice/etc/cacheinfo cp /nfs/work/users/jamwer/GRID/ThisCell /usr/vice/etc/ThisCell cp /nfs/work/users/jamwer/GRID/CellServDB /usr/vice/etc/CellServDB /etc/rc.d/init.d/afs restart

Configuration WNs

/opt/lcg/yaim/scripts/configure_node /nfs/work/users/jamwer/GRID/JSTestBed.def WN_torque Configuring config_upgrade ... Configuring config_ldconf ... /sbin/ldconfig: /opt/lcg/lib/liblcg-info-api-ldap.so.0 is not a symbolic link /sbin/ldconfig: /opt/glite/externals/lib/libswigpy.so.0 is not a symbolic link /sbin/ldconfig: /opt/glite/externals/lib/libswigpl.so.0 is not a symbolic link /sbin/ldconfig: /opt/glite/externals/lib/libswigtcl8.so.0 is not a symbolic link Configuring config_sysconfig_edg ... Configuring config_sysconfig_globus ... Configuring config_sysconfig_lcg ... Configuring config_crl ... Configuring config_rfio ... rfiod already stopped: [FAILED] Configuring config_globus ... creating globus-sh-tools-vars.sh creating globus-script-initializer creating Globus::Core::Paths checking globus-hostname Done Creating... /opt/globus/etc/grid-info.conf Done Creating... /opt/globus/sbin/SXXgris /opt/globus/libexec/grid-info-script-initializer /opt/globus/libexec/grid-info-mds-core /opt/globus/libexec/grid-info-common /opt/globus/libexec/grid-info-cpu* /opt/globus/libexec/grid-info-fs* /opt/globus/libexec/grid-info-mem* /opt/globus/libexec/grid-info-net* /opt/globus/libexec/grid-info-platform* /opt/globus/libexec/grid-info-os* /opt/globus/etc/grid-info-resource-ldif.conf /opt/globus/etc/grid-info-resource-register.conf /opt/globus/etc/grid-info-resource.schema /opt/globus/etc/grid.gridftpperf.schema /opt/globus/etc/gridftp-resource.conf /opt/globus/etc/gridftp-perf-info /opt/globus/etc/grid-info-slapd.conf /opt/globus/etc/grid-info-site-giis.conf /opt/globus/etc/grid-info-site-policy.conf /opt/globus/etc/grid-info-server-env.conf /opt/globus/etc/grid-info-deployment-comments.conf Done Creating gatekeeper configuration file... Done Creating state file directory. Done. Reading gatekeeper configuration file... Determining system information... Creating job manager configuration file... Done Setting up fork gram reporter in MDS ----------------------------------------- Done Setting up pbs gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for qstat... /usr/bin/qstat updating cache /dev/null creating ./config.status creating /opt/globus/libexec/globus-script-pbs-queue Done Setting up condor gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for condor_q... no Setting up lsf gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for lsload... no loading cache ./config.cache checking for mpirun... (cached) /usr/bin/mpirun creating ./config.status creating fork.pm loading cache /dev/null checking for mpirun... /usr/bin/mpirun checking for qdel... /usr/bin/qdel checking for qstat... /usr/bin/qstat checking for qsub... /usr/bin/qsub checking for pbsnodes... /usr/bin/pbsnodes checking for ssh... /usr/bin/ssh updating cache /dev/null creating ./config.status creating /opt/globus/lib/perl/Globus/GRAM/JobManager/pbs.pm loading cache /dev/null checking for condor_submit... no loading cache /dev/null loading cache ./config.cache creating ./config.status creating grid-cert-request-config creating grid-security-config Configuring config_lcgenv ... Configuring config_replica_manager ... Configuring config_users ... Configuring config_sw_dir ... Configuring config_java ... Configuring config_rgma_client ... Welcome to the R-GMA setup utility ---------------------------------- Configuration written to: /opt/glite/etc/rgma/rgma.conf Configuring config_workload_manager_env ... Configuring config_fts_client ... Configuring config_glite_env ... Configuring config_gsissh ... Configuring gsi_openssh ------------------------------------------------------------ Executing... Notes: o Privilege separation is off. o GSI-OpenSSH website is . ------------------------------------------------------------ Finished configuring gsi_openssh. Configuring config_torque_client ... Stopping pbs_mom: [ OK ] Starting pbs_mom: [ OK ] Configuration Complete
Top

Last modified:
Copyright 2004 Manchester University
Feedback to: jamwer@hep.man.ac.uk