Rollout the minimal LCG system

James Werner

These are the installation procedures for a complete installation site LCG independent of any external resources (Deliverable A3.3 - 2005Q4).

Thanks to Sabah Salin and Alessandra Forti for support and discussions.

Installation tasks.

  1. Hardware architecture
  2. Requesting certificates
  3. Network configuration
  4. Adapt Scientific Linux to LCG
  5. Java installation from RPM
  6. Node synchronisation
  7. Configuration tool:YAIM
  8. Site configuration file
  9. Installing Apt-get
  10. Installing information system level 0 (BDII), monitor (MON), and long term proxy (PX)
  11. Installing Resource Broker (RB)
  12. Installing Computer Element (CE)
  13. Installing Torque Worker Nodes (WN)
  14. Installing Storage Element (SE)
  15. Installing User Interface (UI)
  16. AFS configuration
  17. Configuration BDII PX MON
  18. Configuration RB
  19. Configuration CE
  20. Configuration WNs
  21. Configuration UI
  22. Configuration SE
  23. Testing LCG middleware
  24. Operational procedures
  25. The firewall issue


1. Hardware/Software architecture.

Savannah portal:
http://savannah.cern.ch/ http://www.gridpp.ac.uk/deployment/links.html LCG rollout archives: http://www.listserv.rl.ac.uk/archives/lcg-rollout.html Download releases: http://lcg.web.cern.ch/LCG/Sites/releases.html Wiki: http://goc.grid.sinica.edu.tw/gocwiki/FrontPage LCG software is a set of specific functional computers running modules that interact with each other during the job submission, running and deliver processes. The processes are:
  1. User interface (UI): It is the module that allows users interact with grid. It contains programs to submit jobs and commands to the grid, check job status, recover outputs, etc. To install, perform tasks 1 to 9,14, 16, 21, and 23.
  2. Resource broker (RB): It is responsable to match the jobs with resources, implement policies, and send jobs for processing in a remote resource (Computer element).The definition of resources are written in the JDF file using ClassAds (see condor manual and LCG user manual for more details). To install, perform tasks 1 to 9,11, 16, 18, and 23.
  3. Computer element (CE): it is the batch manager (PBS) that allocates resources in the farm to run users' software. There are interactive jobs, parallel jobs, and simple jobs. To install, perform tasks 1 to 9,12, 16, 19, and 23.
  4. Worker Nodes (WN): it is the computer that will run your software. It contains the packages installed, libraries, and necessary local data for processing. To install, perform tasks 1 to 9,13, 16, 20, and 23.
  5. Storage Element (SE): it is the mass storage element in grid. To install, perform tasks 1 to 9,15, 16, 22, and 23.
  6. Berkeley Database (BDII): it is the catalog of resources published in the grid. It is distributed in two levels. The first catalog all CE BDII entries. The second, in the CE, contains the information about the farm managed by the CE.
  7. Proxy manager (PX): stores proxies for long time jobs in the grid.
CE and RB never could be installed in the same computer. There are installations where RB and BDII are together. These are the conputers with standard Scientific Linux 3.0.4 installation from www.scientificlinux.org. The log is available here. IP Hostname nick Function Cert 194.36.3.43 bf30.hep.man.ac.uk bf30 BDII PX MON * 194.36.3.44 bf31.hep.man.ac.uk bf31 RB * 194.36.3.45 bf32.hep.man.ac.uk bf32 CE * 194.36.3.46 bf33.hep.man.ac.uk bf33 WN 194.36.3.47 bf34.hep.man.ac.uk bf34 WN 194.36.3.48 bf35.hep.man.ac.uk bf35 WN 194.36.3.49 bf36.hep.man.ac.uk bf36 WN 194.36.3.50 bf37.hep.man.ac.uk bf37 WN 194.36.3.51 bf38.hep.man.ac.uk bf38 WN 194.36.3.52 bf39.hep.man.ac.uk bf39 UI 194.36.3.53 bf40.hep.man.ac.uk bf40 SE * * e-Science certificate are required in these machines The data server using NFS is cap.hep.man.ac.uk. Most of the current servers get delivered with two ethernet: eth0 should be connected to the public network (194.36.XX.XX) and eth1 to the private network (192.0.168.XX). The advantage is for example, when accessing data server (cap) will not share network connection with AFS. Another important point is: when writing data from your code, do not write records remotely (AFS, NFS, etc). Write outputs in a local disk, and at the end of your job copy it to a remote resource.

2. Requesting certificates

Verify if your certificate is installed:

At desktop pc73.hep.man.ac.uk, user jamwer: ls .globus usercert.pem userkey.pem If you do not have certificate, you have to obtain one before request hosts certificates.

Run firefox browser:

firefox Site: https:/ca.grid-support.ac.uk accept this certificate permanently (or you will not be able to recover the host certificate) Request a certificate Server certificate DSN is the server hostname type host These are the outputs, to confirm the information: E-Mail jamwer2000@hotmail.com DNS Name host/bf30.hep.man.ac.uk L HEP OU Manchester Fully qualified domain name bf30.hep.man.ac.uk User Data Role User Registration Authority Manchester HEP NOTBEFORE Mon Nov 7 09:02:15 2005 UTC PIN 553848908d51c23446b713a058c2f91ed656927e PROFILE UKHOST RA Manchester HEP ROLE User SERIAL 362016 SUBJECT_ALT_NAME DNS: bf30.hep.man.ac.uk TYPE SPKAC

Later, you will receive an email such as:

From : ca@ca.grid-support.ac.uk Sent : 07 November 2005 16:23:11 To : jamwer2000@hotmail.com Subject : UK eScience CA - New Issued Certificate (Serial: 5141) Dear Customer, Your certificate with the serial number 5141 and the DN: emailAddress=jamwer2000@hotmail.com,CN=host/bf30.hep.man.ac.uk,L=HEP,OU=Manchester,O=eScience,C=UK has just been generated by the UK e-Science CA. You can import it directly into the same browser you requested the certificate from by following this link: https://ca.grid-support.ac.uk/cgi-bin/pub/pki?cmd=getcert&key=5141&type=CERTIFICATE You can also do this by going to https://ca.grid-support.ac.uk/pub and clicking on "Import Certificate into Browser" and entering the serial number: 5141

Recovering the certificates:

Go in the same computer, account, and browser (firefox) https://ca.grid-support.ac.uk Import certificate into browser Serial number: (see number in the subject from ca email, in my case 5141) At firefox: Edit/Preferences/Advanced/Certificates/manage certificates Mark one certificate Backup type pem phrase WARNING: in the "Certificate backup password" and "...(again)" type any character and backspace (it will be empty!) Save as computer_name.p12

Converting p12 to pem format

See http://www.hep.man.ac.uk/local/grid/grid-cert-FAQ.html openssl pkcs12 -in bfXX.p12 -clcerts -nokeys -out bfXXcert.pem Enter Import Password: Just type enter! MAC verified OK openssl pkcs12 -in bfXX.p12 -nodes -nocerts -out bfXXkey.pem Enter Import Password: Just type enter! MAC verified OK

At each host computer:

[root@bf30 root]# cd /etc/grid-security/ [root@bf30 grid-security]# cp /nfs/work/users/jamwer/GRID/bfXXcert.pem hostcert.pem [root@bf30 grid-security]# cp /nfs/work/users/jamwer/GRID/bfXXkey.pem hostkey.pem [root@bf30 grid-security]# chmod 400 hostkey.pem [root@bf30 grid-security]# chmod 444 hostcert.pem where XX is 30, 31, 32, and 40.

3. Network configuration

Add to /etc/hosts after the local definition the following hosts: [root@bf30 etc]# vi hosts # Do not remove the following line, or various programs # that require network functionality will fail. 127.0.0.1 localhost.localdomain localhost 194.36.3.43 bf30.hep.man.ac.uk 194.36.3.222 cap.hep.man.ac.uk cap 194.36.3.43 bf30.hep.man.ac.uk bf30 194.36.3.44 bf31.hep.man.ac.uk bf31 194.36.3.45 bf32.hep.man.ac.uk bf32 194.36.3.46 bf33.hep.man.ac.uk bf33 194.36.3.47 bf34.hep.man.ac.uk bf34 194.36.3.48 bf35.hep.man.ac.uk bf35 194.36.3.49 bf36.hep.man.ac.uk bf36 194.36.3.50 bf37.hep.man.ac.uk bf37 194.36.3.51 bf38.hep.man.ac.uk bf38 194.36.3.52 bf39.hep.man.ac.uk bf39 194.36.3.53 bf40.hep.man.ac.uk bf40

CAP NFS auto mounting:

At cap.hep.man.ac.uk, as root user: [root@cap root]# cat /etc/exports # /work bf*.hep.man.ac.uk(rw,no_root_squash) /work bf*.tier2.hep.man.ac.uk(rw,no_root_squash) /work bohr*.tier2.hep.man.ac.uk(rw,no_root_squash) /work *.hep.man.ac.uk(rw,no_root_squash) [root@cap root]# exportfs -a [root@cap root]# exportfs /work bf*.hep.man.ac.uk /work bf*.tier2.hep.man.ac.uk /work bohr*.tier2.hep.man.ac.uk /work *.hep.man.ac.uk -at each network computer: [root@onegrid ~]# ssh -l root bfXX.hep.man.ac.uk [root@bf30 root]# cd /etc [root@bf30 etc]# vi auto.master # # $Id: auto.master,v 1.3 2003/09/29 08:22:35 raven Exp $ # # Sample auto.master file # This is an automounter map and it has the following format # key [ -mount-options-separated-by-comma ] location # For details of the format look at autofs(5). #/misc /etc/auto.misc --timeout=60 #/misc /etc/auto.misc #/net /etc/auto.net /exp_software /etc/auto.experiment -rw,intr,noquota,nosuid,noatime [root@bf30 etc]# vi auto.experiment work cap.hep.man.ac.uk:/work babardisk1 -rw,nosuid,noexec,rsize=65536,wsize=65536 celeste.hep.man.ac.uk:/babardisk1 babardisk2 -rw,nosuid,noexec,rsize=65536,wsize=65536 cornelius.hep.man.ac.uk:/babardisk2 babardisk3 -rw,nosuid,noexec,rsize=65536,wsize=65536 arthur.hep.man.ac.uk:/mnt/babardisk3 [root@bf30 etc]# mkdir /exp_software [root@bf30 etc]# ln -s /exp_software /nfs [root@bf30 etc]# /etc/rc.d/init.d/autofs restart [root@bf30 etc]# /sbin/service nfs restart Shutting down NFS mountd: [ OK ] Shutting down NFS daemon: [ OK ] Shutting down NFS quotas: [ OK ] Shutting down NFS services: [ OK ] Starting NFS services: [ OK ] Starting NFS quotas: [ OK ] Starting NFS daemon: [ OK ] Starting NFS mountd: [ OK ] [root@bf30 root]# ls /nfs/work 00_index.txt babar lost+found newstore SE0 store users This mount point will be used by kanaccess.conf to assembly data physical file name for babar experiment. The direct mounting of CAP can be done as: [root@bf30 root]# mkdir /nfs [root@bf30 root]# mkdir /nfs/work [root@bf30 root]# mount cap:/work /nfs/work [root@bf30 root]# ls /nfs/work 00_index.txt babar lost+found newstore SE0 store users

4. Adapt Scientific Linux to LCG

This is necessary to avoid conflict with message passing packages in LCG: [root@bf30 root]# rpm -e lam-6.5.9-1

5. Java installation from RPM

Verify if java is installed: [root@bf30 usr]# ls /usr/java j2sdk1.4.2_10 [root@bf30 root]# ls /usr/java/j2sdk1.4.2_10 bin demo jre LICENSE README src.zip COPYRIGHT include lib man README.html THIRDPARTYLICENSEREADME.txt and get the address: /usr/java/j2sdk1.4.2_10 If java is not installed:
http://java.sun.com/j2se/1.4.2/download.html Download J2SE SDK 1.4.2_10 accept RPM in self extracting file: j2sdk-1_4_2_10-linux-i586-rpm.bin The file was saved in: cap.hep.man.ac.uk:/nfs/work/users/jamwer/GRID [root@onegrid backup]# scp j2sdk-1_4_2_10-linux-i586-rpm.bin root@cap.hep.man.ac.uk:/nfs/work/users/jamwer/GRID root@cap.hep.man.ac.uk's password: j2sdk-1_4_2_10-linux-i586-rpm.bin 100% 34MB 4.2MB/s 00:08 [root@bf30 root]# cd /nfs/teste/users/jamwer/GRID [root@bf30 GRID]# chmod +x j2sdk-1_4_2_10-linux-i586-rpm.bin [root@bf30 GRID]# ls -l total 34508 -rwxr-xr-x 1 root root 35292766 Nov 1 14:48 j2sdk-1_4_2_10-linux-i586-rpm.bin root@bf30 GRID]# cd /usr/local [root@bf30 local]# ls bin etc games include lib libexec sbin share src [root@bf30 local]# mkdir java [root@bf30 local]# cd java [root@bf30 java]# /nfs/teste/users/jamwer/GRID/j2sdk-1_4_2_10-linux-i586-rpm.bin Sun Microsystems, Inc. Binary Code License Agreement for the JAVATM 2 SOFTWARE DEVELOPMENT KIT (J2SDK), STANDARD EDITION, VERSION 1.4.2_X ... (LFI#129530/Form ID#011801) Do you agree to the above license terms? [yes or no] yes Unpacking... Checksumming... 0 0 Extracting... UnZipSFX 5.40 of 28 November 1998, by Info-ZIP (Zip-Bugs@lists.wku.edu). inflating: j2sdk-1_4_2_10-linux-i586.rpm Done. [root@bf30 java]# ls j2sdk-1_4_2_10-linux-i586.rpm [root@bf30 root]# cd /usr/local/java [root@bf30 java]# ls j2sdk-1_4_2_10-linux-i586.rpm [root@bf30 java]# mv j2sdk-1_4_2_10-linux-i586.rpm /nfs/work/users/jamwer/GRID/ [root@bf30 java]# cd .. [root@bf30 local]# rmdir java [root@bf30 java]# rpm -ivh /nfs/work/users/jamwer/GRID/j2sdk-1_4_2_10-linux-i586.rpm Preparing... ########################################### [100%] 1:j2sdk ########################################### [100%] [root@bf30 root]# ls /usr/java/j2sdk1.4.2_10 bin demo jre LICENSE README src.zip COPYRIGHT include lib man README.html THIRDPARTYLICENSEREADME.txt Set JVM address (verify the correct version): vi /etc/profile export JAVA_LOCATION=/usr/java/j2sdk1.4.2_10 export JAVA_LIBDIR=$JAVA_LOCATION/lib export JNI_LIBDIR=$JAVA_LIBDIR export JVM_LIBDIR=$JAVA_LIBDIR PATH=$JAVA_LOCATION/bin:$PATH [root@bf30 root]# . /etc/profile [root@bf30 root]# java Usage: java [-options] class [args...] (to execute a class) ...

6. Node synchronisation

[root@bf30 yaim]# ping ntp2a.mcc.ac.uk PING maverick.mcc.ac.uk (130.88.202.49) 56(84) bytes of data. 64 bytes from maverick.mcc.ac.uk (130.88.202.49): icmp_seq=0 ttl=63 time=0.419 ms [root@bf30 etc]# mv ntp.conf ntp.conf.orig [root@bf30 etc]# vi ntp.conf [root@bf30 etc]# cat ntp.conf server ntp2a.mcc.ac.uk server ntp2b.mcc.ac.uk server ntp2c.mcc.ac.uk server ntp2d.mcc.ac.uk [root@bf30 etc]# cat /etc/ntp/step-tickers ntp2a.mcc.ac.uk ntp2b.mcc.ac.uk ntp2c.mcc.ac.uk ntp2d.mcc.ac.uk Start ntp: [root@bf30 etc]# ntpdate ntp2a.mcc.ac.uk 1 Nov 16:01:10 ntpdate[2426]: adjust time server 130.88.202.49 offset -0.370997 sec [root@bf30 etc]# service ntpd start ntpd: Synchronizing with time server: [ OK ] Starting ntpd: [ OK ] [root@bf30 etc]# chkconfig ntpd on

7. Configuration tool:YAIM

[root@bf30 teste]# cd /nfs/work/users/jamwer/GRID [root@bf30 GRID]# rpm -ivh lcg-yaim-2.6.0-9.noarch.rpm Preparing... ########################################### [100%] 1:lcg-yaim ########################################### [100%] [root@bf30 GRID]# cd /opt/lcg [root@bf30 lcg]# ls bin yaim [root@bf30 lcg]# ls bin configure_node install_node [root@bf30 lcg]# cd /opt/lcg/yaim [root@bf30 yaim]# ls examples functions scripts

8. Site configuration file

[root@bf30 yaim]# vi wn-list.conf bf33.hep.man.ac.uk bf34.hep.man.ac.uk bf35.hep.man.ac.uk bf36.hep.man.ac.uk bf37.hep.man.ac.uk bf38.hep.man.ac.uk [root@bf30 yaim]# vi users.conf 24014:babar001:2739:babar:babar:: 24022:babar002:2739:babar:babar:: 24023:babar003:2739:babar:babar:: 24024:babar004:2739:babar:babar:: 24028:babar005:2739:babar:babar:: 24029:babar006:2739:babar:babar:: 24030:babar007:2739:babar:babar:: 24031:babar008:2739:babar:babar:: 24032:babar009:2739:babar:babar:: 24033:babar010:2739:babar:babar:: 24034:babar011:2739:babar:babar:: 24038:babar012:2739:babar:babar:: 24044:babar013:2739:babar:babar:: 24045:babar014:2739:babar:babar:: 24046:babar015:2739:babar:babar:: 24047:babar016:2739:babar:babar:: 24048:babar017:2739:babar:babar:: 24049:babar018:2739:babar:babar:: 24050:babar019:2739:babar:babar:: 24051:babar020:2739:babar:babar:: [root@bf30 yaim]# vi /nfs/work/users/jamwer/GRID/JSTestBed.def Verify the file
/nfs/work/users/jamwer/GRID/JSTestbed.def for complete implementation. The information necessary to configure the complete site is: Latitude and longitude: www.infoplease.com/ipa/A0001769.html MY_DOMAIN=hep.man.ac.uk CE_HOST=bf32.$MY_DOMAIN SE_HOST=bf40.$MY_DOMAIN RB_HOST=bf31.$MY_DOMAIN PX_HOST=bf30.$MY_DOMAIN #PX_HOST=lcgrbp01.gridpp.rl.ac.uk BDII_HOST=bf30.$MY_DOMAIN #BDII_HOST=lcgbdii02.gridpp.rl.ac.uk MON_HOST=bf30.$MY_DOMAIN #MON_HOST=bohr0002.tier2.hep.man.ac.uk REG_HOST=lcgic01.gridpp.rl.ac.uk # there is only 1 central registry for now WN_LIST=/opt/lcg/yaim/wn-list.conf USERS_CONF=/opt/lcg/yaim/users.conf FUNCTIONS_DIR=/opt/lcg/yaim/functions JAVA_LOCATION="/usr/java/j2sdk1.4.2_10" MYSQL_PASSWORD=carnaval SITE_NAME=JSTestBed SITE_LOC="Manchester, UK" SITE_LAT=53.30 SITE_LONG=2.15 SITE_WEB="http://www.hep.man.ac.uk" SITE_TIER="TIER 3" SITE_SUPPORT_SITE="tier2.hep.man.ac.uk" CE_CLOSE_SE="SE1" CE_CLOSE_SE1_HOST=$SE_HOST CE_CLOSE_SE1_ACCESS_POINT=/storage CE_CLOSE_SE2_HOST=another-se.$MY_DOMAIN CE_CLOSE_SE2_ACCESS_POINT=/somewhere BDII_HTTP_URL="http://bf30.hep.man.ac.uk/lcg-testbed.conf" #BDII_HTTP_URL="http://grid-deployment.web.cern.ch/grid-deployment/gis/lcg2-bdii/dteam/lcg2-all-sites.conf" VO_SW_DIR=/nfs/work/lcg VO_BABAR_SW_DIR=$VO_SW_DIR/babar VO_BABAR_DEFAULT_SE=$SE_HOST VO_BABAR_STORAGE_DIR=$CE_CLOSE_SE1_ACCESS_POINT/babar VO_BABAR_QUEUES="babar" VO_BABAR_USERS=ldap://babar-vo.gridpp.ac.uk/ou=babar,dc=gridpp,dc=ac,dc=uk The YAIM entries to configure the VOMS server are: site-info.def: VO_BABAR_VOMS_SERVERS="vomss://voms.gridpp.ac.uk:8443/voms/babar?/babar/" VO_BABAR_VOMSES="'babar voms.gridpp.ac.uk 15002 /C=UK/O=eScience/OU=Manchester/L=HEP/CN=voms.gridpp.ac.uk/Email=hostmaster@hep.man.ac.uk babar'" groups.conf: "/VO=babar/GROUP=/babar":::: "/VO=babar/GROUP=/babar/ROLE=lcgadmin":::sgm:

9. Installing Apt-get

Verify if tool already installed: [root@bf30 yaim]# apt-get apt 0.5.15cnc6 for linux i386 compiled on Jun 1 2004 22:58:58 Usage: apt-get [options] command apt-get [options] install|remove pkg1 [pkg2 ...] apt-get [options] source pkg1 [pkg2 ...] ... In this case, available in standard installation.

10. Installing information system level 0 (BDII), monitor (MON), and long term proxy (PX)

To avoid the messages: warning: group globus does not exist - using root warning: user globus does not exist - using root you need to create an account globus: [root@bf30 root]# adduser globus [root@bf30 root]# passwd globus Changing password for user globus. New password: ?????? BAD PASSWORD: it is based on a dictionary word Retype new password: ?????? passwd: all authentication tokens updated successfully. [root@bf30 root]# /opt/lcg/yaim/scripts/install_node /nfs/work/users/jamwer/GRID/JSTestBed.def lcg-BDII lcg-PX lcg-MON Configuring config_apt_prefs Configuring config_apt ... 96:lcg-MON ########################################### [ 99%] 97:lcg-PX ########################################### [100%] Done.

11. Installing Resource Broker

[root@bf31 root]# adduser globus [root@bf31 root]# passwd globus Changing password for user globus. New password: ?????? BAD PASSWORD: it is based on a dictionary word Retype new password: ?????? passwd: all authentication tokens updated successfully. [root@bf31 GRID]# /opt/lcg/yaim/scripts/install_node /nfs/work/users/jamwer/GRID/JSTestBed.def lcg-RB ... The AFS cache is configured for 100 MB. Edit the /usr/vice/etc/cacheinfo file to change this before running AFS for the first time. You should also set your home cell in /usr/vice/etc/ThisCell. Also, you may want to edit /etc/pam.d/login and possibly others there to get an AFS token on login. Put the line: auth sufficient /lib/security/pam_afs.so try_first_pass ignore_root before the one for pwdb. ... Apache Ant 1.6.1 has been installed in /usr/local/apache-ant-1.6.1 directory ... 10:globus warning: user globus does not exist - using root ... PLEASE REMEMBER TO SET A PASSWORD FOR THE MySQL root USER ! To do so, start the server, then issue the following commands: /usr/bin/mysqladmin -u root password 'new-password' /usr/bin/mysqladmin -u root -h bf31.hep.man.ac.uk password 'new-password' See the manual for more instructions. Please report any problems with the /usr/bin/mysqlbug script! The latest information about MySQL is available on the web at http://www.mysql.com Support MySQL by buying support/licenses at https://order.mysql.com ... Done.

12. Installing Computer Element (CE)

[root@bf32 root]# adduser globus [root@bf32 root]# passwd globus Changing password for user globus. New password: ?????? BAD PASSWORD: it is based on a dictionary word Retype new password: ?????? passwd: all authentication tokens updated successfully. [root@bf32 GRID]# /opt/lcg/yaim/scripts/install_node /nfs/work/users/jamwer/GRID/JSTestBed.def lcg-CE_torque ... The AFS cache is configured for 100 MB. Edit the /usr/vice/etc/cacheinfo file to change this before running AFS for the first time. You should also set your home cell in /usr/vice/etc/ThisCell. Also, you may want to edit /etc/pam.d/login and possibly others there to get an AFS token on login. Put the line: auth sufficient /lib/security/pam_afs.so try_first_pass ignore_root before the one for pwdb. ... Done.

13. Installing Torque Worker Nodes (WN)

[root@bf33 root]# adduser globus [root@bf33 root]# passwd globus Changing password for user globus. New password: ?????? BAD PASSWORD: it is based on a dictionary word Retype new password: ?????? passwd: all authentication tokens updated successfully. Installing sar monitor: http://linuxsoft.cern.ch/repository//i386/sysstat-5.0.5-5.rhel3.i386.html [root@bf33 root]# rpm -ivh /nfs/work/users/jamwer/GRID/sysstat-5.0.5-5.rhel3.i386.rpm warning: /nfs/work/users/jamwer/GRID/sysstat-5.0.5-5.rhel3.i386.rpm: V3 DSA signature: NOKEY, key ID a7048f8d Preparing... ########################################### [100%] 1:sysstat ########################################### [100%] [root@bf33 root]# cp /nfs/work/users/jamwer/GRID/monitor /bin [root@bf34 root]# cp /nfs/work/users/jamwer/GRID/anal1 /bin [root@bf33 GRID]# /opt/lcg/yaim/scripts/install_node /nfs/work/users/jamwer/GRID/JSTestBed.def lcg-WN_torque ... The AFS cache is configured for 100 MB. Edit the /usr/vice/etc/cacheinfo file to change this before running AFS for the first time. You should also set your home cell in /usr/vice/etc/ThisCell. Also, you may want to edit /etc/pam.d/login and possibly others there to get an AFS token on login. Put the line: auth sufficient /lib/security/pam_afs.so try_first_pass ignore_root before the one for pwdb. ... ------------------------------------------------------------------ Globus Toolkit 3.2.1 has been installed in /usr/local/globus-3.2.1 directory To set up Globus environment, first set your GLOBUS_LOCATION to /usr/local/globus-3.2.1 and second, depending on your shell, run {csh} source /usr/local/globus-3.2.1/etc/globus-user-env.csh {sh} . /usr/local/globus-3.2.1/etc/globus-user-env.sh For documentation on Globus Toolkit 3.2.1 please refer to http://www-unix.globus.org/toolkit/docs/3.2/index.html About SGE integration with GT 3.2 see http://www.lesc.ic.ac.uk/projects/epic-gt3-sge.html ------------------------------------------------------------------ ... Done.

14. Installing Storage Element (SE)

[root@bf33 root]# adduser globus [root@bf33 root]# passwd globus Changing password for user globus. New password: ?????? BAD PASSWORD: it is based on a dictionary word Retype new password: ?????? passwd: all authentication tokens updated successfully. [root@bf40 GRID]# /opt/lcg/yaim/scripts/install_node /nfs/work/users/jamwer/GRID/JSTestBed.def lcg-SE_classic ... The AFS cache is configured for 100 MB. Edit the^M /usr/vice/etc/cacheinfo file to change this before^M running AFS for the first time. You should also^M set your home cell in /usr/vice/etc/ThisCell.^M ^M Also, you may want to edit /etc/pam.d/login and^M possibly others there to get an AFS token on login.^M Put the line:^M ^M auth sufficient /lib/security/pam_afs.so try_first_pass ignore_root^M ^M before the one for pwdb.^M ... Apache Ant 1.6.1 has been installed in ^M /usr/local/apache-ant-1.6.1 directory ^M ... 11:globus warning: user globus does not exist - using root^M ------------------------------------------------------------------^M Globus Toolkit 3.2.1 has been installed in ^M /usr/local/globus-3.2.1 directory ^M ^M To set up Globus environment, first set your GLOBUS_LOCATION^M to /usr/local/globus-3.2.1 and second, depending on your shell, run^M {csh} source /usr/local/globus-3.2.1/etc/globus-user-env.csh ^M {sh} . /usr/local/globus-3.2.1/etc/globus-user-env.sh ^M For documentation on Globus Toolkit 3.2.1 please refer to ^M http://www-unix.globus.org/toolkit/docs/3.2/index.html^M About SGE integration with GT 3.2 see^M http://www.lesc.ic.ac.uk/projects/epic-gt3-sge.html^M ------------------------------------------------------------------^M ... Done.

15. Installing User Interface (UI)

[root@bf39 root]# adduser globus [root@bf39 root]# passwd globus Changing password for user globus. New password: ?????? BAD PASSWORD: it is based on a dictionary word Retype new password: ?????? passwd: all authentication tokens updated successfully. [root@bf39 root]# /opt/lcg/yaim/scripts/install_node /nfs/work/users/jamwer/GRID/JSTestBed.def lcg-UI ... Done.

16. AFS configuration

[root@bf30 root]# cat /usr/vice/etc/cacheinfo /afs:/usr/vice/cache:300000 [root@bf30 root]# cat /usr/vice/etc/ThisCell hep.man.ac.uk [root@bf30 root]# cat /usr/vice/etc/CellServDB >hep.man.ac.uk #Manchester HEP 194.36.2.3 #afs1.hep.man.ac.uk 194.36.2.4 #afs2.hep.man.ac.uk 194.36.2.6 #afs4.hep.man.ac.uk [root@bf30 root]# /etc/rc.d/init.d/afs restart ### AFS script config: ajusteAFS cp /nfs/work/users/jamwer/GRID/cacheinfo /usr/vice/etc/cacheinfo cp /nfs/work/users/jamwer/GRID/ThisCell /usr/vice/etc/ThisCell cp /nfs/work/users/jamwer/GRID/CellServDB /usr/vice/etc/CellServDB /etc/rc.d/init.d/afs restart

17. Configuration BDII PX MON

[root@bf30 root]# mkdir .globus [root@bf30 root]# cp /nfs/work/users/jamwer/GRID/userkey.pem .globus [root@bf30 root]# cp /nfs/work/users/jamwer/GRID/usercert.pem .globus

Configure httpd server for lcg-siteinfo

[root@bf30 root]# ps -aux | grep httpd root 12759 0.0 0.0 3680 668 pts/1 S 10:46 0:00 grep httpd [root@bf30 root]# rpm -qa |grep http libghttp-1.0.9-8.1 [root@bf30 root]# less /etc/yum.conf [root@bf30 root]# yum install httpd Gathering header information file(s) from server(s) Server: SL 304 errata Server: SL 304 main Finding updated packages Downloading needed headers Resolving dependencies Dependencies resolved I will do the following: [install: httpd 2.0.46-46.3.ent.i386] Is this ok [y/N]: y Downloading Packages Getting httpd-2.0.46-46.3.ent.i386.rpm httpd-2.0.46-46.3.ent.i38 100% |=========================| 1.0 MB 00:02 Running test transaction: Test transaction complete, Success! httpd 100 % done 1/1 Installed: httpd 2.0.46-46.3.ent.i386 Transaction(s) Complete [root@bf30 root]# ls /var/www/ cgi-bin error html icons [root@bf30 root]# service httpd start Starting httpd: [ OK ] [root@bf30 root]# echo teste > /var/www/html/teste Go in any other computer browser, and access: http://bf30.hep.man.ac.uk/ --->> Red Hat Enterprise Linux Test Page http://bf30.hep.man.ac.uk/teste --->> teste The address of teste file at bf30.hep.man.ac.uk is: [root@bf30 root]# cat /var/www/html/teste teste Edit JSTestBed.conf file: BDII_HTTP_URL="http://grid-deployment.web.cern.ch/grid-deployment/gis/lcg2-bdii/dteam/lcg2-all-sites.conf" Open the file http://grid-deployment.web.cern.ch/grid-deployment/gis/lcg2-bdii/dteam/lcg2-all-sites.conf in the browser: # # Top Level BDII configuration file # --------------------------------- # created on Mon Nov 14 10:30:01 2005 # # This file is generated, DO NOT EDIT it directly # # # AsiaPacific # ----------- # # GOG-Singapore GOG-Singapore ldap://melon.ngpp.ngp.org.sg:2170/mds-vo-name=GOG-Singapore,o=grid # INDIACMS-TIFR INDIACMS-TIFR ldap://ce.indiacms.res.in:2170/mds-vo-name=INDIACMS-TIFR,o=grid # LCG_KNU LCG_KNU ldap://cluster50.knu.ac.kr:2170/mds-vo-name=LCG_KNU,o=grid ... where each entry is a level 2 BDII installed in CE, and has the format: # SITE_NAME SITE_NAME ldap://URL_BDII_HTTP_ADDRESS:2170/mds-vo-name=SITE_NAME,o=grid Create a file at BDII computer bf30.hep.man.ac.uk, pointing the http server: [root@bf30 root]# vi /var/www/html/lcg-testbed.conf [root@bf30 root]# cat /var/www/html/lcg-testbed.conf # Testbed for Easygrid development # JSTestBed JSTestBed ldap://bf32.hep.man.ac.uk:2170/mds-vo-name=JSTestBed,o=grid Change the site-config file to: BDII_HTTP_URL="http://bf30.hep.man.ac.uk/lcg-testbed.conf"

Running BDII configuration process.

[root@bf30 root]# /opt/lcg/yaim/scripts/configure_node /nfs/work/users/jamwer/GRID/JSTestBed.def BDII PX MON Configuring config_upgrade ... Configuring config_edgusers ... Configuring config_bdii ... Stopping BDII [ OK ] Starting BDII [ OK ] Configuring config_fmon_client ... Stopping edg-fmon-agent: [ OK ] Starting edg-fmon-agent: [ OK ] Stopping edg-fmon-server: [ OK ] Starting edg-fmon-server: [ OK ] Stopping Globus MDS for GridICE [ OK ] Starting Globus MDS for GridICE (gcc32dbgpthr) [ OK ] Configuring config_ldconf ... /sbin/ldconfig: /opt/glite/externals/lib/libswigpy.so.0 is not a symbolic link /sbin/ldconfig: /opt/glite/externals/lib/libswigpl.so.0 is not a symbolic link /sbin/ldconfig: /opt/glite/externals/lib/libswigtcl8.so.0 is not a symbolic link Configuring config_sysconfig_edg ... Configuring config_sysconfig_globus ... Configuring config_java ... Configuring config_crl ... Configuring config_gip ... Setting up an R-GMA Gin... - Configuring a gip information provider - Configuring an fmon information provider - Not configuring a glite-ce information provider Wrote configuration to: /opt/glite/etc/rgma-gin/gin.conf All done Stopping rgma-gin: [ OK ] Starting rgma-gin: [ OK ] Configuring config_globus ... creating globus-sh-tools-vars.sh creating globus-script-initializer creating Globus::Core::Paths checking globus-hostname Done Creating... /opt/globus/etc/grid-info.conf Done Creating... /opt/globus/sbin/SXXgris /opt/globus/libexec/grid-info-script-initializer /opt/globus/libexec/grid-info-mds-core /opt/globus/libexec/grid-info-common /opt/globus/libexec/grid-info-cpu* /opt/globus/libexec/grid-info-fs* /opt/globus/libexec/grid-info-mem* /opt/globus/libexec/grid-info-net* /opt/globus/libexec/grid-info-platform* /opt/globus/libexec/grid-info-os* /opt/globus/etc/grid-info-resource-ldif.conf /opt/globus/etc/grid-info-resource-register.conf /opt/globus/etc/grid-info-resource.schema /opt/globus/etc/grid.gridftpperf.schema /opt/globus/etc/gridftp-resource.conf /opt/globus/etc/gridftp-perf-info /opt/globus/etc/grid-info-slapd.conf /opt/globus/etc/grid-info-site-giis.conf /opt/globus/etc/grid-info-site-policy.conf /opt/globus/etc/grid-info-server-env.conf /opt/globus/etc/grid-info-deployment-comments.conf Done Creating gatekeeper configuration file... Done Creating state file directory. Done. Reading gatekeeper configuration file... Determining system information... Creating job manager configuration file... Done Setting up fork gram reporter in MDS ----------------------------------------- Done Setting up pbs gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for qstat... no Setting up condor gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for condor_q... no Setting up lsf gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for lsload... no loading cache ./config.cache checking for mpirun... (cached) /usr/bin/mpirun creating ./config.status creating fork.pm loading cache /dev/null checking for mpirun... /usr/bin/mpirun checking for qdel... no loading cache /dev/null checking for condor_submit... no loading cache /dev/null loading cache ./config.cache creating ./config.status creating grid-cert-request-config creating grid-security-config Stopping Globus MDS [ OK ] Starting Globus MDS (gcc32dbgpthr) [ OK ] Configuring config_rgma_client ... Welcome to the R-GMA setup utility ---------------------------------- Configuration written to: /opt/glite/etc/rgma/rgma.conf Configuring config_rgma_server ... Info: using default install root: /opt/glite Info: using default configuration file: /opt/glite/etc/glite-security-trustmanager/config.properties Info: using default configuration directory: /opt/glite/etc/glite-security-trustmanager Warning: /opt/glite/etc/glite-security-trustmanager/server.xml already exists! Saving old one as /opt/glite/etc/glite-security-trustmanager/server.xml.old. Info: you can clean up using the following commands mv -f /etc/tomcat5/server.xml.old-glite /etc/tomcat5/server.xml rm -f /var/lib/tomcat5/server/lib/bcprov-jdk14-122.jar rm -f /var/lib/tomcat5/server/lib/log4j-1.2.8.jar rm -f /var/lib/tomcat5/server/lib/glite-security-trustmanager.jar rm -f /var/lib/tomcat5/server/lib/glite-security-util-java.jar rm -f /etc/tomcat5/log4j-trustmanager.properties rm -f /opt/glite/etc/glite-security-trustmanager/server.xml waiting for processes to exit [ OK ] waiting for processes to exit waiting for processes to exit waiting for processes to exit Starting tomcat5: /usr/bin/rebuild-jar-repository: error: JVM_LIBDIR -exports/usr does not exist or is not a directory /usr/bin/rebuild-jar-repository: error: JVM_LIBDIR -exports/usr does not exist or is not a directory /usr/bin/rebuild-jar-repository: error: JVM_LIBDIR -exports/usr does not exist or is not a directory /usr/bin/rebuild-jar-repository: error: JVM_LIBDIR -exports/usr does not exist or is not a directory /usr/bin/rebuild-jar-repository: error: JVM_LIBDIR -exports/usr does not exist or is not a directory /usr/bin/rebuild-jar-repository: error: JVM_LIBDIR -exports/usr does not exist or is not a directory /usr/bin/rebuild-jar-repository: error: JVM_LIBDIR -exports/usr does not exist or is not a directory [ OK ] Shutting down rgma-publish-site: [ OK ] Starting rgma-publish-site: [ OK ] Stopping rgma-servicetool: [ OK ] Starting rgma-servicetool: [ OK ] Configuring config_apel_rgma ... Configuring config_sysconfig_lcg ... Configuring config_rfio ... Configuring config_host_certs ... Configuring config_proxy_server ... Shutting down MyProxy Starting up MyProxy Configuration Complete

18. Configuration RB

[root@bf31 root]# /opt/lcg/yaim/scripts/configure_node /nfs/work/users/jamwer/GRID/JSTestBed.def RB Configuring config_upgrade ... Configuring config_ldconf ... /sbin/ldconfig: /opt/glite/externals/lib/libswigpy.so.0 is not a symbolic link /sbin/ldconfig: /opt/glite/externals/lib/libswigpl.so.0 is not a symbolic link /sbin/ldconfig: /opt/glite/externals/lib/libswigtcl8.so.0 is not a symbolic link Configuring config_sysconfig_edg ... Configuring config_sysconfig_globus ... Configuring config_sysconfig_lcg ... Configuring config_crl ... Configuring config_rfio ... rfiod already stopped: [FAILED] Configuring config_host_certs ... Configuring config_users ... Configuring config_edgusers ... Configuring config_mkgridmap ... Configuring config_java ... Configuring config_rgma_client ... Welcome to the R-GMA setup utility ---------------------------------- Configuration written to: /opt/glite/etc/rgma/rgma.conf Configuring config_gip ... Setting up an R-GMA Gin... - Configuring a gip information provider - Not configuring an fmon information provider - Not configuring a glite-ce information provider Wrote configuration to: /opt/glite/etc/rgma-gin/gin.conf All done Stopping rgma-gin: [ OK ] Starting rgma-gin: [ OK ] Configuring config_globus ... creating globus-sh-tools-vars.sh creating globus-script-initializer creating Globus::Core::Paths checking globus-hostname Done Creating... /opt/globus/etc/grid-info.conf Done Creating... /opt/globus/sbin/SXXgris /opt/globus/libexec/grid-info-script-initializer /opt/globus/libexec/grid-info-mds-core /opt/globus/libexec/grid-info-common /opt/globus/libexec/grid-info-cpu* /opt/globus/libexec/grid-info-fs* /opt/globus/libexec/grid-info-mem* /opt/globus/libexec/grid-info-net* /opt/globus/libexec/grid-info-platform* /opt/globus/libexec/grid-info-os* /opt/globus/etc/grid-info-resource-ldif.conf /opt/globus/etc/grid-info-resource-register.conf /opt/globus/etc/grid-info-resource.schema /opt/globus/etc/grid.gridftpperf.schema /opt/globus/etc/gridftp-resource.conf /opt/globus/etc/gridftp-perf-info /opt/globus/etc/grid-info-slapd.conf /opt/globus/etc/grid-info-site-giis.conf /opt/globus/etc/grid-info-site-policy.conf /opt/globus/etc/grid-info-server-env.conf /opt/globus/etc/grid-info-deployment-comments.conf Done Creating gatekeeper configuration file... Done Creating state file directory. Done. Reading gatekeeper configuration file... Determining system information... Creating job manager configuration file... Done Setting up fork gram reporter in MDS ----------------------------------------- Done Setting up pbs gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for qstat... no Setting up condor gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for condor_q... no Setting up lsf gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for lsload... no loading cache ./config.cache checking for mpirun... (cached) /usr/bin/mpirun creating ./config.status creating fork.pm loading cache /dev/null checking for mpirun... /usr/bin/mpirun checking for qdel... no loading cache /dev/null checking for condor_submit... no loading cache /dev/null loading cache ./config.cache creating ./config.status creating grid-cert-request-config creating grid-security-config Stopping Globus MDS [ OK ] Starting Globus MDS (gcc32dbgpthr) [ OK ] Configuring config_fmon_client ... Stopping edg-fmon-agent: [ OK ] Starting edg-fmon-agent: [ OK ] Configuring config_replica_manager ... Configuring config_lbserver_db ... DB lbserver20 already exist Configuring config_condor ... Configuring config_workload_manager_server ... Stopping edg-wl-in.ftpd: [ OK ] Starting edg-wl-in.ftpd: [ OK ] Stopping edg-wl-interlogd and edg-wl-logd. Starting LocalLogger: edg-wl-interlogd and edg-wl-logd. [ OK ] bash: /root/.bashrc: Permission denied bash: /root/.bashrc: Permission denied [ OK ] This is LocalLogger, part of Workload Management System in EU DataGrid.Copyright (c) 2002 CERN, INFN and CESNET on behalf of the EU DataGrid. [9109] Initializing... [9109] Parse messages for correctness...[9109] yes. [9109] Send messages also to inter-logger...[9109] yes. [9109] Store messages with the filename prefix "/var/tmp/dg20logd_"...[9109] yes. USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND edguser 24733 0.0 0.4 8832 4180 ? S 13:46 0:00 /opt/edg/bin/edg-wl-log_monitor -c edg_wl.conf root 9053 0.0 0.1 4464 1380 ? S 14:25 0:00 ftpd: accepting connections on port 2811 root 9058 0.0 0.1 4224 1220 pts/0 S 14:25 0:00 /bin/sh /etc/rc.d/init.d/edg-wl-locallogger restart edguser 9104 0.0 0.1 24668 2016 ? S 14:25 0:00 /opt/edg/sbin/edg-wl-interlogd -k /var/edgwl/logging/hostprox edguser 9110 0.0 0.1 4088 1920 ? S 14:25 0:00 /opt/edg/sbin/edg-wl-logd -k /var/edgwl/logging/hostproxy.pem Stopping edg-wl-bkserverd. /etc/rc.d/init.d/edg-wl-lbserver: line 101: kill: (24674) - No such process /etc/rc.d/init.d/edg-wl-lbserver: line 101: kill: (24676) - No such process /etc/rc.d/init.d/edg-wl-lbserver: line 101: kill: (24683) - No such process Starting LBserver: edg-wl-bkserverd [ OK ] bash: /root/.bashrc: Permission denied USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND edguser 9147 0.0 0.3 9624 3876 ? S 14:25 0:00 /opt/edg/sbin/edg-wl-bkserverd -k /var/edgwl/logging/hostprox edguser 9148 0.0 0.3 9744 3892 ? S 14:25 0:00 /opt/edg/sbin/edg-wl-bkserverd -k /var/edgwl/logging/hostprox edguser 9150 0.0 0.3 9744 3892 ? S 14:25 0:00 /opt/edg/sbin/edg-wl-bkserverd -k /var/edgwl/logging/hostprox edguser 9153 0.0 0.3 9744 3892 ? S 14:25 0:00 /opt/edg/sbin/edg-wl-bkserverd -k /var/edgwl/logging/hostprox edguser 9156 0.0 0.3 9744 3892 ? S 14:25 0:00 /opt/edg/sbin/edg-wl-bkserverd -k /var/edgwl/logging/hostprox edguser 9158 0.0 0.3 9744 3892 ? S 14:25 0:00 /opt/edg/sbin/edg-wl-bkserverd -k /var/edgwl/logging/hostprox edguser 9160 0.0 0.3 9744 3892 ? S 14:25 0:00 /opt/edg/sbin/edg-wl-bkserverd -k /var/edgwl/logging/hostprox edguser 9162 0.0 0.3 9744 3892 ? S 14:25 0:00 /opt/edg/sbin/edg-wl-bkserverd -k /var/edgwl/logging/hostprox edguser 9164 0.0 0.3 9744 3892 ? S 14:25 0:00 /opt/edg/sbin/edg-wl-bkserverd -k /var/edgwl/logging/hostprox edguser 9166 0.0 0.3 9744 3892 ? S 14:25 0:00 /opt/edg/sbin/edg-wl-bkserverd -k /var/edgwl/logging/hostprox edguser 9168 0.0 0.3 9744 3892 ? S 14:25 0:00 /opt/edg/sbin/edg-wl-bkserverd -k /var/edgwl/logging/hostprox Stopping LogMonitor... [ OK ] Starting LogMonitor... [ OK ] Restarting JobController daemon(s) Stopping JobController... [ OK ] Stopping CondorG... [ OK ] Starting JobController... [ OK ] Starting CondorG... [ OK ] pidof: can't read sid for pid 9381 Stopping NetworkServer... [ OK ] Starting NetworkServer... [ OK ] stopping workload manager... ok starting workload manager... ok Stopping edg-wl-renewd. Starting ProxyRenewal Daemon: edg-wl-renewd. [ OK ] bash: /root/.bashrc: Permission denied USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND edguser 9541 0.0 0.2 7212 2364 ? S 14:25 0:00 /opt/edg/sbin/edg-wl-renewd -r /opt/edg/var/spool/edg-wl-rene edguser 9542 0.0 0.2 7212 2324 ? S 14:25 0:00 /opt/edg/sbin/edg-wl-renewd -r /opt/edg/var/spool/edg-wl-rene Shutting down lcg-mon-js: [ OK ] Starting lcg-mon-js [ OK ] Configuration Complete

19. Configuration CE

[root@bf32 root]# /opt/lcg/yaim/scripts/configure_node /nfs/work/users/jamwer/GRID/JSTestBed.def CE_torque Configuring config_upgrade ... Configuring config_ldconf ... /sbin/ldconfig: /opt/glite/externals/lib/libswigpy.so.0 is not a symbolic link /sbin/ldconfig: /opt/glite/externals/lib/libswigpl.so.0 is not a symbolic link /sbin/ldconfig: /opt/glite/externals/lib/libswigtcl8.so.0 is not a symbolic link Configuring config_sysconfig_edg ... Configuring config_sysconfig_globus ... Configuring config_sysconfig_lcg ... Configuring config_crl ... Configuring config_rfio ... rfiod already stopped: [FAILED] Configuring config_host_certs ... Configuring config_users ... Configuring config_edgusers ... Configuring config_mkgridmap ... Configuring config_java ... Configuring config_rgma_client ... Welcome to the R-GMA setup utility ---------------------------------- Configuration written to: /opt/glite/etc/rgma/rgma.conf Configuring config_gip ... Setting up an R-GMA Gin... - Configuring a gip information provider - Not configuring an fmon information provider - Not configuring a glite-ce information provider Wrote configuration to: /opt/glite/etc/rgma-gin/gin.conf All done Stopping rgma-gin: [ OK ] Starting rgma-gin: [ OK ] Configuring config_globus ... creating globus-sh-tools-vars.sh creating globus-script-initializer creating Globus::Core::Paths checking globus-hostname Done Creating... /opt/globus/etc/grid-info.conf Done Creating... /opt/globus/sbin/SXXgris /opt/globus/libexec/grid-info-script-initializer /opt/globus/libexec/grid-info-mds-core /opt/globus/libexec/grid-info-common /opt/globus/libexec/grid-info-cpu* /opt/globus/libexec/grid-info-fs* /opt/globus/libexec/grid-info-mem* /opt/globus/libexec/grid-info-net* /opt/globus/libexec/grid-info-platform* /opt/globus/libexec/grid-info-os* /opt/globus/etc/grid-info-resource-ldif.conf /opt/globus/etc/grid-info-resource-register.conf /opt/globus/etc/grid-info-resource.schema /opt/globus/etc/grid.gridftpperf.schema /opt/globus/etc/gridftp-resource.conf /opt/globus/etc/gridftp-perf-info /opt/globus/etc/grid-info-slapd.conf /opt/globus/etc/grid-info-site-giis.conf /opt/globus/etc/grid-info-site-policy.conf /opt/globus/etc/grid-info-server-env.conf /opt/globus/etc/grid-info-deployment-comments.conf Done Creating gatekeeper configuration file... Done Creating state file directory. Done. Reading gatekeeper configuration file... Determining system information... Creating job manager configuration file... Done Setting up fork gram reporter in MDS ----------------------------------------- Done Setting up pbs gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for qstat... /usr/bin/qstat updating cache /dev/null creating ./config.status creating /opt/globus/libexec/globus-script-pbs-queue Done Setting up condor gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for condor_q... no Setting up lsf gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for lsload... no loading cache ./config.cache checking for mpirun... (cached) /usr/bin/mpirun creating ./config.status creating fork.pm loading cache /dev/null checking for mpirun... /usr/bin/mpirun checking for qdel... /usr/bin/qdel checking for qstat... /usr/bin/qstat checking for qsub... /usr/bin/qsub checking for pbsnodes... /usr/bin/pbsnodes checking for ssh... /usr/bin/ssh updating cache /dev/null creating ./config.status creating /opt/globus/lib/perl/Globus/GRAM/JobManager/pbs.pm loading cache /dev/null checking for condor_submit... no loading cache /dev/null checking for mpirun... /usr/bin/mpirun checking for qdel... /usr/bin/qdel checking for qmsg... /usr/bin/qmsg checking for qstat... /usr/bin/qstat checking for qsub... /usr/bin/qsub checking for ssh... /usr/bin/ssh find-lcgpbs-tools: creating ./config.status config.status: creating /opt/globus/lib/perl/Globus/GRAM/JobManager/lcgpbs.pm checking for mpirun... /usr/bin/mpirun checking for bsub... no checking for condor_submit... no Setting up lcgpbs gram reporter in MDS ---------------------------------------- checking for qstat... /usr/bin/qstat find-lcgpbs-reporter-tools: creating ./config.status config.status: creating /opt/globus/libexec/globus-script-lcgpbs-queue Done Setting up lcglsf gram reporter in MDS ---------------------------------------- checking for lsload... no Setting up lcgcondor gram reporter in MDS ---------------------------------------- checking for condor_q... no loading cache ./config.cache creating ./config.status creating grid-cert-request-config creating grid-security-config Stopping Globus MDS [ OK ] Starting Globus MDS (gcc32dbgpthr) [ OK ] Stopping edg-gatekeeper: [ OK ] Starting edg-gatekeeper: [ OK ] Stopping globus-gridftp: [ OK ] Starting globus-gridftp: [ OK ] Shutting down lcg-mon-gridftp: [ OK ] Starting lcg-mon-gridftp [ OK ] Configuring config_fmon_client ... Stopping edg-fmon-agent: [ OK ] Starting edg-fmon-agent: [ OK ] Configuring config_lcgenv ... Configuring config_bdii ... Stopping BDII [ OK ] Starting BDII [ OK ] Configuring config_workload_manager_env ... Configuring config_wm_locallogger ... Stopping edg-wl-interlogd and edg-wl-logd. Starting LocalLogger: edg-wl-interlogd and edg-wl-logd. [ OK ] bash: /root/.bashrc: Permission denied bash: /root/.bashrc: Permission denied [ OK ] This is LocalLogger, part of Workload Management System in EU DataGrid.Copyright (c) 2002 CERN, INFN and CESNET on behalf of the EU DataGrid. [20487] Initializing... [20487] Parse messages for correctness...[20487] yes. [20487] Send messages also to inter-logger...[20487] yes. [20487] Store messages with the filename prefix "/var/tmp/dg20logd_"...[20487] yes. USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND root 20423 4.0 0.1 5292 1220 pts/1 S 14:33 0:00 /bin/sh /etc/rc.d edguser 20480 0.0 0.1 24660 2016 ? S 14:33 0:00 /opt/edg/sbin/edg edguser 20488 0.0 0.1 4068 1920 ? S 14:33 0:00 /opt/edg/sbin/edg Configuring config_apel_pbs ... Configuring config_lcmaps ... Configuring config_lcas ... Configuring config_torque_submitter_ssh ... Reloading sshd: [ OK ] Configuring config_torque_server ... Max open servers: 4 Max open servers: 4 Max open servers: 4 Max open servers: 4 Max open servers: 4 Max open servers: 4 Max open servers: 4 Max open servers: 4 Max open servers: 4 Stopping pbs_server with qterm -t quick: [ OK ] Starting pbs_server: [ OK ] Shutting down MAUI Scheduler: [ OK ] Starting MAUI Scheduler: [ OK ] Configuration Complete [root@bf32 examples]# qstat -q server: bf32.hep.man.ac.uk Queue Memory CPU Time Walltime Node Run Que Lm State ---------------- ------ -------- -------- ---- --- --- -- ----- atlas -- 48:00:00 72:00:00 -- 0 0 -- E R babar -- 48:00:00 72:00:00 -- 0 0 -- E R alice -- 48:00:00 72:00:00 -- 0 0 -- E R lhcb -- 48:00:00 72:00:00 -- 0 0 -- E R cms -- 48:00:00 72:00:00 -- 0 0 -- E R dteam -- 48:00:00 72:00:00 -- 0 0 -- E R sixt -- 48:00:00 72:00:00 -- 0 0 -- E R na48 -- 48:00:00 72:00:00 -- 0 0 -- E R --- --- 0 0 BDII can be reconfigured (if the tests did not work) using: [root@bf32 examples]# /opt/lcg/yaim/scripts/run_function /nfs/work/users/jamwer/GRID/JSTestBed.def config_bdii Configuring config_bdii Stopping BDII [ OK ] Starting BDII [ OK ] ldapsearch -x -H ldap://bf32.hep.man.ac.uk:2170 -b mds-vo-name=JSTestBed,o=grid ...lots of information ... [jamwer@bf39 jamwer]$ ldapsearch -x -H ldap://bf32.hep.man.ac.uk:2170 -b mds-vo-name=JSTestBed,o=grid | grep jobmanager-lcgpbs-babar GlueClusterService: bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar GlueForeignKey: GlueCEUniqueID=bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar GlueForeignKey: GlueCEUniqueID=bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar # bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar, JSTestBed, grid dn: GlueCEUniqueID=bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar,mds-vo-name GlueCEUniqueID: bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar GlueCEInfoContactString: bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar # bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar, JSTestBed, grid GlueCESEBindGroupCEUniqueID: bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar # babar, bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar, JSTestBed, grid GlueChunkKey: GlueCEUniqueID=bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar # bf40.hep.man.ac.uk, bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar, JSTest .hep.man.ac.uk:2119/jobmanager-lcgpbs-babar,mds-vo-name=JSTestBed,o=grid GlueCESEBindCEUniqueID: bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar

20. Configuration WNs

[root@bf33 root]# /opt/lcg/yaim/scripts/configure_node /nfs/work/users/jamwer/GRID/JSTestBed.def WN_torque Configuring config_upgrade ... Configuring config_ldconf ... /sbin/ldconfig: /opt/lcg/lib/liblcg-info-api-ldap.so.0 is not a symbolic link /sbin/ldconfig: /opt/glite/externals/lib/libswigpy.so.0 is not a symbolic link /sbin/ldconfig: /opt/glite/externals/lib/libswigpl.so.0 is not a symbolic link /sbin/ldconfig: /opt/glite/externals/lib/libswigtcl8.so.0 is not a symbolic link Configuring config_sysconfig_edg ... Configuring config_sysconfig_globus ... Configuring config_sysconfig_lcg ... Configuring config_crl ... Configuring config_rfio ... rfiod already stopped: [FAILED] Configuring config_globus ... creating globus-sh-tools-vars.sh creating globus-script-initializer creating Globus::Core::Paths checking globus-hostname Done Creating... /opt/globus/etc/grid-info.conf Done Creating... /opt/globus/sbin/SXXgris /opt/globus/libexec/grid-info-script-initializer /opt/globus/libexec/grid-info-mds-core /opt/globus/libexec/grid-info-common /opt/globus/libexec/grid-info-cpu* /opt/globus/libexec/grid-info-fs* /opt/globus/libexec/grid-info-mem* /opt/globus/libexec/grid-info-net* /opt/globus/libexec/grid-info-platform* /opt/globus/libexec/grid-info-os* /opt/globus/etc/grid-info-resource-ldif.conf /opt/globus/etc/grid-info-resource-register.conf /opt/globus/etc/grid-info-resource.schema /opt/globus/etc/grid.gridftpperf.schema /opt/globus/etc/gridftp-resource.conf /opt/globus/etc/gridftp-perf-info /opt/globus/etc/grid-info-slapd.conf /opt/globus/etc/grid-info-site-giis.conf /opt/globus/etc/grid-info-site-policy.conf /opt/globus/etc/grid-info-server-env.conf /opt/globus/etc/grid-info-deployment-comments.conf Done Creating gatekeeper configuration file... Done Creating state file directory. Done. Reading gatekeeper configuration file... Determining system information... Creating job manager configuration file... Done Setting up fork gram reporter in MDS ----------------------------------------- Done Setting up pbs gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for qstat... /usr/bin/qstat updating cache /dev/null creating ./config.status creating /opt/globus/libexec/globus-script-pbs-queue Done Setting up condor gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for condor_q... no Setting up lsf gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for lsload... no loading cache ./config.cache checking for mpirun... (cached) /usr/bin/mpirun creating ./config.status creating fork.pm loading cache /dev/null checking for mpirun... /usr/bin/mpirun checking for qdel... /usr/bin/qdel checking for qstat... /usr/bin/qstat checking for qsub... /usr/bin/qsub checking for pbsnodes... /usr/bin/pbsnodes checking for ssh... /usr/bin/ssh updating cache /dev/null creating ./config.status creating /opt/globus/lib/perl/Globus/GRAM/JobManager/pbs.pm loading cache /dev/null checking for condor_submit... no loading cache /dev/null loading cache ./config.cache creating ./config.status creating grid-cert-request-config creating grid-security-config Configuring config_lcgenv ... Configuring config_replica_manager ... Configuring config_users ... Configuring config_sw_dir ... Configuring config_java ... Configuring config_rgma_client ... Welcome to the R-GMA setup utility ---------------------------------- Configuration written to: /opt/glite/etc/rgma/rgma.conf Configuring config_workload_manager_env ... Configuring config_fts_client ... Configuring config_glite_env ... Configuring config_gsissh ... Configuring gsi_openssh ------------------------------------------------------------ Executing... Notes: o Privilege separation is off. o GSI-OpenSSH website is . ------------------------------------------------------------ Finished configuring gsi_openssh. Configuring config_torque_client ... Stopping pbs_mom: [ OK ] Starting pbs_mom: [ OK ] Configuration Complete

21. Configuration UI

[root@bf39 root]# /opt/lcg/yaim/scripts/configure_node /nfs/work/users/jamwer/GRID/JSTestBed.def UI Configuring config_upgrade ... Configuring config_ldconf ... /sbin/ldconfig: /opt/lcg/lib/liblcg-info-api-ldap.so.0 is not a symbolic link /sbin/ldconfig: /opt/glite/externals/lib/libswigpy.so.0 is not a symbolic link /sbin/ldconfig: /opt/glite/externals/lib/libswigpl.so.0 is not a symbolic link /sbin/ldconfig: /opt/glite/externals/lib/libswigtcl8.so.0 is not a symbolic link Configuring config_sysconfig_edg ... Configuring config_sysconfig_globus ... Configuring config_sysconfig_lcg ... Configuring config_crl ... Configuring config_rfio ... rfiod already stopped: [FAILED] Configuring config_globus ... creating globus-sh-tools-vars.sh creating globus-script-initializer creating Globus::Core::Paths checking globus-hostname Done Creating... /opt/globus/etc/grid-info.conf Done Creating... /opt/globus/sbin/SXXgris /opt/globus/libexec/grid-info-script-initializer /opt/globus/libexec/grid-info-mds-core /opt/globus/libexec/grid-info-common /opt/globus/libexec/grid-info-cpu* /opt/globus/libexec/grid-info-fs* /opt/globus/libexec/grid-info-mem* /opt/globus/libexec/grid-info-net* /opt/globus/libexec/grid-info-platform* /opt/globus/libexec/grid-info-os* /opt/globus/etc/grid-info-resource-ldif.conf /opt/globus/etc/grid-info-resource-register.conf /opt/globus/etc/grid-info-resource.schema /opt/globus/etc/grid.gridftpperf.schema /opt/globus/etc/gridftp-resource.conf /opt/globus/etc/gridftp-perf-info /opt/globus/etc/grid-info-slapd.conf /opt/globus/etc/grid-info-site-giis.conf /opt/globus/etc/grid-info-site-policy.conf /opt/globus/etc/grid-info-server-env.conf /opt/globus/etc/grid-info-deployment-comments.conf Done Creating gatekeeper configuration file... Done Creating state file directory. Done. Reading gatekeeper configuration file... Determining system information... Creating job manager configuration file... Done Setting up fork gram reporter in MDS ----------------------------------------- Done Setting up pbs gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for qstat... /usr/bin/qstat updating cache /dev/null creating ./config.status creating /opt/globus/libexec/globus-script-pbs-queue Done Setting up condor gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for condor_q... no Setting up lsf gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for lsload... no loading cache ./config.cache checking for mpirun... (cached) /usr/bin/mpirun creating ./config.status creating fork.pm loading cache /dev/null checking for mpirun... /usr/bin/mpirun checking for qdel... /usr/bin/qdel checking for qstat... /usr/bin/qstat checking for qsub... /usr/bin/qsub checking for pbsnodes... /usr/bin/pbsnodes checking for ssh... /usr/bin/ssh updating cache /dev/null creating ./config.status creating /opt/globus/lib/perl/Globus/GRAM/JobManager/pbs.pm loading cache /dev/null checking for condor_submit... no loading cache /dev/null loading cache ./config.cache creating ./config.status creating grid-cert-request-config creating grid-security-config Configuring config_lcgenv ... Configuring config_replica_manager ... Configuring config_java ... Configuring config_rgma_client ... Welcome to the R-GMA setup utility ---------------------------------- Configuration written to: /opt/glite/etc/rgma/rgma.conf Configuring config_workload_manager_client ... Configuring config_fts_client ... Configuring config_glite_env ... Configuring config_gsissh ... Configuring gsi_openssh ------------------------------------------------------------ Executing... Notes: o Privilege separation is off. o GSI-OpenSSH website is . ------------------------------------------------------------ Finished configuring gsi_openssh. Configuration Complete

22. Configuration SE

[root@bf40 grid-security]# cp /nfs/work/users/jamwer/GRID/bf40cert.pem hostcert.pem [root@bf40 grid-security]# cp /nfs/work/users/jamwer/GRID/bf40key.pem hostkey.pem [root@bf40 grid-security]# chmod 400 hostkey.pem [root@bf40 grid-security]# chmod 444 hostcert.pem [root@bf40 grid-security]# ls -l total 124 drwxr-xr-x 2 root root 8192 Nov 8 10:24 certificates drwxrwxr-x 2 root edguser 8192 Nov 8 10:26 gridmapdir -rw-r--r-- 1 root root 91386 Nov 8 10:28 grid-mapfile -r--r--r-- 1 root root 2396 Nov 8 14:32 hostcert.pem -r-------- 1 root root 1931 Nov 8 14:33 hostkey.pem drwxr-xr-x 2 root root 4096 Nov 2 12:37 vomsdir [root@bf40 root]# /opt/lcg/yaim/scripts/configure_node /nfs/work/users/jamwer/GRID/JSTestBed.def SE_classic Configuring config_upgrade ... Configuring config_ldconf ... /sbin/ldconfig: /opt/glite/externals/lib/libswigpy.so.0 is not a symbolic link /sbin/ldconfig: /opt/glite/externals/lib/libswigpl.so.0 is not a symbolic link /sbin/ldconfig: /opt/glite/externals/lib/libswigtcl8.so.0 is not a symbolic link Configuring config_sysconfig_edg ... Configuring config_sysconfig_globus ... Configuring config_sysconfig_lcg ... Configuring config_crl ... Configuring config_rfio ... Stopping rfiod: [ OK ] Starting rfiod: [ OK ] Configuring config_host_certs ... Configuring config_users ... Configuring config_edgusers ... Configuring config_mkgridmap ... Configuring config_java ... Configuring config_rgma_client ... Welcome to the R-GMA setup utility ---------------------------------- Configuration written to: /opt/glite/etc/rgma/rgma.conf Configuring config_gip ... Setting up an R-GMA Gin... - Configuring a gip information provider - Not configuring an fmon information provider - Not configuring a glite-ce information provider Wrote configuration to: /opt/glite/etc/rgma-gin/gin.conf All done Stopping rgma-gin: [ OK ] Starting rgma-gin: [ OK ] Configuring config_globus ... creating globus-sh-tools-vars.sh creating globus-script-initializer creating Globus::Core::Paths checking globus-hostname Done Creating... /opt/globus/etc/grid-info.conf Done Creating... /opt/globus/sbin/SXXgris /opt/globus/libexec/grid-info-script-initializer /opt/globus/libexec/grid-info-mds-core /opt/globus/libexec/grid-info-common /opt/globus/libexec/grid-info-cpu* /opt/globus/libexec/grid-info-fs* /opt/globus/libexec/grid-info-mem* /opt/globus/libexec/grid-info-net* /opt/globus/libexec/grid-info-platform* /opt/globus/libexec/grid-info-os* /opt/globus/etc/grid-info-resource-ldif.conf /opt/globus/etc/grid-info-resource-register.conf /opt/globus/etc/grid-info-resource.schema /opt/globus/etc/grid.gridftpperf.schema /opt/globus/etc/gridftp-resource.conf /opt/globus/etc/gridftp-perf-info /opt/globus/etc/grid-info-slapd.conf /opt/globus/etc/grid-info-site-giis.conf /opt/globus/etc/grid-info-site-policy.conf /opt/globus/etc/grid-info-server-env.conf /opt/globus/etc/grid-info-deployment-comments.conf Done Creating gatekeeper configuration file... Done Creating state file directory. Done. Reading gatekeeper configuration file... Determining system information... Creating job manager configuration file... Done Setting up fork gram reporter in MDS ----------------------------------------- Done Setting up pbs gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for qstat... no Setting up condor gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for condor_q... no Setting up lsf gram reporter in MDS ---------------------------------------- loading cache /dev/null checking for lsload... no loading cache ./config.cache checking for mpirun... (cached) /usr/bin/mpirun creating ./config.status creating fork.pm loading cache /dev/null checking for mpirun... /usr/bin/mpirun checking for qdel... no loading cache /dev/null checking for condor_submit... no loading cache /dev/null loading cache ./config.cache creating ./config.status creating grid-cert-request-config creating grid-security-config Stopping Globus MDS [ OK ] Starting Globus MDS (gcc32dbgpthr) [ OK ] Stopping globus-gridftp: [ OK ] Starting globus-gridftp: [ OK ] Shutting down lcg-mon-gridftp: [ OK ] Starting lcg-mon-gridftp [ OK ] Configuring config_seclassic ... Configuring config_fmon_client ... Stopping edg-fmon-agent: [ OK ] Starting edg-fmon-agent: [ OK ] Configuring config_lcmaps ... Configuring config_lcas ... Configuration Complete

23. Testing LCG middleware

[root@bf39 GRID]# adduser jamwer [root@bf39 GRID]# passwd jamwer [root@bf39 GRID]# mkdir /home/jamwer/.globus [root@bf39 GRID]# cp usercert.pem /home/jamwer/.globus [root@bf39 GRID]# cp userkey.pem /home/jamwer/.globus [root@bf39 GRID]# chown jamwer.jamwer /home/jamwer/.globus/* [root@bf39 GRID]# chmod 400 /home/jamwer/.globus/userkey.pem [root@bf39 GRID]# ls -l /home/jamwer/.globus total 8 -r--r--r-- 1 jamwer jamwer 2224 Nov 8 11:11 usercert.pem -r-------- 1 jamwer jamwer 1920 Nov 8 11:13 userkey.pem [root@bf39 GRID]# exit exit Login as jamwer: [jamwer@bf39 jamwer]$ pwd /home/jamwer [jamwer@bf39 jamwer]$ grid-proxy-init Your identity: /C=UK/O=eScience/OU=Manchester/L=HEP/CN=james werner Enter GRID pass phrase for this identity: Creating proxy .................................................. Done Your proxy is valid until: Tue Nov 8 23:14:57 2005

Job submission using Globus Toolkit

[jamwer@bf39 jamwer]$ globus-job-run bf32.hep.man.ac.uk /bin/hostname bf32.hep.man.ac.uk

Job submission using EDG

[jamwer@bf39 GRID]$ vi testjob.jdl Executable = "testjob.sh"; StdOutput = "testjob.out"; StdError = "testjob.err"; InputSandbox = {"./testjob.sh"}; OutputSandbox = {"testjob.out","testjob.err"}; [jamwer@bf39 jamwer]$ lcg-infosites --vo babar ce --is bf32.hep.man.ac.uk **************************************************************** These are the related data for babar: (in terms of queues and CPUs) **************************************************************** #CPU Free Total Jobs Running Waiting ComputingElement ---------------------------------------------------------- 12 12 0 0 0 bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar vi testjob.sh #!/bin/bash date hostname echo "**************************************" echo "env | sort" echo "**************************************" env | sort [jamwer@bf39 jamwer]$ edg-job-list-match --vo babar testjob.jdl Selected Virtual Organisation name (from --vo option): babar Connecting to host bf31.hep.man.ac.uk, port 7772 *************************************************************************** COMPUTING ELEMENT IDs LIST The following CE(s) matching your job requirements have been found: *CEId* bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar *************************************************************************** [jamwer@bf39 GRID]$ vi testjob.jdl Executable = "testjob.sh"; StdOutput = "testjob.out"; StdError = "testjob.err"; InputSandbox = {"./testjob.sh"}; OutputSandbox = {"testjob.out","testjob.err"}; Requirements = other.GlueCEUniqueID == "bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar"; [jamwer@bf39 jamwer]$ edg-job-list-match --vo babar testjob.jdl Selected Virtual Organisation name (from --vo option): babar Connecting to host bf31.hep.man.ac.uk, port 7772 *************************************************************************** COMPUTING ELEMENT IDs LIST The following CE(s) matching your job requirements have been found: *CEId* bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar *************************************************************************** [jamwer@bf39 jamwer]$ edg-job-submit --vo babar testjob.jdl Selected Virtual Organisation name (from --vo option): babar Connecting to host bf31.hep.man.ac.uk, port 7772 Logging to host bf31.hep.man.ac.uk, port 9002 ********************************************************************************************* JOB SUBMIT OUTCOME The job has been successfully submitted to the Network Server. Use edg-job-status command to check job current status. Your job identifier (edg_jobId) is: - https://bf31.hep.man.ac.uk:9000/01fW-PAe9TleVqAkt5eVxQ ********************************************************************************************* [jamwer@bf39 jamwer]$ edg-job-status https://bf31.hep.man.ac.uk:9000/01fW-PAe9TleVqAkt5eVxQ ************************************************************* BOOKKEEPING INFORMATION: Status info for the Job : https://bf31.hep.man.ac.uk:9000/01fW-PAe9TleVqAkt5eVxQ Current Status: Scheduled Status Reason: Job successfully submitted to Globus Destination: bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar reached on: Mon Nov 14 16:22:00 2005 ************************************************************* [jamwer@bf39 jamwer]$ edg-job-status https://bf31.hep.man.ac.uk:9000/01fW-PAe9TleVqAkt5eVxQ ************************************************************* BOOKKEEPING INFORMATION: Status info for the Job : https://bf31.hep.man.ac.uk:9000/01fW-PAe9TleVqAkt5eVxQ Current Status: Done (Success) Exit code: 0 Status Reason: Job terminated successfully Destination: bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar reached on: Mon Nov 14 16:26:47 2005 ************************************************************* [jamwer@bf39 jamwer]$ edg-job-get-output --dir . https://bf31.hep.man.ac.uk:9000/01fW-PAe9TleVqAkt5eVxQ Retrieving files from host: bf31.hep.man.ac.uk ( for https://bf31.hep.man.ac.uk:9000/01fW-PAe9TleVqAkt5eVxQ ) ********************************************************************************* JOB GET OUTPUT OUTCOME Output sandbox files for the job: - https://bf31.hep.man.ac.uk:9000/01fW-PAe9TleVqAkt5eVxQ have been successfully retrieved and stored in the directory: /home/jamwer/jamwer_01fW-PAe9TleVqAkt5eVxQ ********************************************************************************* [jamwer@bf39 jamwer]$ ls edglog.log lixoi testjob.jdl jamwer_01fW-PAe9TleVqAkt5eVxQ testfile.txt testjob.sh [jamwer@bf39 jamwer]$ cd jamwer_01fW-PAe9TleVqAkt5eVxQ/ [jamwer@bf39 jamwer_01fW-PAe9TleVqAkt5eVxQ]$ ls testjob.err testjob.out [jamwer@bf39 jamwer_01fW-PAe9TleVqAkt5eVxQ]$ cat testjob.err [jamwer@bf39 jamwer_01fW-PAe9TleVqAkt5eVxQ]$ cat testjob.out Tue Nov 15 15:58:41 GMT 2005 bf38.hep.man.ac.uk ************************************** env | sort ************************************** APEL_HOME=/opt/glite _=/bin/env CLASSADJ_INSTALL_PATH=/usr COG_INSTALL_PATH=/usr DPM_HOST=bf40 DPNS_HOST=bf40 EDG_LOCATION=/opt/edg EDG_LOCATION_VAR=/opt/edg/var EDG_TMP=/tmp EDG_WL_JOBID=https://bf31.hep.man.ac.uk:9000/0817If8iQIbGWgPDwxhfZw EDG_WL_LOCATION=/opt/edg EDG_WL_LOCATION_VAR=/opt/edg/var EDG_WL_LOG_DESTINATION=bf32.hep.man.ac.uk EDG_WL_RB_BROKERINFO=/home/babar001/globus-tmp.bf38.20969.0/WMS_bf38_021439_https_3a_2f_2fbf31.hep.man.ac.uk_3a9000_2f0817If8iQIbGWgPDwxhfZw/.BrokerInfo EDG_WL_SEQUENCE_CODE=UI=000003:NS=0000000003:WM=000004:BH=0000000000:JSS=000003:LM=000000:LRMS=000002:APP=000000 EDG_WL_TMP=/var/edgwl EDG_WL_USER=edguser ENVIRONMENT=BATCH G_BROKEN_FILENAMES=1 GLITE_LOCATION_LOG=/opt/glite/log GLITE_LOCATION=/opt/glite GLITE_LOCATION_TMP=/opt/glite/tmp GLITE_LOCATION_VAR=/opt/glite/var GLOBUS_GASS_CACHE_DEFAULT=/home/babar001/globus-tmp.bf38.20969.0/globus-tmp.bf38.20969.0 GLOBUS_GRAM_JOB_CONTACT=https://bf32.hep.man.ac.uk:20001/3556/1132070271/ GLOBUS_GRAM_MYJOB_CONTACT=URLx-nexus://bf32.hep.man.ac.uk:20003/ GLOBUS_LOCATION=/opt/globus GLOBUS_PATH=/opt/globus GLOBUS_REMOTE_IO_URL=/tmp/globus-tmp.bf38.20969.2 GLOBUS_TCP_PORT_RANGE=20000 25000 GPT_LOCATION=/opt/gpt HISTSIZE=1000 HOME=/home/babar001/globus-tmp.bf38.20969.0 HOSTNAME=bf38.hep.man.ac.uk INPUTRC=/etc/inputrc JAVA_INSTALL_PATH=/usr/java/j2sdk1.4.2_04 JAVA_LOCATION=/usr/java/j2sdk1.4.2_10 LANG=en_US.UTF-8 LCG_GFAL_INFOSYS=bf30.hep.man.ac.uk:2170 LCG_LOCATION=/opt/lcg LCG_LOCATION_VAR=/opt/lcg/var LCG_TMP=/tmp LD_LIBRARY_PATH=/opt/globus/lib:/opt/lcg/lib:/opt/edg/lib:/usr/local/lib:/opt/glite/lib:/opt/glite/externals/lib:/opt/d-cache/dcap/lib LESSOPEN=|/usr/bin/lesspipe.sh %s LIBPATH=/opt/globus/lib:/usr/lib:/lib LOG4J_INSTALL_PATH=/usr LOGNAME=babar001 LS_COLORS= MAIL=/var/spool/mail/babar001 MANPATH=/opt/globus/man::/opt/edg/share/man:/opt/glite/share/man:/opt/lcg/share/man:/opt/edg/man MYPROXY_SERVER=bf30.hep.man.ac.uk PATH=/opt/globus/bin:/opt/globus/sbin:/usr/java/j2sdk1.4.2_10/bin:/opt/lcg/bin:/usr/kerberos/bin:/opt/edg/bin:/usr/local/bin:/usr/java/j2sdk1.4.2_10/bin:/bin:/usr/bin:/usr/X11R6/bin:/opt/glite/bin:/opt/glite/externals/bin:/opt/gpt/sbin:/opt/d-cache/srm/bin:/opt/d-cache/dcap/bin:/opt/edg/sbin PBS_ENVIRONMENT=PBS_BATCH PBS_JOBCOOKIE=579CC5F51FA702CF3CC8683360D0E5ED PBS_JOBID=5.bf32.hep.man.ac.uk PBS_JOBNAME=STDIN PBS_MOMPORT=15003 PBS_NODEFILE=/var/spool/pbs/aux/5.bf32.hep.man.ac.uk PBS_NODENUM=0 PBS_O_HOME=/home/babar001 PBS_O_HOST=bf32.hep.man.ac.uk PBS_O_LANG=C PBS_O_LOGNAME=babar001 PBS_O_PATH=/sbin:/usr/sbin:/bin:/usr/bin:/usr/X11R6/bin PBS_O_QUEUE=babar PBS_O_SHELL=/bin/bash PBS_O_WORKDIR=/home/babar001 PBS_QUEUE=babar PBS_TASKNUM=1 PERLLIB=/opt/edg/lib/perl:/opt/glite/lib/perl5:/opt/lcg/lib/perl PWD=/home/babar001/globus-tmp.bf38.20969.0/WMS_bf38_021439_https_3a_2f_2fbf31.hep.man.ac.uk_3a9000_2f0817If8iQIbGWgPDwxhfZw PYTHONPATH=/opt/edg/lib:/opt/edg/lib/python:/opt/lcg/lib/python:/opt/glite/lib/python RGMA_HOME=/opt/glite SASL_PATH=/opt/globus/lib/sasl SCRATCH_DIRECTORY=/home/babar001/globus-tmp.bf38.20969.1 SHELL=/bin/sh SHLIB_PATH=/opt/globus/lib SHLVL=4 SRM_PATH=/opt/d-cache/srm SSH_ASKPASS=/usr/libexec/openssh/gnome-ssh-askpass TMPDIR=/var/spool/pbs/5.bf32.hep.man.ac.uk USER=babar001 VO_ALICE_DEFAULT_SE=bf40.hep.man.ac.uk VO_ALICE_SW_DIR=/nfs/work/lcg/alice VO_ATLAS_DEFAULT_SE=bf40.hep.man.ac.uk VO_ATLAS_SW_DIR=/nfs/work/lcg/atlas VO_BABAR_DEFAULT_SE=bf40.hep.man.ac.uk VO_BABAR_SW_DIR=/nfs/work/lcg/babar VO_CMS_DEFAULT_SE=bf40.hep.man.ac.uk VO_CMS_SW_DIR=/nfs/work/lcg/cms VO_DTEAM_DEFAULT_SE=bf40.hep.man.ac.uk VO_DTEAM_SW_DIR=/nfs/work/lcg/dteam VO_LHCB_DEFAULT_SE=bf40.hep.man.ac.uk VO_LHCB_SW_DIR=/nfs/work/lcg/lhcb VO_NA48_DEFAULT_SE=bf40.hep.man.ac.uk VO_NA48_SW_DIR=/nfs/work/lcg/na48 VO_SIXT_DEFAULT_SE=bf40.hep.man.ac.uk VO_SIXT_SW_DIR=/nfs/work/lcg/sixt X509_USER_PROXY=/tmp/globus-tmp.bf38.20969.0 Following execution in the Computer element during the processing: [root@bf32 root]# qstat [root@bf32 root]# qstat Job id Name User Time Use S Queue ---------------- ---------------- ---------------- -------- - ----- 3.bf32 STDIN babar001 0 Q babar [root@bf32 root]# qstat Job id Name User Time Use S Queue ---------------- ---------------- ---------------- -------- - ----- 3.bf32 STDIN babar001 0 R babar [root@bf32 root]# qstat [root@bf32 root]#

SE tests

[jamwer@bf39 jamwer]$ lcg-infosites --vo babar --is bf30.hep.man.ac.uk se ************************************************************** These are the related data for babar: (in terms of SE) ************************************************************** Avail Space(Kb) Used Space(Kb) Type SEs ---------------------------------------------------------- 3394016 5213972 bf40.hep.man.ac.uk [jamwer@bf39 jamwer]$ edg-gridftp-ls gsiftp://bf40.hep.man.ac.uk/ tmp etc var storage home root opt usr lib bin sbin dev proc boot tftpboot work misc initrd mnt afscache lost+found afs [jamwer@bf39 jamwer]$ lcg-infosites --vo babar closeSE --is bf30.hep.man.ac.uk Name of the CE: bf32.hep.man.ac.uk:2119/jobmanager-lcgpbs-babar Name of the close SE: bf40.hep.man.ac.uk

RGMA Test

Client: [root@bf32 root]# $RGMA_HOME/bin/rgma-client-check *** Running R-GMA client tests on bf32.hep.man.ac.uk *** Checking C API: Success Checking C++ API: Success Checking Python API: Success Checking Java API: Success Checking for safe arrival of tuples, please wait... Success *** R-GMA client test successful *** Server: [root@bf30 grid-security]# $RGMA_HOME/bin/rgma-server-check *** Running R-GMA server tests on bf30.hep.man.ac.uk *** Checking Tomcat is running on the local machine... Successfully connected to Tomcat. Java VM version: 1.4.2_10 (OK) Connecting to http://lcgic01.gridpp.rl.ac.uk:8080/R-GMA/SchemaServlet... Successfully connected to Schema. Using PongServlet (1) on http://lcgic01.gridpp.rl.ac.uk:8080/R-GMA/PongServlet. Checking other servlets... Connecting to http://bf30.hep.man.ac.uk:8080/R-GMA/ConsumerServlet:OK Connecting to streaming port 8088 on bf30.hep.man.ac.uk:OK Checking clock synchronization: OK Connecting to http://bf30.hep.man.ac.uk:8080/R-GMA/StreamProducerServlet:OK Checking clock synchronization: OK Connecting to http://bf30.hep.man.ac.uk:8080/R-GMA/LatestProducerServlet:OK Checking clock synchronization: OK Connecting to http://bf30.hep.man.ac.uk:8080/R-GMA/DBProducerServlet:OK Checking clock synchronization: OK Connecting to http://bf30.hep.man.ac.uk:8080/R-GMA/CanonicalProducerServlet:OK Checking clock synchronization: OK Connecting to http://bf30.hep.man.ac.uk:8080/R-GMA/ArchiverServlet:OK Checking clock synchronization: OK *** R-GMA server test successful ***

Pbs tests

References: http://www.cs.ubc.ca/~pramanik/pbsInstall.htm http://www.uic.edu/depts/accc/hardware/argo/qsub.html http://www.sao.nrc.ca/~gabriel/pbs/pbs_user.html [root@bf32 root]# pbsnodes -a bf33.hep.man.ac.uk state = free np = 2 properties = lcgpro ntype = cluster status = arch=linux,uname=Linux bf33.hep.man.ac.uk 2.4.21-27.0.2.ELsmp #1 SMP Tue Jan 18 19:25:35 CST 2005 i686,sessions=? 0,nsessions=? 0,nusers=0,idletime=3369,totmem=3121864kb,availmem=2155956kb,physmem=1025392kb,ncpus=2,loadave=0.00,rectime=1131964456 bf34.hep.man.ac.uk state = free np = 2 properties = lcgpro ntype = cluster status = arch=linux,uname=Linux bf34.hep.man.ac.uk 2.4.21-27.0.2.ELsmp #1 SMP Tue Jan 18 19:25:35 CST 2005 i686,sessions=? 0,nsessions=? 0,nusers=0,idletime=1107684,totmem=3121864kb,availmem=2165704kb,physmem=1025392kb,ncpus=2,loadave=0.00,rectime=1131964468 bf35.hep.man.ac.uk state = free np = 2 properties = lcgpro ntype = cluster status = arch=linux,uname=Linux bf35.hep.man.ac.uk 2.4.21-27.0.2.ELsmp #1 SMP Tue Jan 18 19:25:35 CST 2005 i686,sessions=? 0,nsessions=? 0,nusers=0,idletime=1107712,totmem=3121864kb,availmem=2164056kb,physmem=1025392kb,ncpus=2,loadave=0.00,rectime=1131964458 bf36.hep.man.ac.uk state = free np = 2 properties = lcgpro ntype = cluster status = arch=linux,uname=Linux bf36.hep.man.ac.uk 2.4.21-27.0.2.ELsmp #1 SMP Tue Jan 18 19:25:35 CST 2005 i686,sessions=? 0,nsessions=? 0,nusers=0,idletime=1106229,totmem=3121864kb,availmem=2163452kb,physmem=1025392kb,ncpus=2,loadave=0.00,rectime=1131964456 bf37.hep.man.ac.uk state = free np = 2 properties = lcgpro ntype = cluster status = arch=linux,uname=Linux bf37.hep.man.ac.uk 2.4.21-27.0.2.ELsmp #1 SMP Tue Jan 18 19:25:35 CST 2005 i686,sessions=? 0,nsessions=? 0,nusers=0,idletime=1106028,totmem=3121864kb,availmem=2163304kb,physmem=1025392kb,ncpus=2,loadave=0.00,rectime=1131964467 bf38.hep.man.ac.uk state = free np = 2 properties = lcgpro ntype = cluster status = arch=linux,uname=Linux bf38.hep.man.ac.uk 2.4.21-27.0.2.ELsmp #1 SMP Tue Jan 18 19:25:35 CST 2005 i686,sessions=? 0,nsessions=? 0,nusers=0,idletime=1106362,totmem=3121864kb,availmem=2163628kb,physmem=1025392kb,ncpus=2,loadave=0.00,rectime=1131964478

24. Operational procedures

Restarting bdii in the bdii computer and CE:

[root@bf30 root]# /etc/init.d/bdii restart Stopping BDII [ OK ] Starting BDII [ OK ] [root@bf32 root]# /etc/init.d/bdii restart Stopping BDII [ OK ] Starting BDII [ OK ]

Restarting globus-mds in the CE and SE:

[root@bf32 root]# /etc/init.d/globus-mds restart Stopping Globus MDS [ OK ] Starting Globus MDS (gcc32dbgpthr) [ OK ] [root@bf40 root]# /etc/init.d/globus-mds restart Stopping Globus MDS [ OK ] Starting Globus MDS (gcc32dbgpthr) [ OK ]

Losing contact with all your farm

When you lose contact with all of your machines, please login to the console and do: [root@bf32 root]# ifconfig -a [root@bf32 root]# netstat -rn and send the output to your site DNS person. For example, we had our machines showing they are in subnet 194.36.2.XX instead of subnet 194.36.3.XX.

25. The firewall issue.

Using grid from your laptop/desktop requires some ports are open in the firewall. The description about it are available at: LCG application, middleware & security - last 2 pages are important, and not able to be printed! Globus requirements Security and Firewall Settings Reporting on experiences operating Globus through firewall.

Top

Last modified:
Copyright 2004 Manchester University
Feedback to: jamwer@hep.man.ac.uk