This page last changed on Jan 14, 2014 by mbabik.
Summary
Start Date |
8 December 2012 |
Release Date |
28 October 2013 |
Status |
Released |
Validation Steps |
SAM-3251 |
Validation Status |
Validated |
Description
This release is mainly focused on the integration of EMI probes. In addition it contains several bug fixes identified during the deployment of SAM Update 20.
The following probes were integrated:
- ARC probes (nordugrid-arc-nagios-plugins-1.6.1-1.rc1.el5)
- new ARGUS probes (nagios-plugins-argus-1.1.0-2.el5)
- BDII probes (nagios-plugins-bdii-1.0.14-1.el5)
- CREAMCE probes (emi-cream-nagios-1.0.1-4.el5.sam)
- FTS probes (nagios-plugins-fts-1.0.1-1.el5)
- GLEXEC probe (nagios-plugins-emi.glexec-0.3.0-1.sl5)
- LFC probes (nagios-plugins-lfc-0.9.5-1.el5)
- new MPI probes (egi-mpi-nagios-0.0.5-1.el5)
- SRM probes (emi.dcache.srm-probes-1.0.0-1.el5)
- UNICORE probes (unicore-nagios-plugins-2.2.1-1.sl5)
- WMS probes (emi-wms-nagios-3.5.0-3.sl5)
- WN replication probes (nagios-plugins-wn-rep-1.0.0-1.sl5)
The full list of metric changes, is available at: SAM Doc FAQs
Installation and Configuration
SAM-Nagios
The new installation guide is available at: New SAM-Nagios install guide
| An upgrade from previous SAM versions is not possible for this release. We strongly recommended to install SAM-Update 22 starting with a base operating system. |
The database backup is no longer performed automatically as part of yaim. The following yaim function can be executed manually to create a backup of your database:
/opt/glite/yaim/bin/yaim -r -d 6 -s /etc/yaim/site-info.def -n SAM_NAGIOS -f config_mysql_backup
Added YAIM variables in this release:
Component |
Name |
Description |
Default |
Mandatory |
Example |
DB |
DB_TMP_DIR |
tmp directory for MySQL |
Yes |
No |
"/var/tmp" |
Removed YAIM variables in this release:
| In order to support transparent migration of SAM to CNRS please add the following yaim variables to your site-info:
ATP_ROOT_URL="http://mon.egi.eu/atp"
POEM_SYNC_URLS="http://mon.egi.eu/poem/api/0.1/json/" |
| In order for ARC SRM and LFC tests to work the following needs to be done:
1. Yaim variable JOBSUBMIT_WN_SE_REP_FILE must be set to file where hr.srce.GoodSEs will store list of working SRM endpoints, e.g.
JOBSUBMIT_WN_SE_REP_FILE=GOOD_SES
2. Global attribute LFC_HOST must be set to LFC in localdb, e.g.
GLOBAL_ATTRIBUTE!LFC_HOST!prod-lfc-shared-central.cern.ch
ARC SRM tests require additional configuration in file /etc/nagios/plugins/arcnagios-local.ini. Since the directory is not consistent on all SEs, admins must manually define directory for each SE that might be used. Details can be found in Switch section of documentation: http://git.nbi.ku.dk/downloads/NorduGridARCNagiosPlugins/arcce.html#custom-substitutions-in-job-test-sections. Alternatively, se_host and se_test_dir can be used to define a single SE for ARC SRM tests. |
SAM-Gridmon
The new installation guide is available at: New SAM-Gridmon install guide
The database deployment is not performed automatically as part of yaim. The following yaim function should be executed manually:
/opt/glite/yaim/bin/yaim -r -s /etc/lcg-quattor-site-info.def -n sam_gridmon -f config_database
Added YAIM variables in this release:
Component |
Name |
Description |
Default |
Mandatory |
Example |
MyWLCG |
MYWLCG_REPORT_VO_ALL_SITES_PROFILES |
Profiles to be used on VO All sites report |
Yes |
No |
"atlas_critical cms_critical" |
Known Issues
- For NGIs monitoring ARC or using ARC probes, there is a missing dependency that needs to be installed manually:
$ yum install nordugrid-arc-plugins-globus
- If you use the latest CentOS 5.10 (or SL5.10) be aware that the base now contains mysql51 packages. Since the base has higher priority than sam repo please modify the exclude for base and updates accordingly:
[base]
priority=2
protect=1
exclude = perl-DBI mysql51*
[updates]
priority=2
protect=1
exclude = perl-DBI mysql51*
- Please apply the following patch atp_service_type_update.patch* in case after restoring database from backup, yaim fails with :
INFO: Creating database schema
Existing DB schema and versions:
- atp is currently 1.19
- metricstore is currently 1.17
- mddb is currently 1.1
- poem_sync is currently 1.3
Upgrading atp DB to version 1.20
ERROR: deploy_dbschema.pl failed, check /var/log/sam-db.log.
ERROR: Configuration error !
(*credits to Jan Astalos for the fix)
- eu.egi.mpi.complexjob.CREAMCE-JobState-/ops/Role=lcgadmin fails with error: SMPGranularity and HostNumber are mutually exclusive when WholeNodes allocation is not requested: wrong combination of values (more information at https://ggus.eu/ws/ticket_info.php?ticket=98851):
In /usr/libexec/grid-monitoring/probes/eu.egi.mpi/complexjob/jdl.template
replace:
HostNumber = 2;
with:
CPUNumber = 4;
Package List
Full list of Update-22 packages and dependencies is available at SAM Update-22 repository
SAM-Nagios changes
- glite-yaim-nagios-1.10.31-1.el5
- ncg-metric-config-1.3.13-1.el5
- grid-monitoring-config-gen-0.93.6-1.el5
- grid-monitoring-probes-hr.srce-0.37.0-1.el5
- grid-monitoring-probes-ch.cern.sam-1.6.14-1.el5
- grid-monitoring-probes-eu.egi.sec-1.0.10-2.el5
- grid-monitoring-probes-cadist-0.5.0-1.el5
- mywlcg-atp-web-1.26.2-3.el5
- perl-GridMon-1.0.73-1.el5
- poem-0.9.84-1.el5 and poem-sync-0.9.84-1.el5
- python-GridMon-1.1.13-1.el5
SAM-Gridmon changes
- glite-yaim-nagios-1.10.31-1.el5
- msg-consume2db-1.0.23-1.el5
- mywlcg-atp-web-1.26.2-3.el5
- poem-0.9.84-1.el5 and poem-sync-0.9.84-1.el5
Tickets List
jiraissues: Unable to determine if sort should be enabled.
|