This page last changed on Jun 18, 2013 by mbabik.

Release: Update-17

Summary

Start Date 13 February 2012
End Date 02 May 2012
Status This is internal release, please don't install it
Release Date 03 Jul 2012
Release Manager Wojciech Lapka
Main Activities Integration with POEM

Validation Steps performed

Outlined in SAM-2388

Note

Please check also release notes and configuration changes of Update-16,
as it was an internal release.

List of packages updated in this release

Node sam-nagios

atp-1.23.9-1.el5
atp-web-1.23.9-1.el5
mywlcg-1.2.8-2.el5
poem-0.9.5-1.el5
poem-sync-0.9.5-1.el5
mrs-1.7.6-4.el5
grid-monitoring-config-gen-0.89.7-1.el5
grid-monitoring-probes-ch.cern.sam-1.6.2-1.el5
grid-monitoring-probes-eu.egi.sec-1.0.6-1.el5
grid-monitoring-probes-org.ndgf-0.10-2.el5
grid-monitoring-probes-org.sam-0.4.1-1.el5
glite-yaim-nagios-1.7.40-7.el5
sam-sync-1.0.6-1.el5
sam-nagios-1.17.2-1.el5
nagios-3.3.1-1.el5.rf.1oat
nagios-plugins-dg-1.0.1-1.el5
ncg-metric-config-1.0.5-1.el5
sam-release-1.17.0-1.el5
msg-nagios-bridge-1.0.63-1.el5
unicore-nagios-plugins-2.1.0-1
unicore-ucc6-5.0.0-1.sl5
unicore-uvos-clc-1.6.0-0.sl5
perl-TOM-2.3-1.el5
perl-Directory-Queue-1.2-1.el5
perl-Net-STOMP-Client-1.2-1.el5
voms2htpasswd-1.12.2-1.el5

New dependencies (not included in the sa1 repository)
java-1.6.0-openjdk
tzdata-java

Dependencies removed
unicore-monitoring-probes

Node sam-gridmon

atp-1.23.9-1.el5
atp-web-1.23.9-1.el5
poem-0.9.5-1.el5
poem-sync-0.9.5-1.el5
ace-0.1.37-1.el5
mywlcg-1.2.8-2.el5
mrs-1.7.6-4.el5
dax-1.0.5-1.el5
glite-yaim-nagios-1.7.40-7.el5
openreports-3.2.07-2
rgf-1.0.4-1
sam-gridmon-1.17.2-1.el5
sam-release-1.17.0-1.el5
ncg-metric-config-1.0.5-1.el5
sqlalchemy-0.7.5-4.el5
perl-TOM-2.3-1.el5
perl-Directory-Queue-1.2-1.el5
perl-Net-STOMP-Client-1.2-1.el5
voms2htpasswd-1.12.2-1.el5

New dependencies (not included in the sa1 repository)
PyXML
curl
libidn
python-curl

Release Notes

  • ACE
    • New Nagios probes (for ops-monitor) for monitoring of ACE behaviour.
    • Use associations between ATP groups (Tiers/Sites).
    • Bug fixes
  • ATP
    • Keep associations between Tiers/PhysicalSites.
    • Bug fixes.
  • DAX
    • New component: Data Transfer Computation Engine - generation of FTS graphs in MyWLCG portal
  • glite-yaim-nagios
    • SAM concurrency improvements
    • Enable filtering of metrics from remote Nagioses
  • grid-monitoring-probes-ch.cern.sam
    • Probe for checking of deployed SAM-Nagios version
  • grid-monitoring-probes-eu.egi.sec
    • Ability to ignore expired CRLs on CRL check
  • MRS
    • Decomissioning of Nagios metric 'org.egee.MrsCheckMissingProbes'
    • Integration with POEM - phase 2/2
      • Computation logic based on POEM
      • New bootstrapping mechanism
    • Bug fixes.
  • MyWLCG
    • Integration of Data Transfers (only sam-gridmon)
    • Hide site names in Gridmap view
  • NCG
    • Integration with POEM
    • Service ncg is replaced with sam-sync (see SAM-2518). Yaim variable NAGIOS_NCG_ENABLE_CRON is removed and Yaim will always switch on service sam-sync. In order to switch off automatic config generation one must switch off sam-sync after each Yaim run.
    • Script ncg.reload.sh does not execute external components anymore (i.e. atp, mddb and mrs sync). Script can now be used for simple changes in NCG config (e.g. localdb changes) without waiting for synchronizers to finish.
    • NCG::LocalMetrics::Hash is disabled and only NCG::LocalMetrics::POEM is used. Exception are site and security roles. Yaim variables NCG_HASH_CONFIG_PROFILES, NCG_PROFILE_FQAN_* are removed. Profiles and FQAN mappings should be defined in POEM profile.
    • NCG::LocalMetrics::Hash_local module is obsoleted. Metric configuration files and POEM profiles should be used instead.
    • DesktopGrid probes are integrated into SAM(see SAM-2421). In order for probes to be properly configured URL field in GOCDB must point to URL where XML reports are stored (e.g. http://edgi-bridge.ibercivis.es/3gbridge_report_dir). No additional steps on SAM box are needed for probes to work.
    • Starting from Update-17 packages unicore-ucc and unicore-uvos-clc needed for UNICORE probes are distributed as part of SAM and manual installation is NOT needed.
  • POEM
    • Service poem-sync replaces mddb-sync to synchronizes profiles and metrics.
    • For NGI-Nagios migration is transparent and no changes need to be applied.
    • For VO-Nagioses namespace needs to be established with a profile that will determine how Nagios, MRS and MyEGI are configured. Please follow VO-Nagios section at Installing SAM/Nagios guide
    • POEM User's Guide is available at POEM User's Guide

Configuration changes (common)

New Yaim configuration variables:

POEM_WEB_ENABLE - enable poem web instance (default on SAM-Gridmon)
POEM_NAMESPACE - poem web instance namespace
POEM_ATP_ROOT_URL - poem web instance ATP URL
POEM_IMPORT_FROM_MDDB - if True bootstrap profiles from MDDB otherwise use a fixture file
POEM_DEBUG - enable poem web instance debug
POEM_ADMIN_NAME - poem web instance admin name
POEM_ADMIN_EMAIL - poem web instance admin e-mail
POEM_SYNC_URLS - URLs to synchronize from (pointing to poem web instances; SAM/Nagios defaults to grid-monitoring; SAM-Gridmon defaults to localhost)
POEM_SYNC_NS_RESTRICT - restrict synchronization of profiles for given namespace (space separated namespace!profile values)
MYEGI_DEFAULT_PROFILE - profile by default in MyEGI

Configuration changes (sam-gridmon)

New Yaim configuration variables:

DAX_MSG_HOST - Substitutes the name of the Broker host in the consumer configuration of DAX component (Default: "dashb-mb")
MYWLCG_DATA_TRANSFER Enables Data Transfer Module in MyWLCG (Default: False)
MYWLCG_DT_VO_OTHERS_LIMIT Place VOs in category 'Others' when total aggregated data tranfer or avg. throughput less than MYWLCG_DT_VO_OTHERS_LIMIT (Default: 2)
MYWLCG_DT_SRCSITE_OTHERS_LIMIT - Place Source Sites in category 'Others' when total aggregated data tranfer or avg. throughput less than MYWLCG_DT_SRCSITE_OTHERS_LIMIT (Default: 2)
MYWLCG_DT_DSTSITE_OTHERS_LIMIT - Place Destination Sites in category 'Others' when total aggregated data tranfer or avg. throughput less than MYWLCG_DT_DSTSITE_OTHERS_LIMIT (Default: 5)
OPENREPORTS_ADMIN - admin user for openreports.
OPENREPORTS_ADMIN_PASS - admin password for openreports.

Configuration changes (sam-nagios)

New Yaim configuration variables:

NCG_POEM_ROOT_URL - URL of POEM sync that NCG will use (default: "http://localhost/poem_sync")
NCG_REMOTE_NAGIOS_HOSTS - list of hosts from where results will be imported, used only on site instance if NCG_REMOTE_USE_NAGIOS is true

Removed YAIM variables

NCG_MDDB_SUPPORTED_PROFILES
NCG_HASH_CONFIG_PROFILES
NCG_PROFILE_FQAN_*
MDDB_SYNC_TIMEOUT
NAGIOS_NCG_ENABLE_CRON
NCG_TOPOLOGY_USE_SAM
NCG_TOPOLOGY_USE_ENOC
NCG_REMOTE_USE_ENOC

Removed localdb configuration options (definition of metrics in localdb):

ADD_PROFILE_SERVICE_METRIC!profile!service!metric
METRIC_PROBE!metric!probe
METRIC_METRICSET!metric!metricset
METRIC_DOCURL!metric!url
METRIC_NATIVE!metric!native
METRIC_CONFIG!metric!config!value
METRIC_DEPENDENCY!metric!metricParent!value
METRIC_ATTRIBUTE!metric!attribute!value
METRIC_FLAG!metric!flag
METRIC_PARENT!metric!parent
New version of ARC probes provides new config file /etc/grid-monitoring/org.ndgf.conf. After upgrade make sure that new version is used:
mv /etc/grid-monitoring/org.ndgf.conf.rpmnew /etc/grid-monitoring/org.ndgf.conf

New probes require creation of new testfile for LFC service. Create ops voms proxy and execute following commands as non privileged user:

. /etc/grid-monitoring/org.ndgf.conf
hostname -f > /tmp/testfile
ngcp file:///tmp/testfile lfc://$LFC_PHYSICAL_URL/testfile@$LFC_HOST$LFC_LOGICAL_PATH/testfile

List of new metrics

dg.CREAM-CE:

dg.ARC-CE:

dg.TargetSystemFactory:

Known Issues

For machines running latest version of glite-UI (3.2.10-1 or higher):
Please restart Nagios after yaim execution. Otherwise you may see problems similar to SAM-1693.
service nagios restart
Machines running dax component (only sam-gridmon) need to get registered in the Msg Broker host for enabling the consumer to consume FTS messages.

List of Issues fixed in this release

jiraissues: Unable to determine if sort should be enabled.
Document generated by Confluence on Feb 27, 2014 10:19