This page last changed on Jan 14, 2013 by tarragon.
Summary
Start Date |
25 June 2012 |
Release Date |
23 November 2012 |
Status |
Released |
Release Manager |
Wojciech Lapka |
Validation Steps |
SAM-2660 and SAM-2764 |
Description
This release is devoted to documentation and to the MyEGI component, improving many aspects of its visualization. Please note that it is highly recommended to first update to SAM Update-17 before upgrading to this release.
Technical details:
- 287 tickets resolved
- Status and Availability computation:
- Improved availability re-computation algorithm and status computation bootstrapping
- Log information about status of execution of MySQL events
- Improvement of logging mechanism
- Topology aggregation:
- New ATP API package integrated in MyWLCG
- VOFeed validation logs added to ATP probe
- Bug fixes
- Profile Management:
- Added tagging capability and improving user interface
- Changes to public Web API
- Service types have been deprecated and are currently mapped to service flavors
- MyEGI changes:
- Major style and layout changes
- Adding new view availability and reliability reporting
- Trends dashboard: availability/reliability trends, Experiment usage
- Public API documentation revised
- Added MyEGI user and admin guides
- Changed to Django-1.3 to improve security and functionality of several components (POEM, MyEGI, ATP)
- Updated MySQL to non-vulnerable version (5.1.63) and improved MySQL database dump
- Developer documentation for all components
- Nagios configuration
- Removed resource BDII from SAM/Nagios
- Consume VO Nagios results in a Site Nagios instance
- Removed probe 'org.nagios.NCGPidFile'
- Added probe 'org.nagiosexchange.NCGLogFiles'
- Probes integration and changes:
- Added probe ch.cern.sam.MrsCheckSpool
- Repackaging of perl-gridmon probe development framework
- Integration of QCG/MAPPER probes
- Integration of UNICORE Job and unicore6.StorageFactory
- Fixing compatibility with EMI WN probe.
- Fixing EMI version detection in the WN probe.
- Fixing configuration issue with perl-Net-STOMP-Client-1.2.1
- SAM configuration changes (glite-yaim-nagios):
- Removed MDDB configuration
Package List
SAM-Nagios
SAM-Gridmon
Configuration Changes
Common
- New Yaim configuration variables:
Component |
Name |
Description |
Default |
Mandatory |
Example |
all |
DB_PASS_R |
Database password (reader) |
No |
Yes |
my_reader_password |
all |
DB_PASS_W |
Database password (writer) |
No |
Yes |
my_writer_password |
all |
DB_USER_R |
Database user (reader) |
Yes |
Yes |
mrs |
all |
DB_USER_W |
Database user (writer) |
Yes |
Yes |
mrs |
MRS |
MRS_BOOTSTRAPPER_TIMEOUT |
Timeout (in seconds) for MRS bootstrapper |
Yes |
Yes |
1800 |
MRS |
MRS_LOG_LEVEL |
Level of debug for MRS log (possible values: DEBUG WARNING or ERROR) |
Yes |
Yes |
ERROR |
MyWLCG |
MYWLCG_ACCESS_CONTROL_ALLOW_ORIGIN |
Set Access-Control-Allow-Origin header |
Yes |
Yes |
* |
SAM-Gridmon
- New Yaim configuration variables
Component |
Name |
Description |
Default |
Mandatory |
Example |
MyWLCG |
MYWLCG_REPORTS |
To enable/disable Reports |
Yes |
Yes |
False |
MyWLCG |
MYWLCG_REPORT_AUTHENTIC_DNS |
List of authentic DNs that can generate official reports (Seperated by commas with no space) |
No |
No |
|
MyWLCG |
MYWLCG_REPORT_EGI_PROFILES |
List of allowed profiles for EGI Reports |
Yes |
Yes |
ROC_CRITICAL |
MyWLCG |
MYWLCG_REPORT_WLCG_TIER1_HISTORY_PROFILES |
List of allowed profiles for WLCG Tier1 History Reports |
Yes |
Yes |
WLCG_CREAM_LCGCE_CRITICAL |
MywLCG |
MYWLCG_REPORT_WLCG_TIER1_SUMMARY_PROFILES |
List of allowed profiles for WLCG Tier1 Summary Reports |
Yes |
Yes |
WLCG_CREAM_LCGCE_CRITICAL |
MyWLCG |
MYWLCG_REPORT_WLCG_TIER1_VO_PROFILES |
List of allowed profiles for WLCG Tier1 VO Reports |
Yes |
Yes |
WLCG_CREAM_LCGCE_CRITICAL ALICE_CRITICAL ATLAS_CRITICAL CMS_CRITICAL LHCb_CRITICAL |
MyWLCG |
MYWLCG_REPORT_WLCG_TIER2_PROFILES |
List of allowed profiles for WLCG Tier2 Reports |
Yes |
Yes |
WLCG_CREAM_LCGCE_CRITICAL |
MyWLCG |
MYWLCG_TOPOLOGY |
To turn on/off Topology module in MyWLCG |
Yes |
Yes |
False |
MyWLCG |
MYWLCG_EGI_EMAILS |
List of emails to which EGI Reports will be send |
Yes |
Yes |
egi-league-results@hellasgrid.gr |
MyWLCG |
MYWLCG_WLCG_TIER1_EMAILS |
List of emails to which WLCG Tier1 Reports will be send |
Yes |
Yes |
worldwide-lcg-management-board@cern.ch |
MyWLCG |
MYWLCG_WLCG_TIER2_EMAILS |
List of emails to which WLCG Tier2 Reports will be send |
Yes |
Yes |
project-wlcg-cb@cern.ch |
SAM-Nagios
- New Yaim configuration variables:
Component |
Name |
Description |
Default |
Mandatory |
Example |
grid-monitoring-probes-ch.cern.sam |
MRS_S_crit_recalc |
critical_recalc parameter of MrsCheckSpool probe |
Yes |
Yes |
50000 |
grid-monitoring-probes-ch.cern.sam |
MRS_S_crit_spool |
critical_spool parameter of MrsCheckSpool probe |
Yes |
Yes |
10000 |
grid-monitoring-probes-ch.cern.sam |
MRS_S_path_recalc |
web_service_path_recalc parameter of MrsCheckSpool probe |
Yes |
Yes |
myegi/sam-pi/num_records_metricdataforrecalculation |
grid-monitoring-probes-ch.cern.sam |
MRS_S_path_spool |
web_service_path_spool parameter of MrsCheckSpool probe |
Yes |
Yes |
myegi/sam-pi/num_records_metricdata_spool |
grid-monitoring-probes-ch.cern.sam |
MRS_S_warn_recalc |
warn_recalc parameter of MrsCheckSpool probe |
Yes |
Yes |
25000 |
grid-monitoring-probes-ch.cern.sam |
MRS_S_warn_spool |
warn_spool parameter of MrsCheckSpool probe |
Yes |
Yes |
5000 |
NCG |
NCG_INCLUDE_PROXY_CHECKS |
Switch on/off internal tests for generating proxy credential |
Yes |
No |
1 |
NCG |
NCG_SEND_TO_DASHBOARD |
Enables interaction with Dashboard on VO instances |
Yes |
Yes |
false |
NCG |
HOST_KEYSTORE_PASS |
Password used for encrypting host certificate keystore |
Yes |
Yes |
password |
MyWLCG |
MYWLCG_EGI_EMAILS |
List of emails to which EGI Reports will be send |
Yes |
Yes |
egi-league-results@hellasgrid.gr |
MyWLCG |
MYWLCG_WLCG_TIER1_EMAILS |
List of emails to which WLCG Tier1 Reports will be send |
Yes |
Yes |
worldwide-lcg-management-board@cern.ch |
MyWLCG |
MYWLCG_WLCG_TIER2_EMAILS |
List of emails to which WLCG Tier2 Reports will be send |
Yes |
Yes |
project-wlcg-cb@cern.ch |
Known Issues
| Please upgrade to SAM Update-17 first as the upgrade from SAM Update-15 directly to SAM Update-19 is not supported. |
| For machines running latest version of glite-UI (3.2.10-1 or higher):
Please restart Nagios after yaim execution. Otherwise you may see problems similar to SAM-1693.
|
| Upgrading a node with yum requires a package exclusion, e.g.:
- on sam-nagios
yum update --exclude sam-gridmon
- on sam-gridmon
yum update --exclude sam-nagios
|
| New package grid-monitoring-probes-org.sam-0.5.7-1.el5 solves the problem of running org.sam.WN* check on SL6 platform. More details can be found here: https://tomtools.cern.ch/jira/browse/SAM-2999. This package contains nagios binary that cannot be executed on 32-bit architecture and fails on some older WN platforms behind CE service type. Deployment of this package will cause CE service type tests to become UNKNOWN. |
| Because of a MRS bug SAM-3098, metrics are not added to metricdata tables on MySQL.
In order to fix it, please download the MRS-patch.sql attached (MRS-patch.sql) and deploy it (connected to your mysql db) with:
source MRS-patch.sql; |
| Perl module Directory::Queue is upgraded in Update-17. New dirqueue structure is not backward compatible with the previous version. After running Yaim please run:
rm -rf /var/spool/nagios2metricstore/*/*
rm -rf /var/spool/msg-nagios-bridge/*/*
If one doesn't perform steps described above SAM will not publish result to the central SAM-GridMon and test org.egee.SendToMsg will fail with the following error:
**ePN /usr/libexec/grid-monitoring/plugins/nagios/send_to_msg: "Directory::Queue::Simple: cannot link(/var/spool/msg-nagios-bridge/outgoing-messages/00000000/50b3452588432f, /var/spool/msg-nagios-bridge/outgoing-messages/00000000/50b3452588432f.lck): Operation not permitted". |
Tickets List
Without Nightly Validation
jiraissues: Unable to determine if sort should be enabled.
During Nightly Validation
jiraissues: Unable to determine if sort should be enabled.
|