Skip to main content

Metric templates

List of metric templates

Metric templates page is accessible from the menu on the left side. The page is shown in the image below.

SuperAdmin metric templates

Metric templates may be filtered by name, probe, type (active or passive), or tag. By clicking on the metric template name, user can access particular metric template's page. It is also possible to access particular probe version by clicking on its name.

There is also a feature to delete multiple metric templates at once. For that, user may select metrics (s)he wishes to delete from POEM, and click on Delete button (shown in figure below).

SuperAdmin metric template bulk delete

User will then be asked if (s)he is sure (s)he wants to delete selected metrics. Once the user confirms his/her selection, metric templates and their histories are deleted. Metrics imported to tenants and their histories are also deleted, and they are removed from metric profiles. User will be informed of successful deletion of metric templates (figure below). Also, in case of any error, user will be informed the same way.

SuperAdmin metric template deleted

Metric template details

Depending on metric template type, pages for particular metric templates are a bit different. Active metric template must have a probe assigned to it, while passive metric template does not.

Active metric templates

Active metric template's page is shown in the image below.

SuperAdmin active metric template

Field descriptions

Mandatory fields in case of active metric template:

  • Name - metric template name;
  • Probe - probe name and version. This field is autocomplete, user must choose from available probes;
  • Type - type (active or passive; active in this example);
  • Tag - tag;
  • Probe executable - executable file;
  • Config
    • maxCheckAttempts - used to define the number of times that Nagios will retry the service check command if it returns any state other than an OK state; setting this value to 1 will cause Nagios to generate an alert without retrying the service check again;
    • timeout - timeout (in seconds) for probe,
    • path - path to executable file,
    • interval - used to define the time (in minutes) to wait before scheduling the next "regular" check of the service; "regular" checks are those that occur when the service is in an OK state or when the service is in a non-OK state, but has already been rechecked maxCheckAttempts number of times;
    • retryInterval - used to define the time (in minutes) to wait before scheduling a re-check of the service; services are rescheduled at the retry interval when they have changed to a non-OK state; once the service has been retried maxCheckAttempts times without a change in its status, it will revert to being scheduled at its "normal" rate as defined by the interval value.
Tags

Metric templates can be tagged by any tag. There are, however, three tags that are special. So-called internal metrics are tagged with internal tag. Those metrics are checking internal processes which are necessary for all tenants. They are automatically imported to a tenant POEM once new tenant is created.

Metric templates tagged with deprecated or eol tag are no longer being maintained, and are not used actively.

Tag field is a multiple select field with autocomplete and a possibility to create new entries. So, when SuperAdmin user starts typing, all the already existing tags are listed (filtered by the letters typed-in). If the desired tag does not exist, user can create new one by simply clicking create entry.

Attributes

Attributes are part of metric template definition. They are used to pass values as arguments to metric command (e.g. TCP port on which service is listening, SE path, CE queue...).

Description of attributes is given in the table below.

AttributeDescription
AGORA_PASSWORDPassword for Agora Catalogue service.
AGORA_USERNAMEUsername for Agora Catalogue service.
ARC_CE_MEMORY_LIMITMemory limit used by jobs in bytes for ARC-CE metrics.
ARC_GOOD_SESValue of ARC_GOOD_SES variable for ARC-CE metrics.
ARC_NAGIOS_SERVICEARC metric being used to check Nagios freshness.
ARGOUI_AR_PATHPath used for checking the ARGO UI A/R report availability.
ARGOUI_AR_REGEXRegex used for checking the ARGO UI A/R report availability.
ARGOUI_STATUS_PATHPath used for checking the ARGO UI status report availability.
ARGOUI_STATUS_REGEXRegex used for checking the ARGO UI A/R report availability.
ARGO_AMS_PROJECTProject name used for checking the ARGO AMS availability.
ARGO_AMS_TOKENToken for authentication against ARGO AMS.
ARGO_API_TOKENToken for authentication against ARGO Web-API.
ARGO_OIDC_SP_URLURL of the Service Provider's AUTHENTICATION link for OIDC.
ARGO_SAML_SP_URLURL of the Service Provider's AUTHENTICATION link for SAML.
ARGO_STATUS_URLURL of endpoint containing service status.
ARGO_TENANTS_TOKENPOEM token(s) for ARGO tenant(s).
ARGO_WEBDAV_OPS_URLWebDAV URL containing also the VO ops folder.
ARGO_WEBDAV_SKIP_DIR_TESTUse this flag to skip the directory listing test for WebDAV.
ARGO_WEBODV_SECRETToken for authentication against WebODV service.
ARGO_XML_XPATHXPath to check in the XML response.
ARGO_XML_XPATH_OK_VALUEXPath value which will return OK status when checking XML response.
ARGO_XROOTD_OPS_URLXRootD base SURL to test (the path where ops VO has write access).
ARGUS_SALTSalt for the dynamic DN from Argus
B2HANDLE_PREFIXB2HANDLE prefix.
B2STAGE_API_PASSWORDPassword for B2STAGE.
B2STAGE_API_USERNAMEUseername for B2STAGE.
BDII_PORTBDII port.
BDII_DNBDII base DN
BROKER_PORTBroker port.
CREAM_PORTCREAM CE port.
CREAM_QUEUECREAM CE queue for a given VO.
CVMFS-Stratum-1_PORTCVMFS-Stratum-1 port.
EDUGAIN_PASSWORDEDUGAIN password.
EDUGAIN_USEREDUGAIN username.
EGISSO_PASSWORDEGI SSO password.
EGISSO_USEREGI SSO username.
EOSCCORE_HELPDESK_TOKENToken for EOSCCORE helpdesk API.
FTS_PORTFTS port.
GITLAB_URL_SECRETGitlab URL containing token.
GLUE2_BDII_DNGLUE2 LDAP base.
GRAM_PORTGRAM Gatekeeper port.
GRIDFTP_PORTGridFTP port.
GRIDPROXY_NAGIOS_SERVICEGRIDPROXY metric being used to check Nagios freshness
GSISSH_PORTGSISSH port.
HOSTDNHost DN.
KEYCLOAK_CLIEND_IDClient ID for Keycloak service.
KEYCLOAK_CLIENT_SECRETClient secret for Keycloak service.
KEYSTORELocation of Java keystore with host certificate. Default value is /etc/nagios/globus/keystore.jks.
LB_PORTLB port.
MYPROXY_NAMEName of MyProxy credential to use.
MYPROXY_PORTMYPROXY port.
MYPROXY_SERVERName or IP address of MyProxy host to check.
MYPROXY_USERName of MyProxy account under which the credential was stored.
NAGIOS_B2ACCESS_LOGINB2ACCESS username.
NAGIOS_B2ACCESS_PASSWORDB2ACCESS password.
NAGIOS_FRESHNESS_PASSWORDNagios password.
NAGIOS_FRESHNESS_USERNAMENagios username.
NAGIOS_HOST_CERTLocation of host certificate with nagios user ownership. Value is /etc/nagios/globus/hostcert.pem.
NAGIOS_HOST_KEYLocation of host certificate key with nagios user ownership. Value is /etc/nagios/globus/hostkey.pem.
NAGIOS_UI_CREDENTIALSCredentials for Nagios UI.
OCCI_PORTOCCI port.
OIDC_ACCESS_TOKENOIDC access token.
OIDC_CLIENT_IDOIDC client ID.
OIDC_CLIENT_SECRETOIDC client secret.
OIDC_REFRESH_TOKENOIDC refresh token.
OS_APPDB_IMAGEOpenStack AppDB image.
OS_KEYSTONE_HOSTOpenStack Keystone host.
OS_KEYSTONE_PORTOpenStack Keystone port.
OS_KEYSTONE_URLOpenStack Keystone URL.
OS_REGIONOpenStack region.
PATHPath extracted from URL.
PORTPort extracted from URL.
PROXY_LIFETIMELifetime of generated proxy in hours.
QCG-BROKER_PORTQCG Broker port.
QCG-COMPUTING_PORTQCG Computing port.
QCG-NOTIFICATION_PORTQCG Notification port.
RM_PORTRM port.
ROBOT_CERTLocation of robot certificate.
ROBOT_KEYLocation of robot certificate key.
SENSU_HOST_CERTLocation of certificate used by Sensu.
SE_PATHStorage element path for a given VO.
<serviceType>_URLURL extracted topology database (e.g. GOCDB attribute URL); serviceType prefix is used in order to avoid collision in case of multiple serviceTypes on a same host.
SITE_BDIIHostname of Site BDII.
SITENAMESite name.
SRM1_PORTSRM1 port.
SSH_PORTSSH port.
SSLAttribute is set if URL scheme is https://.
SURLSRM base SURL to test.
TOP_BDIIHostname of Top BDII used for generating configuration.
TRUSTSTORELocation of Java truststore with all IGTF CA bundle. Default value is /etc/nagios/globus/truststore.ts.
URLURL extracted topology database (e.g. GOCDB attribute URL).
VOMS_PORTVOMS port.
VONAMEVO name.
VO_FQANVO FQAN.
WMPROXY_PORTWMPROXY port.
X509_USER_PROXYLocation of x509 certificate proxy file.
Flags

Flags are part of metric definition and control how the metrics are configured on the mon-box. Below is description currently supported flags.

FlagDescription
NOHOSTNAMEMetric's command is configured without -H parameter.
NOTIMEOUTMetrics command is configured without -t parameter.
NRPEMetric is configured to be executed via NRPE on service node.
NOPUBLISHMetric results are not published to AMS.
PASSIVEIf defined, metric is part of a complex check. NCG configures metric as passive check only. Results for the metric should be generated by parent metric or received via message bus.
VOMetric is VO dependent. Metric is configured multiple time for each VO or VO FQAN defined. In case that VO is not supported on a given host, metric is not generated at all.

Passive metric templates

Page for particular passive metric template is considerably reduced, and it is shown in the image below.

SuperAdmin passive metric template

Mandatory fields in case of passive metric template:

  • Name - metric template name;
  • Type - type (active or passive; passive in this example);
  • Flags
    • PASSIVE flag is mandatory for passive metrics - it is added automatically when passive type is chosen through the UI and it is disabled to prevent users from accidentally deleting it.

Metric template cloning

Metric template can be cloned by clicking Clone button. Then a new form is opened with fields filled with values as they are defined in source metric template.

Metric template history

By clicking History button, one may see all the versions of metric template as is shown in image below.

SuperAdmin metric template history

The user may see particular version of the metric template by clicking on its version, and see the differences between two versions by choosing the versions and clicking Compare button.