wiki:MonitoringDcache
Last modified 9 years ago Last modified on 04/30/09 12:28:45

Global services

SRM

The S2 SRM-2.2 VO/CCRC08 test results page.

Plot of number of instances of each dCache version discovered via srmPing and WLCG Info System. NB shows only SRM-2.2 end-points.

http://wn3.epcc.ed.ac.uk/srm/bar_graphs/srm_versions_bar http://wn3.epcc.ed.ac.uk/srm/pie_graphs/srm_versions_pie

(Source: Greig Cowan's srmPing monitoring page, which uses Brian Bockleman's GraphTool)

SAM tests

http://wn3.epcc.ed.ac.uk/sam_dcache_month.png http://wn3.epcc.ed.ac.uk/sam_dcache_t1_month.png

(Source: Greig Cowan's dCache monitoring page, which uses Brian Bockleman's GraphTool)

SAM (Service Availability Monitoring) is a service that checks whether services are running correctly at sites. In addition to the views below, SAM also provides an interface to query stored data directly.

Available views of SAM data:

FTS

File Transfer Service : groups the transfer of multiple files into transfer Jobs and provides a service for managing their transfer. It schedules transfers into different channels to manage its bandwidth usage. See also: GridPP info.

FTS transfer failures come in three flavours: "source", "destination" and "transfer" errors. "Source" and "destination" errors are when the problem is clearly at either the source or destination sites (e.g., out-of-space error) and "transfer" error is returned when it is unclear of the cause (e.g. when a connection has timed-out).

CERN

Work is ongoing on providing monitoring information.

RAL

RAL FTS service supports the UK sites. Available information includes:

IN2P3

Information is available, links to follow...

GridKa

In the last 12 hours:

LHC Experiments

Experiments provide services, usually on top of FTS.

ATLAS

The DDM (Distributed Data Management) includes DQ2 (Don Quijote v2), which is the component that supports transfers of datasets. DDM monitoring is provided through a dashboard

CMS

CMS uses PhEDEx for managing transfers. Monitoring information is available through their own dashboard. This includes:

LHCb

LHCb has a Dirac Service Monitoring pages. This includes:

Alice

Alice uses MonALISA for monitoring. Information is available at the MonALISA dashboard:

Some additional information is available from their ARDA dashboard.

Regional monitoring

Usually country-based views of storage.

UK

Available monitoring:

Germany

Other pages

GridPP's collection of monitoring links.