Global services
SRM
The S2 SRM-2.2 VO/CCRC08 test results page.
Plot of number of instances of each dCache version discovered via srmPing and WLCG Info System. NB shows only SRM-2.2 end-points.
(Source: Greig Cowan's srmPing monitoring page, which uses Brian Bockleman's GraphTool)
SAM tests
(Source: Greig Cowan's dCache monitoring page, which uses Brian Bockleman's GraphTool)
SAM (Service Availability Monitoring) is a service that checks whether services are running correctly at sites. In addition to the views below, SAM also provides an interface to query stored data directly.
Available views of SAM data:
- Above graphs, from Greig Cowan's overview of dCache instances in GLUE,
- GridMap view, which can show SE status,
- GoC overview of Prod. system,
- GridView history view of SAM test results,
- a plot of UK SE-availability over time.
FTS
File Transfer Service : groups the transfer of multiple files into transfer Jobs and provides a service for managing their transfer. It schedules transfers into different channels to manage its bandwidth usage. See also: GridPP info.
FTS transfer failures come in three flavours: "source", "destination" and "transfer" errors. "Source" and "destination" errors are when the problem is clearly at either the source or destination sites (e.g., out-of-space error) and "transfer" error is returned when it is unclear of the cause (e.g. when a connection has timed-out).
CERN
Work is ongoing on providing monitoring information.
RAL
RAL FTS service supports the UK sites. Available information includes:
- various Ganglia plots (e.g., all VOs for past day),
- Information on FTS Jobs, for example:
- all failed { jobs, files, transfers} in past hour,
- grouping repeated failed transfers in past hour ordered by frequency.
IN2P3
Information is available, links to follow...
GridKa
In the last 12 hours:
- Overview of channel activity,
- List of failed jobs,
- List of failed files.
LHC Experiments
Experiments provide services, usually on top of FTS.
ATLAS
The DDM (Distributed Data Management) includes DQ2 (Don Quijote v2), which is the component that supports transfers of datasets. DDM monitoring is provided through a dashboard
CMS
CMS uses PhEDEx for managing transfers. Monitoring information is available through their own dashboard. This includes:
- plots, over time, of failed transfers,
- detailed information about recent transfer errors.
LHCb
LHCb has a Dirac Service Monitoring pages. This includes:
- An overview of transfers.
- Plots, for the past 24 hours, of failed transfers:
Alice
Alice uses MonALISA for monitoring. Information is available at the MonALISA dashboard:
- an overview of current status,
- transfers by state,
- various Atom feeds for event monitoring (e.g., FTD/FTS, [ttp://pcalimonitor.cern.ch/atom.jsp?set=10 Storage Elements]).
Some additional information is available from their ARDA dashboard.
Regional monitoring
Usually country-based views of storage.
UK
Available monitoring:
Germany
- GridKa:
Other pages
GridPP's collection of monitoring links.


