Last modified 2 years ago Last modified on 11/06/18 15:12:13

dCache Tier I meeting November 6, 2016

[part of a series of meetings]

Present, Paul), IN2P3(), Sara(), Triumf(), BNL(), NDGF(), PIC(), KIT(Xavier), Fermi(), CERN()


(see box on the other side)

Site reports


Things are running fine since Monday.

Two weeks ago, updated CMS instance to 3.2.

Wasn't fully working until Monday. An issue at CERN; IPv6 packets from Helsinki or RAL were not routed correctly.

Cancelled downtime for tomorrow (planned for ATLAS) ... shifted until December.

In CMS instance, two times, httpd domain crashed in 9537.

Xavier couldn't find the upload directory for DESY-Cloud, to send the heap-dump.

Paul to check it exists and send Xavier a link.

One pool-manager died; one core domains also died.

Xavier to send the heap-dump for these two domains, too.

These domains are surviving ~24--48 hours.

pool size

pools configured 8 EiB

RT 9516

Numbers reported by the pool in the same way.

total and reported /

Not fix output in 3.2 due to risk of breaking monitoring scripts, but look at standardising this in 5.0

Maybe add more information in the info service.

But also check what is available in 4.2 and frontend's monitoring API.

Xavier to open a new ticket.

SSH admin interface

RT 9418

Paul to remind Dmitry he promised to have a look at this ticket.

Support tickets for discussion

[Items are added here automagically]


Same time, next week.