wiki:developers-meeting-20110427

[part of a series of meetings]

Participants

Agenda

[see box on the right-hand side]

Postcards

Up to two minutes (uninterrupted) per person where they can answer two questions:

  • What I did last week (since the last meeting),
  • What I plan to do in the next week.

No questions until we get through everyone :)

Plans for patch-releases

Should we make a new patch release?

Trunk activity

Progress with new features...

Workshops

Lund

First: workshop in Lund; make sure you have hotels and flights booked.

dCache developers

Need someone to take care that there's an agenda, etc .. a caretaker.

Dmitry has volunteered to take on this role.

Patrick will give Dmitry access to the agenda page so he is free to change the slots.

Topics include: review gPlazma; plan for the next year.

Important to work out the planning what sessions we shall have.

Patrick to send Dmitry the list of participants.

Tanja to find a nice restaurant: preferably one light one and a more heavy one, too.

emails and mailing lists

Two questions about NFS .. no answers yet.

Do I need to restart dCache after changing

If you change the exports file then you must restart dCache server.

Are you sure it's NFS v4? Yes. Ah, that's a surprise.

What about uid / gid mapping on the client host.

You must run the nfsidmapd daemon.

With Unix-permissions, the linux kernel checks if the ID string contains only digits. If so, then it treats the string as a UID / GID.

Working on integrating Kerberos support

I run only NFSv4 or NFSv3 but not both. Can this be fixed?

Yes; the problem is that they are started as separate services. The both try to open the same port-number, so cannot work. There is support for running both within the same cell, but this is awkward to make the choice between NFS-3, NFS-41 or NFS-3+NFS-41 with Spring.

This is for a demo of NFSv4.1/pNFS at Fermi.

The current behaviour is that the NFS server prevents clients from staging file. This is a deliberate policy decision, but hard-coded.

NFS has a time-out of 30 seconds: if hard-mounted then client will hang until file is available; if soft-mounted then the client will keep retrying.

Patrick: if portmapper is running then dCache doesn't work with NFS.

Needs a configuration in /etc/sysconf/rpcbind to tell it that dCache is allowed to register with port number > 1024.

Are you happy to do exercises with Scientific Linux 6?

Yes, that should be OK.

The SL-5 builds is something Tigran supports; Today, learnt that Fedora-core 16 kernels support NFS 4.1 out-of-the-box. Installing the FC-16 kernel in SL-6 then NFSv4.1/pNFS should just work.

Issues from yesterday's Tier-1 meeting

News about enstore / pnfs problem seems to be solved with a kernel upgrade.

ON client running the kernel 2.6.18-2.38.5.1. Cat on the name of file (via a dot-command). Since encp (a lot of stuff) is "monitoring" '(nameof)' and similar monitoring.

The file ID has changed.

The funny thing was all reads were sent back were OK; none returned stale-file-handle.

Infinite loop on open on the file didn't crash the kernel.

The encp crashes the kernel almost straight away.

encp does open on '.(nameof)('

Could be the file-size doesn't match the size of the file.

We can either match the kernel-patch to our code or we say "it's solved".

Pragmatically, we can point anyone with the problem to the newer kernel version.

Issues from EMI

Nothing really to reported, except the metrics.

Metrics

EMI produces metrics: SLOC count, FindBugs?, etc.

Tigran pointed out the code-branch that is srmclient-only.

Patrick suggested they take it as one component: client+server. Take the SLOC-count of both.

Do they have exclude filters? Probably; what for? For a specific error; e.g., "your java name isn't compliant" (e.g., from PMD or FindBugs?). We exclude stupid cases (e.g., due to auto-generated).

Some things like taking Thread.start out of cells would take a lot of effort, so we won't change it. It's too dangerous to change it only to decrease the FindBug? counts.

Do they run FindBugs? on other projects? Yup.

Tigran wants to drop FindBugs? from our source-tree.

Outstanding RT Tickets

[This is an auto-generated item. Don't add items here directly]

 RT 6167: bringonline failed due to pinning failed:finding read pool failed

Gerd added to the agenda.

Patch to fix this problem has been committed to 1.9.5 branch (and all others, too).

Do we want to do a release of 1.9.5 to get the fix out? Tigran: yes.

Ticket #6218

Yesterday Dmitry says it works with v1.9.5, so if 1.9.5 uses the same jline version as dCache v1.9.10 then it's our bug and needs to be fixed.

If the problem is in jline then we can patch it and submit the patch upstream.

If not, then we can fix our code ;-)

Tigran will try not to forget about it.

Review of RB requests

Please (Paul) look at the GSS code (it's for 1.9.12 ;-)

DTNM

Same time, next week.