:: [DNG] NFS Stale file handle for reg…
Top Page
Delete this message
Reply to this message
Author: Jackman
Date:  
To: DNG
Subject: [DNG] NFS Stale file handle for regular user, not root
I don't even know how to Google for this.

Hosts involved:
storage0: NFS/KRB5 server
dorito: NFS client

As root, I can mount, list files, view files, and so on. When I try to list
files on the mount, I get a stale file handle error.

➜  ~ mount /mnt/backup && ls -hal /mnt/backup
ls: cannot open directory '/mnt/backup': Stale file handle
➜  ~ ls /mnt -hal
ls: cannot access '/mnt/backup': Permission denied
total 24K
drwxr-xr-x  7 root root 4.0K Feb 24 12:10 .
drwxr-xr-x 22 root root 4.0K Feb 23 15:56 ..
d?????????  ? ?    ?       ?            ? backup


Note the difference with root:

dorito :: ~ » mount /mnt/backup && ls -hal /mnt/backup
total 57K
drwxr-xr-x 12 root    root      12 Feb 23 03:36 .
drwxr-xr-x  7 root    root    4.0K Feb 24 12:10 ..
drwxr-xr-x  3 jackman jackman    3 Feb 11 04:21 backup_dorito_20210211
drwxr-xr-x  4 jackman jackman    4 Feb 23 06:16 dorito_20210223-0336
dorito :: ~ » ls -hal /mnt
total 26K
drwxr-xr-x  7 root root 4.0K Feb 24 12:10 .
drwxr-xr-x 22 root root 4.0K Feb 23 15:56 ..
drwxr-xr-x 12 root root   12 Feb 23 03:36 backup


Cute, eh?

This is not a problem on the storage server itself:

storage0 :: /srv » mount /mnt/backup && ls -hal /mnt/backup
total 57K
drwxr-xr-x 12 root    root      12 Feb 23 03:36 .
drwxr-xr-x  8 root    root    4.0K Feb 24 14:16 ..
drwxr-xr-x  3 jackman jackman    3 Feb 11 04:21 backup_dorito_20210211
drwxr-xr-x  4 jackman jackman    4 Feb 23 06:16 dorito_20210223-0336


Here are some relevant things:

➜ ~ mount | grep nfs
storage0:/srv/backup on /mnt/backup type nfs4
(rw,nosuid,nodev,noexec,relatime,vers=4.2,rsize=1048576,wsize=1048576,namlen=255,hard,proto=tcp,timeo=600,retrans=2,sec=krb5,clientaddr=10.1.0.110,local_lock=none,addr=10.1.0.100,user=jackman)

➜  ~ cat /etc/fstab | grep backup
storage0:/srv/backup    /mnt/backup     nfs4
 noauto,rw,noexec,user,sec=krb5  0       0


storage0 :: /srv » cat /etc/exports | grep backup
/srv/backup     10.0.0.0/8(rw,no_subtree_check,sec=krb5)


storage0 :: /srv » cat /etc/default/nfs-kernel-server
RPCNFSDCOUNT=32
RPCNFSDPRIORITY=0
RPCMOUNTDOPTS="--manage-gids"
NEED_SVCGSSD="yes"
RPCSVCGSSDOPTS=""

storage0 :: /srv » cat /etc/default/nfs-common
NEED_STATD=
STATDOPTS=
NEED_IDMAPD="yes"
NEED_GSSD="yes"

Networking is static:

storage0 :: /var/log/kerberos » cat /etc/hosts
127.0.0.1       localhost
::1     localhost ip6-localhost ip6-loopback
ff02::1 ip6-allnodes
ff02::2 ip6-allrouters
10.1.0.100      storage0.jackman.local storage0
10.1.0.110      dorito.jackman.local dorito


The mount appears to execute cleanly:

➜ ~ mount -v /mnt/backup
mount.nfs4: timeout set for Wed Feb 24 14:27:58 2021
mount.nfs4: trying text-based options
'sec=krb5,vers=4.2,addr=10.1.0.100,clientaddr=10.1.0.110'

I don't see anything in the system logs on either machine that look at all
relevant.

Nothing odd (IMHO) is happening in the KRB5 logs, just successful grants.

I thought this problem was exclusive to dorito (the client), so I nuked and
re-installed Devuan. I have since installed Xubuntu on my desktop and it
has the same issue now, too, but I'm politely setting that machine aside as
it's not a Devuan machine.

BTW, I've tried with 'noac', with no apparent change in behavior.

I'm happy to RTFM if I knew what I was looking for.

As always, any helpful souls are welcome to solicit on behalf of their beer
fund.

Andrew Jackman
kd7nyq@???