Todo/Status List for Linux-NFS
* denotes to be done;
o denotes draft implementation, possibly commented out
- denotes done,
+ denotes done and tested
------------------------------------------------------------------
RPC:
* Server-side AUTH_DES authentication
NFS:
* stat() calls don't check whether the cached attrs are stil valid
(this is a problem in the VFS).
- NFS_ROOT stuff needs fixing.
o Swapping over NFS.
Issues of swapout:
* Avoid recursion in low memory situations where
kmalloc may call try_to_swap_out etc ad inf.
* Don't do async I/O on swap files.
For special-casing related to NFS swap I/O, flag swap file
semantics in inode->i_flags. In swapfile.c, change functions
to call readpage/writepage if available, otherwise proceed
as usual.
- Write-back support.
* Disable page cache invalidation/flushing for locked file
regions.
- Directory caching (we now have page-sized dircache entries
which could easily be organized into a linked list). These
dircache pages come along as a linked list that can be copied
almost 1-to-1 into a dirent struct. If this is put into the
VFS, other remote fs's will also benefit.
[Note: I just increased the readdir cache to hold more than
one directory. With this, the exclusive lock on readdir goes
away, too. With a larger cache, it may also be worth to think
about directory readahead...]
* Better lookup caching?
* When a read lock is present, don't time out attr cache or
page cache for that region. Likewise, if a write lock is present,
be lazy on write-back.
* Implement CTO.
- BUG: Invalidate readdir cache after remove/rename/unlink
* Automatic `mounting' when the server crosses mount points
transparently (some IRIX machines seem to do this when
using -nohide).
* NFSv3 support. This requires careful design to maximize
code sharing between NFSv2 and NFSv3.
* More robust rename handling (see comment before nfs_rename).
* Add Miquel's O_EXCL hack for file creation.
* Performance improvement: When a complete reply is received, and
the (async) task is woken up, don't put it on rpciod's scheduling
queue, but add it to a `fast scheduler queue.' The fast scheduler
could be a special handler that's registered on the tq_scheduler task
queue. This queue is fired by the kernel scheduler as soon as
the other bottom halves have been run.
Note that implementing this for sync tasks is even trickier than
for async tasks, because you have to make sure you do the right
thing in rpc_sleep_on().
* writeback of writable mmaps. Dirty pages are not subject to
writeback scheduling. Also, msync should make sure pages are
written with O_SYNC on.
nfsd:
* uid/gid mapping, and rpc.ugidd support
- Don't read/write a file that might have mandatory locks.
* Implement secure/kerberos export options (take care of lockd
fopen() calls--most clients seem to use NULL creds for lockd).
- there's a bug in readdir wrt large directories. Try mounting
the linux source tree and do an ls on include/linux...
* Support for UNIX socket creation.
* Someone should look over the error return codes. I tend to
mix up EPERM and EACCES.
* NFSv3 support.
- Refuse to look up inodes in procfs (security issues).
o Delayed writes (delay syncing of file data when nfsd handles
several write requests for the same file concurrently).
(Draft - see nfsd_write in fs/nfsd/write.c. Needs benchmarking).
* Faster read operations (single copy): mmap the file region
to be read into VM, and pass the VMA to the xdr routines
which pass the region's VM address into sock->ops->writemsg.
This copies the file data directly from the page cache into
the network buffer.
Release the vma region after encoding.
* Faster write operations (single copy, with IPv6 net layout):
Get the unfragmented UDP datagram, pull the header and
do normal processing. Then mmap the file, copy the write data,
and release VMA.
- Clear setuid/setgid bit after write().
* Quota support.
lockd:
* Server should run on privileged port.
* Testing reclaim support.
* HP lockd accepts our GRANT_MSG callback and passes on the grant
to the blocking process, but doesn't reply with a GRANT_RES.
It's not clear to me why it would do this.
* Unregister hosts (SM_UNMON) with rpc.statd when appropriate.
mountd
* Unregister service from portmapper upon exit/SIGTERM
mount
* If available, use version 3 of the mount protocol and
obtain pathconf data (fill in data->bsize).
documentation:
- Manpages need to be written