[Krypton-users] Krypton /home file system load + some advice
Kent Engström
kent at nsc.liu.se
Tue Oct 8 11:18:22 CEST 2013
Dear Krypton Users,
> we've received reports about the Krypton file system being
> more sluggish than normal.
We found a job that is likely to have been the immediate cause this time
(started at the "correct" time, using a run directory under /home,
etc).
We discussed it with the user, he stopped the job, and the /home load
seems to have dropped to normal levels at once.
We will use this opportunity to give you some advice on file system use
on Krypton:
1) Avoid flat directories with tens of thousands of files in them. That
will be bad regardless of filesystem. Use directory hierarchies.
[Probably not involved in today's problems, but we have been hurt by
that before.]
2) If your job needs to do a lot of manipulation on small files, and it
is contained within a node, and not too large, please use the node-local
scratch file system. It is autocreated when a job starts with the name
present in the $SNIC_TMP environment variable on the nodes. It is
removed when the job exits. The amount of space available is ~ 420 GiB
on normal nodes and ~ 840 GiB on fat nodes.
3) For other situations, use your /nobackup filesystems for your data
processing, including work/run directories.
4) Use /home for configuration files and precious data that you need to
have backed up on tape. That is, copy stuff to /home to protect it.
Or use symlinks to put part of you stuff on /home and part of it on
/nobackup, as appropriate.
5) If you can choose between processing data as a few large files or
many small files, choose the former. Handling huge amounts of small
files will hurt on both /nobackup and /home, to varying degrees.
--
Kent Engström, National Supercomputer Centre
kent at nsc.liu.se, +46 13 28 4444
More information about the Krypton-users
mailing list