Hello,
Sorry I didn't notice your e-mail earlier. We ran into that exact issue
on KLAT2, and I debugged it (at least to my satisfaction) back in an
earlier version of LAM. However, my patches didn't solve
the problem in all cases. Basically the problem is that NFS will rename
a file rather than delete it if there is a live file-descriptor for the
file (i.e. it is still "open" by something). The details are beyond
the scope of this e-mail, but suffice to say, there wasn't an elegant
solution for patching LAM as it was then (summer of 2001).
My current best solution for diskless nodes with LAM, is to use tmpfs
to create a RAM based /tmp. It's not ideal, since /tmp would now be
very limited in size, and any files there would directly compete with your
parallel program for real RAM space in the node.
In RedHat 7.x and later, you can look at the /dev/shm entry in /etc/fstab
for an example use of tmpfs. You'll need to add a line to /etc/fstab
that looks like this:
none /tmp tmpfs defaults 0 0
I don't know if the upcoming 7.0 version really fixes this issue,
but at least with the option to change the location of those special
files, more elegant solutions might present themselves.
--
Tim Mattox - tmattox_at_[hidden] - http://home.earthlink.net/~timattox
http://aggregate.org/KAOS/ - http://advogato.org/person/tmattox/
On Tue, 29 Apr 2003, Jerome BENOIT wrote:
> Hello LAM,
>
> >
> >> * Is /tmp on a network file system?
> >
>
>
> Can we impose an another `/tmp' directory ?
>
>
> Thanks,
> Jerome
|