[Lustre-discuss] Extremely high load and hanging processes on a Lustre client

Bernd Schubert bs_lists at aakef.fastmail.fm
Fri Mar 5 16:38:17 PST 2010


On Friday 05 March 2010, Götz Waschk wrote:
> Hi everyone,
> 
> I have a critical problem on one of my Lustre client machines running
> Scientific Linux 5.4 and the patchless Lustre 1.8.2 client. After a
> few days of usage, some processes like cp and kswapd0 start to use
> 100% CPU. Only 180k of swap space are in use though.
> 
> Processes that try to access Lustre use a lot of CPU and seem to hang.
> 
> There is some output in the kernel log I'll attach to this mail.
> 
> Do you have any idea what to test before rebooting the machine?

Don't reboot, but disable LRU resizing. 

for i in /proc/fs/lustre/ldlm/namespaces/*; do echo 800 > ${i}/lru_size; done


At least that helped all the time before when we had that problem. I hoped it 
would be fixed in 1.8.2, but seems it is not. Please open a bug report.


Thanks,
Bernd

-- 
Bernd Schubert
DataDirect Networks



More information about the lustre-discuss mailing list