[Lustre-discuss] page allocation failure
Wang lu
wanglu at ihep.ac.cn
Sun Nov 30 23:52:55 PST 2008
Dear Brain,
There are two kinds of threads that "may be" ost thread
ll_ost_** and ll_ost_io_**. Which one is ost thread? I am asking this question
to make sure the module option works.
in /etc/modprobe.conf:
options ost oss_num_threads=200
[root at boss03 ~]# ps -ef | grep ll_ost|wc -l
405
[root at boss03 ~]# ps -ef | grep ll_ost_io|wc -l
201
If ll_ost_io is ost thread, then what is ll_ost?
Brian J. Murrell 写:
> On Fri, 2008-11-28 at 16:22 +0800, Wang lu wrote:
>>
>> Nov 28 16:12:50 boss01 kernel: LustreError: 2129:0:(filter_io_26.c:779:
>> filter_commitrw_write()) besfs-OST0005: slow direct_io 53s
>> Nov 28 16:12:50 boss01 kernel: LustreError: 871:0:(lustre_fsfilt.h:318:
>> fsfilt_commit_wait()) besfs-OST0005: slow journal start 53s
>> Nov 28 16:12:50 boss01 kernel: LustreError: 871:0:(filter_io_26.c:792:
>> filter_commitrw_write()) besfs-OST0005: slow commitrw commit 53s
>> Nov 28 16:13:24 boss01 kernel: LustreError: 838:0:(filter_io_26.c:714:
>> filter_commitrw_write()) besfs-OST0002: slow i_mutex 123s
>> Nov 28 16:13:24 boss01 kernel: LustreError: 838:0:(filter_io_26.c:714:
>> filter_commitrw_write()) Skipped 84 previous similar messages
>> Nov 28 16:13:24 boss01 kernel: LustreError: 2107:0:(lustre_fsfilt.h:262:
>> fsfilt_brw_start_log()) besfs-OST0002: slow journal start 123s
>> Nov 28 16:13:24 boss01 kernel: LustreError: 2107:0:(lustre_fsfilt.h:262:
>> fsfilt_brw_start_log()) Skipped 10 previous similar messages
>> Nov 28 16:13:24 boss01 kernel: LustreError: 2107:0:(filter_io_26.c:727:
>> filter_commitrw_write()) besfs-OST0002: slow brw_start 123s
>> Nov 28 16:13:24 boss01 kernel: LustreError: 2107:0:(filter_io_26.c:727:
>> filter_commitrw_write()) Skipped 11 previous similar messages
>> Nov 28 16:13:24 boss01 kernel: LustreError: 2093:0:(lustre_fsfilt.h:227:
>> fsfilt_start_log()) besfs-OST0002: slow journal start 122s
>> Nov 28 16:13:24 boss01 kernel: LustreError: 2175:0:(lustre_fsfilt.h:318:
>> fsfilt_commit_wait()) besfs-OST0002:
>
> Your storage is too slow for the OSS load you are throwing at it. Try
> reducing the number of OST threads on your OSSes. I don't recall
> exactly the name of the module option but it is an ost module option.
> modinfo should tell you. I'm sure the operations manual covers this as
> well.
>
> b.
>
More information about the lustre-discuss
mailing list