<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body>
I think it was this one:<br>
<br>
<a class="moz-txt-link-freetext" href="https://git.whamcloud.com/?p=fs/lustre-release.git;a=commit;h=878561880d2aba038db95e199f82b186f22daa45">https://git.whamcloud.com/?p=fs/lustre-release.git;a=commit;h=878561880d2aba038db95e199f82b186f22daa45</a><br>
<br>
<div class="moz-cite-prefix">On 07.03.2022 09.05, Hans Henrik Happe
via lustre-discuss wrote:<br>
</div>
<blockquote type="cite"
cite="mid:cbc33f29-0f86-dee8-4ca8-0157b9e5faab@nbi.dk">
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
Hi Thomas,<br>
<br>
They should work together, but there are other requirements that
need to be fulfilled:<br>
<br>
<a class="moz-txt-link-freetext"
href="https://wiki.lustre.org/Lustre_2.12.8_Changelog"
moz-do-not-send="true">https://wiki.lustre.org/Lustre_2.12.8_Changelog</a><br>
<br>
I guess your servers are CentOS 7.9 as required for 2.12.8.<br>
<br>
I had an issue with Rocky 8.5 and the latest kernel with 2.12.8.
While RHEL 8.5 is supported there was something new after
4.18.0-348.2.1.el8_5, which caused problems. I found an LU fixing
it post 2.12.8 (can't remember the number), but downgrading to
4.18.0-348.2.1.el8_5 was the quick fix.<br>
<br>
Cheers,<br>
Hans Henrik<br>
<br>
<div class="moz-cite-prefix">On 03.03.2022 08.40, Thomas Roth via
lustre-discuss wrote:<br>
</div>
<blockquote type="cite"
cite="mid:8e842c06-bf75-3864-2fd1-5e935065bb9f@gsi.de">Dear all,
<br>
<br>
this might be just something I forgot or did not read
thoroughly, but shouldn't a 2.12.7-client work with 2.12.8 -
servers? <br>
<br>
The 2.12.8-changelog has the standard disclaimer <br>
<blockquote type="cite">Interoperability Support: <br>
Clients & Servers: Latest 2.10.X and Latest 2.11.X <br>
</blockquote>
<br>
<br>
<br>
I have this test cluster that I upgraded recently to 2.12.8 on
the servers. <br>
<br>
The fist client I attached now is a fresh install of rhel 8.5
(Alma). <br>
I installed 'kmod-lustre-client' and `lustre-client` from <a
class="moz-txt-link-freetext"
href="https://downloads.whamcloud.com/public/lustre/lustre-2.12.8/el8.5.2111/"
moz-do-not-send="true">https://downloads.whamcloud.com/public/lustre/lustre-2.12.8/el8.5.2111/</a>
<br>
I copied a directory containing ~5000 files - no visible issues
<br>
<br>
<br>
The next client was also installed with rhel 8.5 (Alma), but now
using 'lustre-client-2.12.7-1' and 'lustre-client-dkms-2.12.7-1'
from <br>
<a class="moz-txt-link-freetext"
href="https://downloads.whamcloud.com/public/lustre/lustre-2.12.7/el8/client/RPMS/x86_64/"
moz-do-not-send="true">https://downloads.whamcloud.com/public/lustre/lustre-2.12.7/el8/client/RPMS/x86_64/</a>
<br>
<br>
As on my first client, I copied a directory containing ~5000
files. The copy stalled, and the OSTs exploded in my face <br>
<br>
<blockquote type="cite">kernel: LustreError:
23345:0:(events.c:310:request_in_callback()) event type 2,
status -103, </blockquote>
service ost_io <br>
<blockquote type="cite">kernel: LustreError:
40265:0:(pack_generic.c:605:__lustre_unpack_msg()) message
length 0 too small </blockquote>
for magic/version check <br>
<blockquote type="cite">kernel: LustreError:
40265:0:(sec.c:2217:sptlrpc_svc_unwrap_request()) error
unpacking request from </blockquote>
12345-10.20.2.167@o2ib6 x1726208297906176 <br>
<blockquote type="cite">kernel: LustreError:
23345:0:(events.c:310:request_in_callback()) event type 2,
status -103, </blockquote>
service ost_io <br>
<br>
<br>
The latter message is repeated ad infinitum. <br>
<br>
The client log blames the network: <br>
<blockquote type="cite">Request sent has failed due to network
error <br>
Connection to was lost; in progress operations using this
service will wait for recovery to complete <br>
</blockquote>
<br>
<blockquote type="cite">LustreError:
181316:0:(events.c:205:client_bulk_callback()) event type 1,
status -103, desc0000000086e248d6 <br>
LustreError: 181315:0:(events.c:205:client_bulk_callback())
event type 1, status -5, desc </blockquote>
00000000e569130f <br>
<br>
<br>
<br>
There is also a client running Debian 9 and Lustre 2.12.6
(compiled from git) - no trouble at all. <br>
<br>
<br>
The I switched those two rhel8.5-clients: reinstalled the OS,
gave the first one the 2.12.7 -packages, the second on the
2.12.8 - and the error followed: again the client running with
'lustre-client-dkms-2.12.7-1' immedeately ran into trouble,
causing the same error messages in the logs. <br>
So this is not a network problem in the sense of broken hardware
etc. <br>
<br>
<br>
What did I miss? <br>
Some important Jira I did not read? <br>
<br>
<br>
Regards <br>
Thomas <br>
<br>
<br>
</blockquote>
<br>
<br>
<fieldset class="moz-mime-attachment-header"></fieldset>
<pre class="moz-quote-pre" wrap="">_______________________________________________
lustre-discuss mailing list
<a class="moz-txt-link-abbreviated" href="mailto:lustre-discuss@lists.lustre.org">lustre-discuss@lists.lustre.org</a>
<a class="moz-txt-link-freetext" href="http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org">http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org</a>
</pre>
</blockquote>
<br>
</body>
</html>