<div dir="ltr">For information, arpwatch can be used to alert on duplicated addresses<br><br><a href="https://en.wikipedia.org/wiki/Arpwatch">https://en.wikipedia.org/wiki/Arpwatch</a></div><br><div class="gmail_quote gmail_quote_container"><div dir="ltr" class="gmail_attr">On Fri, 31 Oct 2025 at 13:13, Michael DiDomenico via lustre-discuss <<a href="mailto:lustre-discuss@lists.lustre.org">lustre-discuss@lists.lustre.org</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">unfortunately i don't think so. we're pretty good about assigning<br>
addresses, but still human. i don't see any evidence of a dup'd<br>
address, but i'll keep looking<br>
<br>
thanks<br>
<br>
On Thu, Oct 30, 2025 at 8:10 PM Mohr, Rick <<a href="mailto:mohrrf@ornl.gov" target="_blank">mohrrf@ornl.gov</a>> wrote:<br>
><br>
> Michael,<br>
><br>
> It might be a long shot, but is there any chance another machine has the same IP address as the one having problems?<br>
><br>
> --Rick<br>
><br>
><br>
><br>
> On 10/30/25, 3:09 PM, "lustre-discuss on behalf of Michael DiDomenico via lustre-discuss" wrote:<br>
> our network is running 2.15.6 everywhere on rhel9.5, we recently built a new machine using 2.15.7 on rhel9.6 and i'm seeing a strange problem. the client is ethernet connected to ten lnet routers which bridge ethernet to infiniband. i can mount the client just fine, read/write data, but then several hours later, the client marks all the routers offline. the only recovery is to lazy unmount, lustre_rmmod, and then restart the lustre mount nothing unusual comes out in the journal/dmesg logs. to lustre it "looks" like someone pulled the network cable, but there's no evidence that this has happened physically or even at the switch/software layers we upgraded two other machine to see if the problem replicates, but so far it hasn't. the only significant difference between the three machines is the one with the problem has heavy container (podman) usage, the others have zero. i'm not sure if this is an cause or just a red herring any suggestions<br>
><br>
><br>
_______________________________________________<br>
lustre-discuss mailing list<br>
<a href="mailto:lustre-discuss@lists.lustre.org" target="_blank">lustre-discuss@lists.lustre.org</a><br>
<a href="http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org" rel="noreferrer" target="_blank">http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org</a><br>
</blockquote></div>