[Lustre-discuss] clients gets EINTR from time to time

Francois Chassaing fch at weborama.com
Thu Feb 24 05:16:40 PST 2011


Well, as I understand your point and I do also understand that this signal is not a malfunction, 
my question was regarding to the intrinsic "why" (and when) does this signal is sent to the client.

Thnaks

line
weborama	line	François Chassaing Directeur Technique - CTO 
weborama.com - fch at weborama.com 
T : +33 (0)1 53 19 21 51 F : +33 (0)1 53 19 21 41 
Weborama - 15 rue Clavel 75019 Paris 

----- Mail Original -----
De: "Brian J. Murrell" <brian at whamcloud.com>
À: lustre-discuss at lists.lustre.org
Envoyé: Jeudi 24 Février 2011 13h17:33 GMT +01:00 Amsterdam / Berlin / Berne / Rome / Stockholm / Vienne
Objet: Re: [Lustre-discuss] clients gets EINTR from time to time

On 11-02-24 05:50 AM, Francois Chassaing wrote:
> Dear list members, 

Hi,

> We are experiencing issues with an application that gets a EINTR when trying to write to a file. 

If I understand that errno properly, that is to be expected.

> Those errors happens "randomly" on both clients,

Well, not "randomly".  It happens when a signal arrives.

> So my app treats the error as if the file was full

This is wrong.  Your app is broken and needs to be fixed.

> I've tryed to change the IB switch, so it is most probably not coming from here (while it is a "cheap" switch). I've also tried to change the client mount options, changed the stripping policy from -1 to 1, but it did not change anything neither. 

None of this is going to resolve your problem.  Yours is a problem of
application programming defect, not a system fault.

> I would really appreciate pointers or suggestions to debug this issue. 

Maybe some understanding of how signals can affect system calls.  A
quick google found this for me:

http://www.gnu.org/s/libc/manual/html_node/Interrupted-Primitives.html#Interrupted-Primitives

Probably there is more detailed text out there to help you and your
application programmer to handle this application programming fault
better.  But alas, it is an application programming problem and not a
Lustre filesystem or equipment problem.

b.

-- 
Brian J. Murrell
Senior Software Engineer
Whamcloud, Inc.


_______________________________________________
Lustre-discuss mailing list
Lustre-discuss at lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss



More information about the lustre-discuss mailing list