[lustre-discuss] LBUG: 2.5.16, EL8 Linux 4.18.0-553.30.1 in 'll_truncate_inode_pages

Aurelien Degremont adegremont at nvidia.com
Thu Jan 2 05:26:17 PST 2025


Hello Peter,

This is indeed this bug, and as the ticket said, this is not fixed in 2.15.x, only 2.16.0
I've no idea if this will ever make 2.15.x


Aurélien

________________________________
De : lustre-discuss <lustre-discuss-bounces at lists.lustre.org> de la part de Peter Grandi <pg at lustre.list.sabi.co.UK>
Envoyé : jeudi 2 janvier 2025 13:45
À : list Linux fs Lustre <lustre-discuss at lists.Lustre.org>
Objet : [lustre-discuss] LBUG: 2.5.16, EL8 Linux 4.18.0-553.30.1 in 'll_truncate_inode_pages

External email: Use caution opening links or attachments


Relatively rarely across a 200-machine cluster I get an LBUG on the
clients which seems triggered by specific access patterns (most jobs do
not trigger it) and looks quite similar to:

  https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fjira.whamcloud.com%2Fbrowse%2FLU-16637&data=05%7C02%7Cadegremont%40nvidia.com%7Cf5d4d3826d8a48b637a508dd2b2bc296%7C43083d15727340c1b7db39efd9ccc17a%7C0%7C0%7C638714189150988150%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=D4tRyR%2BoN2euGk8W5zjL1Y3ksJ6j0iFuJ7%2FS%2BYidmIg%3D&reserved=0<https://jira.whamcloud.com/browse/LU-16637>
  https://nam11.safelinks.protection.outlook.com/?url=http%3A%2F%2Flists.lustre.org%2Fpipermail%2Flustre-devel-lustre.org%2F2023-April%2F011016.html&data=05%7C02%7Cadegremont%40nvidia.com%7Cf5d4d3826d8a48b637a508dd2b2bc296%7C43083d15727340c1b7db39efd9ccc17a%7C0%7C0%7C638714189151026923%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=Qwm%2Blb7Wb4nGaO%2Fb6wW6LPUuW22DsLlwnUGG2R5wtik%3D&reserved=0<http://lists.lustre.org/pipermail/lustre-devel-lustre.org/2023-April/011016.html>
  https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgit.whamcloud.com%2F%3Fp%3Dfs%2Flustre-release.git%3Ba%3Dcommit%3Bh%3D7bb1e211d217d5a82ac2d5e4edad5ae018090761&data=05%7C02%7Cadegremont%40nvidia.com%7Cf5d4d3826d8a48b637a508dd2b2bc296%7C43083d15727340c1b7db39efd9ccc17a%7C0%7C0%7C638714189151046567%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=8JLPFy1CyyoJn9OrW8i6mnSo%2BW6Q3D2JSK0ZNKHCn%2Fg%3D&reserved=0<https://git.whamcloud.com/?p=fs/lustre-release.git;a=commit;h=7bb1e211d217d5a82ac2d5e4edad5ae018090761>

Since the LBUG is fatal all I get is the backtrace from the crash dump:

  lbug_with_loc.cxold.8+0x18
  ll_truncate_inode_pages_final+0xab
  vvp_prune+0x181
  cl_object_prune+0x58
  lov_layout_change.isra.49+0x1ba
  lov_conf_set+0x391
  cl_conf_set+0x60
  ll_layout_conf+0x14b
  ? _ptlrpc_req_finished+0x54d
  ll_layout_lock_set+0x3df
  ? ll_take_md_lock+0x148
  ll_layout_refresh+0x1cc
  vvp_io_init+0x22e
  cl_io_init0.isra.14+0x86
  ll_file_io_generic+0x388
  ? file_update_time+0x62
  ? srso_return_thunk+0x5
  ? __generic_file_write_iter+0x102
  ll_file_write_iter+0x558
  ? kmem_cache_freee+0x116
  new_sync_write+0x112
  vfs_write+0x5a

If this is a manifestation of LU-16637 there is a fix, but I have
checked the changelogs and LU-16637 is listed as applied to 2.16.0 but
it does not seem to be listed in the 2.15.[1-6] changelogs.
_______________________________________________
lustre-discuss mailing list
lustre-discuss at lists.lustre.org
https://nam11.safelinks.protection.outlook.com/?url=http%3A%2F%2Flists.lustre.org%2Flistinfo.cgi%2Flustre-discuss-lustre.org&data=05%7C02%7Cadegremont%40nvidia.com%7Cf5d4d3826d8a48b637a508dd2b2bc296%7C43083d15727340c1b7db39efd9ccc17a%7C0%7C0%7C638714189151061516%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=5zgL9DVZ6HA1M630A1e4RSajZnPBoCfQA%2BuryjBnbfk%3D&reserved=0<http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20250102/9288fbaa/attachment.htm>


More information about the lustre-discuss mailing list