← Back to team overview

canonical-ubuntu-qa team mailing list archive

[Bug 2121866] Re: ubuntu_kselftests_net:nl_netdev.py regression plucky nsim_queue_stop [netdevsim]

 

This bug is awaiting verification that the linux/6.14.0-36.36 kernel in
-proposed solves the problem. Please test the kernel and update this bug
with the results. If the problem is solved, change the tag
'verification-needed-plucky-linux' to 'verification-done-plucky-linux'.
If the problem still exists, change the tag 'verification-needed-plucky-
linux' to 'verification-failed-plucky-linux'.


If verification is not done by 5 working days from today, this fix will
be dropped from the source code, and this bug will be closed.


See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how
to enable and use -proposed. Thank you!


** Tags added: kernel-spammed-plucky-linux-v2 verification-needed-plucky-linux

-- 
You received this bug notification because you are a member of Canonical
Platform QA Team, which is subscribed to ubuntu-kernel-tests.
https://bugs.launchpad.net/bugs/2121866

Title:
  ubuntu_kselftests_net:nl_netdev.py regression plucky nsim_queue_stop
  [netdevsim]

Status in ubuntu-kernel-tests:
  Invalid
Status in linux package in Ubuntu:
  Invalid
Status in linux source package in Plucky:
  In Progress

Bug description:
  [SRU Justification]

  == Impact ==
  Since SRU cycle 2025.08.11 we saw the ubuntu_kselftest_net suite failing
  more or less harshly. The reason was that running nl_netdev.py was
  causing a crash:

  [ 5769.558910] RIP: 0010:mutex_lock+0x1c/0x50
  [ 5769.568304] napi_disable+0x1d/0x50
  [ 5769.568661] nsim_queue_stop+0x27/0x80 [netdevsim]
  [ 5769.569140] netdev_rx_queue_restart+0x14a/0x260
  [ 5769.569603] nsim_qreset_write+0x10d/0x150 [netdevsim]
  [ 5769.570113] full_proxy_write+0x5e/0xa0
  [ 5769.570499] vfs_write+0x10a/0x460

  == Fix ==
  It was found that this got introduced by
   "netdevsim: allow normal queue reset while down"
  Apparently it should be allowed but stopping and starting the queue(s) is not allowed while a device is not running.
  This was handled with the following upstream change:
   net: devmem: don't call queue stop / start when the interface is down
  However the submission was explicitly stating that is was not inteneded as a bug fix but a clarification. Therefor it never was picked into upstream stable.

  == Testcase ==
  Running ubuntu_kselftest_net (specifically nl_netdev.py) should no longer fail and produce an oops.

  == Regression Potential ==
  Potentially the behaviour of netdev_queue_restart() (resetting network devices) is impacted.

  ---
  With recent upstream changes (6.14.0-30.30), ubuntu_kselftests_net fails due to regression in netdevsim module. Because this is related to netdevsim, it should not block us from releasing the kernel.

  Possibly offending commit:
  --------------------------
  netdevsim: allow normal queue reset while down

  BugLink: https://bugs.launchpad.net/bugs/2115678

  [ Upstream commit 285b3f78eabd951e59e98f01f86abaaa6c76cd44 ]

  Resetting queues while the device is down should be legal.
  Allow it, test it. Ideally we'd test this with a real device
  supporting devmem but I don't have access to such devices.

  Failing test cases:
  -------------------
   - net:nl_netdev.py (this manipulates the state of netdevsim (down, up) and
     causes the regression, afterwards other tests fail (timeout) because rtnl_lock()
     is held before the panic and network is not responsive)
   - net:srv6_end_dt46_l3vpn_test.sh
   - net:srv6_end_dt4_l3vpn_test.sh
   - sut-test

  Regression:
  -----------
  kt-p-l-gen-6-14-u-kselftests-net-amd64 login:
  [ 5585.609562] ip_local_port_range: prefer different parity for start/end values.
  [ 5769.554010] BUG: kernel NULL pointer dereference, address: 0000000000000a20
  [ 5769.554781] #PF: supervisor write access in kernel mode
  [ 5769.555294] #PF: error_code(0x0002) - not-present page
  [ 5769.555796] PGD 1157cd067 P4D 1157cd067 PUD 1075e3067 PMD 0
  [ 5769.556387] Oops: Oops: 0002 [#1] PREEMPT SMP NOPTI
  [ 5769.556872] CPU: 0 UID: 0 PID: 395903 Comm: python3 Not tainted 6.14.0-32-generic #32-Ubuntu
  [ 5769.557686] Hardware name: OpenStack Foundation OpenStack Nova, BIOS 1.16.3-debian-1.16.3-2~backport22.04.202412060920~ubuntu22.04.1 04/01/2014
  [ 5769.558910] RIP: 0010:mutex_lock+0x1c/0x50
  [ 5769.560030] Code: 90 90 90 90 90 90 90 90 90 90 90 90 90 90 0f 1f 44 00 00 55 48 89 e5 53 48 89 fb e8 fe d6 ff ff 65 48 8b 15 a6 ae cf 56 31 c0 <f0> 48 0f b1 13 75 10 48 8b 5d f8 c9 31 c0 31 d2 31 ff e9 ad 74 00
  [ 5769.561801] RSP: 0018:ffffcf8381803a88 EFLAGS: 00010246
  [ 5769.562317] RAX: 0000000000000000 RBX: 0000000000000a20 RCX: 0000000000000000
  [ 5769.563005] RDX: ffff8b7146032a00 RSI: 0000000000000000 RDI: 0000000000000000
  [ 5769.563694] RBP: ffffcf8381803a90 R08: 0000000000000000 R09: 0000000000000000
  [ 5769.564395] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000001
  [ 5769.565085] R13: ffff8b71402accd0 R14: 0000000000000001 R15: 0000000000000000
  [ 5769.565779] FS:  0000788a7e85e780(0000) GS:ffff8b717bc00000(0000) knlGS:0000000000000000
  [ 5769.566558] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
  [ 5769.567120] CR2: 0000000000000a20 CR3: 000000010a1c2000 CR4: 0000000000350ef0
  [ 5769.567813] Call Trace:
  [ 5769.568079]  <TASK>
  [ 5769.568304]  napi_disable+0x1d/0x50
  [ 5769.568661]  nsim_queue_stop+0x27/0x80 [netdevsim]
  [ 5769.569140]  netdev_rx_queue_restart+0x14a/0x260
  [ 5769.569603]  nsim_qreset_write+0x10d/0x150 [netdevsim]
  [ 5769.570113]  full_proxy_write+0x5e/0xa0
  [ 5769.570499]  vfs_write+0x10a/0x460
  [ 5769.570843]  ? srso_return_thunk+0x5/0x5f
  [ 5769.571241]  ? arch_exit_to_user_mode_prepare.isra.0+0x22/0xd0
  [ 5769.571812]  ? srso_return_thunk+0x5/0x5f
  [ 5769.572221]  ? syscall_exit_to_user_mode+0x38/0x1d0
  [ 5769.572705]  ksys_write+0x70/0xf0
  [ 5769.573042]  __x64_sys_write+0x19/0x30
  [ 5769.573418]  x64_sys_call+0x2a3/0x2310
  [ 5769.573795]  do_syscall_64+0x7e/0x170
  [ 5769.574165]  ? srso_return_thunk+0x5/0x5f
  [ 5769.574565]  ? putname+0x60/0x80
  [ 5769.574894]  ? srso_return_thunk+0x5/0x5f
  [ 5769.575334]  ? do_sys_openat2+0xa4/0xf0
  [ 5769.575721]  ? do_syscall_64+0x8a/0x170
  [ 5769.576118]  ? srso_return_thunk+0x5/0x5f
  [ 5769.576518]  ? srso_return_thunk+0x5/0x5f
  [ 5769.576916]  ? srso_return_thunk+0x5/0x5f
  [ 5769.577682]  ? arch_exit_to_user_mode_prepare.isra.0+0x22/0xd0
  [ 5769.578584]  ? srso_return_thunk+0x5/0x5f
  [ 5769.579289]  ? syscall_exit_to_user_mode+0x38/0x1d0
  [ 5769.580086]  ? srso_return_thunk+0x5/0x5f
  [ 5769.580766]  ? do_syscall_64+0x8a/0x170
  [ 5769.581428]  ? srso_return_thunk+0x5/0x5f
  [ 5769.582114]  ? srso_return_thunk+0x5/0x5f
  [ 5769.582780]  ? arch_exit_to_user_mode_prepare.isra.0+0x22/0xd0
  [ 5769.583619]  ? srso_return_thunk+0x5/0x5f
  [ 5769.584299]  ? syscall_exit_to_user_mode+0x38/0x1d0
  [ 5769.585053]  ? srso_return_thunk+0x5/0x5f
  [ 5769.585706]  ? do_syscall_64+0x8a/0x170
  [ 5769.586339]  ? srso_return_thunk+0x5/0x5f
  [ 5769.586992]  ? do_syscall_64+0x8a/0x170
  [ 5769.587617]  ? srso_return_thunk+0x5/0x5f
  [ 5769.588251]  ? do_syscall_64+0x8a/0x170
  [ 5769.588853]  ? sysvec_call_function_single+0x57/0xc0
  [ 5769.589568]  entry_SYSCALL_64_after_hwframe+0x76/0x7e
  [ 5769.590297] RIP: 0033:0x788a7e69eba6
  [ 5769.590887] Code: 00 00 48 8b 15 53 12 17 00 64 89 02 48 c7 c2 ff ff ff ff 48 8b 5d f8 c9 48 89 d0 c3 0f 1f 84 00 00 00 00 00 48 8b 45 10 0f 05 <48> 63 d0 3d 00 f0 ff ff 77 10 48 8b 5d f8 48 89 d0 c9 c3 0f 1f 80
  [ 5769.593223] RSP: 002b:00007ffca0364020 EFLAGS: 00000202 ORIG_RAX: 0000000000000001
  [ 5769.594207] RAX: ffffffffffffffda RBX: 0000788a7e85e780 RCX: 0000788a7e69eba6
  [ 5769.595150] RDX: 0000000000000003 RSI: 0000000022a02f00 RDI: 0000000000000006
  [ 5769.596115] RBP: 00007ffca0364030 R08: 0000000000000000 R09: 0000000000000000
  [ 5769.597057] R10: 0000000000000000 R11: 0000000000000202 R12: 0000000000000000
  [ 5769.598002] R13: 0000788a7e85e6e8 R14: 0000000000000006 R15: 0000000000a814f0
  [ 5769.598953]  </TASK>
  [ 5769.599440] Modules linked in: netdevsim act_gact cls_bpf xt_HL amt act_tunnel_key bareudp sch_netem xfrm_interface cls_matchall ib_core ip6t_rpfilter mpls_gso mpls_iptunnel mpls_router l2tp_ip6 l2tp_eth l2tp_ip l2tp_netlink l2tp_core br_netfilter ip6t_REJECT nf_reject_ipv6 ipt_REJECT nf_reject_ipv4 xt_tcpudp sch_etf sch_fq dccp_ipv6 dccp_ipv4 dccp nft_chain_nat xt_nat nf_nat algif_hash af_alg mptcp_diag xsk_diag vsock_diag tcp_diag udp_diag raw_diag inet_diag unix_diag af_packet_diag netlink_diag xt_conntrack nf_conntrack nf_defrag_ipv4 nf_defrag_ipv6 ip_vti ip6_vti xfrm6_tunnel fou6 sit ipip tunnel4 geneve act_csum act_pedit cls_flower sch_prio xt_mark sch_ingress act_mirred cls_basic esp6 authenc echainiv xt_policy nft_compat nf_tables bonding esp4_offload ptp_mock psample esp4 xfrm_user xfrm_algo macsec fou vxlan ip6_udp_tunnel udp_tunnel vrf 8021q garp mrp bridge stp llc ip6_gre ip6_tunnel tunnel6 ip_gre ip_tunnel gre cls_u32 sch_htb dummy veth aria_aesni_avx2_x86_64 aria_aesni_avx_x86_64 aria_generic ccm
  [ 5769.599603]  sm4_generic sm4_aesni_avx2_x86_64 sm4_aesni_avx_x86_64 sm4 poly1305_generic libpoly1305 poly1305_x86_64 chacha_generic chacha_x86_64 libchacha chacha20poly1305 tls qrtr cfg80211 binfmt_misc intel_rapl_msr intel_rapl_common nls_iso8859_1 kvm_amd ccp kvm irqbypass qxl drm_ttm_helper i2c_piix4 ttm i2c_smbus drm_exec joydev input_leds mac_hid sch_fq_codel efi_pstore msr dm_multipath nfnetlink vsock_loopback vmw_vsock_virtio_transport_common vmw_vsock_vmci_transport vsock vmw_vmci dmi_sysfs qemu_fw_cfg ip_tables x_tables autofs4 btrfs blake2b_generic raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid1 raid0 linear polyval_clmulni psmouse polyval_generic ghash_clmulni_intel sha256_ssse3 sha1_ssse3 serio_raw virtio_rng floppy pata_acpi aesni_intel crypto_simd cryptd [last unloaded: br_netfilter]
  [ 5769.620077] CR2: 0000000000000a20
  [ 5769.620767] ---[ end trace 0000000000000000 ]---
  [ 5769.621605] RIP: 0010:mutex_lock+0x1c/0x50
  [ 5769.622389] Code: 90 90 90 90 90 90 90 90 90 90 90 90 90 90 0f 1f 44 00 00 55 48 89 e5 53 48 89 fb e8 fe d6 ff ff 65 48 8b 15 a6 ae cf 56 31 c0 <f0> 48 0f b1 13 75 10 48 8b 5d f8 c9 31 c0 31 d2 31 ff e9 ad 74 00
  [ 5769.624961] RSP: 0018:ffffcf8381803a88 EFLAGS: 00010246
  [ 5769.625879] RAX: 0000000000000000 RBX: 0000000000000a20 RCX: 0000000000000000
  [ 5769.626975] RDX: ffff8b7146032a00 RSI: 0000000000000000 RDI: 0000000000000000
  [ 5769.628086] RBP: ffffcf8381803a90 R08: 0000000000000000 R09: 0000000000000000
  [ 5769.629269] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000001
  [ 5769.630400] R13: ffff8b71402accd0 R14: 0000000000000001 R15: 0000000000000000
  [ 5769.631577] FS:  0000788a7e85e780(0000) GS:ffff8b717bc00000(0000) knlGS:0000000000000000
  [ 5769.632816] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
  [ 5769.633831] CR2: 0000000000000a20 CR3: 000000010a1c2000 CR4: 0000000000350ef0
  [ 5769.634976] note: python3[395903] exited with irqs disabled

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/2121866/+subscriptions



References