← Back to team overview

canonical-ubuntu-qa team mailing list archive

[Bug 2103713] [NEW] ubuntu_dgx_mofed_build.dkms fails on nvidia-dgx-2 running bionic:linux

 

Public bug reported:

The ubuntu_dgx_mofed_build.dkms test currently assumes that only MOFED
dkms modules are installed but nothing else.

>From autotest-client-tests/ubuntu_dgx_mofed_build/check-mofed-
modules.sh:

# This test assumes that the only modules installed here are the MOFED ones.
# If other DKMS packages are installed that will throw it off.
echo "INFO: Scanning for available MOFED kernel modules..."
ls /lib/modules/$(uname -r)/updates/dkms | sort > ${actual}

if diff -u ${expected} ${actual}; then
    echo "INFO: Success: Actual module list matches expected module list."
    exit 0
fi

echo "ERROR: Actual modules list does not match expected modules list" 1>&2
exit 1

This assumption is currently not met when testing bionic:linux on
nvidia-dgx-2:

20:32:39 DEBUG| --- /home/ubuntu/autotest/client/tests/ubuntu_dgx_mofed_build/expected-mofed-modules/4.9-4.1.7.0-bionic.lst	2024-09-09 18:52:26.633244076 +0000
20:32:39 DEBUG| +++ /tmp/tmp.iXHAbsWK90	2025-03-19 20:32:39.725257853 +0000
20:32:39 DEBUG| @@ -19,6 +19,11 @@
20:32:39 DEBUG|  mlxfw.ko
20:32:39 DEBUG|  mst_pci.ko
20:32:39 DEBUG|  mst_pciconf.ko
20:32:39 DEBUG| +nvidia-drm.ko
20:32:39 DEBUG| +nvidia-modeset.ko
20:32:39 DEBUG| +nvidia-peermem.ko
20:32:39 DEBUG| +nvidia-uvm.ko
20:32:39 DEBUG| +nvidia.ko
20:32:39 DEBUG|  rdma_cm.ko
20:32:39 DEBUG|  rdma_ucm.ko
20:32:39 DEBUG|  rshim.ko
20:32:39 DEBUG| stderr:
20:32:39 DEBUG| ERROR: Actual modules list does not match expected modules list

** Affects: ubuntu-kernel-tests
     Importance: Undecided
         Status: New

-- 
You received this bug notification because you are a member of Canonical
Platform QA Team, which is subscribed to ubuntu-kernel-tests.
https://bugs.launchpad.net/bugs/2103713

Title:
  ubuntu_dgx_mofed_build.dkms fails on nvidia-dgx-2 running bionic:linux

Status in ubuntu-kernel-tests:
  New

Bug description:
  The ubuntu_dgx_mofed_build.dkms test currently assumes that only MOFED
  dkms modules are installed but nothing else.

  From autotest-client-tests/ubuntu_dgx_mofed_build/check-mofed-
  modules.sh:

  # This test assumes that the only modules installed here are the MOFED ones.
  # If other DKMS packages are installed that will throw it off.
  echo "INFO: Scanning for available MOFED kernel modules..."
  ls /lib/modules/$(uname -r)/updates/dkms | sort > ${actual}

  if diff -u ${expected} ${actual}; then
      echo "INFO: Success: Actual module list matches expected module list."
      exit 0
  fi

  echo "ERROR: Actual modules list does not match expected modules list" 1>&2
  exit 1

  This assumption is currently not met when testing bionic:linux on
  nvidia-dgx-2:

  20:32:39 DEBUG| --- /home/ubuntu/autotest/client/tests/ubuntu_dgx_mofed_build/expected-mofed-modules/4.9-4.1.7.0-bionic.lst	2024-09-09 18:52:26.633244076 +0000
  20:32:39 DEBUG| +++ /tmp/tmp.iXHAbsWK90	2025-03-19 20:32:39.725257853 +0000
  20:32:39 DEBUG| @@ -19,6 +19,11 @@
  20:32:39 DEBUG|  mlxfw.ko
  20:32:39 DEBUG|  mst_pci.ko
  20:32:39 DEBUG|  mst_pciconf.ko
  20:32:39 DEBUG| +nvidia-drm.ko
  20:32:39 DEBUG| +nvidia-modeset.ko
  20:32:39 DEBUG| +nvidia-peermem.ko
  20:32:39 DEBUG| +nvidia-uvm.ko
  20:32:39 DEBUG| +nvidia.ko
  20:32:39 DEBUG|  rdma_cm.ko
  20:32:39 DEBUG|  rdma_ucm.ko
  20:32:39 DEBUG|  rshim.ko
  20:32:39 DEBUG| stderr:
  20:32:39 DEBUG| ERROR: Actual modules list does not match expected modules list

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/2103713/+subscriptions