← Back to team overview

group.of.nepali.translators team mailing list archive

[Bug 1553185] Re: PCI RoCe IB perftest Aborted (core dumped)

 

** Also affects: perftest (Ubuntu Zesty)
   Importance: Undecided
       Status: New

** Changed in: perftest (Ubuntu Zesty)
       Status: New => In Progress

** Changed in: perftest (Ubuntu Zesty)
   Importance: Undecided => High

** No longer affects: perftest (Ubuntu Yakkety)

** No longer affects: perftest (Ubuntu Xenial)

** Changed in: perftest (Ubuntu Zesty)
     Assignee: (unassigned) => Dimitri John Ledkov (xnox)

** Changed in: perftest (Ubuntu Zesty)
    Milestone: None => zesty-updates

** Changed in: perftest (Ubuntu Zesty)
       Status: In Progress => Triaged

-- 
You received this bug notification because you are a member of नेपाली
भाषा समायोजकहरुको समूह, which is subscribed to Xenial.
Matching subscriptions: Ubuntu 16.04 Bugs
https://bugs.launchpad.net/bugs/1553185

Title:
  PCI RoCe IB perftest Aborted (core dumped)

Status in Ubuntu on IBM z Systems:
  Confirmed
Status in perftest package in Ubuntu:
  Fix Released
Status in perftest source package in Zesty:
  Triaged

Bug description:
  SRU:
  ====

  [Impact]

   * the perftest tools (ib_*) incl. in the perftest package cannot be used at all, they all always core dump on all platforms
   * a backport is required for getting again a working perftest package / tool set
   * the fix was officially provided by Mellanox and fixes the version comparison that was broken before (partly using int, partly string compares)

  [Test Case]

   * the bug can easily be reproduced on two systems both with RoCE cards installed
     and by starting a perftest run like this:
     on one machine as 'server': sudo ./ib_read_bw -d mlx4_0 -a
     and on a second machine as 'client', pointing to the servers IP address: sudo ./ib_read_bw <server IP> -d mlx4_0
     
  detailed instructions how to reproduce the bug

   * install the perftest package including all dependencies
   * configure the RoCE devices as network devices using a private network range in /etc/network/interfaces like this
  	# The 1st RoCE interface configuration
  	auto enP1p0s0
  	iface enP1p0s0 inet static
  	        address 192.168.1.141
  	        netmask 255.255.255.0
  	        network 192.168.1.0
  	        broadcast 192.168.1.255
   * test if the network is okay, with ping (or rping, udaddy rdma_client/rdma_server)
   * and run the Test Case above

  [Regression Potential]

   * the regression is low due to the fact that the current tools that are part of the perftest package just segfault and are unusable
   * the target for this patched perftest package is Zesty (and higher), because only Zesty has the depending packages in an up-to-date version

   * people may ask to SRU that to Xenial as well, but that would
  require the update of several other packages ...

  --------%<----------------%<----------------%<----------------%<--------

  == Comment: #0 - Helmut Grauer - 2016-03-04 06:46:50 ==
  Hi
  Configure IB for perftest
  Ethernet Interface
  np0s0    Link encap:Ethernet  HWaddr 82:01:14:32:f0:90
            inet addr:10.100.80.2  Bcast:10.100.255.255  Mask:255.255.0.0
            inet6 addr: fe80::8001:14ff:fe32:f090/64 Scope:Link
            inet6 addr: fd00:10:100::ff:80:2/80 Scope:Global
            inet6 addr: fd00:10:100:0:8001:14ff:fe32:f090/64 Scope:Global
            UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
            RX packets:25938 errors:0 dropped:0 overruns:0 frame:0
            TX packets:253 errors:0 dropped:0 overruns:0 carrier:0
            collisions:0 txqueuelen:1000
            RX bytes:8228251 (8.2 MB)  TX bytes:21494 (21.4 KB)

  Installing related packages for dapltest

  librdmacm-dev                                   install
  librdmacm1                                      install
  librdmacm1-dbg                                  install
  dapl2-utils                                     install
  libibumad3                                      install
  libibverbs-dev                                  install
  libibverbs1                                     install
  libmlx4-1                                       install
  libmlx4-1-dbg                                   install
  libmlx4-dev                                     install
  libmlx5-1                                       install
  libmlx5-1-dbg                                   install
  libmlx5-dev                                     install
  perftest                                        install

  ++++++++++++++++++++++++++++++++++++++
  ++++++++++++++++++++++++++++++++++++++
  +++         PCI-Overview:          +++
  ++++++++++++++++++++++++++++++++++++++
  ++++++++++++++++++++++++++++++++++++++

  ======================================
  DEVICE_List:
  -------------
  0000:00:00.0
  0001:00:00.0

  ======================================
  FunctionID_List:
  -----------------
  0x0000003e
  0x0000003f

  ======================================
  PCHID_List:
  ------------
  0x0108
  0x013c

  ======================================
  Interface_List:
  ----------------
  /sys/bus/pci/devices/0000:00:00.0/net/:
  enp0s0
  enp0s0d1

  /sys/bus/pci/devices/0001:00:00.0/net/:
  enP1p0s0
  enP1p0s0d1

  ======================================
  Infiniband_List:
  ----------------
  /sys/bus/pci/devices/0000:00:00.0/infiniband/:
  mlx4_0

  /sys/bus/pci/devices/0001:00:00.0/infiniband/:
  mlx4_1

  --------------------------------------------------------------------------

  server

  root@s83lp02:~# dpkg -S /etc/dat.conf
  libdapl2: /etc/dat.conf
  root@s83lp02:~# ib_read_bw -d mlx4_0 -a

  ************************************
  * Waiting for client to connect... *
  ************************************
  *** stack smashing detected ***: ib_read_bw terminated
  Aborted (core dumped)

  -----------------------------------------------------------------------------
  root@s83lp18:~# ./xpci.sh

  ++++++++++++++++++++++++++++++++++++++
  ++++++++++++++++++++++++++++++++++++++
  +++         PCI-Overview:          +++
  ++++++++++++++++++++++++++++++++++++++
  ++++++++++++++++++++++++++++++++++++++

  ======================================
  DEVICE_List:
  -------------
  0000:00:00.0
  0001:00:00.0

  ======================================
  FunctionID_List:
  -----------------
  0x00000040
  0x00000041

  ======================================
  PCHID_List:
  ------------
  0x0108
  0x013c

  ======================================
  Interface_List:
  ----------------
  /sys/bus/pci/devices/0000:00:00.0/net/:
  ens40
  ens40d1

  /sys/bus/pci/devices/0001:00:00.0/net/:
  enP1s41
  enP1s41d1

  ======================================
  Infiniband_List:
  ----------------
  /sys/bus/pci/devices/0000:00:00.0/infiniband/:
  mlx4_0

  /sys/bus/pci/devices/0001:00:00.0/infiniband/:
  mlx4_1

  Client

  root@s83lp18:~# ib_read_bw 10.100.80.2 -d mlx4_1
  Couldn't connect to 10.100.80.2:18515
  Unable to open file descriptor for socket connection Unable to init the socket connection
  root@s83lp18:~# ib_read_bw 10.100.80.2 -d mlx4_1
  *** stack smashing detected ***: ib_read_bw terminated
  Aborted (core dumped)
  (reverse-i-search)`':

  I will add SOSReport and dgbinfo.sh

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-z-systems/+bug/1553185/+subscriptions