group.of.nepali.translators team mailing list archive
-
group.of.nepali.translators team
-
Mailing list archive
-
Message #13969
[Bug 1553185] Re: PCI RoCe IB perftest Aborted (core dumped)
** Also affects: perftest (Ubuntu Zesty)
Importance: Undecided
Status: New
** Changed in: perftest (Ubuntu Zesty)
Status: New => In Progress
** Changed in: perftest (Ubuntu Zesty)
Importance: Undecided => High
** No longer affects: perftest (Ubuntu Yakkety)
** No longer affects: perftest (Ubuntu Xenial)
** Changed in: perftest (Ubuntu Zesty)
Assignee: (unassigned) => Dimitri John Ledkov (xnox)
** Changed in: perftest (Ubuntu Zesty)
Milestone: None => zesty-updates
** Changed in: perftest (Ubuntu Zesty)
Status: In Progress => Triaged
--
You received this bug notification because you are a member of नेपाली
भाषा समायोजकहरुको समूह, which is subscribed to Xenial.
Matching subscriptions: Ubuntu 16.04 Bugs
https://bugs.launchpad.net/bugs/1553185
Title:
PCI RoCe IB perftest Aborted (core dumped)
Status in Ubuntu on IBM z Systems:
Confirmed
Status in perftest package in Ubuntu:
Fix Released
Status in perftest source package in Zesty:
Triaged
Bug description:
SRU:
====
[Impact]
* the perftest tools (ib_*) incl. in the perftest package cannot be used at all, they all always core dump on all platforms
* a backport is required for getting again a working perftest package / tool set
* the fix was officially provided by Mellanox and fixes the version comparison that was broken before (partly using int, partly string compares)
[Test Case]
* the bug can easily be reproduced on two systems both with RoCE cards installed
and by starting a perftest run like this:
on one machine as 'server': sudo ./ib_read_bw -d mlx4_0 -a
and on a second machine as 'client', pointing to the servers IP address: sudo ./ib_read_bw <server IP> -d mlx4_0
detailed instructions how to reproduce the bug
* install the perftest package including all dependencies
* configure the RoCE devices as network devices using a private network range in /etc/network/interfaces like this
# The 1st RoCE interface configuration
auto enP1p0s0
iface enP1p0s0 inet static
address 192.168.1.141
netmask 255.255.255.0
network 192.168.1.0
broadcast 192.168.1.255
* test if the network is okay, with ping (or rping, udaddy rdma_client/rdma_server)
* and run the Test Case above
[Regression Potential]
* the regression is low due to the fact that the current tools that are part of the perftest package just segfault and are unusable
* the target for this patched perftest package is Zesty (and higher), because only Zesty has the depending packages in an up-to-date version
* people may ask to SRU that to Xenial as well, but that would
require the update of several other packages ...
--------%<----------------%<----------------%<----------------%<--------
== Comment: #0 - Helmut Grauer - 2016-03-04 06:46:50 ==
Hi
Configure IB for perftest
Ethernet Interface
np0s0 Link encap:Ethernet HWaddr 82:01:14:32:f0:90
inet addr:10.100.80.2 Bcast:10.100.255.255 Mask:255.255.0.0
inet6 addr: fe80::8001:14ff:fe32:f090/64 Scope:Link
inet6 addr: fd00:10:100::ff:80:2/80 Scope:Global
inet6 addr: fd00:10:100:0:8001:14ff:fe32:f090/64 Scope:Global
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:25938 errors:0 dropped:0 overruns:0 frame:0
TX packets:253 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:8228251 (8.2 MB) TX bytes:21494 (21.4 KB)
Installing related packages for dapltest
librdmacm-dev install
librdmacm1 install
librdmacm1-dbg install
dapl2-utils install
libibumad3 install
libibverbs-dev install
libibverbs1 install
libmlx4-1 install
libmlx4-1-dbg install
libmlx4-dev install
libmlx5-1 install
libmlx5-1-dbg install
libmlx5-dev install
perftest install
++++++++++++++++++++++++++++++++++++++
++++++++++++++++++++++++++++++++++++++
+++ PCI-Overview: +++
++++++++++++++++++++++++++++++++++++++
++++++++++++++++++++++++++++++++++++++
======================================
DEVICE_List:
-------------
0000:00:00.0
0001:00:00.0
======================================
FunctionID_List:
-----------------
0x0000003e
0x0000003f
======================================
PCHID_List:
------------
0x0108
0x013c
======================================
Interface_List:
----------------
/sys/bus/pci/devices/0000:00:00.0/net/:
enp0s0
enp0s0d1
/sys/bus/pci/devices/0001:00:00.0/net/:
enP1p0s0
enP1p0s0d1
======================================
Infiniband_List:
----------------
/sys/bus/pci/devices/0000:00:00.0/infiniband/:
mlx4_0
/sys/bus/pci/devices/0001:00:00.0/infiniband/:
mlx4_1
--------------------------------------------------------------------------
server
root@s83lp02:~# dpkg -S /etc/dat.conf
libdapl2: /etc/dat.conf
root@s83lp02:~# ib_read_bw -d mlx4_0 -a
************************************
* Waiting for client to connect... *
************************************
*** stack smashing detected ***: ib_read_bw terminated
Aborted (core dumped)
-----------------------------------------------------------------------------
root@s83lp18:~# ./xpci.sh
++++++++++++++++++++++++++++++++++++++
++++++++++++++++++++++++++++++++++++++
+++ PCI-Overview: +++
++++++++++++++++++++++++++++++++++++++
++++++++++++++++++++++++++++++++++++++
======================================
DEVICE_List:
-------------
0000:00:00.0
0001:00:00.0
======================================
FunctionID_List:
-----------------
0x00000040
0x00000041
======================================
PCHID_List:
------------
0x0108
0x013c
======================================
Interface_List:
----------------
/sys/bus/pci/devices/0000:00:00.0/net/:
ens40
ens40d1
/sys/bus/pci/devices/0001:00:00.0/net/:
enP1s41
enP1s41d1
======================================
Infiniband_List:
----------------
/sys/bus/pci/devices/0000:00:00.0/infiniband/:
mlx4_0
/sys/bus/pci/devices/0001:00:00.0/infiniband/:
mlx4_1
Client
root@s83lp18:~# ib_read_bw 10.100.80.2 -d mlx4_1
Couldn't connect to 10.100.80.2:18515
Unable to open file descriptor for socket connection Unable to init the socket connection
root@s83lp18:~# ib_read_bw 10.100.80.2 -d mlx4_1
*** stack smashing detected ***: ib_read_bw terminated
Aborted (core dumped)
(reverse-i-search)`':
I will add SOSReport and dgbinfo.sh
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-z-systems/+bug/1553185/+subscriptions