← Back to team overview

group.of.nepali.translators team mailing list archive

[Bug 1652018] Re: PowerNV: PCI Slot is invalid after fencedPHB Error injection

 

This bug was fixed in the package linux - 4.4.0-62.83

---------------
linux (4.4.0-62.83) xenial; urgency=low

  [ Thadeu Lima de Souza Cascardo ]

  * Release Tracking Bug
    - LP: #1657430

  * Backport DP MST fixes to i915 (LP: #1657353)
    - SAUCE: i915_bpo: Fix DP link rate math
    - SAUCE: i915_bpo: Validate mode against max. link data rate for DP MST

  * Ubuntu xenial - 4.4.0-59-generic i3 I/O performance issue (LP: #1657281)
    - blk-mq: really fix plug list flushing for nomerge queues

linux (4.4.0-61.82) xenial; urgency=low

  [ Thadeu Lima de Souza Cascardo ]

  * Release Tracking Bug
    - LP: #1656810

  * Xen MSI setup code incorrectly re-uses cached pirq (LP: #1656381)
    - SAUCE: xen: do not re-use pirq number cached in pci device msi msg data

  * nvme drive probe failure (LP: #1626894)
    - nvme: revert NVMe: only setup MSIX once

linux (4.4.0-60.81) xenial; urgency=low

  [ John Donnelly ]

  * Release Tracking Bug
    - LP: #1656084

  * Couldn't emulate instruction 0x7813427c (LP: #1634129)
    - KVM: PPC: Book3S PR: Fix illegal opcode emulation

  * perf: 24x7: Eliminate domain name suffix in event names (LP: #1560482)
    - powerpc/perf/hv-24x7: Fix usage with chip events.
    - powerpc/perf/hv-24x7: Display change in counter values
    - powerpc/perf/hv-24x7: Display domain indices in sysfs
    - powerpc/perf/24x7: Eliminate domain suffix in event names

  * i386 ftrace tests hang on ADT testing (LP: #1655040)
    - ftrace/x86_32: Set ftrace_stub to weak to prevent gcc from using short jumps
      to it

  * VMX module autoloading if available (LP: #1651322)
    - powerpc: Add module autoloading based on CPU features
    - crypto: vmx - Convert to CPU feature based module autoloading

  * ACPI probe support for AD5592/3 configurable multi-channel converter
    (LP: #1654497)
    - SAUCE: iio: dac: ad5592r: Add ACPI support
    - SAUCE: iio: dac: ad5593r: Add ACPI support

  * Xenial update to v4.4.40 stable release (LP: #1654602)
    - btrfs: limit async_work allocation and worker func duration
    - Btrfs: fix tree search logic when replaying directory entry deletes
    - btrfs: store and load values of stripes_min/stripes_max in balance status
      item
    - Btrfs: fix qgroup rescan worker initialization
    - USB: serial: option: add support for Telit LE922A PIDs 0x1040, 0x1041
    - USB: serial: option: add dlink dwm-158
    - USB: serial: kl5kusb105: fix open error path
    - USB: cdc-acm: add device id for GW Instek AFG-125
    - usb: hub: Fix auto-remount of safely removed or ejected USB-3 devices
    - usb: gadget: f_uac2: fix error handling at afunc_bind
    - usb: gadget: composite: correctly initialize ep->maxpacket
    - USB: UHCI: report non-PME wakeup signalling for Intel hardware
    - ALSA: usb-audio: Add QuickCam Communicate Deluxe/S7500 to
      volume_control_quirks
    - ALSA: hiface: Fix M2Tech hiFace driver sampling rate change
    - ALSA: hda/ca0132 - Add quirk for Alienware 15 R2 2016
    - ALSA: hda - ignore the assoc and seq when comparing pin configurations
    - ALSA: hda - fix headset-mic problem on a Dell laptop
    - ALSA: hda - Gate the mic jack on HP Z1 Gen3 AiO
    - ALSA: hda: when comparing pin configurations, ignore assoc in addition to
      seq
    - clk: ti: omap36xx: Work around sprz319 advisory 2.1
    - Btrfs: fix memory leak in reading btree blocks
    - Btrfs: bail out if block group has different mixed flag
    - Btrfs: return gracefully from balance if fs tree is corrupted
    - Btrfs: don't leak reloc root nodes on error
    - Btrfs: fix memory leak in do_walk_down
    - Btrfs: don't BUG() during drop snapshot
    - btrfs: make file clone aware of fatal signals
    - block_dev: don't test bdev->bd_contains when it is not stable
    - ptrace: Capture the ptracer's creds not PT_PTRACE_CAP
    - crypto: caam - fix AEAD givenc descriptors
    - ext4: fix mballoc breakage with 64k block size
    - ext4: fix stack memory corruption with 64k block size
    - ext4: use more strict checks for inodes_per_block on mount
    - ext4: fix in-superblock mount options processing
    - ext4: add sanity checking to count_overhead()
    - ext4: reject inodes with negative size
    - ext4: return -ENOMEM instead of success
    - ext4: do not perform data journaling when data is encrypted
    - f2fs: set ->owner for debugfs status file's file_operations
    - loop: return proper error from loop_queue_rq()
    - mm/vmscan.c: set correct defer count for shrinker
    - fs: exec: apply CLOEXEC before changing dumpable task flags
    - exec: Ensure mm->user_ns contains the execed files
    - usb: gadget: composite: always set ep->mult to a sensible value
    - blk-mq: Do not invoke .queue_rq() for a stopped queue
    - dm flakey: return -EINVAL on interval bounds error in flakey_ctr()
    - dm crypt: mark key as invalid until properly loaded
    - dm space map metadata: fix 'struct sm_metadata' leak on failed create
    - ASoC: intel: Fix crash at suspend/resume without card registration
    - CIFS: Fix a possible memory corruption during reconnect
    - CIFS: Fix missing nls unload in smb2_reconnect()
    - CIFS: Fix a possible memory corruption in push locks
    - kernel/watchdog: use nmi registers snapshot in hardlockup handler
    - kernel/debug/debug_core.c: more properly delay for secondary CPUs
    - tpm xen: Remove bogus tpm_chip_unregister
    - xen/gntdev: Use VM_MIXEDMAP instead of VM_IO to avoid NUMA balancing
    - arm/xen: Use alloc_percpu rather than __alloc_percpu
    - xfs: set AGI buffer type in xlog_recover_clear_agi_bucket
    - driver core: fix race between creating/querying glue dir and its cleanup
    - ppp: defer netns reference release for ppp channel
    - Linux 4.4.40

  * igb i210 probe of pci device failed with error -2 (LP: #1639810)
    - SAUCE: igb: Workaround for igb i210 firmware issue.
    - SAUCE: igb: add i211 to i210 PHY workaround

  * PowerNV: PCI Slot is invalid after fencedPHB Error injection (LP: #1652018)
    - powerpc/powernv: Call opal_pci_poll() if needed

  * mfd: intel-lpss: Add default I2C device properties for Apollo Lake
    (LP: #1635177)
    - mfd: intel-lpss: Add default I2C device properties for Apollo Lake

  * Xenial update to v4.4.39 stable release (LP: #1650609)
    - powerpc/eeh: Fix deadlock when PE frozen state can't be cleared
    - parisc: Purge TLB before setting PTE
    - parisc: Remove unnecessary TLB purges from flush_dcache_page_asm and
      flush_icache_page_asm
    - parisc: Fix TLB related boot crash on SMP machines
    - zram: restrict add/remove attributes to root only
    - locking/rtmutex: Prevent dequeue vs. unlock race
    - locking/rtmutex: Use READ_ONCE() in rt_mutex_owner()
    - perf/x86: Fix full width counter, counter overflow
    - crypto: mcryptd - Check mcryptd algorithm compatibility
    - can: raw: raw_setsockopt: limit number of can_filter that can be set
    - can: peak: fix bad memory access and free sequence
    - arm64: futex.h: Add missing PAN toggling
    - m68k: Fix ndelay() macro
    - batman-adv: Check for alloc errors when preparing TT local data
    - hotplug: Make register and unregister notifier API symmetric
    - crypto: rsa - Add Makefile dependencies to fix parallel builds
    - Linux 4.4.39

  * Xenial update to v4.4.38 stable release (LP: #1650607)
    - virtio-net: add a missing synchronize_net()
    - net: check dead netns for peernet2id_alloc()
    - ip6_tunnel: disable caching when the traffic class is inherited
    - net: sky2: Fix shutdown crash
    - af_unix: conditionally use freezable blocking calls in read
    - rtnetlink: fix FDB size computation
    - l2tp: fix racy SOCK_ZAPPED flag check in l2tp_ip{,6}_bind()
    - net: dsa: bcm_sf2: Ensure we re-negotiate EEE during after link change
    - net, sched: respect rcu grace period on cls destruction
    - net/sched: pedit: make sure that offset is valid
    - netlink: Call cb->done from a worker thread
    - netlink: Do not schedule work from sk_destruct
    - net/dccp: fix use-after-free in dccp_invalid_packet
    - net: bcmgenet: Utilize correct struct device for all DMA operations
    - sh_eth: remove unchecked interrupts for RZ/A1
    - geneve: avoid use-after-free of skb->data
    - net: ping: check minimum size on ICMP header length
    - sparc32: Fix inverted invalid_frame_pointer checks on sigreturns
    - sparc64: Fix find_node warning if numa node cannot be found
    - sparc64: fix compile warning section mismatch in find_node()
    - constify iov_iter_count() and iter_is_iovec()
    - Don't feed anything but regular iovec's to blk_rq_map_user_iov
    - ipv6: Set skb->protocol properly for local output
    - ipv4: Set skb->protocol properly for local output
    - esp4: Fix integrity verification when ESN are used
    - esp6: Fix integrity verification when ESN are used
    - Linux 4.4.38

  * Xenial update to v4.4.37 stable release (LP: #1650604)
    - ARC: Don't use "+l" inline asm constraint
    - zram: fix unbalanced idr management at hot removal
    - kasan: update kasan_global for gcc 7
    - x86/traps: Ignore high word of regs->cs in early_fixup_exception()
    - rcu: Fix soft lockup for rcu_nocb_kthread
    - PCI: Export pcie_find_root_port
    - PCI: Set Read Completion Boundary to 128 iff Root Port supports it (_HPX)
    - mwifiex: printk() overflow with 32-byte SSIDs
    - pwm: Fix device reference leak
    - arm64: cpufeature: Schedule enable() calls instead of calling them via IPI
    - arm64: mm: Set PSTATE.PAN from the cpu_enable_pan() call
    - arm64: suspend: Reconfigure PSTATE after resume from idle
    - Linux 4.4.37

  * Xenial update to v4.4.36 stable release (LP: #1650601)
    - iommu/vt-d: Fix PASID table allocation
    - iommu/vt-d: Fix IOMMU lookup for SR-IOV Virtual Functions
    - KVM: x86: check for pic and ioapic presence before use
    - usb: chipidea: move the lock initialization to core file
    - USB: serial: cp210x: add ID for the Zone DPMX
    - USB: serial: ftdi_sio: add support for TI CC3200 LaunchPad
    - Fix USB CB/CBI storage devices with CONFIG_VMAP_STACK=y
    - scsi: mpt3sas: Fix secure erase premature termination
    - tile: avoid using clocksource_cyc2ns with absolute cycle count
    - cfg80211: limit scan results cache size
    - NFSv4.x: hide array-bounds warning
    - parisc: Fix races in parisc_setup_cache_timing()
    - parisc: Fix race in pci-dma.c
    - parisc: Also flush data TLB in flush_icache_page_asm
    - mpi: Fix NULL ptr dereference in mpi_powm() [ver #3]
    - drm/radeon: Ensure vblank interrupt is enabled on DPMS transition to on
    - mei: me: disable driver on SPT SPS firmware
    - mei: me: fix place for kaby point device ids.
    - mei: fix return value on disconnection
    - scsi: mpt3sas: Unblock device after controller reset
    - Linux 4.4.36

  * Miscellaneous Ubuntu changes
    - [Debian] consider renames in gen-auto-reconstruct

 -- Thadeu Lima de Souza Cascardo <cascardo@xxxxxxxxxxxxx>  Wed, 18 Jan
2017 09:34:07 -0200

** Changed in: linux (Ubuntu Xenial)
       Status: Fix Committed => Fix Released

** Changed in: linux (Ubuntu Yakkety)
       Status: Fix Committed => Fix Released

-- 
You received this bug notification because you are a member of नेपाली
भाषा समायोजकहरुको समूह, which is subscribed to Xenial.
Matching subscriptions: Ubuntu 16.04 Bugs
https://bugs.launchpad.net/bugs/1652018

Title:
  PowerNV: PCI Slot is invalid after fencedPHB Error injection

Status in linux package in Ubuntu:
  In Progress
Status in linux source package in Xenial:
  Fix Released
Status in linux source package in Yakkety:
  Fix Released

Bug description:
  == Comment: #0 - Pridhiviraj Paidipeddi <ppaidipe@xxxxxxxxxx> - 2016-12-21 01:16:41 ==
  ---Problem Description---
  PCI Slot is in invalid state after fencedPHB Error injection Test.
   
  Contact Information = ppaidipe@xxxxxxxxxx 
   
  ---uname output---
  Linux brigstrat1p1 4.4.0-57-generic #78-Ubuntu SMP Fri Dec 9 23:46:13 UTC 2016 ppc64le ppc64le ppc64le GNU/Linux
   
  Machine Type = PowerNV CSE-829U 
   
  ---Debugger---
  A debugger is not configured
   
  ---Steps to Reproduce---
   1. Boot the system to runtime.
  2. Inject fencedPHB Error.
  echo 0x8000000000000000 > /sys/kernel/debug/powerpc/PCI0002/err_injct_outbound

  
  dmesg:
  [42725.641368] EEH: PHB#2 failure detected, location: N/A
  [42725.641450] CPU: 8 PID: 898 Comm: kworker/u320:1 Not tainted 4.4.0-57-generic #78-Ubuntu
  [42725.641461] Workqueue: i40e i40e_service_task [i40e]
  [42725.641464] Call Trace:
  [42725.641469] [c00000000407f9e0] [c000000000b13b4c] dump_stack+0xb0/0xf0 (unreliable)
  [42725.641474] [c00000000407fa20] [c0000000000376e0] eeh_dev_check_failure+0x200/0x580
  [42725.641477] [c00000000407fac0] [c000000000037ae4] eeh_check_failure+0x84/0xd0
  [42725.641485] [c00000000407fb00] [d000000035845710] i40e_service_task+0x17b0/0x1a30 [i40e]
  [42725.641489] [c00000000407fc50] [c0000000000dde10] process_one_work+0x1e0/0x5a0
  [42725.641492] [c00000000407fce0] [c0000000000de364] worker_thread+0x194/0x680
  [42725.641496] [c00000000407fd80] [c0000000000e6e60] kthread+0x110/0x130
  [42725.641499] [c00000000407fe30] [c000000000009538] ret_from_kernel_thread+0x5c/0xa4
  [42725.641509] EEH: Detected error on PHB#2
  [42725.641514] EEH: This PCI device has failed 1 times in the last hour
  [42725.641516] EEH: Notify device drivers to shutdown
  [42725.641523] i40e 0002:01:00.0: i40e_pci_error_detected: error 2
  [42725.641907] i40e 0002:01:00.0: VSI seid 396 Tx ring 0 disable timeout
  [42725.642144] i40e 0002:01:00.0: VSI seid 396 Rx ring 0 disable timeout
  [42725.666205] i40e 0002:01:00.1: i40e_pci_error_detected: error 2
  [42725.666499] i40e 0002:01:00.2: i40e_pci_error_detected: error 2
  [42725.666533] i40e 0002:01:00.0: ARQ event error -32
  [42725.666601] i40e 0002:01:00.3: i40e_pci_error_detected: error 2
  [42725.666700] EEH: Collect temporary log
  [42725.666702] PHB3 PHB#2 Diag-data (Version: 1)
  [42725.666703] brdgCtl:     0000ffff
  [42725.666704] UtlSts:      00100000 00000000 00000000
  [42725.666706] RootSts:     ffffffff ffffffff ffffffff ffffffff 0000ffff
  [42725.666707] RootErrSts:  ffffffff ffffffff ffffffff
  [42725.666708] RootErrLog:  ffffffff ffffffff ffffffff ffffffff
  [42725.666709] RootErrLog1: ffffffff 0000000000000000 0000000000000000
  [42725.666711] nFir:        0000808000000000 0030006e00000000 0000800000000000
  [42725.666712] PhbSts:      0000001800000000 0000001800000000
  [42725.666713] Lem:         8000020000800000 42498e367f502eae 8000000000000000
  [42725.666715] OutErr:      8000002000000000 8000000000000000 120800600003fffe 402002a800000000
  [42725.666716] InBErr:      0000000040000000 0000000040000000 0000080000000000 000c10c010010000
  [42725.666718] EEH: Reset without hotplug activity
  [42730.052455] EEH: Notify device drivers the completion of reset
  [42730.053334] EEH: Notify device driver to resume
  [42730.184457] i40e 0002:01:00.0 enP2p1s0f0: NIC Link is Down
  [42731.568230] i40e 0002:01:00.0 enP2p1s0f0: NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None

  
  OPAL LOG:
  [42990.475630456,7] PHB#0002: CRESET: Starts
  [42990.482717333,7] PHB#0002: CRESET: No pending transactions
  [42991.023963215,7] PHB#0002: CRESET: Reinitialization
  [42991.023964143,7] PHB#0002: Initializing PHB...
  [42991.075167078,7] PHB#0002: Core revision 0xa30005
  [42991.075171529,7] PHB#0002: Default system config: 0x421100fc30000000
  [42991.075172655,7] PHB#0002: New system config    : 0x421000fc30000000
  [42991.075174000,7] PHB#0002: PHB_RESET is 0x2000000000000000
  [42991.075410938,7] PHB#0002: Waiting for DLP PG reset to complete...
  [42991.083713914,7] PHB#0002: Initialization complete
  [42991.136599535,7] PHB#0002: FRESET: Starts
  [42991.136600954,7] PHB#0002: FRESET: Prepare for link down
  [42991.136602933,7] PHB#0002: FRESET: Assert
  [42992.138625290,7] PHB#0002: FRESET: Deassert
  [42993.140657592,7] PHB#0002: LINK: Start polling
  [42993.193893558,7] PHB#0002: LINK: Electrical link detected
  [42993.247138072,7] PHB#0002: LINK: Link is up
  [42993.247174237,3] PCI-SLOT-0000000000000002 Invalid state 00000000

  == Comment: #2 - VIPIN K. PARASHAR <viparash@xxxxxxxxxx> - 2016-12-22
  04:57:28 ==

  $ git log fbce44d0ed42e465317 -1
  commit fbce44d0ed42e4653172376f4dfeaa5710f06a27
  Author: Gavin Shan <gwshan@xxxxxxxxxxxxxxxxxx>
  Date:   Fri Jun 24 16:44:19 2016 +1000

      powerpc/powernv: Call opal_pci_poll() if needed
      
      When issuing PHB reset, OPAL API opal_pci_poll() is called to drive
      the state machine in OPAL forward. However, we needn't always call
      the function under some circumstances like reset deassert.
      
      This avoids calling opal_pci_poll() when OPAL_SUCCESS is returned
      from opal_pci_reset(). Except the overhead introduced by additional
      one unnecessary OPAL call, I didn't run into real issue because of
      this.
      
      Reported-by: Pridhiviraj Paidipeddi <ppaiddipe@xxxxxxxxxx>
      Signed-off-by: Gavin Shan <gwshan@xxxxxxxxxxxxxxxxxx>
      Signed-off-by: Michael Ellerman <mpe@xxxxxxxxxxxxxx>

  $ git tag --contains  fbce44d0e
  v4.9
  v4.9-rc1
  v4.9-rc2
  v4.9-rc3
  v4.9-rc4
  v4.9-rc5
  v4.9-rc6
  v4.9-rc7
  v4.9-rc8
  $ 
    
  This issue is fixed by commit # fbce44d0ed4, available in kernel version 4.9.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1652018/+subscriptions