← Back to team overview

kernel-packages team mailing list archive

[Bug 1491797] Re: Shuts down when supposed to suspend as a reaction to self-caused overheat, session lost

 

Attached is a spreadsheet with the above data and some graphs.

The first graph compares normalized data from the ACPI thermal zones
(acpiz),  CPU freq and CPU utilization.   This shows a good correlation
on CPU utilization and temperature (as we would expect). CPU frequencies
are scaling according to the load and looks OK to me.

The second graph is a comparison of ACPI thermal zones and CPU
utilization in absolute terms (e.g. degrees C and CPU %).  Note that
the CPU is loaded never less than 20% and that at this level the
temperature seems to be levelling out to around 70 degrees C which seems
rather high.  The CPU is bouncing around 800Mhz - 1.8GHz at that point,
which I guess is to be expected if one has so many context switches and
IRQs occurring.

So. Some interesting data:

1. The IRQ rate does seem high.
2.  The Context switch rate seems high too.
3. The machine is not that idle.
4.  Even at a low-ish load, the system is rather hot.  (This makes me wonder if there is something physically wrong between the CPU and the heatpipe/fan unit.)
5.  Dropping from fully loaded to partially loaded we see heat being dissipated but not that well, e.g. it drops from nearly 100 C down to 70-65 C, which is surprising and I expected this to go lower. 

Hence I think it may be a physical hardware issue with the cooling.


** Attachment added: "LibreOffice spreadsheet"
   https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1491797/+attachment/4456965/+files/powerstat-lenovo-W500-2015.09.03.ods

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1491797

Title:
  Shuts down when supposed to suspend as a reaction to self-caused
  overheat, session lost

Status in linux package in Ubuntu:
  In Progress

Bug description:
  Error:
  Kernel foolishly shuts down the computer when it overheats.
  /var/log/kern.log
  W500 kernel: [1448.648529] thermal thermal_zone1: critical temperature reached (100 C), shutting down

  Consequence:
  Shutting down destroys session in Ubuntu, Gnome, and all applications that can't remember their latest conscious state (most applications).

  Attempted repair, failed:
  Laptop has suspending ability, but I can't find the setting for the kernel to make the computer suspend instead of shutting down.

  Repair suggestions:
  1. Persistence of session, so that everything would reappear after the restart. (this would also make updating less disruptive)
  2. Do not heat the machine like crazy; speed up fans or slow down processes. (problematic Lenovo Thinkpad W500 fan on low speed right up to the fiery end)
  3. Put the computer to suspend when it's too hot.

  (The problem has remained the same from at least Ubuntu 11.10 through
  14.04)

  ProblemType: Bug
  DistroRelease: Ubuntu 14.04
  Package: linux-image-3.13.0-62-generic 3.13.0-62.102
  ProcVersionSignature: Ubuntu 3.13.0-62.102-generic 3.13.11-ckt24
  Uname: Linux 3.13.0-62-generic x86_64
  ApportVersion: 2.14.1-0ubuntu3.12
  Architecture: amd64
  AudioDevicesInUse:
   USER        PID ACCESS COMMAND
   /dev/snd/controlC0:  user       2171 F.... pulseaudio
  CurrentDesktop: Unity
  Date: Thu Sep  3 13:42:28 2015
  HibernationDevice: RESUME=UUID=991e1383-ff5b-46c1-84c4-c904e1d81256
  InstallationDate: Installed on 2013-12-29 (612 days ago)
  InstallationMedia: Ubuntu 13.10 "Saucy Salamander" - Release amd64 (20131016.1)
  MachineType: LENOVO 4063B22
  PccardctlIdent:
   Socket 0:
     no product info available
  PccardctlStatus:
   Socket 0:
     no card
  ProcFB: 0 radeondrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-3.13.0-62-generic root=UUID=bd426989-b545-41b3-97b8-de9410f27aa6 ro persistent quiet splash vt.handoff=7
  RelatedPackageVersions:
   linux-restricted-modules-3.13.0-62-generic N/A
   linux-backports-modules-3.13.0-62-generic  N/A
   linux-firmware                             1.127.15
  SourcePackage: linux
  UpgradeStatus: Upgraded to trusty on 2014-04-27 (494 days ago)
  dmi.bios.date: 12/14/2011
  dmi.bios.vendor: LENOVO
  dmi.bios.version: 6FET92WW (3.22 )
  dmi.board.name: 4063B22
  dmi.board.vendor: LENOVO
  dmi.board.version: Not Available
  dmi.chassis.asset.tag: No Asset Information
  dmi.chassis.type: 10
  dmi.chassis.vendor: LENOVO
  dmi.chassis.version: Not Available
  dmi.modalias: dmi:bvnLENOVO:bvr6FET92WW(3.22):bd12/14/2011:svnLENOVO:pn4063B22:pvrThinkPadW500:rvnLENOVO:rn4063B22:rvrNotAvailable:cvnLENOVO:ct10:cvrNotAvailable:
  dmi.product.name: 4063B22
  dmi.product.version: ThinkPad W500
  dmi.sys.vendor: LENOVO

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1491797/+subscriptions


References