← Back to team overview

kernel-packages team mailing list archive

[Bug 1491797] Re: Shuts down when supposed to suspend as a reaction to self-caused overheat, session lost

 

>From my understanding, I can see that thermald is detecting the passive
trip point temperature being reached at 75 degrees C and then it
attempts to move to cpufreq index position 3; so it seems to be trying
to do some passive CPU frequency switching.

Sensor hwmon :temp 76000 
update_set_point 76000,0,98000
pref 0 type 1 temp 76000 trip 99000 
Passive Trip point applicable 
Trip point applicable <  0:99000 
cdev size for this trippoint 2
cdev at index 0:Processor
Need to switch to next cdev 
cdev at index 3:cpufreq
Need to switch to next cdev 
pref 0 type 2 temp 76000 trip 95000 
Passive Trip point applicable 
Trip point applicable <  1:95000 
cdev size for this trippoint 2
cdev at index 0:Processor
Need to switch to next cdev 
cdev at index 3:cpufreq
Need to switch to next cdev 

What is really interesting is that your firmware is configured to do
passive (e.g. non-fan) cooling strategies at 75 degrees C.  In the
earlier analysis we saw that even at 20% CPU load your machine is close
to this temperature.   So, we can conclude that slightly busy CPU
already into the thermal danger zone and informing the OS to start
attempting passive cooling strategies.

Key facts we can draw from this:

At 20% utilization your machine is already moving into the zone where
the designers of the machine believe that fan control is not sufficient
and hence is triggering passive cooling strategies.

That really tells me that your machine is having some cooling issues
between the CPU and the fan.  This implies that one should check that
there is sufficient thermal paste between the CPU and the heat pipe.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1491797

Title:
  Shuts down when supposed to suspend as a reaction to self-caused
  overheat, session lost

Status in linux package in Ubuntu:
  In Progress

Bug description:
  Error:
  Kernel foolishly shuts down the computer when it overheats.
  /var/log/kern.log
  W500 kernel: [1448.648529] thermal thermal_zone1: critical temperature reached (100 C), shutting down

  Consequence:
  Shutting down destroys session in Ubuntu, Gnome, and all applications that can't remember their latest conscious state (most applications).

  Attempted repair, failed:
  Laptop has suspending ability, but I can't find the setting for the kernel to make the computer suspend instead of shutting down.

  Repair suggestions:
  1. Persistence of session, so that everything would reappear after the restart. (this would also make updating less disruptive)
  2. Do not heat the machine like crazy; speed up fans or slow down processes. (problematic Lenovo Thinkpad W500 fan on low speed right up to the fiery end)
  3. Put the computer to suspend when it's too hot.

  (The problem has remained the same from at least Ubuntu 11.10 through
  14.04)

  ProblemType: Bug
  DistroRelease: Ubuntu 14.04
  Package: linux-image-3.13.0-62-generic 3.13.0-62.102
  ProcVersionSignature: Ubuntu 3.13.0-62.102-generic 3.13.11-ckt24
  Uname: Linux 3.13.0-62-generic x86_64
  ApportVersion: 2.14.1-0ubuntu3.12
  Architecture: amd64
  AudioDevicesInUse:
   USER        PID ACCESS COMMAND
   /dev/snd/controlC0:  user       2171 F.... pulseaudio
  CurrentDesktop: Unity
  Date: Thu Sep  3 13:42:28 2015
  HibernationDevice: RESUME=UUID=991e1383-ff5b-46c1-84c4-c904e1d81256
  InstallationDate: Installed on 2013-12-29 (612 days ago)
  InstallationMedia: Ubuntu 13.10 "Saucy Salamander" - Release amd64 (20131016.1)
  MachineType: LENOVO 4063B22
  PccardctlIdent:
   Socket 0:
     no product info available
  PccardctlStatus:
   Socket 0:
     no card
  ProcFB: 0 radeondrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-3.13.0-62-generic root=UUID=bd426989-b545-41b3-97b8-de9410f27aa6 ro persistent quiet splash vt.handoff=7
  RelatedPackageVersions:
   linux-restricted-modules-3.13.0-62-generic N/A
   linux-backports-modules-3.13.0-62-generic  N/A
   linux-firmware                             1.127.15
  SourcePackage: linux
  UpgradeStatus: Upgraded to trusty on 2014-04-27 (494 days ago)
  dmi.bios.date: 12/14/2011
  dmi.bios.vendor: LENOVO
  dmi.bios.version: 6FET92WW (3.22 )
  dmi.board.name: 4063B22
  dmi.board.vendor: LENOVO
  dmi.board.version: Not Available
  dmi.chassis.asset.tag: No Asset Information
  dmi.chassis.type: 10
  dmi.chassis.vendor: LENOVO
  dmi.chassis.version: Not Available
  dmi.modalias: dmi:bvnLENOVO:bvr6FET92WW(3.22):bd12/14/2011:svnLENOVO:pn4063B22:pvrThinkPadW500:rvnLENOVO:rn4063B22:rvrNotAvailable:cvnLENOVO:ct10:cvrNotAvailable:
  dmi.product.name: 4063B22
  dmi.product.version: ThinkPad W500
  dmi.sys.vendor: LENOVO

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1491797/+subscriptions


References