Hi all, I'm looking at what seems to be a nice puzzle 
A ThinkPad T14s Gen 1 that stopped working more or less a month after the warranty expired. I tried to boot both Windows 10 and Debian 12 from a USB stick. In the first case, the system just reboots after showing the Windows logo for a few seconds, while I'm able to see the Linux kernel crashing if I boot the second in failsafe mode.
Interestingly, some times (unfortunately it's difficult to replicate) I manage to boot into Debian 12 live. Moreover, the kernel doesn't seem to crash always at the same point during the boot process. But more or less between 52 and 56 seconds after the kernel initializes.
The system always reliably POSTs and starts the bootloader (e.g. GRUB). I even managed to successfully upgrade the BIOS to the latest available version using the Lenovo USB bootable image. Unfortunately, it didn't make any difference.
I ran a couple of passes of memtest86 and the system is rock solid, no errors whatsoever.
I also tried to remove both SSD and Wi-Fi card without any notable differences, and disconnected from the board fingerprint and smartcard readers, trackpad, and one of the USB A ports that lives on a separate board. Still no changes.
Taking some measurements on the board, I noticed that VCOREs range from about 1v to 2.2v, while the VDDCR_SOC (here on the schematics excerpt) it's more within 0.76v and 0.9v. VCORE-SOC is at 10.9v
Could such low voltage cause the instability I'm experiencing? The APU is a Ryzen Pro 5.
In general, I have the feeling it could be something linked to the SOC part of the APU (e.g. I/O) that causes the system to crash as soon as it gets activated during a regular boot process.
Considering the amount of dust I found in the heat sink, I suspect the APU overheated over the years. I was also considering to attempt a reflow as last try.
Any ideas?

A ThinkPad T14s Gen 1 that stopped working more or less a month after the warranty expired. I tried to boot both Windows 10 and Debian 12 from a USB stick. In the first case, the system just reboots after showing the Windows logo for a few seconds, while I'm able to see the Linux kernel crashing if I boot the second in failsafe mode.
Interestingly, some times (unfortunately it's difficult to replicate) I manage to boot into Debian 12 live. Moreover, the kernel doesn't seem to crash always at the same point during the boot process. But more or less between 52 and 56 seconds after the kernel initializes.
The system always reliably POSTs and starts the bootloader (e.g. GRUB). I even managed to successfully upgrade the BIOS to the latest available version using the Lenovo USB bootable image. Unfortunately, it didn't make any difference.
I ran a couple of passes of memtest86 and the system is rock solid, no errors whatsoever.
I also tried to remove both SSD and Wi-Fi card without any notable differences, and disconnected from the board fingerprint and smartcard readers, trackpad, and one of the USB A ports that lives on a separate board. Still no changes.
Taking some measurements on the board, I noticed that VCOREs range from about 1v to 2.2v, while the VDDCR_SOC (here on the schematics excerpt) it's more within 0.76v and 0.9v. VCORE-SOC is at 10.9v
Could such low voltage cause the instability I'm experiencing? The APU is a Ryzen Pro 5.
In general, I have the feeling it could be something linked to the SOC part of the APU (e.g. I/O) that causes the system to crash as soon as it gets activated during a regular boot process.
Considering the amount of dust I found in the heat sink, I suspect the APU overheated over the years. I was also considering to attempt a reflow as last try.
Any ideas?
Comment