![]() |
Since when did ASRock/AMD/AMI start to suck£ |
Post Reply
|
Page 12> |
| Author | ||
Xaltar
Moderator Group
Joined: 16 May 2015 Location: Europe Status: Offline Points: 33668 |
Post Options
Thanks(0)
Quote Reply
Topic: Since when did ASRock/AMD/AMI start to suck£Posted: 28 Jan 2026 at 1:59am |
|
|
Thanks for the info RokeJulianLockhart.
|
||
|
||
![]() |
||
RokeJulianLockhart
Newbie
Joined: 19 Dec 2024 Status: Offline Points: 135 |
Post Options
Thanks(2)
Quote Reply
Posted: 27 Jan 2026 at 1:42am |
|
|
Another user resolved this by replacing their GPU [^1] although other hardware can cause similar effects. [^2] If inapplicable, Framework has been dealing with this with some stock hardware, too. [^3] The conducted diagnosis may assist you.
[^1]: https://discussion.fedoraproject.org/t/hardware-error-possily-cpu-issue/162595/13 [^2]: https://community.frame.work/t/intel-wi-fi-7-be200-and-be202-compatible-with-framework-16/45786/65?u=rokejulianlockhart [^3]: https://github.com/FrameworkComputer/SoftwareFirmwareIssueTracker/issues/41#issuecomment-3795165884 |
||
|
When using a forum, remember to tag the person you are responding to, in case they are not subscribed to the thread.
|
||
![]() |
||
zoltan
Newbie
Joined: 27 Mar 2025 Status: Offline Points: 110 |
Post Options
Thanks(1)
Quote Reply
Posted: 16 May 2025 at 1:26am |
|
|
So some more progress, at least in identifying the cause of the resets:
After compiling a custom kernel with a recent upstream patch, I am now able to identify the proximate cause of the reset: [ 2.068426] x86/amd: Previous system reset reason [0x08000800]: an uncorrected error caused a data fabric sync flood event So it might still be a bad PSU, or the BIOS didn't quite set up the h/w properly. Keep in mind the resets are random, under almost no load, cool system temperature, and very little user activity. At least the Linux community cares about AMD enough to dump the last-reset-reason MSR into dmesg. Patch isn't released into mainline yet, but it probably will be next release (6.16...). It is an easy backport, and I couldn't wait. I'll have to do some more research, but it doesn't seem like this error is referring to memory...I believe the uncorrected error might be on the HT bus or something internal in the CPU...maybe a uCode patch is in order or the PSU really is flaky. And this happened with two different Ryzen 9 7900X's on two different boards (same PSU, until this one is RMA'd, which is in progress). I wonder if AMD or ASRock is hiring? I've done more to debug this then their collective lot...maybe they can take it from here. |
||
![]() |
||
zoltan
Newbie
Joined: 27 Mar 2025 Status: Offline Points: 110 |
Post Options
Thanks(0)
Quote Reply
Posted: 14 May 2025 at 1:09am |
|
|
Well, got the new memory, and still no difference. Random resets happen every 5 minutes to every two days without a reasonable explanation or even a clue. And again, never under load, mostly at idle.
I'm trying to replace the "Remarkable 10-Year Warranty"-ed ASRock SL850G PSU, so let's see how that goes. By the way, the PSU was unable to handle legacy 3.5" SATA hard-drives. When I connected the included SATA power cables, hard-drives were not detected at all. I had to connect the 4-wire Molex power cable and then use a Molex-SATA adapter, then the hard-drives were detected. There is some information on the internet that new SATA drives (SSD's mostly) use the 12V line for some kind "sleep" state that is incompatible with the legacy SATA drives (3.5"), but I couldn't find a spec. for it and of course ASRock tech-support knows even less than I do. But even with the Molex - SATA power adapter, the hard-drives were very unstable at SATA Gen3 speeds...had to limit them to SATA Gen2 to not get SATA h/w errors on the SATA bus. These drives work fine in older ASRock computer builds (in Gen3 mode even) and the drives aren't that old (2018). |
||
![]() |
||
M440
Senior Member
Joined: 12 Jul 2023 Status: Offline Points: 4610 |
Post Options
Thanks(0)
Quote Reply
Posted: 08 May 2025 at 7:29pm |
|
|
Yes, that points to the PSU.
|
||
|
asrock b650m-hdv/m.2, ryzen 7700x@85watt, arch/kde
|
||
![]() |
||
zoltan
Newbie
Joined: 27 Mar 2025 Status: Offline Points: 110 |
Post Options
Thanks(0)
Quote Reply
Posted: 08 May 2025 at 4:17am |
|
|
Thanks for the helpful tips. I've run memtest86+ for several iterations (6 hours give or take) and not a single error. But I don't think I'm going to rule out memory just yet...Patriot memory claims it needs/wants 1.35V but memtest86+ and other stress tools indicate 1.1V is what it is running at.
I've run gst and stress and don't see any issues, but one thing to remember is that I've never had a reset under load. Only time it ever resets is when it is more or less idle and thermally cool. I've tried two different x870 SL wifi boards, two different Ryzen 9 7900X cpu's, and after this weekend, I'll be trying some new memory. Only thing after that would be the power-supply, but that is an ASRock 850W P.S. so if that is problematic than ASRock still has some 'splaining to do. Anyway, I'm hoping it is the memory...I guess there's a remote chance it could still be s/w, but memory corruption or kernel issues generally result in an OOPS (linux BSOD) or a hang and not a "somebody hit the reset button" reboot. And still wouldn't explain another reset while in BIOS setup screen immediately thereafter on next reboot. So for now, it is still just one guess after another, until I can pry the location of the last-reset-reason register from ASRock/AMD/AMI. |
||
![]() |
||
M440
Senior Member
Joined: 12 Jul 2023 Status: Offline Points: 4610 |
Post Options
Thanks(1)
Quote Reply
Posted: 06 May 2025 at 1:33pm |
|
there is a couple stress test apps you can use in Linux - gst (GtkStressTest), stress-ng, prime95.. to have memtest as a bootloader entry in GRUB bootloader in Arch i needed to install 'memtest86+-efi' package and regenerate grub config (grub-mkconfig -o /boot/grub/grub.cfg) i too think it might be CPU not memory. I would stress test and would try to set a POSSITIVE curve optimizer value in the BIOS (+3~5 max), giving the cpu a bit more voltage at stock speeds. Limit the wattage a lil bit below designed TDP. ![]()
Edited by M440 - 06 May 2025 at 1:39pm |
||
|
asrock b650m-hdv/m.2, ryzen 7700x@85watt, arch/kde
|
||
![]() |
||
NDRE28
Senior Member
Joined: 08 Sep 2024 Location: Romania Status: Offline Points: 2325 |
Post Options
Thanks(0)
Quote Reply
Posted: 06 May 2025 at 10:06am |
|
|
Hi!
Please download MemTest86 (it's free), then put it on a USB flash drive. You must boot with the USB flash drive inserted (so, it doesn't matter what OS you're on). Running the tests will take 3-4 hours to complete. I hope this helps. |
||
![]() |
||
zoltan
Newbie
Joined: 27 Mar 2025 Status: Offline Points: 110 |
Post Options
Thanks(0)
Quote Reply
Posted: 06 May 2025 at 4:58am |
|
|
I have not run any overclocking, cpu, memory, nor graphics.
I do not know where to get stability tests. Even if I did, I doubt it would run on Linux, though you never know these days... I have not run memtest. Which memtest in particular (again, for Linux), or better yet, do you have a memtest.efi that I can run from the EFI shell? So, I did observe that my memory is NOT on the approved QVL for the x870 SL wifi. I have now ordered new memory (rather expensive I might add) that claims to be specifically compatible with the x870 Pro RS board _and_ x870 chipset with the AM5 Ryzen. Given that the x870 Pro RS and x870 SL wifi are same chipset (at least as far as memory is concerned), and the memory manufacturers additional claim that x870 chipset is supported (though only the x870 Pro RS board was singled-out by name), there is reason to believe that this memory should be compatible and may fix the resets. So we shall see if memory instability is causing sporadic and spontaneous resets. What would still help debug this is if ASRock and/or AMD would tell us the MMIO location (or the PCIe BDF/offset) for the last reset reason register. This should not be top-secret or even proprietary. If you don't want to hand-out the decoder sheet for the meaning, then fine, but at least I could post the value and an ASRock BIOS engineer or AMD rep. could point us in the right direction. At a minimum it would be nice to know if it was a CPU reset or a Platform reset that caused the last reset. I'm not 100% convinced that the memory may be the only culprit in these resets. The various Cx-states have also become so complicated and when intertwined with that abomination of a spec. called 'ACPI' (another demonic plot hatched by the evil that resides in Redmond) makes it so that I can't rule-out some mis-configured ACPI table trying save half-a-milliwatt over the next 20 years is accidentally and/or indirectly resetting the board. One more data point observation: with my current memory, the first reset occurs after about 3.5 - 4 days of being powered-on and running (even if idle). However, after this first spontaneous reset, the next reset occurs in about 6 - 12 hours. And it keeps getting shorter and shorter, sometimes not even being stable enough to fully boot without a reset. But, if I physically put the board in G3 state (manually unplug the dang thing), then I'm good for another 3.5 - 4 days before the next reset. I know memory and chipset maintain a few things across resets but not across full power-cycles, so maybe it is the memory. Anyway, if you could point me to the stability and memory tests, I'd be happy to run them (Linux only please, as I refuse to install viruses posing as an OS on my machines). |
||
![]() |
||
M440
Senior Member
Joined: 12 Jul 2023 Status: Offline Points: 4610 |
Post Options
Thanks(1)
Quote Reply
Posted: 30 Apr 2025 at 3:43pm |
|
|
have you set any overclocking?
have you run stability tests? have you run memtest if memory is stable? |
||
|
asrock b650m-hdv/m.2, ryzen 7700x@85watt, arch/kde
|
||
![]() |
||
Post Reply
|
Page 12> |
|
Tweet
|
| Forum Jump | Forum Permissions ![]() You cannot post new topics in this forum You cannot reply to topics in this forum You cannot delete your posts in this forum You cannot edit your posts in this forum You cannot create polls in this forum You cannot vote in polls in this forum |