ASRock.com Homepage
Forum Home Forum Home > Technical Support > AMD Motherboards
  New Posts New Posts RSS Feed - Quad Chanel Memory Errors with trx40 Tiachi
  FAQ FAQ  Forum Search Search  Events   Register Register  Login Login

Quad Chanel Memory Errors with trx40 Tiachi

 Post Reply Post Reply
Author
Message
MantisMan13 View Drop Down
Newbie
Newbie


Joined: 15 May 2020
Status: Offline
Points: 5
Post Options Post Options   Thanks (0) Thanks(0)   Quote MantisMan13 Quote  Post ReplyReply Direct Link To This Post Topic: Quad Chanel Memory Errors with trx40 Tiachi
    Posted: 15 May 2020 at 11:12am
I've been having major issues that seem to be related to the A1 and A2 slots on my TRX40 Taichi with a AMD 3970x and 256G kit of CORSAIR Vengeance RGB Pro CMW256GX4M8E3200C16. Microsoft Windows 10 (10.0) Pro for Workstations 64-bit (Build 18363)

Issues started after running hard for 2 weeks running folding@home with 2 cpu clients (32 thread and 24 thread) and a gpu client using AMD rx5700xt slight overclocked. No over clocking on the CPU or Memory and Thermals all were well handled by case cooling and AIO. I first noticed the issue with F@H when it crashed over night and after that it would BSOD after tying to start folding again. This was a day after a Microsoft updated and also installing node js and vuejs development packages. I original suspected software or driver conflicts, so I made sure to update AMD chips set to amd_software_2.04.04.111. This didn't help. I then also discover running Cinebench r20 world cause BOSD as would CPU-Z bench or Stress. The BOSD's were a verity of messages. MEMORY_MANAGEMENT,IRQL_NOT_LESS_OR_EQUAL, PAGE_FAULT_IN_NONPAGED_AREA etc, but all pointed to crash address of ntoskrnl.exe+1c2390 when the minidumps were viewd with BlueScreenView. I started to suspect memory when I noticed I was running 32G low. Going into BIOS I found that A2 slot was not showing up. I also was having getting in and out of BIOS as the usb wireless keyboard was not working most of the time when the system would come back up to the post screen. I had to clear the CMOS and full poser down and back a couple time to get back into BIOS and set things up again. Sometimes all 8 slots would show fine and I could get it going again a would get back into window. At one of these I found going into the iCue software there was a Firmware update for the Ram and I ran that. After than I had a XMP profile that I could chose from in the bios that I don't think had been there before and setting that initial seems to help. But it would still crash and I would get back into BIOS and see empty slot A1, A2. I then created a USB boot for MemoryTest86 and started running tests with isolated ram. I tested only B2,A2 without issue. B1,A1 no issue. All men was testing fine and I boot up into window with B2,A2 but I think i had a crash and start to test the memory again. I spent almost 2 days running memory tests and found no memory errors. My last test was back to the full 8 chips loaded and all seemed fine. I then tried to boot up into Windows, but had issues with the BIOS freezing up on me a few times and also moving from English into kanji langue and freezing. I then flashed it to BIOS 1.6 and brought it back up and reconfigured. Raid options where now showing again (had been missing in 1.1) I have 2 raided NVME 1T drives and 8 SATA drives for 22T Raid10 using the AMD raid drivers. It seems to take a couple cycles to going in and out of the BIOS to get the raid to hook back up so that windows could boot. But along with that I stated seeing a warning flsah "Memory PMU Training Error at Socket 0 Channel 2 Dimm 0" and if I would go into the BIOS both A1 and A2 would not show up. After removing both from the system so that I have 2 Channels of Tri-channel memory on B2,C2,D2 and B1,C1,D1 I seem to be error free. All benchmark and stress test are running with out issue and I'm folding with the CPU and GPU all at 90% as I wright this.

I should also mention I had uninstalled F@H, vuejs and other recent installs to no avail. I have not reinstalled node or vue yet.

So, that is about the best recall I can give for this with out post all of my screen shots and other notes.

My feeling is either one or both of the memory chips I removed has some issues and I will put them into another system to retest them, however, I think I did enough testing where I doubt that is the issues. I should add that I rotated memory around to different slots in the various tests I did.

So what I'm getting down to is either 1: 2 weeks of running hard with 90%+ CPU & GPU has cooked the A1, A2 slots a bit and made them unstable somehow or 2: there is some sort of system compatibility issue when I get up into quad channel memory, or 3: there is a issues with the 3970x cpu with quad mem. I could use some advise on how to test this out to rule out hardware faults and where to look if it's a software/driver problem.

John Glassman
MantisMan LLC
Back to Top
MantisMan13 View Drop Down
Newbie
Newbie


Joined: 15 May 2020
Status: Offline
Points: 5
Post Options Post Options   Thanks (0) Thanks(0)   Quote MantisMan13 Quote  Post ReplyReply Direct Link To This Post Posted: 17 May 2020 at 2:28pm
I did another test today where I swapped out the memory that has been running fine in B1 and B2 slots for the Mem I removed from the A1, A2 slots and those worked just fine and I've been folding on them all day. I also then tried putting the memory from B slots back into A slots to try to get back to 256 quad memory but I could not even get to the post screen on 2 restarts. Both times I got a 0d error on the board. I did a full power down on the PSU and tried to boot and this time got to the ASUS screen, but it would not respond to the keyboard and did not try to load window boot manager. Powered down, removed the 2 chips in A's and it started up and boot right up without issue. So I really need to know is this a problem with the board, with the cpu or still some driver/bios thing. How to test?
Back to Top
MantisMan13 View Drop Down
Newbie
Newbie


Joined: 15 May 2020
Status: Offline
Points: 5
Post Options Post Options   Thanks (0) Thanks(0)   Quote MantisMan13 Quote  Post ReplyReply Direct Link To This Post Posted: 18 May 2020 at 4:35am
I believed I've ruled out Quad Memory being the issues. I ran another test where I changed from a working triple Mem config of b1,b2,c1,c2,d1,d2, moved the mem in d1,d1 to a1,a2. The system posted and booted but after 5 mins with no load it crashed with a BSOD error. IRQL_NOT_LESS_OR_EQUAL ntoskrnl.exe+1c2390.

I think this confirm there is an issue in the A channel. So how to know if it is the MOBO or the CPU. It's not like I've got 3970x cpus I can just swap in.      
Back to Top
 Post Reply Post Reply
  Share Topic   

Forum Jump Forum Permissions View Drop Down

Forum Software by Web Wiz Forums® version 12.04
Copyright ©2001-2021 Web Wiz Ltd.

This page was generated in 0.188 seconds.