purchased a new nuc7i7bnh for a small workstation roughly 2 weeks ago. upon getting windows 10 pro installed and several apps setup (dropbox, quickbooks, etc) i had my first system lock up. no blue screen or error message... just frozen. traditionally, this is from some sort of hardware issue, so i started checking with some basic tests... memtest86, intel processor diagnostic, crystalmark... memtest took the longest (almost 5 days for 16gb of ram - but thats about average) - but everything passed without issue. my gut told me that even though the memory passed all tests, it was acting like a memory issue.... anytime i would put a heavy read/write action in place (ie: first setup of dropbox which had about 500k files worth about 160gb) i would experience a lock up.
so i had the memory exchanged under warranty. 20 minutes after install.... same thing again. so i swapped out the only other item i could in the machine, the pcie ssd - still resulted in the same thing.
then i noticed - anytime i was putting this system under load, i would hear the blower kick in heavy. so i installed hwmonitor and kept it on top for anything i was doing. sure enough... every single time that system reached 100 deg c, that system would lock up. on this very forum in several places, intel reps have stated the system is designed for 100 degrees or lower. however, inside intel processor diagnostics it very clearly states for the cpu a max temp of 90 degrees cel.
i updated bios to the latest (released just a week or two ago it appears) which made no difference.
i have turned off turbo on the processor, and cranked the blower up WAY high (what i have set it to is attached in the images below). at this point, that seems to be doing the trick, but makes it so damn loud it somewhat defeats the purpose of having a tiny machine. and im not 100% sure just yet... still testing. but its definitely at least more stable at this point.
i purchased two identical nuc's... the other one is not having the system lock ups like this one, but is running just about as hot. so im not sure what is going on - but there is obviously an issue. any ideas?
as a side note... since i am using only a single pcie ssd in this system, and had no need for the 2.5 sata connection, i went ahead and removed the 2.5" cage on a whim to see if that would help with air ciruclation. surprisingly, under the exact same test (intel proc diag) the cpu did drop 5 deg celsius. might be something to try if you are in the same boat. its a minor amount of help, but when the system idles at 75 deg c, 5 degrees is a pretty big deal.
ok... lots more testing today, and some things to report:
i loaded up prime95 and started pushing the processor a bit. in the background, im running ohw which is logging all temps/stats.
when i go into bios, and select "cool" or "balanced" for the cooling settings, after just 4-5 minutes of running prime95, the cpu is reporting 102 degrees cel, and the system locks up.
when i setup my own custom fan profile (pictured in the original post)... it sounds like a drone is about to take off from my desk, but the max temp i see is 83 deg cel. i have run it for an hour without an issue. but... go back in and change those fan settings to the default, and it will crash... every.single.time.
at this point, i can say beyond any shadow of a doubt - intel has a cooling problem with this machine.
Thank you for your feedback and providing us with everything you have tried so far to troubleshoot this issue.
As you can verify in https://ark.intel.com/, the maximum temperature allowed is 100°C and according to one of the pictures attached about the Intel® Processor Diagnostic Tool results, the maximum CPU temperature allowed is also 100°C. However, as you were saying, other users have reported that the fan is noisy and that it causes system instability. In this case, we would like you to try these parameters provided by N.Scott.Pearson, which other community peers have found really helpful to alleviate this issue.
so i THOUGHT it was just the heat, as the two corresponded with each other. while there IS a heat issue, its the load on the cpu thats the problem. i went through great lengths (3d printed an external blower, which actually lowered the temps 20 deg cel) and wound up still seeing the lock ups. i swapped the drive/memory with the other nuc7i7bnh i have sitting here that i have not experience the lockup on yet in the 3 weeks its been running, and the new drive and memory made no difference... it still locked up. vice-versa, the drive and memory from the bad machine has been running for 48 hours without issue on the good machine.
so... ive got a faulty nuc. it is something on the processor/motherboard. i am going to try and return it to the retailer i bought it from, but since its now well outside their return window... they may not do it. if they dont, what do i need to do to get this thing replaced by intel (literally only 4 weeks old)
Thank you for you reply.
In this case, we recommend replacing your Intel® NUC through the retailer. If this is not feasible, I will send you a private message to discuss personal information to continue assisting you.