I have setup a FreeNAS (11.2-U8) server at home containing 8 drives comprised of 2TB Seagate Barracuda ST2000DM006 drives in a RAID-Z2 configuration.
The rest of the system hardware is as follows:
- CPU: Intel(R) Core(TM) i3-4170 CPU @ 3.70GHz
- Power Supply: EVGA 100-W1-0500-KR
- Motherboard: ASRock C226 (ECC capable)
- RAM: NEMIX RAM 16GB (2 x 8GB) DDR3-1066MHz PC3-8500 (ECC)
This setup has been running for 2.5 years without issues after which I started to have SATA errors on a hard disk connected to port ADA7. 2.5 years being short in my opinion, I decided to replace the drive a "NAS grade" drive that has a longer waranty, so I bought a WD Red Pro WD2002FFSX.
After successfully resilvering the pool, I started to receive the same type of SATA errors with this new drive after only few days of operations.
I started suspecting the cables so I switched one end of the SATA cable from the drive on ADA6 and ADA7, connecting the problematic drive on ADA6. The errors kept being flagged, but on ADA6 now; eliminating the possibility of a bad cable or even a problem with the motherboard port.
After replugging everything on their respective ports, I then proceeded to switch the power cables of ADA6 and ADA7. The errors were still presents on ADA7 meaning the power cable is clear.
This being said, what's my next step of debugging? I am not an expert on the matter and it seems like either I am very unlucky and got two bad drives or something else is happening and I have no idea what it is.
The error I get are as follow:
Device: /dev/ada7, ATA error count increased from 2960 to 2961
Device: /dev/ada7, not capable of SMART self-check
Pool main state is DEGRADED: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected.
I must mention that all hard drives are SMART capable. Sometimes the NAS is running without problems for few days, then it starts sending me emails repeatedly reporting issues. It really is sporadic behavior.