Skip to main content

Questions tagged [ecc]

Abbreviation for Error-Correcting Code.

0 votes
2 answers
41 views

Hardware Theory: Read only Scrub of HDD/SSD drive using dd

Claim If drives are capable of hardware controller correction of data upon read, then it is possible to routinely catch and repair silent data corruption by simply reading it. Premises Normally, when ...
Sepero's user avatar
  • 1,619
0 votes
2 answers
141 views

Are there any filesystems with builtin data repairing via checksums?

I've read that ZFS/BtrFS have a checksum check, but they don't use it for data recovery, only for recovering data from a full local copy or a mirror copy. On the other hand, RAR archives support data ...
bodqhrohro's user avatar
4 votes
2 answers
245 views

Is there a filesystem that can maintain extra ECC data like raid5, but in the filesystem to make a fault-tolerant single external drive?

Normally to make a fault-tolerant or corruption-repairing filesystem, you use multiple drives and raid 5, or anything but raid 0. There are also many ways to make a fault-tolerant archive file like ...
Brian White's user avatar
0 votes
2 answers
513 views

What are the self-healing file formats?

It is known that there are self-healing file systems like e.g. ZFS, Btrfs, bcachefs and self-healing RAM, like e.g. ECC RAM or corresponding software implementations, which can correct single or ...
Alfred.37's user avatar
  • 137
-1 votes
1 answer
391 views

What software alternatives are there to ECC storage under Linux Mint and Linux Mint Debian Edition LMDE to protect against a bit flip problem?

It is known that there are other approaches besides ECC memory that can help avoid data loss due to e.g. flipping of RAM memory cells by cosmic rays (bit flip problem): What the Bit Flip Problem is: ...
Alfred.37's user avatar
  • 137
1 vote
1 answer
4k views

Hardware error from APEI Generic Hardware Error Source (ECC RAM)

[58306.633900] {1}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 1 [58306.633905] {1}[Hardware Error]: It has been corrected by h/w and requires no further action [58306....
Vlastimil Burián's user avatar
2 votes
0 answers
538 views

Identify ram module linked to ECC error di DMESG

one of my server is logging the following ECC errors: [lun set 14 00:14:16 2020] {33}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 1 [lun set 14 00:14:16 2020] {33}...
sKo's user avatar
  • 21
10 votes
1 answer
6k views

How do I enable and verify ECC RAM scrubbing in Linux?

I bought my first system with ECC RAM and trying to learn about its possibilities when it comes to alerting and maintenance in Linux. To be specific, Debian Linux on a Super Micro H8SGL motherboard ...
pipe's user avatar
  • 935
3 votes
0 answers
263 views

Mapping around ecc errors in Linux does not seem to work?

I get the following ecc error on a Linux box several times a day - May 24 18:21:04 staton-nas kernel: mce: [Hardware Error]: Machine check events logged May 24 18:21:04 staton-nas kernel: EDAC ...
statop's user avatar
  • 31
5 votes
1 answer
6k views

Remove ECC warnings in system log

How can I disable these warnings about ECC? I don't have ECC memory and so disabled it in bios also but it still prints it. [ 4.697057] EDAC amd64: Node 0: DRAM ECC disabled. [ 4.697061] EDAC ...
JoKeR's user avatar
  • 438
2 votes
0 answers
112 views

software-level error detection and correction for raw storage

If I understand data storage correctly, all storage devices are unreliable to some extent, which is why most have hardware-level abstraction layers. Hard drives use error correction. If a sector is ...
enigmaticPhysicist's user avatar
4 votes
0 answers
626 views

How to check flash memory for ECC errors from the Linux command line

Is there a way to check flash memory for ECC errors from the Linux command line? Note that I do not want to correct ECC errors. I just want to detect errors and list page addresses where they occur.
Stephen305's user avatar
10 votes
3 answers
4k views

How to get error detection and correction on a single hard drive on linux (with btrfs or other methods)

One of the cool things about btrfs on linux is that it can correct bit rot if it has redundant data because of its per-block checksumming. I can get redundant data by setting up a raid1 with two disks....
lnmaurer's user avatar
  • 253
4 votes
2 answers
13k views

Understanding "Hardware error from APEI Generic Hardware Error Source" error message

Summary: I'm trying to understand exactly what the following error message means: [17016.923750] {4}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 1 [17016.923758] {4}[...
Gabriel Southern's user avatar
19 votes
5 answers
8k views

Is it possible to add error correction codes (BCH, RS or etc.) to a single file?

As far as I know, WinRAR archives may contain ECC (error correction codes), so if the archive is slightly damaged, then it can be fixed by itself. For example, I can first encode archives.tar to ...
Kevin Dong's user avatar
  • 1,179

15 30 50 per page