Recoverable File Archiving with Free Software? 80
Viqsi asks: "Back in my Win32 days, I was a very frequent user of RAR archives. I've had them get hit by partial hardware failures and still be recoverable, so I've always liked them, but they're completely non-Free, and the mini-RMS in my brain tells me this could be a problem for long-term archival. The closest free equivalent I can find is .tar.bz2, and while bzip2 has some recovery ability, tar is (as far as I have ever been able to tell) incapable of recovering anything past the damaged point, which is unacceptable for my purposes. I've recently had to pick up a copy of RAR for Linux to dig into one of those old archives, so this question's come back up for me again, and I still haven't found anything. Does anyone know of a file archive type that can recover from this kind of damage?"
where have you been? (Score:3, Informative)
Re:where have you been? (Score:5, Informative)
Try apio (Score:5, Informative)
Par2 works great (Score:5, Informative)
cpio (Score:5, Informative)
True, tar cannot handle a single error... all files past that error are lost.
On the other hand, cpio (and clones) can handle missing/damaged data without losing the undamaged portions that follow (you only lose the archived file that contains the damage). It is the only common/free format I can think of (from the top of my head) that is capable of this.
Re:Are you sure tar is unacceptable? (Score:5, Informative)
then 2 years later you want the data back.
there's a read-error at some point within the
bunzip2 will actually be able to recover all other 900kB chunks of the original tar file, except for this missing chunk or part of it.
Tar will just choke at that point and you lost everything past the read error. bunzip2 was able to recover the data past the error, but tar can't use the data.
It's quite frustrating.
Re:Try apio (Score:5, Informative)
RAR isn't completely non-free (Score:4, Informative)
tar/gzip recovery toolkit (Score:5, Informative)
The gzip Recovery Toolkit
/tmp/tar.log 2>&1 & /tmp/tar.log
http://www.urbanophile.com/arenn/hacking/gzrt/gzrThe gzip Recovery Toolkit has a program - gzrecover - that attempts to skip over bad data in a gzip archive and a patch to GNU tar that enables that program to skip over bad data and extract whatever files might be there. This saved me from exactly the above situation. Hopefully it will help you as well.
[...]
Here's an example:
$ ls *.gz
my-corrupted-backup.tar.gz
$ gzrecover my-corrupted-backup.tar.gz
$ ls *.recovered
my-corrupted-backup.tar.recovered
$ tar --recover -xvf my-corrupted-backup.tar.recovered >
$ tail -f
Re:Par2 works great (Score:5, Informative)
good overview here: PAR2 files [slyck.com]
comparison between v1 and 2: here [quickpar.org.uk]
Re:cpio (Score:2, Informative)
ZIP also supports this (the command is "zip -F" with Info-ZIP, the standard zip/unzip program on Linux).
Rar has one of the best Recovery methods (Score:2, Informative)
rar has one of the best recovery methods, as it has mutliple of them.
during compression:
Recovery Record (-rr option)
it has Recovery Record, this is data appended to the actual
rar file that lets you recover from errors within a file. The
default RR takes 1% of the archive and lets you recover 0.6%. You
can change this behaviour to going more recoverability by
specifying -rr[N]p and telling it larger percantage for recoverability.
Recovery Volume (-rv option)
further more, rar supports PAR like volumes called REV
That can recover full missing files. For all you are concerned REV is
PAR, except its integrated to RAR utility. all you type is unrar *.rar
and rar will recover files for you, either through RR or REV. No need
to muck around twenty different utilities just to ensure proper file.
Non Solid Archiving (-s- option)
Further more, rar support non solid archiving, meaning each file is
saved using new compression statistics. You will lose some space due
to this method, however you will gain speed (you dont need to decompress
first 20 files to gain access to 21st file), as well as you will gain
partial recoverability (if file 20 is corrupt, you can still decompress
file 21)
during decompression:
Keep Broken Files (-kb option)
By default, like most archiving software, rar will not save a file
that is known that is corrupt, unless you explicitly force it to do
so.
I highly recommend checking out the command line manual to RAR,
Eugene Roshal is GOD
There is a Free Software extractor for RAR files (Score:1, Informative)
Re:cpio (Score:3, Informative)
You can, too, recover tar archives!! (see: tarx) (Score:2, Informative)
It's been possible to do that for well over a decade, using various utilities such as tarx. I've successfully recovered files after a damaged point in a tarball many times. (Sigh, I used to use an old AT&T UNIX with a #$*@# broken tar, which occasionally created corrupt tarballs).
See this post [sunmanagers.org] on the Sun Managers list circa 1993, and the venerable comp.sources.unix collection, volume 24, for the sources.