



RAID Trust Issues — Windows Or a Cheap Controller? 564
NicApicella writes "My new system has two sparklin' SATA drives which I would like to mirror. After having been burned by a not-so-cheap, dedicated RAID controller, I have been pointed to software RAID solutions. I now stand in front of two choices for setting up my RAID: a Windows 7 RC software RAID or a hardware RAID done by the cheap integrated RAID controller of my motherboard. Based on past experiences, I have decided that only my data is worth saving — that's why the RAID should mirror two disks (FAT32) that are not the boot disk (i.e. do not contain an OS or any fancy stuff). Of course, such a setup should secure my data; should a drive crash, I want the system up and running in no time. Even more importantly, I want any drive and its data to be as safe and portable as possible (that's the reason for choosing FAT32), even if the OS or the controller screw up big time. So, which should I choose? Who should I trust more, Microsoft's Windows 7 or possibly the cheapest RAID controller on the market? Are there other cheap solutions?"
Have some FUD with your RAID (Score:5, Funny)
Do you really want to trust Windows with your data?
You are asking the wrong question. (Score:5, Insightful)
RAID is only marginally valuable. In my experience, for all but the most carefully controlled environments, RAID simply adds complexity, the number of things to go wrong increases, along with the likelyhood of lost data. Do it only if you want the *experience* of running RAID, but don't count on RAID to "save your data".
I've worked as a system administrator for more than a decade, in medium-large scale deployments with good success, (think: servicing thousands of users, hundreds of domain names, tens of thousands of email addresses, etc) so I think I have some useful experience you can benefit from.
IMHO, you most likely to lose data from the following things (in order)
1) Aw sh1tz. "I didn't mean to delete that folder"... or "Whoops! I formatted the wrong drive", "I saved the wrong version of the file!", whatever. Although I *myself* don't have this happen often, it does happen. And even in my case I've lost about as much useful information this way as by drives dying. Users delete stuff all the time, and it's usually my job to bring it back, which is why I perform redundant, historical backups EVERY SINGLE DAY.
2) Malware. Don't minimize this - it's real, and it's why I reply to Parent. You are more likely to lose information from a virus/worm/malware and/or b0rked install of something that hoses your filesystem than by a hard disk crash given stable hardware.
3) Bugs. Filesystems have bugs. So do applications, utilities, anything with software. Strange, unexpected conditions, often caused by bugs in applications can cause data to "disappear", files to get corrupted, filesystems to get corrupted, folders to be incompletely written, etc. This is about as likely to cause lost data as:
4) Hardware failure. This is one of the lowest orders of lost data, although when it happens, it can be one of the most extreme.
Let me say this: RAID 1/5 only PARTIALLY protects you from the last one. Actual, bona-fide backups protect you from all of these. If you care about the data, get backups. If you care about uptimes at great expense, RAID *may* be worth it.
My advice is something most people don't want to hear: for personal use, get backups online for $5/month. Mozy/Carbonite/etc. There are zillion vendors, just Google it. In two years, it will cost you about as much as that 2nd hard drive. It protects you far better than that 2nd hard drive, and it's so automatic that you'll hardly notice it until the moment it actually matters: when you just have discovered that your data is gone.
Re:You are asking the wrong question. (Score:5, Insightful)
Your 4 points are correct. However, the reason for using RAID is NOT as a backup. RAID != Backup.
RAID is for redundancy and performance increases.
I had a drive die in my NAS a few weeks ago. It took 5 minutes to walk to the server room and plug in a new drive. There's no added complexity for the sysadmin, everything is done automagically by the RAID controller. Losing a server or data for hours while the drive is restored from tape is more expensive and complex.
Re: (Score:3, Interesting)
I learned this important lesson when setting up a RAID array for audio and video production. It increased my throughput tremendously. When you need to stream digital video or audio into an editor, or a multitrack recorder, It really helps to have more than one disk doing it. Of course, I can't use FAT32 like the author because I often have very large files to move around.
However, RAID on my regular desktop has never been much more than a headache. Now I j
Re: (Score:3, Funny)
Re: (Score:3, Interesting)
Plus, the RAID array can keep track of where the head is on each drive and choose the one that's closest to the requested sector. Linux software RAID does this, though I don't know specifically who else does.
Re:You are asking the wrong question. (Score:5, Interesting)
Heh. Linux software RAID doesn't do jack. I've looked at the source code. The mdadm RAID1 driver just alternates drives for reads whenever the requests are not contiguous. That is all. Nothing more. There's no intelligence in there. No keeping track of head positions, no attempts to discover or infer physical drive geometry. Nothing. Just a simple round-robbin. It just so happens that for MOST things that involve random access, the effective throughput is nearly doubled. More intelligence wouldn't actually buy you much in the general case, so why bother?
Also, the dmraid (fakeraid) RAID1 driver only does reads from one disk. I made the mistake of using dmraid instead of mdraid, only to discover through performance tests and iostat that there are basically two software RAID drivers that CLAIM to do identical things but in fact do not.
Re: (Score:3, Interesting)
I would expect this - modern drives (when I say modern I mean any drive made in the last 15 years) is effectively a black-box to the computer. The OS has absolutely no idea where the heads are, or even how the sectors are actually laid out on disk. Any attempt to be "clever" in ordering reads is doomed.
Re: (Score:3, Interesting)
Try areca raid cards, they do give a performance boost on raid 1, but of course a slight performance hit on writes.
Can't have everything.
Re: (Score:2, Insightful)
Or until the backup company disappears. I suspect hardware is much more stable than any company providing any online backup.
I'd feel safer by far with an outfit that picks up your physical tapes and can return them as needed. If the company is going belly up, it's a lot harder for them to "lose" a warehouseful of tapes than a bunch of files on rotating memory.
For a private person intereste
Re: (Score:3, Informative)
Are you crazy? (Score:5, Funny)
Do you work for one of these online backup places?
I would sooner trust a WD drive with my valuable data.
Re: (Score:3, Interesting)
A periodic rsync copy (with history) is much better for keeping your data, accidental deletion or overwritten partition tables happen very easily, no RIAD system will save your 'ass'.
Preferable you keep the copy on an other machine, different UPS or surge protector and not in the same machine, hanging on the same PSU. Even better is to copy it to a remote place.
With current bandwidth 'limits', it's possible a good idea to keep it somewhere else.
Re: (Score:3, Insightful)
No, I don't work for any of these. And I do maintain my own backup set, because I backup TBs of data daily. But for personal use, the online vendors are the best bet.
Re: (Score:3, Funny)
Re:Are you crazy? (Score:5, Interesting)
I'm still somewhat astonished that WD would think that it's acceptable to have external drives that work on OSes other than Win except for the power management features. Saying you're just supporting Win for a hard disk is nowhere near acceptable.
Personally, what I do at home is I use ZFS to mirror a pair of 1tb Seagate drives and that seems to work fine, it's not really the best set up, but it's hard to get such things located off site for the amount of money I have to spend.
Re: (Score:3, Insightful)
My advice is something most people don't want to hear: for personal use, get backups online for $5/month. Mozy/Carbonite/etc. There are zillion vendors, just Google it. In two years, it will cost you about as much as that 2nd hard drive. It protects you far better than that 2nd hard drive, and it's so automatic that you'll hardly notice it until the moment it actually matters: when you just have discovered that your data is gone.
And is so slow that a LS120 drive reading a 1.44MB floppy would actually be faster. Or a 1x CROM. Or a 16 year old hard drive.
Also, I have to trust that the service and my internet connection will be available when I need to restore my data.
Or I can use RAID... and tapes.
Re:You are asking the wrong question. (Score:5, Interesting)
Ive had #4 happen to me. A power supply in my computer failed (a name brand one, not a cheap no name brand) and damaged everything attached to one of the 12v rails. This included both drives of a raid1 set. (ironically all my drives that wernt part of a raid set were completely undamaged) I was later able to recover the data from both drives (both had damaged sections but different areas were damaged on the 2 drives allowing for a complete recovery between the 2 of them)however it goes to show that just having a raid array wont completely protect you from hardware failures.
Re:You are asking the wrong question. (Score:5, Interesting)
That's an interesting thing... the power supply has more potential to cause damage than anything else in the PC, but nobody ever thinks about protecting against its failure. Makes me wonder why we don't have surge protectors on the 5/12V rails as standard yet.
Re: (Score:3, Funny)
This is why instead of a simple "name brand" PS, I have one that not only employs internal surge protection features, but they actually place an insurance guarantee against all the components connected to it internally, including data recovery(up to $10,000). It's conditional on you having a properly rated UPS attached (you even have to send a copy of the UPS receipt with the waranty registration card), and if it's a true electical failure, they expect the UPS company's insurance guarantee to kick in first
Re: (Score:3, Insightful)
I had a similar case where the controller decided it wanted to die and started writing spurious data to the disks. RAID won't protect you from the controller itself dying - and that also can occur for software RAID as well, the controller can still bork your data.
Re: online backups (Score:5, Insightful)
Personally, I haven't yet encountered anyone who really got benefit from those personal Internet backup services like Mozy. In regular use, it always seems like the person exceeds their storage allotment or Internet connectivity issues prevent them from recovering what they need, when they need it.
I tend to recommend people buy an inexpensive external USB or firewire drive, leave it attached and assigned as a backup device, and have some software package run a daily backup of all the relevant folders and files they might need to save.
It's great that your data is stored offline and off-site ... but I'm just not sold on most of the implementations for "home use" being as great a solution as they first appear to be. Many of the providers have come and gone over the years, too. What happens when your offline backup company goes under?
Re: (Score:3, Informative)
Having run RAID quite a bit myself one must remember having all your drives in one box is always an invitation for trouble since hardware failures on a higher order will likely hit all the drives.
If you want to do online backup get DSL instead of cable internet for the faster upload bandwidth.
Get a backup service with versioning. That way if you or a virus delete something it just doesn't sync the deletion to your backup.
I personally use JungleDisk which uses Amazon S3 storage. You can set the versioning co
Re:You are asking the wrong question. (Score:5, Informative)
Having run RAID quite a bit myself one must remember having all your drives in one box is always an invitation for trouble since hardware failures on a higher order will likely hit all the drives.
Not to mention the temptation to use _Identical_ disks in your redundant array... I've had a RAID1 pair fail totally when both drives died within 24 hours of each other because of a firmware bug. This happens a lot more than most people think. Statistical analysis of the reliability of RAID _always_ assumes failures arrive independently of each other, but a large proportion of failures are caused not by random events but by external circumstances and therefore happen either simultaneously or nearly simultaneously.
Re:You are asking the wrong question. (Score:5, Insightful)
Re:You are asking the wrong question. (Score:4, Insightful)
Re: (Score:3, Interesting)
I'm sitting here laughing at the idea of backing up 1TB when you could just download it all from Pira^H^H^H^H torrent sites again.
Re: (Score:3, Informative)
4) Hardware failure. This is one of the lowest orders of lost data, although when it happens, it can be one of the most extreme.
I don't think this is quite right. Hardware fails all the time, it should not be underestimated, and often it is useful to augment backups with RAID. It just depends on your exact needs... what is this data? Does the world end if you can't get back up and running within an hour of a hardware failure? If 12 hours of downtime is ok, then restoring a backup, is of cours
Re: (Score:3, Insightful)
5) Theft. Break-ins do happen and no amount of RAID will protect your data from that.
Re: (Score:2, Informative)
Yeah, blame broken applications on the filesystem. Seems like a good idea.
Re: (Score:2)
Written in complete ignorance of other linux file systems that do not have these problems...
Drobo (Score:2)
"I want any drive and its data to be as safe and portable as possible [...] even if the OS or the controller screw up big time."
If the controller screws up and writes crap to your raid, your data is dead. Not sure if your expectations are realistic.
Try a drobo instead.
Comment removed (Score:5, Insightful)
Re:Seriously? (Score:5, Insightful)
The article smacks of false dichotomy. There are a number of solutions, not just Windows 7 or a hardware RAID controller.
To begin with, every NT-lineage Windows version ever produced supports software RAID out of the box. Add that to the fact that any major Linux distro today supports software RAID. And so do the *BSDs. And Mac OS X. And Solaris. And probably a bunch of other platforms I can't think of right now.
Hell, you could buy one of these one of these [linksysbycisco.com] and throw the drives in it, connect it to your network switch, and presto -- instant RAID+NAS.
I think we would all like to know why you think Windows 7 is your only option, because if that's what you think, you don't know how mistaken you are.
Re: (Score:3, Insightful)
The article smacks of false dichotomy. There are a number of solutions, not just Windows 7 or a hardware RAID controller.
Agreed.
As I see it, if you want guaranteed repairability then you basically have two options: enterprise-class hardware with a support contract (and price tag to match), or an Open Source software solution.
Put another way, either you pay someone to take responsibility for fixing it, or you take responsibility yourself. A Microsoft solution doesn't give you enough control to take full responsibility, because you can't be certain that it will be legally or technically possible to recreate your current setup
FAT??? (Score:5, Insightful)
You data is most important and you plan to use FAT? Good luck with that!
Seriously, though. No RAID solution that is not totally S/W is portable. But do you really need RAID? It sounds like what you need is a good backup solution with frequent backups. Does you data change so much that losing one day's worth of data would be a problem?
Re: (Score:2, Informative)
The big problem in this picture is the way that Windows deals with drive errors. It doesn't report them and people commonly discover that one of the drives in a mir
Re: (Score:2)
You can have a FAT toasted beyond recognition and still get it back by putting it into a Win 9X box.
Well, if you are able to find a Win95 box, and if you still remember how that works... :-)
(Oh, and it better be a Win95 box with support for FAT32, unless all your partitions are under 2GB)
But I do agree with the rest of your remarks.
Re:FAT??? (Score:5, Informative)
No. NTFS is not perfect, but to think FAT is as bad is deluded. I've honestly never seen a HD formatted with NTFS that I couldn't repair with built-in tools, unless it had physical defects, and in such a case ANY file system would have problems. But I've seen so many FAT drives get hosed by little problems, it's not even funny.
Seriously, don't trust your data to a FAT partition - not worth it.
Re: (Score:3, Informative)
The big problem in this picture is the way that Windows deals with drive errors. It doesn't report them [...]
Windows reports drive errors in the Event Log.
Re: (Score:3, Interesting)
I've messed with software RAID in Windows 2003 server for years and have never had a drive failure not reported to the Event Log.
Well, sure, but how often do you read your event log? Most users _never_ read their event log, so logging the failure like this is next to useless. This is something the user needs to know about, immediately. At the very least a notification area icon and pop-out box would be appropriate.
Re: (Score:3, Insightful)
"Most users _never_ read their event log, so logging the failure like this is next to useless."
The original complaint was that Windows doesn't report disk errors, which is not true. If this were Linux, people would point out that the information desired is right there in the logs..
Software RAID isn't (wasn't?) even available on consumer desktop versions of Windows, so you'd expect some minimum level of cluefulness on the part of the user and less handholding on the part of Microsoft.
A.
Re: (Score:2)
Agreed, RAID prevents from a drive failure, period. If your controller fails you lose data, if your mother board fails you may loose data, if your memory fails, you may loose data, etc...
As the poster said, I would go first with a proper backup strategy, it is more important in order to secure your data. Then, go with RAID if you evaluate that you still need it. Heck, as the other poster mentioned, you may find out that you do not need RAID anywhere although I always like to add RAID AFTER my strategy is ma
Re:FAT??? (Score:4, Insightful)
If all you want is multiple copies of your data, then really what you want is an automated incremental backup system, that copies your files to a second hard drive, and ideally keeps a few older copies so that if a file gets accidentally deleted or somehow corrupted, you have a chance to go back and find a usable copy. This is what I do on my system: I keep multiple incremental copies from the last few days/weeks/months. It was easy to set-up (in Linux, mind you). Do hourly syncs if necessary.
Also critical, if the poster is truly concerned about never losing data, is to get some kind of offsite backup. Two hard drives don't do you much good when the computer is stolen or your roof leaks. You need to have a way to regularly copy data offsite (ideally automated over the network, or via external hard drive if you're sufficiently disciplined).
RAID has its uses, to be sure. But if the poster is most worried about never losing important user files, then it seems like what he wants is is the multiple-redundancy of backup, not the immediate failover of RAID.
RAID != BACKUP (Score:5, Insightful)
Re: (Score:2)
With two drives you can set up a backup scenario. Put the second drive in an external enclosure and use robocopy on a regular basis. Most data loss is from user error. RAID 0 won't protect you from that. Screw FAT32 and use NTFS. If you need to access it in an emrgency put the drive in an external enclosure.
Re: (Score:2)
A serious backup solution HAS to include incremental backups where you keep ALL previous copies of a file for a given period of time.
Otherwise, you will quickly get screwed in various scenarios. One of them is your main controller failing so slowly that you won't realize that your data is slightly corrupted before it is too late (e.g. you have already overwritten the copy and you no longer possess any non-corrupted copy)
It also needs to have all components redundant. The cheapest way to achieve this is to h
Re: (Score:3, Insightful)
What about Volume Shadow Copy in Winblows or Time Machine in OSX? They seem pretty useful in the "man, that was dumb" scenarios.
Your first problem is Fat32 (Score:5, Informative)
If you want data integrity, use NTFS. Using Fat32 is like saying you want a reliable car, so you're buying a Edsel because they've been around a long time-- it doesn't make sense. Every other OS on earth can read NTFS (if not write it), so it won't affect your portability requirement.
Secondly, before you make any decision regarding Windows 7 RAID, make sure the edition of Windows 7 you want to buy ships with software RAID support before you put all your eggs in that basket-- early betas and RCs of Vista had software RAID enabled, only to have it disabled before release. I've seen no guarantees about Windows 7 software RAID support, and which editions will have it enabled. (If any.)
If you're planning to move to a server OS after Windows 7 expires, I can practically guarantee software RAID will be enabled, but that still doesn't mean you can necessarily upgrade your Windows 7 software RAID array to a Windows Server software RAID array. Do your homework.
RAID is *NOT* backup! (Score:5, Insightful)
Re:RAID is *NOT* backup! (Score:5, Funny)
Are you insane?
Mod parent up up up (Score:2)
The only thing RAID will save you from is a dead drive.
There are infinitely many ways to lose your data, and a dead drive is only one of them.
If you are going to use RAID, you might as well use RAID 0 unless you can't afford the downtime.
If you start talking about data loss, then you just lose, because you should have backups.
It's *NOT* hardware raid on your motherboard. (Score:5, Informative)
These motherboard "raids" are called fakeraids.
All that it is is that it writes the metadata on the disk in specific format so that you can see the raid volumes via BIOS. Note: Only "see" their status - in case you replace one drive, the resync is still done by software and you must boot to operating system. One clue is the fact that in Linux the dmraid package uses exactly same driver for accessing fakeraid-mirrored drives and Linux's own software-raids - device mapper just does a bit of magic at init.
However, if faced with choice of Windows-only or motherboard-raid, I'd go with the motherboard-version, because that's at least supported both by Windows and Linux so in case something goes wrong with your Windows installation you can always pop in Knoppix or some other Linux CD for recovery.
Re:It's *NOT* hardware raid on your motherboard. (Score:5, Informative)
DO NOT buy a real RAID card unless you have a pretty good budget for your system, and need the highest performance. The problem with buying a real RAID card is that you need to buy not 1, but 2 or better yet 3 of them, so that you can have spares. If your RAID card dies (and they do, more often than you'd think), the only way you'll be able to access that data, because of the proprietary on-disk storage method used by RAID card vendors, is to have an identical card (with the same firmware version, to be safe). And since hardware is constantly being obsoleted, you need to buy your replacements when you buy your card, not hope they're still available later. It's also a good idea to have spares of the same make and model hard drive, because hardware RAID controllers aren't usually that flexible in allowing you to pair up different sized drives like Linux sofware RAID.
For many purposes, software RAID using Linux is really a much better solution, because the on-disk format is open-source and standardized, so it doesn't matter what hardware you have, you can plug the disks into a different Linux system and you'll be able to read the data with no trouble. The only downside is a slight performance decrease since the CPU has to do all the work, but even then unless the system is heavily loaded, it's still faster than hardware RAID because the hardware RAID cards aren't that fast.
With the giant drives that are now common, I think the best solution, at least for home/desktop systems, is to forget about RAID5/6 altogether and just get a couple of 1-2TB SATA drives and mirror them with software RAID 1 in Linux.
Windows + NTFS + external backup (Score:2)
Cheap RAID controllers suck - at least you can trust Windows to be consistent between installations if need be. External (preferably offsite) backup is also a must! As I'm sure you'll be reminded 1000 times in this thread, RAID is not backup.
RAID is never about protection. (Score:4, Insightful)
Re:RAID1 is not fool's gold (Score:3, Informative)
Re: (Score:3, Insightful)
Tell me, how often do you back up your data? Daily? Hourly? Because when your hard disk craps out after a morning's work you'll be mighty grateful for that RAID-1 mirror.
RAID is no substitute for backups (Score:4, Informative)
RAID is no substitute for backups. RAID is very good at propagating errors and problems very quickly, be they software glitches or human errors.
For consumer class storage, weekly / daily backups might be more efficient than investing a lot of effort into live RAID. Since I'm a Mac guy, I see the best answer to this question as Time Machine to a network / USB attached drive -- hourly (configurable for more or less often) differential backups, almost transparent to the user. To my knowledge, Windows has no similar set of software to allow reinstallation to the last hourly backup -- my wife had the misfortune of having to restore a blank drive from her last backup and it was a flawless process that truly left her where she left off less than an hour before the hardware failure. The reinstall wizard just had to ask where the backup was. Casting aside MacOSX advocacy, there is truly no substitute for a good automated backup solution that is regularly tested. I think the best method would use the fewest common components, like a NAS, followed by an external drive with its own power supply. My least favored option would be an internal drive with every single component shared.
Thats not really hardware raid... (Score:2)
Off site backup and test your restores (Score:3, Insightful)
Without regard to if you use software or hardware RAID or the quality of the RAID system, RAID only protects you from a physical disk failure. If you as a user screw up (delete or change something you didn't want to) or if some software bug screws up for you, or if you have a non-disk related hardware failure (causing a data corrupting machine crash) then you have lost your data -- RAID doesn't help.
Even if you are only trying to protect against disk errors, if the RAID system fails (even expensive quality ones can), or if you don't know and follow the recovery procedures EXACTLY, you can lose all your data.
The only reliable solution is making a copy or a "backup". Backup does not mean making a copy of the data on the same machine. (Whatever took out your RAID might also take out the other non-RAID disk or directory that you put your copy on.) If you are paranoid (or just prudent) your backup should not be a mapped or mounted drive on another machine. (Viruses can write to the network as well.)
And finally... Backups only count if you have tested your restore process.
Your best bet is to buy server grade SATA drives (Score:2)
Re: (Score:2)
The best way to protect against drive failure is to buy server grade SATA drives, which are designed for 24/7/365*5 operation, and not cheap PC drives which are designed for 10 hours per day for 3-4 years. Buy server grade SATA drives
This is just pure marketing baloney. Do you have any real-world tests that actually back this claim up? I've never used "server-grade" drives, and never will. I've seen "server grade" drives fail in large quantities and "desktop" grade drives last for years running 24/7/365.
Since You're Just Mirroring ... (Score:2)
... do it with software. This CPU overhead is minimal for that.
Things like RAID 5 are where you'd be better off with a dedicated controller.
RAID is a high availability feature, not backup (Score:2, Informative)
Consumer editions of Windows only ever supported Software RAID1. I've made a few experiences with SW RAID1 on WS03, and it's pretty much crap. Linux SW RAID on the other hand worked fairly well.
RAID is not a backup. This is the most important observation. RAID is a high availibility feature. If you lose your RAID array, you shouldn't lose any data. If you do, your backup strategy sucks.
Generally, skip RAID in a consumer setup. RAID is complicated, it's a PITA and especially the low end stuff can do more har
Are sure RAID is what you want? (Score:3, Insightful)
With RAID mirroring, if you overwrite or delete an important file, it's copy on the mirror is immediately overwritten/deleted too, and the file is lost. Wouldn't you rather need a good regular backup?
And as someone pointed out already, FAT is really not a reliable file system. If you are on Windows, use NTFS. It is still portable, having read/write drivers for both Linux and Mac (see this guide [alma.ch]).
Since the files you want to keep safe appear to be regular files, not system files, any simple file copy mechanism could do. For an easy and simple system, you can use the Windows robocopy.exe tool in a batch file. For a more sophisticated system which can keep older file versions, and can easily be adapted for use over the network, you could try a Windows version of rsync like cwrsync. There are also a few rsync GUI frontends for Windows.
If you decide you really want RAID mirroring and go with the hardware solution, my understanding is that you need a replacement controller in case yours breaks. Since your controller seems to be embedded in the motherboard, you would need a replacement motherboard.
With the Windows software RAID, you are dependent on that software, and have portability only between machines with this Windows 7 software RAID (possibly even only this particular version).
Real RAID is cheap (Score:2)
You can buy a real RAID controller for $400-$500 nowadays. If your data is not worth that much...
Re:Real RAID is cheap (Score:5, Interesting)
Wrong. You need to buy at least two of these controllers, at the same time, or else when your "real" RAID card dies (and they do), you'll lose all your data unless you can find an identical card (you may even need the exact same firmware version).
Software RAID on Linux is a much better solution, as the underlying hardware doesn't matter. You can mix and match different drive models/sizes (can't do that on HW RAID), and swap the drives to a different system and still read them thanks to the standardized on-disk data format.
External eSATA enclusure (Score:2, Informative)
Re: (Score:2)
I'm with him. I have a bunch of external FW800 RAID enclosures (for both SATA and older IDE drives.) Advantages include not using/depending on the computer for RAID drivers (in my case, Mac OS X software RAID), independence from computer failures (e.g. bad power supply), cooling (less hot drives in the case). Disadvantages include performance (unless you have a good eSATA RAID case) and Size, Weight & Power (more than for drives within the computer case).
I've been happy with the enclosures I've bough
If this is not a production web server... (Score:2)
...then you probably don't need RAID. Use NTFS and set up some kind of scheduled backup to the second drive. Or, build a Linux NAS device and run BackupPC (backuppc.sourceforge.net). BackupPC works great for this sort of thing, it can do incremental and full backups of all your data, on the schedule you choose.
RAID != BACKUP (Score:2)
Re: (Score:2)
RAID 0 != mirroring.
See: RAID 1
Kudos. (Score:5, Funny)
"Safe" FAT32, cheap RAID, RAID implied as backup, Microsoft.
Nice job, you successfully trolled the /. frontpage.
Are we talking Pr0n or Tax Receipts here? (Score:3, Interesting)
As for Motherboard RAID chipsets... Keep in mind that your motherboard has a non-zero probability of frying, having it's caps go bad, being peed on by irate government agents, etc.. I once had a RAID 0 array that was hooked up to one of those things. After the Mobo died I had to do without letters K through P of my Japanese horror-comedy-porno-game-show collection until I was able to find a used computer with the same RAID chipset. (I don't know if it's changed, but at the time each different RAID chipset made RAID 0 arrays that were not compatible with anything else on this lump of rock.) If data portability rather than performance is a priority for you, my advice would be to avoid hardware RAID entirely.
NTFS + External USB disk + MS SyncToy (Score:2)
Cheep Non-RAID Controller! (Score:5, Interesting)
Fast facts:
I prefer pure software RAIDs, for a simple reason: They do not depend on available hardware. If one controller dies, switch to another one: Other brand, other type, other drivers, and the RAID still works. If you insist, you can even mix an IDE drive, a USB drive, a SATA drive and a SCSI drive into a single RAID. Try that with a hardware or host RAID. Some people even built RAIDs of floppy disks or USB sticks (not for pemanent use, of course).
My faithful old Linux home server runs two RAIDs, both in software: a RAID-1 for the OS (remember: the BIOS does not know about the RAID), and a RAID-5 for the data. The RAID-1 used to run on old SCA drives, but recently, I switched to two small IDE drives due to unrecoverable SCA cabling problems. The RAID-5 is composed of four IDE drives, connected to two IDE controllers, each disk on a single IDE cable. An external USB disk is used to back up my data, rotating through 10 days. All filesystems are ext3, all disks are monitored using SMART, all RAIDs are monitored. If anything wents wrong, I will get an e-mail from the monitoring software.
Until recently, one of the controllers was an el-cheapo non-RAID controller, and the other one was a donated, expensive, well-known brand, RAID-capable controller running in non-RAID mode. The latter one decided to randomly take some free time on the job, and either disconnected from the PCI bus or disturbed it, causing panics in the OS above. Only pure luck protected me from data loss. I ripped it out of the machine, kicked it into the trash bin, rewired the RAID to use two disks per IDE cable, and verified and reconstructed my data. Some days later, another el-cheapo non-RAID IDE controller arrived, the same brand, model and type that already sat in the next PCI slot. So I rewired the RAID again to work with one disk per cable, everything was fine again.
For a new small business or home server, I would use nearly the same setup again: Two software RAIDs, one for the OS, and one for the data. Upgrading the OS is just fun when you can
Re: (Score:3, Informative)
My faithful old Linux home server runs two RAIDs, both in software: a RAID-1 for the OS (remember: the BIOS does not know about the RAID), and a RAID-5 for the data.
Beware of RAID-5, it's dangerous.
The problem is that reconstructing the array after a disk fails is a very intense operation that touches every sector of every disk. If another disk in the array has a latent failure, the reconstruction operation will trigger it, and when you lose two disks from the array, you're hosed.
This happened to me. I had a RAID 5 array with a hot spare, one drive failed and dropped out of the array, and the process of reconstructing onto the hot spare triggered another failure.
Promise SATA RAID = Cheap and Good (Score:3, Informative)
Raid will NOT secure your data (Score:3, Informative)
raid will make your data highly available, not secure.
if you want security, you need backups, and backups are:
- off line (viruses, power surge, sabotage...)
- off site (fires, theft...)
- tested (i've got horrors stories of people that THOUGHT they had backups...)
- multiple (... and of backups that turn bad at the worst possible moment)
Raid is none of that. I know plenty of people who thought their data was safe because they had raid. It isn't, it wasn't, it ain't ever gonna be.
Forget about RAID. (Score:3, Informative)
> I have decided that only my data is worth saving
Then forget about RAID. RAID is designed to protect the integrity of the underlying volume - NOT the data that's on it.
> Of course, such a setup should secure my data; should a drive crash,
Then forget about RAID. RAID will only secure your data under some very specific cases of hardware failure of the drive. It does absolutely nothing towards preventing data loss due to (say) a corrupt file allocation table, virus, accidental deletion, or corruption.
> Even more importantly, I want any drive and its data to be as safe and portable as possible
Then use proper backups - not RAID. Preferably off-site backup. I use Carbonite which backs up to the 'cloud' at minimal cost.
By all means use RAID to protect you from hard disk failure, but don't under any circumstances assume it stops you losing your data. For backups, I always use the rule that at any given point in time, assume that the next time you walk back into your house/office, that NOTHING in that building is still there. Do you have a copy of everything you care about somewhere else?
I'm still amazed by people that carry 12 months of work around on a single floppy disk/USB stick/laptop, then cry when they go to the helpdesk asking what "sector not found reading drive A:" means, or perhaps "A USB device attached to the system is not functioning".
Get your data in as many places as possible - preferably three. A drive which is mounted one inch above the main one is *NOT* a valid second place!
Re:Just remember the first rule of RAID 0 (Score:5, Informative)
Re:Just remember the first rule of RAID 0 (Score:5, Informative)
"That's nice, but the submitter is asking about RAID 1."
I think he's asking the wrong question anyway.
"Based on past experiences, I have decided that only my data is worth saving"
See? He is asking for backup, not RAID. It has been said one thousand times but it seems it must be said again: RAID is *NOT* in order to protect your data. NOT, NOT, NOT and then NOT again.
RAID (not talking about RAID-0) is there in order to enhance your data's avaliability (as in, say, instead of being able to get to my data 99% of the time, I can get to it 99,9%) but when it's hosed, it's hosed. To protect your data you need backups, not RAID.
"Of course, such a setup should secure my data"
Of course not. Of course you will get quite a funny face when you discover it. Quite more or less the one that had the guy from this story, about six months ago, with the very enlightning title "Why Mirroring Is Not a Backup Solution": http://hardware.slashdot.org/article.pl?sid=09/01/02/1546214 [slashdot.org]
"Even more importantly, I want any drive and its data to be as safe and portable as possible"
Then, *even* if RAID could be considered for data security (which is not) you already answered your question: as a general matter, hardware RAID will only work when using exactly the same controller model, possibly up to its minor revision. You can't count to break a hardware-managed mirror, take one disk to a standard SATA controller and get any data out of it. If your controller dies and miracolously doesn't take the disks with it you can't count on buying a different RAID card (as it will most probably be in about a year for consumer-grade hardware) and get any data out of the mirror. So you should go with software RAID.
AND TAKE BACKUPS.
Re: (Score:3, Informative)
Re: (Score:3, Interesting)
Yes, Mod parent up. RAID is for providing data resiliency, not data protection. In corporations where either very large data sets would simply take too LONG to restore, or where spindle count for acheiving IOPS is critical, RAID permits a reduction in failure rate. However, is it NOT a replacement for backups, and RAID should generally only be considdered when you;re already considering a multiple disk setup (either for capacity or performance reasons).
A RAID1 setup on your home computer may increase you
Re:Just remember the first rule of RAID 0 (Score:5, Informative)
Why RAID is not a backup:
1) not fireproof.
2) not mistake proof "oops, didn;t mean to delete that"
3) not immune to file system corruption.
4) not immune to power supply failure/surge/lightning/other destructive forces
5) more expensive than a good backup
6) not protable offsite
7) does not track versionb history or old files (something that should be of critical importance to a programmer...
8) Viruses, mailware, hackers oh my!
9) bad/corrupt install
10) OS failure
I could easily go on. I worked in DR for 4 years...
Nearly all of the above have a higher frequency of occurance over a 5 year typical HDD life. Even if you continually replace drives without a data failure, you're still eventually going to have an issue RAID can not deal with.
My Qnap was a $399 device. The 4 drives in it were $90 each (and the 5th spare too). The HDDs I run the PC off on the RAID 1/0 were $40 each. I only run the RAID 1/0 for performance during video editing. I chose 1/0 vs 1 since 1 halves the reliabiltiy of the drives. Even though I do have a good recovery solution, the downtime, nor the effort involved in recovery, would be welcome, and the extra $80 to mirror the performance stripe was easily spent.
The Qnap is also my iTunes media server, my FTP server, included the price of the DR software, and runs 2 IP cameras I set up at home too (which let me tell the insurance company I have real-time video monitoring, and they knowcked an extra 5% of my homewoners policy cost, which by itself is enough to fund replacement drives as I'll need them).... Oh, yea, and it's a NAS too... It has a lot of value beyond a backup system.
I'm guessing you've not got a child yet, or a large family. You probably don;t value to pictures you take, files you have, and other stuff on your PC. That's fine, someday you likely will.
There are cheaper ways than mine to do backups. I have over a TB, and 3 (currelty, soon to add 2 Macs to the list an decom 1 old laptop leaving me with 4) computers I'm backing up, so centrally makes sense. If you have 1-2 machines, a small amount of data, and don't value most of it, then 2 external USB drives and a safety deposit box (Dad's house) usually suffice... Or, just an online backup account for $5 a month...
RAID 1 might save you from a firmware failure, or a disk going bad, but that's about it... Also, RAID 1 may be cheap, but a backup is cheaper. Also, good luck rebuilding that RAID if your MOTHERBOARD fails... RAIDs are proprietary to a particular controller. Unless your new board usues the same chipset (and firmware too in most cases) you;re screwed without a backup.
Re:Just remember the first rule of RAID 0 (Score:5, Informative)
And what allot of people don't realize is if you build a RAID array and a drive fails can you replace the drive with the exact make and model? Raids work best when every disk in the array is the same model and revision. If you plan to build a 5 disk raid array you should also purchase a 6th drive to keep as a cold spare.
I built a RAID 5 array using three 500GB disks via mdadm under Linux. I assembled the array and formatted it. Within minutes of testing I was getting mail from mdadm telling me the array was degraded. I then began to test each disk for defects and lo and behold one disk was bad right from the start. I tried to RMA the disk but newegg had informed me those disks were now obsolete. Great. I was credited for the bad disk and purchased a new one that closely matched the other two. It was a nightmare as during some boots the disks went haywire and I would get a "Could not bd_claim sdaX" And it would hang for a while and I would have no array. It happend once in a rare while until it became a real problem. I kept my most precious data safely backed up on different disks I had spread around. It finally got so bad that I would have to constantly reboot the machine for up to ten times before the disks were synced up and the array worked. I purchased a 1TB disk and copied all the data off the array to it and used the 500gb disks in other systems. RAID is great for big fat storage arrays but it can become very sensitive and then one day POOF its all gone.
This is the reason OEM drives from Dell, Apple, HP etc. Cost four times what a retail drive would cost. The cost is no way associated with quality but rather consistency. Retail SATA drives are constantly changing: less/more platters, faster seek and read speeds and firmware revisions. Those costly OEM drives are the same disk every time right down to the inner workings and firmware. So if you buy an Apple 1TB disk on a sled and it takes a dump in three years you can be confident Apple will replace that drive with the EXACT same one. Its not a magical Apple disk of superior quality but a Maxtor/WD/Hitachi disk that is produced for Apple with no revision changes unless Apple orders it. Unlike retail drives which are changed at the manufactures whim.
So if you are building your own raid plan for failures and try to buy a spare for your array. I don't know disk shelf live but it will save you down the line. Also keep a USB or 1394 disk around for backups. Spread your most precious data around like pictures home movies and documents. If you have a few computers around the house keep a mirror of that data one those machines. Music, and downloaded video can be re downloaded but home movies and pictures cannot. Put all the silly stuff on the raid along with the precious stuff for access but keep backups of the good stuff!
Buy more drives! (Score:3, Insightful)
Simple answer: buy more drives. Put the extras in the closet. By the time you run out of spares, it will be time to move on to new drives.
Re:Just remember the first rule of RAID 0 (Score:5, Informative)
Whoa, hold the boat. I've had a lot of experience with Dell & HP/Compaq(Proliant) provided RAID systems and they are not sensitive to disks with vastly different innards. All that matters is block count and software mirroring doesn't even care about that, because you'll simply be limited to the size of the smaller disk. If you're using mirroring or RAID, try to go with different makes of the same size. This article [ssdirect.com] talks about MTBF. It turns out if 2 drives of the same exact model comes off the line and end up in your PC, there is a chance they could fail within a very close time to one another. So your mirror or RAID could fail permanently while rebuilding from the first failure. But if all your drives are of a different make, chances are they won't fail at the same time and you'll get the critical time needed to rebuild your array.
When I'm going to do mirroring or RAID on hardware that doesn't have high-end dedicated server RAID controller, I use Windows or Linux software RAID. Performance is surprisingly good and I'm not married to a specific hardware implementation. I've had _none_ of the issues you've described with Linux software RAID on several servers for several years. Mdadm has only whined after a power outage or genuine disk failure.
Re:Just remember the first rule of RAID 0 (Score:5, Insightful)
Raids work best when every disk in the array is the same model and revision. If you plan to build a 5 disk raid array you should also purchase a 6th drive to keep as a cold spare.
I hate to break it to you, but you're actually wrong.
A RAID array is most effective using completely different drives, but of the same capacity. Five hard disks from the same manufacturer, of the same model, bought at the same time means that you're highly likely to get five drives from the same batch. Let's posit that there was some defect in this batch. Now all five of your drives have a significantly higher probability of failing at the same time. Oops! RAID can only deal with one (or two) drive failures!
Using drives from different manufacturers or model lines means you spread the risk of simultaneous drive failure.
Re:Just remember the first rule of RAID 0 (Score:4, Informative)
Do you have any evidence for this claim?
A typical RAID implementation writes stripes at a time, by issuing a series of writes to each drive. If your disks have the same geometry, then each write will be at the same physical location on each drive and so complete in almost exactly the same time. If they are not, then the different disks will be moving their heads at different times. The RAID controller (hardware or software) will then be bottlenecked by the slowest drive. To make things worse, the slowest drive can be different for each write. One write may require moving the head sideways on one disk, the next may require moving the head sideways on the other. In both cases, you are limited by the worst-case performance for the disk. The same is true for reads on RAID-5, but not RAID-1, which can just use the result for whichever disk returns first.
Re: (Score:3, Interesting)
You may have noticed that some hard drives are marketed as being designed for RAID use. These work slightly differently to most consumer disks. Typically, a small region of a disk is hidden. If the disk discovers a bad sector then it will use one from the hidden region to replace it, so every write to the bad sector goes to one of the spare ones instead. This is very bad for RAID, because two drives writing to the same sector may be writing to two different physical locati
Re:Just remember the first rule of RAID 0 (Score:5, Informative)
All modern disks remap sectors as necessary. The main difference between consumer and RAID drives is the timeout for error correction [wikipedia.org].
Re:Just remember the first rule of RAID 0 (Score:4, Informative)
Is it worth keeping a spare for the sole purpose of having the same model available in the event of a failure when you can get a newer and faster drive in the future?
I would say not, but when one drive fails you should replace all of them. For a home array, expect one drive to fail every few years. I had a disk in RAID-1 array fail last year. It was a 40GB disk which cost around £100 new. For the same price, I can buy two 500GB+ disks now.
Is the difference in performance between modern SATA drives so significant?
It's not a question of performance, it's a question of the difference between a linear access and a seek. The time for a seek is 4ms+. If a drive can read 50MB/s then a linear access is around 10 microseconds. If your one disk is doing a linear access while the other is doing a seek then you are limited by the time of the seek (for RAID-1 writes and RAID-5 reads and writes). If you have to seek after every block, your maximum throughput is 125KB/s. If you do a linear read, your throughput is 50MB/s. If your drives have different geometries, you double the number of seeks you are needing, dramatically reducing your throughput.
Re:Just remember the first rule of RAID 0 (Score:5, Informative)
Actually, it depends on the reliability. 95% reliability becomes 90.25% reliability. 50% reliability becomes 25% reliability. 1% reliability becomes 0.01% reliability.
So if your drives are very reliable, it's very slightly less than twice the failure rate. If your drives are not reliable, then it asymptotically approaches an infinitely greater risk of failure.
Statistically speaking. :)
Re:Just remember the first rule of RAID 0 (Score:5, Informative)
Re: (Score:2)
Re: (Score:3, Informative)
Fakeraid is software too.
Get a real hardware RAID controller, or don't use RAID. Windows SW RAID or a Fakeraid controller is just plain stupid.
Re:Be Careful (Score:5, Informative)
What RAID is good for:
Better to just throw a disk in an old machine and back up to it regularly.
Re: (Score:2)
FAT32? You are fucking lame!
There, fixed it for you.