NicApicella writes "My new system has two sparklin' SATA drives which I would like to mirror. After having been burned by a not-so-cheap, dedicated RAID controller, I have been pointed to software RAID solutions. I now stand in front of two choices for setting up my RAID: a Windows 7 RC software RAID or a hardware RAID done by the cheap integrated RAID controller of my motherboard. Based on past experiences, I have decided that only my data is worth saving — that's why the RAID should mirror two disks (FAT32) that are not the boot disk (i.e. do not contain an OS or any fancy stuff). Of course, such a setup should secure my data; should a drive crash, I want the system up and running in no time. Even more importantly, I want any drive and its data to be as safe and portable as possible (that's the reason for choosing FAT32), even if the OS or the controller screw up big time. So, which should I choose? Who should I trust more, Microsoft's Windows 7 or possibly the cheapest RAID controller on the market? Are there other cheap solutions?"
Have some FUD with your RAID (Score:5, Funny)
Do you really want to trust Windows with your data?
You are asking the wrong question. (Score:5, Insightful)
RAID is only marginally valuable. In my experience, for all but the most carefully controlled environments, RAID simply adds complexity, the number of things to go wrong increases, along with the likelyhood of lost data. Do it only if you want the *experience* of running RAID, but don't count on RAID to "save your data".
I've worked as a system administrator for more than a decade, in medium-large scale deployments with good success, (think: servicing thousands of users, hundreds of domain names, tens of thousands of email addresses, etc) so I think I have some useful experience you can benefit from.
IMHO, you most likely to lose data from the following things (in order)
1) Aw sh1tz. "I didn't mean to delete that folder"... or "Whoops! I formatted the wrong drive", "I saved the wrong version of the file!", whatever. Although I *myself* don't have this happen often, it does happen. And even in my case I've lost about as much useful information this way as by drives dying. Users delete stuff all the time, and it's usually my job to bring it back, which is why I perform redundant, historical backups EVERY SINGLE DAY.
2) Malware. Don't minimize this - it's real, and it's why I reply to Parent. You are more likely to lose information from a virus/worm/malware and/or b0rked install of something that hoses your filesystem than by a hard disk crash given stable hardware.
3) Bugs. Filesystems have bugs. So do applications, utilities, anything with software. Strange, unexpected conditions, often caused by bugs in applications can cause data to "disappear", files to get corrupted, filesystems to get corrupted, folders to be incompletely written, etc. This is about as likely to cause lost data as:
4) Hardware failure. This is one of the lowest orders of lost data, although when it happens, it can be one of the most extreme.
Let me say this: RAID 1/5 only PARTIALLY protects you from the last one. Actual, bona-fide backups protect you from all of these. If you care about the data, get backups. If you care about uptimes at great expense, RAID *may* be worth it.
My advice is something most people don't want to hear: for personal use, get backups online for $5/month. Mozy/Carbonite/etc. There are zillion vendors, just Google it. In two years, it will cost you about as much as that 2nd hard drive. It protects you far better than that 2nd hard drive, and it's so automatic that you'll hardly notice it until the moment it actually matters: when you just have discovered that your data is gone.
Parent
Re:You are asking the wrong question. (Score:5, Insightful)
Your 4 points are correct. However, the reason for using RAID is NOT as a backup. RAID != Backup.
RAID is for redundancy and performance increases.
I had a drive die in my NAS a few weeks ago. It took 5 minutes to walk to the server room and plug in a new drive. There's no added complexity for the sysadmin, everything is done automagically by the RAID controller. Losing a server or data for hours while the drive is restored from tape is more expensive and complex.
Parent
Re:You are asking the wrong question. (Score:5, Interesting)
Heh. Linux software RAID doesn't do jack. I've looked at the source code. The mdadm RAID1 driver just alternates drives for reads whenever the requests are not contiguous. That is all. Nothing more. There's no intelligence in there. No keeping track of head positions, no attempts to discover or infer physical drive geometry. Nothing. Just a simple round-robbin. It just so happens that for MOST things that involve random access, the effective throughput is nearly doubled. More intelligence wouldn't actually buy you much in the general case, so why bother?
Also, the dmraid (fakeraid) RAID1 driver only does reads from one disk. I made the mistake of using dmraid instead of mdraid, only to discover through performance tests and iostat that there are basically two software RAID drivers that CLAIM to do identical things but in fact do not.
Parent
Are you crazy? (Score:5, Funny)
Do you work for one of these online backup places?
I would sooner trust a WD drive with my valuable data.
Parent
Re: (Score:3, Interesting)
A periodic rsync copy (with history) is much better for keeping your data, accidental deletion or overwritten partition tables happen very easily, no RIAD system will save your 'ass'.
Preferable you keep the copy on an other machine, different UPS or surge protector and not in the same machine, hanging on the same PSU. Even better is to copy it to a remote place.
With current bandwidth 'limits', it's possible a good idea to keep it somewhere else.
Re: (Score:3, Insightful)
No, I don't work for any of these. And I do maintain my own backup set, because I backup TBs of data daily. But for personal use, the online vendors are the best bet.
Re:Are you crazy? (Score:5, Interesting)
I'm still somewhat astonished that WD would think that it's acceptable to have external drives that work on OSes other than Win except for the power management features. Saying you're just supporting Win for a hard disk is nowhere near acceptable.
Personally, what I do at home is I use ZFS to mirror a pair of 1tb Seagate drives and that seems to work fine, it's not really the best set up, but it's hard to get such things located off site for the amount of money I have to spend.
Parent
Re: (Score:3, Insightful)
My advice is something most people don't want to hear: for personal use, get backups online for $5/month. Mozy/Carbonite/etc. There are zillion vendors, just Google it. In two years, it will cost you about as much as that 2nd hard drive. It protects you far better than that 2nd hard drive, and it's so automatic that you'll hardly notice it until the moment it actually matters: when you just have discovered that your data is gone.
And is so slow that a LS120 drive reading a 1.44MB floppy would actually be faster. Or a 1x CROM. Or a 16 year old hard drive.
Also, I have to trust that the service and my internet connection will be available when I need to restore my data.
Or I can use RAID... and tapes.
Re:You are asking the wrong question. (Score:5, Interesting)
Ive had #4 happen to me. A power supply in my computer failed (a name brand one, not a cheap no name brand) and damaged everything attached to one of the 12v rails. This included both drives of a raid1 set. (ironically all my drives that wernt part of a raid set were completely undamaged) I was later able to recover the data from both drives (both had damaged sections but different areas were damaged on the 2 drives allowing for a complete recovery between the 2 of them)however it goes to show that just having a raid array wont completely protect you from hardware failures.
Parent
Re:You are asking the wrong question. (Score:5, Interesting)
That's an interesting thing... the power supply has more potential to cause damage than anything else in the PC, but nobody ever thinks about protecting against its failure. Makes me wonder why we don't have surge protectors on the 5/12V rails as standard yet.
Parent
Re: online backups (Score:5, Insightful)
Personally, I haven't yet encountered anyone who really got benefit from those personal Internet backup services like Mozy. In regular use, it always seems like the person exceeds their storage allotment or Internet connectivity issues prevent them from recovering what they need, when they need it.
I tend to recommend people buy an inexpensive external USB or firewire drive, leave it attached and assigned as a backup device, and have some software package run a daily backup of all the relevant folders and files they might need to save.
It's great that your data is stored offline and off-site ... but I'm just not sold on most of the implementations for "home use" being as great a solution as they first appear to be. Many of the providers have come and gone over the years, too. What happens when your offline backup company goes under?
Parent
Re: (Score:3, Informative)
Having run RAID quite a bit myself one must remember having all your drives in one box is always an invitation for trouble since hardware failures on a higher order will likely hit all the drives.
If you want to do online backup get DSL instead of cable internet for the faster upload bandwidth.
Get a backup service with versioning. That way if you or a virus delete something it just doesn't sync the deletion to your backup.
I personally use JungleDisk which uses Amazon S3 storage. You can set the versioning co
Re:You are asking the wrong question. (Score:5, Informative)
Having run RAID quite a bit myself one must remember having all your drives in one box is always an invitation for trouble since hardware failures on a higher order will likely hit all the drives.
Not to mention the temptation to use _Identical_ disks in your redundant array... I've had a RAID1 pair fail totally when both drives died within 24 hours of each other because of a firmware bug. This happens a lot more than most people think. Statistical analysis of the reliability of RAID _always_ assumes failures arrive independently of each other, but a large proportion of failures are caused not by random events but by external circumstances and therefore happen either simultaneously or nearly simultaneously.
Parent
Re:You are asking the wrong question. (Score:5, Insightful)
Parent
Re:You are asking the wrong question. (Score:4, Insightful)
Parent
Seriously? (Score:5, Insightful)
You're posting to Slashdot asking whether you should trust Microsoft?
Really?
Re:Seriously? (Score:5, Insightful)
The article smacks of false dichotomy. There are a number of solutions, not just Windows 7 or a hardware RAID controller.
To begin with, every NT-lineage Windows version ever produced supports software RAID out of the box. Add that to the fact that any major Linux distro today supports software RAID. And so do the *BSDs. And Mac OS X. And Solaris. And probably a bunch of other platforms I can't think of right now.
Hell, you could buy one of these one of these [linksysbycisco.com] and throw the drives in it, connect it to your network switch, and presto -- instant RAID+NAS.
I think we would all like to know why you think Windows 7 is your only option, because if that's what you think, you don't know how mistaken you are.
Parent
Re: (Score:3, Insightful)
The article smacks of false dichotomy. There are a number of solutions, not just Windows 7 or a hardware RAID controller.
Agreed.
As I see it, if you want guaranteed repairability then you basically have two options: enterprise-class hardware with a support contract (and price tag to match), or an Open Source software solution.
Put another way, either you pay someone to take responsibility for fixing it, or you take responsibility yourself. A Microsoft solution doesn't give you enough control to take full responsibility, because you can't be certain that it will be legally or technically possible to recreate your current setup
FAT??? (Score:5, Insightful)
You data is most important and you plan to use FAT? Good luck with that!
Seriously, though. No RAID solution that is not totally S/W is portable. But do you really need RAID? It sounds like what you need is a good backup solution with frequent backups. Does you data change so much that losing one day's worth of data would be a problem?
Re:FAT??? (Score:4, Insightful)
If all you want is multiple copies of your data, then really what you want is an automated incremental backup system, that copies your files to a second hard drive, and ideally keeps a few older copies so that if a file gets accidentally deleted or somehow corrupted, you have a chance to go back and find a usable copy. This is what I do on my system: I keep multiple incremental copies from the last few days/weeks/months. It was easy to set-up (in Linux, mind you). Do hourly syncs if necessary.
Also critical, if the poster is truly concerned about never losing data, is to get some kind of offsite backup. Two hard drives don't do you much good when the computer is stolen or your roof leaks. You need to have a way to regularly copy data offsite (ideally automated over the network, or via external hard drive if you're sufficiently disciplined).
RAID has its uses, to be sure. But if the poster is most worried about never losing important user files, then it seems like what he wants is is the multiple-redundancy of backup, not the immediate failover of RAID.
Parent
Re:FAT??? (Score:5, Informative)
No. NTFS is not perfect, but to think FAT is as bad is deluded. I've honestly never seen a HD formatted with NTFS that I couldn't repair with built-in tools, unless it had physical defects, and in such a case ANY file system would have problems. But I've seen so many FAT drives get hosed by little problems, it's not even funny.
Seriously, don't trust your data to a FAT partition - not worth it.
Parent
RAID != BACKUP (Score:5, Insightful)
Your first problem is Fat32 (Score:5, Informative)
If you want data integrity, use NTFS. Using Fat32 is like saying you want a reliable car, so you're buying a Edsel because they've been around a long time-- it doesn't make sense. Every other OS on earth can read NTFS (if not write it), so it won't affect your portability requirement.
Secondly, before you make any decision regarding Windows 7 RAID, make sure the edition of Windows 7 you want to buy ships with software RAID support before you put all your eggs in that basket-- early betas and RCs of Vista had software RAID enabled, only to have it disabled before release. I've seen no guarantees about Windows 7 software RAID support, and which editions will have it enabled. (If any.)
If you're planning to move to a server OS after Windows 7 expires, I can practically guarantee software RAID will be enabled, but that still doesn't mean you can necessarily upgrade your Windows 7 software RAID array to a Windows Server software RAID array. Do your homework.
RAID is *NOT* backup! (Score:5, Insightful)
Re:RAID is *NOT* backup! (Score:5, Funny)
Are you insane?
Parent
It's *NOT* hardware raid on your motherboard. (Score:5, Informative)
These motherboard "raids" are called fakeraids.
All that it is is that it writes the metadata on the disk in specific format so that you can see the raid volumes via BIOS. Note: Only "see" their status - in case you replace one drive, the resync is still done by software and you must boot to operating system. One clue is the fact that in Linux the dmraid package uses exactly same driver for accessing fakeraid-mirrored drives and Linux's own software-raids - device mapper just does a bit of magic at init.
However, if faced with choice of Windows-only or motherboard-raid, I'd go with the motherboard-version, because that's at least supported both by Windows and Linux so in case something goes wrong with your Windows installation you can always pop in Knoppix or some other Linux CD for recovery.
Re:It's *NOT* hardware raid on your motherboard. (Score:5, Informative)
DO NOT buy a real RAID card unless you have a pretty good budget for your system, and need the highest performance. The problem with buying a real RAID card is that you need to buy not 1, but 2 or better yet 3 of them, so that you can have spares. If your RAID card dies (and they do, more often than you'd think), the only way you'll be able to access that data, because of the proprietary on-disk storage method used by RAID card vendors, is to have an identical card (with the same firmware version, to be safe). And since hardware is constantly being obsoleted, you need to buy your replacements when you buy your card, not hope they're still available later. It's also a good idea to have spares of the same make and model hard drive, because hardware RAID controllers aren't usually that flexible in allowing you to pair up different sized drives like Linux sofware RAID.
For many purposes, software RAID using Linux is really a much better solution, because the on-disk format is open-source and standardized, so it doesn't matter what hardware you have, you can plug the disks into a different Linux system and you'll be able to read the data with no trouble. The only downside is a slight performance decrease since the CPU has to do all the work, but even then unless the system is heavily loaded, it's still faster than hardware RAID because the hardware RAID cards aren't that fast.
With the giant drives that are now common, I think the best solution, at least for home/desktop systems, is to forget about RAID5/6 altogether and just get a couple of 1-2TB SATA drives and mirror them with software RAID 1 in Linux.
Parent
RAID is never about protection. (Score:4, Insightful)
Re:RAID1 is not fool's gold (Score:3, Informative)
RAID is no substitute for backups (Score:4, Informative)
RAID is no substitute for backups. RAID is very good at propagating errors and problems very quickly, be they software glitches or human errors.
For consumer class storage, weekly / daily backups might be more efficient than investing a lot of effort into live RAID. Since I'm a Mac guy, I see the best answer to this question as Time Machine to a network / USB attached drive -- hourly (configurable for more or less often) differential backups, almost transparent to the user. To my knowledge, Windows has no similar set of software to allow reinstallation to the last hourly backup -- my wife had the misfortune of having to restore a blank drive from her last backup and it was a flawless process that truly left her where she left off less than an hour before the hardware failure. The reinstall wizard just had to ask where the backup was. Casting aside MacOSX advocacy, there is truly no substitute for a good automated backup solution that is regularly tested. I think the best method would use the fewest common components, like a NAS, followed by an external drive with its own power supply. My least favored option would be an internal drive with every single component shared.
Off site backup and test your restores (Score:3, Insightful)
Without regard to if you use software or hardware RAID or the quality of the RAID system, RAID only protects you from a physical disk failure. If you as a user screw up (delete or change something you didn't want to) or if some software bug screws up for you, or if you have a non-disk related hardware failure (causing a data corrupting machine crash) then you have lost your data -- RAID doesn't help.
Even if you are only trying to protect against disk errors, if the RAID system fails (even expensive quality ones can), or if you don't know and follow the recovery procedures EXACTLY, you can lose all your data.
The only reliable solution is making a copy or a "backup". Backup does not mean making a copy of the data on the same machine. (Whatever took out your RAID might also take out the other non-RAID disk or directory that you put your copy on.) If you are paranoid (or just prudent) your backup should not be a mapped or mounted drive on another machine. (Viruses can write to the network as well.)
And finally... Backups only count if you have tested your restore process.
Are sure RAID is what you want? (Score:3, Insightful)
With RAID mirroring, if you overwrite or delete an important file, it's copy on the mirror is immediately overwritten/deleted too, and the file is lost. Wouldn't you rather need a good regular backup?
And as someone pointed out already, FAT is really not a reliable file system. If you are on Windows, use NTFS. It is still portable, having read/write drivers for both Linux and Mac (see this guide [alma.ch]).
Since the files you want to keep safe appear to be regular files, not system files, any simple file copy mechanism could do. For an easy and simple system, you can use the Windows robocopy.exe tool in a batch file. For a more sophisticated system which can keep older file versions, and can easily be adapted for use over the network, you could try a Windows version of rsync like cwrsync. There are also a few rsync GUI frontends for Windows.
If you decide you really want RAID mirroring and go with the hardware solution, my understanding is that you need a replacement controller in case yours breaks. Since your controller seems to be embedded in the motherboard, you would need a replacement motherboard.
With the Windows software RAID, you are dependent on that software, and have portability only between machines with this Windows 7 software RAID (possibly even only this particular version).
Kudos. (Score:5, Funny)
"Safe" FAT32, cheap RAID, RAID implied as backup, Microsoft.
Nice job, you successfully trolled the /. frontpage.
Are we talking Pr0n or Tax Receipts here? (Score:3, Interesting)
As for Motherboard RAID chipsets... Keep in mind that your motherboard has a non-zero probability of frying, having it's caps go bad, being peed on by irate government agents, etc.. I once had a RAID 0 array that was hooked up to one of those things. After the Mobo died I had to do without letters K through P of my Japanese horror-comedy-porno-game-show collection until I was able to find a used computer with the same RAID chipset. (I don't know if it's changed, but at the time each different RAID chipset made RAID 0 arrays that were not compatible with anything else on this lump of rock.) If data portability rather than performance is a priority for you, my advice would be to avoid hardware RAID entirely.
Cheep Non-RAID Controller! (Score:5, Interesting)
Fast facts:
I prefer pure software RAIDs, for a simple reason: They do not depend on available hardware. If one controller dies, switch to another one: Other brand, other type, other drivers, and the RAID still works. If you insist, you can even mix an IDE drive, a USB drive, a SATA drive and a SCSI drive into a single RAID. Try that with a hardware or host RAID. Some people even built RAIDs of floppy disks or USB sticks (not for pemanent use, of course).
My faithful old Linux home server runs two RAIDs, both in software: a RAID-1 for the OS (remember: the BIOS does not know about the RAID), and a RAID-5 for the data. The RAID-1 used to run on old SCA drives, but recently, I switched to two small IDE drives due to unrecoverable SCA cabling problems. The RAID-5 is composed of four IDE drives, connected to two IDE controllers, each disk on a single IDE cable. An external USB disk is used to back up my data, rotating through 10 days. All filesystems are ext3, all disks are monitored using SMART, all RAIDs are monitored. If anything wents wrong, I will get an e-mail from the monitoring software.
Until recently, one of the controllers was an el-cheapo non-RAID controller, and the other one was a donated, expensive, well-known brand, RAID-capable controller running in non-RAID mode. The latter one decided to randomly take some free time on the job, and either disconnected from the PCI bus or disturbed it, causing panics in the OS above. Only pure luck protected me from data loss. I ripped it out of the machine, kicked it into the trash bin, rewired the RAID to use two disks per IDE cable, and verified and reconstructed my data. Some days later, another el-cheapo non-RAID IDE controller arrived, the same brand, model and type that already sat in the next PCI slot. So I rewired the RAID again to work with one disk per cable, everything was fine again.
For a new small business or home server, I would use nearly the same setup again: Two software RAIDs, one for the OS, and one for the data. Upgrading the OS is just fun when you can
Re:Just remember the first rule of RAID 0 (Score:5, Informative)
Parent
Re:Just remember the first rule of RAID 0 (Score:5, Informative)
"That's nice, but the submitter is asking about RAID 1."
I think he's asking the wrong question anyway.
"Based on past experiences, I have decided that only my data is worth saving"
See? He is asking for backup, not RAID. It has been said one thousand times but it seems it must be said again: RAID is *NOT* in order to protect your data. NOT, NOT, NOT and then NOT again.
RAID (not talking about RAID-0) is there in order to enhance your data's avaliability (as in, say, instead of being able to get to my data 99% of the time, I can get to it 99,9%) but when it's hosed, it's hosed. To protect your data you need backups, not RAID.
"Of course, such a setup should secure my data"
Of course not. Of course you will get quite a funny face when you discover it. Quite more or less the one that had the guy from this story, about six months ago, with the very enlightning title "Why Mirroring Is Not a Backup Solution": http://hardware.slashdot.org/article.pl?sid=09/01/02/1546214 [slashdot.org]
"Even more importantly, I want any drive and its data to be as safe and portable as possible"
Then, *even* if RAID could be considered for data security (which is not) you already answered your question: as a general matter, hardware RAID will only work when using exactly the same controller model, possibly up to its minor revision. You can't count to break a hardware-managed mirror, take one disk to a standard SATA controller and get any data out of it. If your controller dies and miracolously doesn't take the disks with it you can't count on buying a different RAID card (as it will most probably be in about a year for consumer-grade hardware) and get any data out of the mirror. So you should go with software RAID.
AND TAKE BACKUPS.
Parent
Re:Just remember the first rule of RAID 0 (Score:5, Informative)
Why RAID is not a backup:
1) not fireproof.
2) not mistake proof "oops, didn;t mean to delete that"
3) not immune to file system corruption.
4) not immune to power supply failure/surge/lightning/other destructive forces
5) more expensive than a good backup
6) not protable offsite
7) does not track versionb history or old files (something that should be of critical importance to a programmer...
8) Viruses, mailware, hackers oh my!
9) bad/corrupt install
10) OS failure
I could easily go on. I worked in DR for 4 years...
Nearly all of the above have a higher frequency of occurance over a 5 year typical HDD life. Even if you continually replace drives without a data failure, you're still eventually going to have an issue RAID can not deal with.
My Qnap was a $399 device. The 4 drives in it were $90 each (and the 5th spare too). The HDDs I run the PC off on the RAID 1/0 were $40 each. I only run the RAID 1/0 for performance during video editing. I chose 1/0 vs 1 since 1 halves the reliabiltiy of the drives. Even though I do have a good recovery solution, the downtime, nor the effort involved in recovery, would be welcome, and the extra $80 to mirror the performance stripe was easily spent.
The Qnap is also my iTunes media server, my FTP server, included the price of the DR software, and runs 2 IP cameras I set up at home too (which let me tell the insurance company I have real-time video monitoring, and they knowcked an extra 5% of my homewoners policy cost, which by itself is enough to fund replacement drives as I'll need them).... Oh, yea, and it's a NAS too... It has a lot of value beyond a backup system.
I'm guessing you've not got a child yet, or a large family. You probably don;t value to pictures you take, files you have, and other stuff on your PC. That's fine, someday you likely will.
There are cheaper ways than mine to do backups. I have over a TB, and 3 (currelty, soon to add 2 Macs to the list an decom 1 old laptop leaving me with 4) computers I'm backing up, so centrally makes sense. If you have 1-2 machines, a small amount of data, and don't value most of it, then 2 external USB drives and a safety deposit box (Dad's house) usually suffice... Or, just an online backup account for $5 a month...
RAID 1 might save you from a firmware failure, or a disk going bad, but that's about it... Also, RAID 1 may be cheap, but a backup is cheaper. Also, good luck rebuilding that RAID if your MOTHERBOARD fails... RAIDs are proprietary to a particular controller. Unless your new board usues the same chipset (and firmware too in most cases) you;re screwed without a backup.
Parent
Re:Just remember the first rule of RAID 0 (Score:5, Informative)
And what allot of people don't realize is if you build a RAID array and a drive fails can you replace the drive with the exact make and model? Raids work best when every disk in the array is the same model and revision. If you plan to build a 5 disk raid array you should also purchase a 6th drive to keep as a cold spare.
I built a RAID 5 array using three 500GB disks via mdadm under Linux. I assembled the array and formatted it. Within minutes of testing I was getting mail from mdadm telling me the array was degraded. I then began to test each disk for defects and lo and behold one disk was bad right from the start. I tried to RMA the disk but newegg had informed me those disks were now obsolete. Great. I was credited for the bad disk and purchased a new one that closely matched the other two. It was a nightmare as during some boots the disks went haywire and I would get a "Could not bd_claim sdaX" And it would hang for a while and I would have no array. It happend once in a rare while until it became a real problem. I kept my most precious data safely backed up on different disks I had spread around. It finally got so bad that I would have to constantly reboot the machine for up to ten times before the disks were synced up and the array worked. I purchased a 1TB disk and copied all the data off the array to it and used the 500gb disks in other systems. RAID is great for big fat storage arrays but it can become very sensitive and then one day POOF its all gone.
This is the reason OEM drives from Dell, Apple, HP etc. Cost four times what a retail drive would cost. The cost is no way associated with quality but rather consistency. Retail SATA drives are constantly changing: less/more platters, faster seek and read speeds and firmware revisions. Those costly OEM drives are the same disk every time right down to the inner workings and firmware. So if you buy an Apple 1TB disk on a sled and it takes a dump in three years you can be confident Apple will replace that drive with the EXACT same one. Its not a magical Apple disk of superior quality but a Maxtor/WD/Hitachi disk that is produced for Apple with no revision changes unless Apple orders it. Unlike retail drives which are changed at the manufactures whim.
So if you are building your own raid plan for failures and try to buy a spare for your array. I don't know disk shelf live but it will save you down the line. Also keep a USB or 1394 disk around for backups. Spread your most precious data around like pictures home movies and documents. If you have a few computers around the house keep a mirror of that data one those machines. Music, and downloaded video can be re downloaded but home movies and pictures cannot. Put all the silly stuff on the raid along with the precious stuff for access but keep backups of the good stuff!
Parent
Re:Just remember the first rule of RAID 0 (Score:5, Informative)
Whoa, hold the boat. I've had a lot of experience with Dell & HP/Compaq(Proliant) provided RAID systems and they are not sensitive to disks with vastly different innards. All that matters is block count and software mirroring doesn't even care about that, because you'll simply be limited to the size of the smaller disk. If you're using mirroring or RAID, try to go with different makes of the same size. This article [ssdirect.com] talks about MTBF. It turns out if 2 drives of the same exact model comes off the line and end up in your PC, there is a chance they could fail within a very close time to one another. So your mirror or RAID could fail permanently while rebuilding from the first failure. But if all your drives are of a different make, chances are they won't fail at the same time and you'll get the critical time needed to rebuild your array.
When I'm going to do mirroring or RAID on hardware that doesn't have high-end dedicated server RAID controller, I use Windows or Linux software RAID. Performance is surprisingly good and I'm not married to a specific hardware implementation. I've had _none_ of the issues you've described with Linux software RAID on several servers for several years. Mdadm has only whined after a power outage or genuine disk failure.
Parent
Re:Just remember the first rule of RAID 0 (Score:5, Insightful)
Raids work best when every disk in the array is the same model and revision. If you plan to build a 5 disk raid array you should also purchase a 6th drive to keep as a cold spare.
I hate to break it to you, but you're actually wrong.
A RAID array is most effective using completely different drives, but of the same capacity. Five hard disks from the same manufacturer, of the same model, bought at the same time means that you're highly likely to get five drives from the same batch. Let's posit that there was some defect in this batch. Now all five of your drives have a significantly higher probability of failing at the same time. Oops! RAID can only deal with one (or two) drive failures!
Using drives from different manufacturers or model lines means you spread the risk of simultaneous drive failure.
Parent
Re:Just remember the first rule of RAID 0 (Score:4, Informative)
Do you have any evidence for this claim?
A typical RAID implementation writes stripes at a time, by issuing a series of writes to each drive. If your disks have the same geometry, then each write will be at the same physical location on each drive and so complete in almost exactly the same time. If they are not, then the different disks will be moving their heads at different times. The RAID controller (hardware or software) will then be bottlenecked by the slowest drive. To make things worse, the slowest drive can be different for each write. One write may require moving the head sideways on one disk, the next may require moving the head sideways on the other. In both cases, you are limited by the worst-case performance for the disk. The same is true for reads on RAID-5, but not RAID-1, which can just use the result for whichever disk returns first.
Parent
Re:Just remember the first rule of RAID 0 (Score:5, Informative)
All modern disks remap sectors as necessary. The main difference between consumer and RAID drives is the timeout for error correction [wikipedia.org].
Parent
Re:Just remember the first rule of RAID 0 (Score:4, Informative)
Is it worth keeping a spare for the sole purpose of having the same model available in the event of a failure when you can get a newer and faster drive in the future?
I would say not, but when one drive fails you should replace all of them. For a home array, expect one drive to fail every few years. I had a disk in RAID-1 array fail last year. It was a 40GB disk which cost around £100 new. For the same price, I can buy two 500GB+ disks now.
Is the difference in performance between modern SATA drives so significant?
It's not a question of performance, it's a question of the difference between a linear access and a seek. The time for a seek is 4ms+. If a drive can read 50MB/s then a linear access is around 10 microseconds. If your one disk is doing a linear access while the other is doing a seek then you are limited by the time of the seek (for RAID-1 writes and RAID-5 reads and writes). If you have to seek after every block, your maximum throughput is 125KB/s. If you do a linear read, your throughput is 50MB/s. If your drives have different geometries, you double the number of seeks you are needing, dramatically reducing your throughput.
Parent
Re:Just remember the first rule of RAID 0 (Score:5, Informative)
Actually, it depends on the reliability. 95% reliability becomes 90.25% reliability. 50% reliability becomes 25% reliability. 1% reliability becomes 0.01% reliability.
So if your drives are very reliable, it's very slightly less than twice the failure rate. If your drives are not reliable, then it asymptotically approaches an infinitely greater risk of failure.
Statistically speaking. :)
Parent
Re:Just remember the first rule of RAID 0 (Score:5, Informative)
Parent
Re:Be Careful (Score:5, Informative)
What RAID is good for:
Better to just throw a disk in an old machine and back up to it regularly.
Parent
Re: (Score:3, Informative)
Fakeraid is software too.
Get a real hardware RAID controller, or don't use RAID. Windows SW RAID or a Fakeraid controller is just plain stupid.
Re:Real RAID is cheap (Score:5, Interesting)
Wrong. You need to buy at least two of these controllers, at the same time, or else when your "real" RAID card dies (and they do), you'll lose all your data unless you can find an identical card (you may even need the exact same firmware version).
Software RAID on Linux is a much better solution, as the underlying hardware doesn't matter. You can mix and match different drive models/sizes (can't do that on HW RAID), and swap the drives to a different system and still read them thanks to the standardized on-disk data format.
Parent