Which RAID Settings Should I Use For My NAS?

RAID 1 (Mirror)

A NAS (Network Attached Storage) puts storage onto your network, where it can be accessed by many computers. They often have more than one hard drive which can allow you to have automated copies of your data, which is known as RAID. A common type of RAID found on NAS devices is RAID 1, which will make two hard drives into a mirror copy of one another. Some manufacturers call RAID 1 Safe mode. If you have a NAS with two 1TB hard drives and set them to RAID 1 mirroring, instead of 2TB of storage (1TB x2) you only get 1TB. Everything you store to the NAS gets saved to both drives automatically. The theory is that if one of the drives fail, you can access all of the data from the other one. In practice that is not always the case. More on that later.

RAID 1 Mirror Animation

RAID 1 Mirror Animation

RAID 0 (Stripe)

Another common NAS option is RAID 0. The “R” in RAID stands for redundant, however there is no redundancy in RAID 0, so it’s not a real RAID type. If you setup the same two 1TB disks as RAID 0, you will get a 2TB volume to store your data on. The problem is that every single file you write to the NAS will be split into tiny pieces and distributed across both drives. If one drive fails, you not only lose the data from that failed drive, but also from the non-failed drive as it only contains half the pieces of each file. RAID 0 should never be used for long term storage, but can be fast so is often used for video editing.

RAID 0 Stripe

RAID 0 Stripe

So that’s the hardware taken care of. What other things should you look out for when choosing a NAS?

Another problem with most NAS devices is the non-standard filesystems they use to store the data on the disks. If the NAS itself fails, you cannot usually read the disks by attaching them to a standard PC. So even in RAID 1 mirror mode, you could end up with no usable copies of your data. Most NAS drives run a simplified version of Linux, but only some of them use standard Linux filesystems like ext2/3/4.

Backup my backup?

Some NAS drives have a USB port to allow you to backup the data to an external hard drive. This is great, as long as you can access the backup data on a regular PC, and it doesn’t need to go through the NAS. You can imagine why that would be a problem.

To summarise, NAS drives can be a great way to upgrade your home or small office storage. They can allow collaboration and sharing of files between users, and should simplify your backup process. Just remember that a NAS is a small server that needs to be backed up as a matter of urgency. As long as you have that covered then a NAS can be a smart addition to your network.

Format Your Drive For A Mac

Plenty of shops will sell you a “Mac Hard Drive” but there is no reason why you cannot use a windows format drive on a Mac. You just need to format it first. There was once a time when a drive was specially formatted by Apple to use on their Macs, but these days Apple use the same hard drives as everyone else. To use with the latest versions of OS X I would recommend following the steps below.

NOTE: Formatting your drive will destroy all the data. Make sure there’s nothing on there you need.

1. First attach the drive to your Mac. The Mac will notify you with a small finder window to initialise the drive. See below.

Format Your Drive For A Mac

Initialize Your Drive For A Mac

2. Once you have clicked initialize you will see the Disk Utility Application window. See below.

Format Your Drive For A Mac

Mac Disk Utility

3. You need to select the drive you want to format in the left hand window of the utility as highlighted in blue. Internal drives show as grey and external drives show as yellow. At this point make sure you choose the correct drive, the utility will not allow you to format the internal boot drive. See below.

Format Your Drive For A Mac

Drive selected

4. Now choose the Partition Tab. See below.

Format Your Drive For A Mac

Partition Tab

5. Now click on the Partition Layout drop down bar and choose the first option “1 Partition”. Also to the right under Partition Information give your drive a name and below that choose the partition type you want which will be Mac OS Extended ( journaled ). We are nearly there. You now need to click on the options tab in the bottom left of the utility window and choose GUID Partition Table and click okay. As you will read in the text information, this allows the drive to be used with all current OS X Macs. See Below.

Format Your Drive For A Mac

Options Tab

7. Now all you need to do is click the apply button as shown in red below.

Format Your Drive For A Mac

Apply

8. Another window will appear asking for confirmation to partition the drive. Click partition. See below.

Format Your Drive For A Mac

Partition Drive

9. A formatting window with a progress bar will now appear and then disappear when done. You will now see your named drive in the left window, which means that your drive is now formatted. Close the disk utility and the hard drive is ready to use. See below.

Format Your Drive For a Mac

Hard Drive Formatted

 

How a Data Recovery Engineer Sees RAID

As somebody recovering data from RAID arrays, my view on them is a little different to the norm. In most cases I would say avoid RAID wherever possible. Simplicity is key.

Below are my answers to some real questions I have received from clients about RAIDs.

Why did this RAID disk fail?

Hard drive failure is not unusual and is often not avoidable. The truth is that all hard drives fail eventually, whether they are used in a RAID or not. Even though a RAID system can provide some fault tolerance from physical drive failure, they do have limits. A RAID5 on three disks for example can only handle a single drive failure at any one time. It is common for a second disk to fail whilst the other disk is being replaced. This is when RAID recovery is required; to first access the failed drives, and then rebuild the RAID. The best protection against RAID failure is to make backups. Backups in as many formats, in as many different physical locations as possible.

Why did the server fail so badly? Isn’t RAID meant to prevent this?

A 3-disk RAID5 can only cope with one bad disk. This doesn’t help when two drives fail at the same time. Although a RAID array can provide some leeway when it comes to disk failures, it doesn’t always help when you have multiple failures in quick succession. Adding more disks to the RAID can provide more redundancy, however this costs more money, and also adds complexity when things go wrong. Also you could be in a similar position if three disks happen to fail next time. A live system could fail at any time so prepare for the worst. Backups are cheap, and take a relatively short amount of time. RAID recovery can be expensive and cause unnecessary downtime.

Why couldn’t our IT support recover this?

We are a specialist data recovery company, with access to tools and resources which are not available to IT Support staff. We have spent the last fifteen years perfecting the process of extracting data from failed & failing hard drives and RAID arrays. For the best chance of recovery, we like to get the drives as soon after failure as possible. If more work gets carried out on the drives, things can be made much worse.

How can we avoid this happening again in the future?

To avoid similar problems in the future, the best way forward is some form of regular backup. The backups should be verified and then tested / restored as often as possible. This is where disaster recovery comes in, which can involve simulating certain types of failure and making sure you can get up and running again from your backups. At the very least, it wouldn’t hurt to put the really crucial business files onto an external hard drive every few weeks and store it in your company safe. It’s low-tech but at least you could plug it in to any PC and access the important business data if required as a last resort.

I’m not against RAIDs. They do have their place, but cannot be relied upon as a replacement for regular backups.

We have more articles about RAID here.

Why RAID Can Be Bad For Business

RAID is often touted as the silver bullet in data storage. Increased storage capacity, resistance from hardware failures and improved performance. While these are all valid upsides to a RAID setup, there are also a few downsides which need to be addressed.

1. Extra Storage.

RAID can allow for a huge pool of storage, but with that storage comes great responsibility. You should factor in at least enough capacity to backup the RAID data somewhere else. If you can only afford 8TB of storage then you should only use 4TB for data and the other 4TB to back it up; Preferably on another machine / standalone system.

2. Redundancy.

The first letter in RAID stands for redundancy. This means you can afford to lose a certain number of disks without losing access to your data.  This also means that if you have a disk failure you need to get it replaced immediately, otherwise you’re running without redundancy.

3. Downtime.

Nobody likes downtime. If your 16TB RAID array goes offline without a backup then you have a couple of options. One option is to attempt to get the RAID back online by replacing disks, rebuilding the array etc, but this is risky. If this is your only copy of the data then rebuilding / reformatting the RAID could corrupt the data beyond recovery. Don’t do this if you don’t have a backup to fall back on.

The second and preferable option is to get the RAID professionally recovered. When we receive a RAID, the first thing we do is make images of all disks. This allows us to work on the RAID without risk. Then we use a read-only process to extract the data onto another form of storage. This is where downtime comes in. Unless you go for an emergency process, you could have to make do without the data for a number of days.

So What’s The Way Forward?

It’s one word. Redundancy.

Whatever you do, make sure your data is replicated across as many types of storage as possible. In an ideal world you would have a duplicate system running alongside the live system, which can take over if anything goes wrong. Then have the data on another type of storage, which you can access from somewhere else. Imagine if the RAID controller failed, and you could only access the data from that one machine.

It doesn’t matter how many backups you have if they all require the same system to access them.

I’ve only just scratched the surface here, but you should always look to make extra copies of your data. It may seem redundant now, but when your server fails containing all your data, all your accounts, all your client details and your website, you’ll be glad you kept that extra copy.

ST373454LC Unusual RAID drives

ST373454LC Unusual RAID Drive

ST373454LC Unusual RAID Drive

We have recently recovered a RAID 5 array which consisted of three of these ST373454LC SCSI hard drives. These are solid, weighty drives, which don’t give off a great deal of vibration, despite spinning at 15,000 rpm; 3 times faster than most laptop hard drives!

Upon opening one of the drives for cleanroom rework we discovered why these drives  spin so quietly. In the picture below you can see that although the drives are standard 3.5″ form factor, they actually have 2.5″ disk platters. These smaller disks create less drag, and therefore can spin faster without stability problems.

Inside a ST373454LC Hard Drive

Inside a ST373454LC Hard Drive

These drives are not alone in mixing up the form factors. The popular WD Raptor drives also use a similar design.

Downside?

Of course the biggest downside to using smaller disks is the lower storage capacity. Typically SCSI hard drives are much lower capacity than their SATA counterparts, so this trade-off is acceptable for the speed and reliability increases. The relatively low capacity is further mitigated when the drives are used in RAID arrays.

Archiving Old Data

Data Recovery Software - Do Not Erase!

Data Recovery Software - Do Not Erase!

I recently read a brilliant article about the guy that wrote the original Prince of Persia game for the Apple II in the 80′s. He had long since lost the original source code, until an old box of floppy disks was uncovered in his father’s apartment.

Sensibly, he enlisted some experts to help with the data extraction, and after a day of collaboration was able to release the source code online.

This got me thinking. Although I am meticulous with my backups of current data, I still have boxes of old software on floppy disks, which are happily degrading as we speak. John even found an old 212.6MB hard drive with some vintage data recovery software on it. Now this stuff isn’t always useful, but occasionally a really old drive comes down to us, and it is only this old software that can do the job.

As a result, John and I have started a project to get all of our old data recovery software from floppy disks and hard drives, and back it up to our file server. The 212.6 MB hard drive in the picture had 128MB of old DOS recovery software, which would easily fit on my mobile phone. Who knows when we might need it, but we now have it available when the need arises.

Bigger Hard Drives of the Future

The Register has today posted two articles about the ongoing battle to expand hard drive capacities.

First is an actual device for sale, a 2TB Western Digital portable drive. This drive has a fancy new case and USB3 connection. It contains backup software and also the option to encrypt the data with a password. I wonder if it encrypts the data by default like some of their previous portables. (A bad thing!)

Second is a futuristic announcement from Seagate about their new HAMR technology. This new tech uses a laser to heat part of the disk before magnetising it. This apparently allows for much higher densities, theoretically paving the way for 60TB hard drives. There doesn’t appear to be any products using this technology at the moment.

60TB drives will be fantastic for backups, but horrible to backup without a new, faster form of connection. These would take almost forever (exaggeration) to fill up by SATA.

This news helps prove that hard drives are far from dead. It will take a long time until SSDs can cope with such massive capacities, at a similar cost to these beasts.

4K Sectors & RAID

ZDNet has an interesting article about 4K sectors (Advanced Format), however what was more interesting was the bit about large SATA drives.

 Today’s large SATA drives shouldn’t be used in 4 drive RAID 5 arrays due to the high likelihood of a read error after a drive failure, which will abort the RAID rebuild.

It is a common misconception that if you run a RAID system then you can avoid keeping backups. Although fault tolerant to a point, there are plenty of issues with RAIDs that can at best cause lengthy downtime and at worst prevent any recovery at all.

More info on RAID Recovery