Hard Drive Reliability Review for 2015

February 16th, 2016

Backblaze Hard Drive Stats
By the end of 2015, the Backblaze datacenter had 56,224 spinning hard drives containing customer data. These hard drives reside in 1,249 Backblaze Storage Pods. By comparison 2015 began with 39,690 drives running in 882 Storage Pods. We added 65 Petabytes of storage in 2015 give or take a Petabyte or two. Not only was 2015 a year of growth, it was also a year of drive upgrades and replacements. Let’s start with the current state of the hard drives in our datacenter as of the end of 2015 and then dig into the rest later on.

Hard Drive Statistics for 2015

The table below contains the statistics for the 18 different models of hard drives in our datacenter as of 31 December 2015. These are the hard drives used in our Storage Pods to store customer data. The Failure Rates and Confidence Intervals are cumulative from Q2 2013 through Q4 2015. The Drive Count is the number of drives reporting as operational on 31 December 2015.
Hard Drive Relability 2015

During 2015, five drive models were retired and removed from service. These are listed below. The Cumulative Failure Rate is based on data from Q2 2013 through the date when the last drive was removed from service.

Hard Drives Removed 2015

Note that drives retired and removed from service are not marked as “failed” they just stop accumulating drive-hours when they are removed.

Computing Drive Failure Rates

This is a good point to review how we compute our drive failure rates. There are two different ways to do this, either works.

For the first way, there are four things required:

  1. A defined group to observe, in our case a group of drives, usually by model,
  2. a period of observation, typically a year,
  3. the number of drive failures in the defined group over the period of observation, and
  4. the number of hours a group of drives are in operation over the period of observation.

Let’s use the example of 100 drives which over the course of 2015 accumulated a total of 750,000 Power-on-hours based on their SMART 9 RAW values. During 2015, our period of observation, five (5) drives failed. We use the following formula to compute the failure rate:

(100*drive-failures)/(drive-hours/24/365)

(100*5)/(750,000/24/365)

This gives us a 5.84% annual failure rate for 2015.

For the second method, the only change is how we count the time a group of drives is in service. For a given drive we simply count the number of days that drive shows up in our daily log files. Each day a drive is listed it counts as one drive-day for that hard drive. When a drive fails it is removed from the list and its final count is used to compute the total number of drive-days for all the drives being observed.

Drives by Manufacturer

The drives in the datacenter come from 4 manufacturers. The following chart shows the cumulative hard drive failure rates by manufacturer for all drives:
Hard Drive Failures by Manufacturer
Let’s take a look at the “make up” of the drives in our datacenter.

Hard Drive Count 2015 2015-drive-days-piechart

The chart on the left is the total number of drives and the chart on the right is the total number of drive hours for all the data drives by each manufacturer. Notice there are more Seagate drives but the HGST drives have more hours. The HGST drives are older, as such they have more drives hours, but most of our recent drive purchases have been Seagate drives. Case in point, nearly all of the 16,000+ drives purchased in 2015 have been Seagate drives. Of the Seagate drives purchased in 2015, over 85% were 4TB Seagate drives.

Hard Drive Reliability by Drive Size

1TB Hard Drives

We removed the last of our 1TB drives during Q4 and ended the quarter with zero installed. This was done to increase the amount of storage in a 1TB filled Storage Pod as we replaced the 1TB drives with 4TB drives (and sometimes 6TB drives). Now in the same Storage Pod we get four times as much data. The 1TB Western Digital drives performed well with many of the drives exceeding 6 years in service and a handful reaching 7 years before we replaced them. The cumulative annual failure rate was 5.74% in our environment, a solid performance.

We actually didn’t retire these 1TB WD drives – they just changed jobs. We now use many of them to “burn-in” Storage Pods once they are done being assembled. The 1TB size means the process runs quickly, but is still thorough. The burn-in process pounds the drives with reads and writes to exercise all the components of the system. In many ways this is much more taxing on the drives then life in an operational Storage Pod. Once the “burn-in” process is complete, the WD 1TB drives are removed and we put 4- or 6TB drives in the pods for the cushy job of storing customer data. On the other hand, the workhorse 1TB WD drives are returned to the shelf where they dutifully await the next “burn-in” session.

2TB Hard Drives

The Seagate 2TB drives were also removed from service in 2015. While their cumulative failure rate was slightly high at 10.1%, they were removed from service because we only had 225 of those drives and it was easier to upgrade the Storage Pods to 4TB drives than to buy and stock the 2TB Seagate drives.

On the other hand, we still have over 4,500 HGST 2TB drives in operation. Their average age is nearly 5 years (58.6 months) and their cumulative failure rate is a meager 1.55%. At some point we will want to upgrade the 100 Storage Pods they occupy to 4- or 6TB drives, but for now the 2TB HSGT drives are performing very well.

3TB Hard Drives

The last of the 3TB Seagate drives were removed from service in the datacenter during 2015. Below is a chart of all of our 3TB drives that were in our datacenter anytime from April 2013 through the end of Q4 2015.

3TB Drive Review

4TB Hard Drives

As of the end of 2015, 75% of the hard drives in use in our datacenter were 4TB in size. That represents 42,301 drives broken down as follows by manufacturer:

4TB Drive Stats

The cumulative failure rates of the 4TB drives to date are shown below:

4TB Hard Drive Reliability

All of the 4TB drives have acceptable failure rates, but we’ve purchased primarily Seagate drives. Why? The HGST 4TB drives, while showing exceptionally low failure rates, are no longer available having been replaced with higher priced, higher performing models. The readily available and highly competitive price of the Seagate 4TB drives, along with their solid performance and respectable failure rates, have made them our drive of choice.

A relevant observation from our Operations team on the Seagate drives is that they generally signal their impending failure via their SMART stats. Since we monitor several SMART stats, we are often warned of trouble before a pending failure and can take appropriate action. Drive failures from the other manufacturers appear to be less predictable via SMART stats.

6TB Hard Drives

We continued to add 6TB drives over the course of 2015, bringing the total to nearly 2,400 drives (1,882 Seagate, 485 Western Digital.) Below are the Q4 and Cumulative Failure Rates for each of these drives.

6TB Drive Stats

The Seagate 6TB drives are performing very well, even better than the 4TB Seagate drives. So why do we continue to buy 4TB drives when quality 6TB drives are available? Three reasons:

  1. Based on current street prices, the cost per TB of the 4TB drives (0.028/GB) is less that of the 6TB drives (0.044/GB).
  2. The channels we purchase from are not flush with 6TB drives, often limiting sales to 50 or 100 units. There was a time during our drive farming days when we would order 50 drives and be happy, but in 2015 we purchased over 16,000 new drives. The time and effort of purchasing small lots of drives doesn’t make sense when we can purchase 5,000 4TB Seagate drives in one transaction.
  3. The 6TB drives like electricity. The “Average Operating Power” is 9.0W for a Seagate 6TB drive. That is 60% more than the 5.6W used by the 4TB Seagate drives we use. When you have a fixed amount of power per rack, this can be a problem. The easy answer would seem to be to add more electric to the rack, but as anyone who designs datacenters knows it’s not that simple. Today, we mix 6TB filled Storage Pods and 4TB filled Storage Pods in the same rack to optimize both power consumption and the storage space per square foot.

5TB and 8TB Hard Drives

We continue to only have 45 of each of the 5TB Toshiba and 8TB HGST Helium drives. One 8TB HGST drive failed during Q4 of 2015. Over their lifetime the 5TB Toshiba drives have a 2.70% annual failure rate with 1 drive failure and the 8TB HGST drives have a 4.90% annual failure rate with 2 drive failures. In either case, there is not enough data to reach any conclusions about failure rates of these drives in our environment.

Drive Stats Data

Each day we record and store the drive statistics on every drive in our datacenter. This includes the operational status of the drive, along with the SMART statistics reported by each drive. We use the data collected to produce our Drive Stats reviews. We also take this raw data and make it freely available by publishing it on our website. We’ll be uploading the Q4 2015 data in the next few days then you will be able to download the data so you can reproduce our results or you can dig in and see what other observations you can find. Let us know if you find anything interesting.

Andy Klein

Andy Klein

Andy has 20+ years experience in technology marketing. He has shared his expertise in computer security and data backup at the Federal Trade Commission, Rootstech, RSA and over 100 other events. His current passion is to get everyone to back up their data before it's too late.
Andy Klein

Latest posts by Andy Klein (see all)

  • Emmanuel Goldstein

    Always fun to look at the stats, but it’s always from drives that you can’t find anywhere.

  • Andy Klein

    Here’s a link to the 4TB Seagates drives on Amazon: http://www.amazon.com/Seagate-SATA-3-5-Inch-Desktop-ST4000DM000/dp/B00B99JU4S

    I agree some of the older HGST ones in particular are hard to find.

    • Emmanuel Goldstein

      Even when the original Q4 post came out, trying to find any of the HGST models was a pain since they’ve all been EOL’d for a while.

    • http://www.oortcloudcomputing.com/ Tim Wessels

      Well, if the link to AWS is accurate, these Seagate 4TB HDDs spin at 5900 RPM, which may make them more reliable over time. Wear and tear matters and a HDD that spins at a lower number or RPMs may enjoy a longer life. And in this particular application, performance is probably not as important as reliability. And of course, the price is right.

  • Dc120

    Thank you so much for sharing this data!

  • http://www.elite-strategies.com/blog Patrick Coombe

    Definitely going to look into some of these HGST drives. Seems like extremely low failure rate and lots of great stats.

  • Bill McKenzie

    Problem with the 2tb A330 HGST drives on eBay and Amazon is they are all (AFAIK) used drives with thousands of hours on them already. I went through 2 and both died within a month.

    • Andy Klein

      HGST has been moving towards the enterprise space, so they haven’t made the type of drives we use in a while. Sometimes we can still buy some new ones in the channel, but they are hard to find.
      Appreciate you letting folks know about Used Drives, folks should pay attention to that when they buy. We don’t buy used drives, so I can’t shed any light on the reliability of used drives.

  • http://www.srigi.sk/ Igor Hlina

    Can you show us photo of the server – how HDDs are mounted to the rack?

  • noyb

    WD really surprises me, their platter drives were always my goto for reliability but that is truly abysmal reliability for SSD

    • Lastb0isct

      Where are you seeing SSD listed? I’m pretty sure these are all HDD, platter drives.

  • pooloo

    How are these stats pertinent to a typical consumer? Data centers typically have their drives running 99.99% of the time, while the average consumer would not, which is why there is a distinction between consumer and enterprise product. How is this a fair depiction of consumer based product reliability? Shouldn’t you be using drives designed for these particular services, would that not save you money in the long term?

    • Andy Klein

      In our experience there is little difference between Enterprise and Consumer drives for our purpose. In truth all drives fail and we’ve designed our own hardware and software in recognition of this reality – it doesn’t care if the drive costs $150 or $1,500.
      People can decide if this information in valuable to them. They are free to add it other sources of information such as personal experience with hard drives, drive tear-downs and reviews, and manufacturer specs and marketing.

    • Ray Tuholski

      4 9’s is not terribly reliable for enterprise needs…. that would get you fired from certain IT shops. If you read through the BackBlaze blog, they lay out their philosophy regarding their server builds pretty clearly. That philosophy says “Why buy a $400 drive to do the same work that a $100 drive can do?” Nothing like watching your bonus vanish and your CAPEX budget go down the drain buying a bunch of unneeded, expensive hard drives.

  • Mark Scudder

    Andy,

    Thank you for sharing this information as you always do. However, I hate to say it, but it’s not enough. You or somebody else at your company has to focus your efforts elsewhere.

    I am a former Backblaze customer. I was almost entirely happy with your service, but I had to get rid of you and go to CrashPlan, which is a wretched company with terrible software and people who don’t care.

    Why did I go from being a happy Backblaze customer to an unhappy CrashPlan user? Long story short, I had to put a Linux server in my basement. I’m actually pretty poor, not like you guys in San Fransisco where even the homeless get $500 on an ATM card every month. So sometimes if I need something badly enough, I have to assemble it from junk parts and throw Linux on it so it’s at least party reliable. Your software supports Mac and Windows, and does so very well, as I recall. But the information on my Linux server is at least as important as the information on my other computers. I’m sure you’ve heard the old advice that your data should exist in at least three places. As badly as it is implemented, CrashPlan does this.

    The other reason I had to switch is because you couldn’t adapt your online backup software to do the same thing, but to my local computer. Part of the role of that Linux server in my basement is so CrashPlan can back up my computer and my wife’s computer locally, so I can get files back quickly in the event of a system failure, or accidental deletion (we both work with big files, and I’d rather spend $140 on an extra hard drive for my server that $140 every time you need to ship me a hard drive; again, the assumption people have the money to just do that is insulting). I also didn’t think it was necessary, since you write such nice software, to burden is with that web interface, which is good for restoring one or two files quickly, but not a large number in an unattended manner, say for instance my house burns down, at which point paying you for hard drives seems a bad financial idea. As bad a company as CrashPlan is, their software supports all major OSs reliably, and if my internet can handle it, I can restore gigabytes of files from their software and just let it run – no maximum ZIP file sizes, no manually moving files into place. Windows and Linux don’t have “Time Machine,” and it’s short-sighted to assume all your users can afford Macs and “Time Capsules” for backup over wi-fi.

    And the other reason: CrashPlan has a “family plan” for $13 and change that allows me to back up 10+ computers, and if I recall correctly, with you it was $5 per machine per month, no alternatives. As clunky as CrashPlan’s software is, I can also protect my dad and my wife’s parents – in essence protecting myself from more free tech support visits – from my CrashPlan account for free. (They also backup to my Linux server, another thing CrashPlan does.)

    You’ll probably likely just ignore this email, or worse, reply and say that’s not your business focus. Well it should be somebody’s. CrashPlan is the only backup service provider who does all this in one relatively easy to use app. But they don’t have any competition. You know their app is written in freaking Java? It runs like crap and often resends terabytes of files it should already have to its cloud servers, tying up my internet for weeks. They don’t have a local deduplication system like you do, meaning if I move a couple hundred gigs of files to another machine, CrashPlan resends them all (and, I should add, “deduplicates” them on the remote end and calls that a “feature”)

    Please. I’m begging you. Now that you are managing your failure rate, please give CrashPlan some competition, so I can come back and be happy and secure with my backups. Maybe you guys are into monopolies out there, but it hurts the little guy. Before the little guy has no voice left, will you hear me?

    • Andy Klein

      Thanks for the post. I’m glad you found a backup solution that fits your needs. Many people find our solution straight forward and easy to use and are happy to pay us for that.
      Let me correct one inaccuracy in your description. If you have a large amount of data to restore, you can order a hard drive from us with your data on it for $189. If you return the hard drive to us within 30 days, we will refund your $189. We do this as many people have monthly bandwidth caps (or restricted bandwidth) and downloading even a couple-a-hundred GBs of data or more is not realistic for them.

    • Matthew

      Amazing comment! I too am in the same boat. Backblaze needs to get on it and develop a native linux client. CrashPlan’s java app is a pain but gets the job done and can do so with my own encryption key. Come on Backblaze, it’s 2016 now; support all the OSes!

  • SomeGuy2398067

    Can you guys explain why you have so few of the 4TB, 5TB and 6TB Toshiba drives? They are quality drives from what I hear and based on my own research, they are priced on par with the Seagate drives. Many people like me use these stats to make decisions on which drives to buy and I was hoping you would be using more Toshiba drives.

    • Andy Klein

      Good question. It has to do with what is available for us to buy at what price. We purchase our drives from the reseller channel. We ask for quotes on say 5,000, 4TB drives, to be available on “X date”. To date, Toshiba resellers have not met those criteria. We have been offered lower (much lower) quantities of Toshiba drives at a higher unit price but that doesn’t make much business sense. We have nothing against Toshiba drives, we like to have multiple vendors, but we just have not been able to get the quantity and price we need.

  • linkdude31

    No WD Enterprise (RD or SE) or Black drives tested…what?

    • solomonshv

      they don’t use enterprise drives. those cost too much. it’s cheaper to keep replacing consumer drives that keep dying.

  • JY

    Oddly enough, I’ve been using WD drives for years and never had trouble with them. I generally had them installed in daily gaming desktops as well as linux servers I put together myself. I’ve also used HGST (Previously Hitachi) drives for years. I actually have a Hitachi and WD 1 TB drives are still going strong. I picked up a new WD 3 GB drive not long ago.

  • Tech1ceTech2ce

    What is the cost difference between the HGST and Seagate Drives?

  • genar0

    Thank you for sharing. Here in our province, they only sell Seagate or WD HDs. I guess I have to go for Seagate now.

  • Randy Morris

    Toshiba is my go to platter drive now. Quiet, fast, reliable, cheap. Just lost 3TB from a HGST, so I’m not so gung ho about reliability there. WD black has served well. Seagate has failed me EVERY time.