Categories: Data Storage

Facebook Study Into SSDs Finds ‘Several Distinct Failure Periods’

A joint study by Facebook engineers and Carnegie Mellon University experts into SSD failure rates has found that SSDs go through “several distinct failure periods” corresponding to the amount of data written to flash chips.

The paper, which its authors said is the “first comprehensive study of flash-based SSD reliability trends”, six SSD platforms used by Facebook were cross-examined for failure causes.

Although the SSDs studied were not disclosed, the paper said that the components were “similar to those” used in server hardware available from firms such as Fusion-io, Hitachi, Intel, OCZ, Seagate and Virident.

‘Several distinct failure periods’

“We observe that SSDs go through several distinct failure periods – early detection, early failure, usable life, and wearout – during their lifecycle, corresponding to the amount of data written to flash chips,” was the authors’ first conclusion.

The researchers advised that additional error correction at the start of an SSD’s life would go some way in reducing the failure rates during the early detection period.

Inside Facebook’s Swedish data centre

Another observation from the researchers was that SSDs that do not use throttling techniques to manage temperature have more chance of failure.

“Higher temperatures lead to increased failure rates, but do so most noticeably for SSDs that do not employ throttling techniques,” said the study. “In general, we find techniques like throttling, which may be employed to reduce SSD temperature, to be effective at reducing the failure rate of SSDs.”

The most interesting finding from the study, which examined the SSDs over a four-year period, was that read disturbance errors, are “not prevalent in the field”. The researchers said that SSDs that have read the most data do not show a statistically significant increase in failure rates.

“We find that the effect of read disturbance errors is not a predominant source of errors in the SSDs we examine,” said that two Facebook and two Carnegie researchers.

“While prior work has shown that such errors can occur under certain access patterns in controlled environments… we do not observe this effect across the SSDs we examine.”

Take our cloud quiz here!

Ben Sullivan

Ben covers web and technology giants such as Google, Amazon, and Microsoft and their impact on the cloud computing industry, whilst also writing about data centre players and their increasing importance in Europe. He also covers future technologies such as drones, aerospace, science, and the effect of technology on the environment.

Recent Posts

Craig Wright Sentenced For Contempt Of Court

Suspended prison sentence for Craig Wright for “flagrant breach” of court order, after his false…

2 days ago

El Salvador To Sell Or Discontinue Bitcoin Wallet, After IMF Deal

Cash-strapped south American country agrees to sell or discontinue its national Bitcoin wallet after signing…

2 days ago

UK’s ICO Labels Google ‘Irresponsible’ For Tracking Change

Google's change will allow advertisers to track customers' digital “fingerprints”, but UK data protection watchdog…

2 days ago

EU Publishes iOS Interoperability Plans

European Commission publishes preliminary instructions to Apple on how to open up iOS to rivals,…

3 days ago

Momeni Convicted In Bob Lee Murder

San Francisco jury finds Nima Momeni guilty of second-degree murder of Cash App founder Bob…

3 days ago