The Great Disk Drive in the Sky: How Web giants store huge amounts of data - LGF Pages

Sign In • Register • Forgot password?

The Great Disk Drive in the Sky: How Web giants store huge amounts of data

Cutting edge distributed file system technology

By Charles Johnson

Technology • January 2012 • Views: 1,226

arstechnica.com

Consider the tech it takes to back the search box on Google’s home page: behind the algorithms, the cached search terms, and the other features that spring to life as you type in a query sits a data store that essentially contains a full-text snapshot of most of the Web. While you and thousands of other people are simultaneously submitting searches, that snapshot is constantly being updated with a firehose of changes. At the same time, the data is being processed by thousands of individual server processes, each doing everything from figuring out which contextual ads you will be served to determining in what order to cough up search results.

The storage system backing Google’s search engine has to be able to serve millions of data reads and writes daily from thousands of individual processes running on thousands of servers, can almost never be down for a backup or maintenance, and has to perpetually grow to accommodate the ever-expanding number of pages added by Google’s Web-crawling robots. In total, Google processes over 20 petabytes of data per day.

That’s not something that Google could pull off with an off-the-shelf storage architecture. And the same goes for other Web and cloud computing giants running hyper-scale data centers, such as Amazon and Facebook. While most data centers have addressed scaling up storage by adding more disk capacity on a storage area network, more storage servers, and often more database servers, these approaches fail to scale because of performance constraints in a cloud environment. In the cloud, there can be potentially thousands of active users of data at any moment, and the data being read and written at any given moment reaches into the thousands of terabytes.

10

Google Facebook Distributed File System Search Engine Cloud Computing Amazon

Recent Pages by Charles (Charles Johnson):
The Conservative Plan to Take Over the Country (You Need to Know About This) Techdirt: Hey Elon: The ADL Convincing Advertisers to Run Away From Your Site Is Part of the Free Speech You Pretend to Support Elon Musk Commandeers '@Music' Handle From User With Half a Million Followers ChatGPT Is a Data Privacy Nightmare, and We Ought to Be Concerned Ex-Lawmaker Who Served Time for Jan. 6 Riot Seeks House Seat

2 comments

1

ShaunP Jan 27, 2012 12:22:38pm

I didn’t see it in the pages, but I heard this on NPR this week and it kind of blew me away:

YouTube video uploads grow rapidly

Kai Ryssdal: Listeners of a certain background might have heard the phrase “a New York minute” — how fast something can get done. Herewith, a suggestion for a new time-reference to add to the popular lexicon: How about ‘a YouTube second’?

The popular website for all things video has announced its latest figures for how much material is being uploaded. YouTube is now taking in one hour of video every second of the day.

A resounding success for its basic premise. But even with that, parent company Google is still struggling to make it pay. Here’s our senior business correspondent Bob Moon.

Bob Moon: Think of it this way: If you set out to watch every single video posted to YouTube just in the past week and a half, it would take you 100 years. You heard that right.

Matt McLernon is a spokesman for YouTube.

Matt McLernon: A century of video is uploaded every 10 days.

…

Even though YouTube figures people watch four billion videos every day, it’s been introducing ads slowly to avoid a backlash from viewers. So far, it says it’s making money on just three billion videos a week, only a tiny fraction of its viewership.

I’m Bob Moon for Marketplace.

2

KernelPanic Jan 27, 2012 2:21:13pm

Big Data is part of my day job and this article was very interesting; we really only hear what Google is doing internally a few years after they have invented something better.

For cloud storage I’m entirely an Amazon S3 zealot. When I’ve got money to spend on local storage I’m generally a fan of the scale-out NAS stuff from Isilon which can seamlessly expand from 100TB beyond 10 Petabytes — great for customers who know they need large storage but don’t know when and how soon. The Isilon stuff just works.

And when money is a huge issue the DIY route comes to mind. We built a backblaze pod clone about 6 months ago and managed to get 100 terabytes into a file server for a rough cost of about $12,000 USD. Not bad but sort of an edge case given the downsides that come with the backblaze pod design.

Auto

This page has been archived.
Comments are closed.

Create a PageThis is the LGF Pages posting bookmarklet. To use it, drag this button to your browser's bookmark bar, and title it 'LGF Pages' (or whatever you like). Then browse to a site you want to post, select some text on the page to use for a quote, click the bookmarklet, and the Pages posting window will appear with the title, text, and any embedded video or audio files already filled in, ready to go.
Or... you can just click this button to open the Pages posting window right away.
Last updated: 2023-04-04 11:11 am PDT LGF User's Guide RSS Feeds

Help support Little Green Footballs!

Featured PagesClick to refresh: The Good Liars at the Schnecksville Trump Rally [VIDEO] New theories, great tunes and SHOCKING breaking news. SUPPORT US: http://Herohero.co/thegoodliars SEE THE GOOD LIARS LIVE!WASHINGTON D.C. MAY 23RD: https://www.unionstage.com/shows/good-liars-fix-america/NASHVILLE, TN JUNE 6TH: https://www.etix.com/ticket/p/42992972/the-good-liars-nashville-the-lab-at-zaniesSAN FRANCISCO, CA JUNE 25TH: https://www.livenation.com/event/G5vYZbavGvggG/the-good-liars SUBSCRIBE TO OUR AUDIO PODCAST:Apple Podcasts: https://podcasts.apple.com/us/podcast/the-good-liars-tell-the-truth/id1731178442Spotify: https://open.spotify.com/show/7mgfiwzr32907N4y68eFOCJoin this channel ...
teleskiguy
4 days ago
Views: 378 • Comments: 1 • Rating: 0; Trump’s “Stolen Election” Lie Based on Evidence From Pervy Bathroom Cam-Spy OK, this really takes the cake. If you have relatives that still cling to the “election was stolen, dadgum, I jes’ KNOW IT … This should be a slight remedy to the stubborn madness Thanks to online anonymity, the ...
Khal Wimpo (free internal organs upon request!)
6 days ago
Views: 294 • Comments: 0 • Rating: 3; Best of April 2024 Nothing new here but these are a look back at the a few good images from the past month. Despite the weather, I was quite pleased with several of them. These were taken with older lenses (made from the ...
William Lewis
2 weeks ago
Views: 286 • Comments: 2 • Rating: 6; Gateway Pundit, Sued by Election Workers, Declares BankruptcyA onetime favorite, now just pathetic figure around these parts, Jim Hoft aka SMOTI ("Stupidest Man On The Internet"), has filed for Chapter 11 bankruptcy in response to the defamation lawsuits filed against him to the same election workers that ...
Khal Wimpo (free internal organs upon request!)
2 weeks ago
Views: 336 • Comments: 1 • Rating: 3; The Pandemic Cost 7 Million Lives, but Talks to Prevent a Repeat Stall In late 2021, as the world reeled from the arrival of the highly contagious omicron variant of the coronavirus, representatives of almost 200 countries met - some online, some in-person in Geneva - hoping to forestall a future worldwide ...
Cheechako
3 weeks ago
Views: 1,087 • Comments: 0 • Rating: 2; Once Praised, the Settlement to Help Sickened BP Oil Spill Workers Leaves Most With Nearly Nothing When a deadly explosion destroyed BP’s Deepwater Horizon drilling rig in the Gulf of Mexico, 134 million gallons of crude erupted into the sea over the next three months — and tens of thousands of ordinary people were hired ...
Cheechako
3 weeks ago
Views: 1,041 • Comments: 0 • Rating: 2

Recent PagesClick to refresh: Texas County at Center of Border Fight Is Overwhelmed by Migrant Deaths EAGLE PASS, Tex. - The undertaker lighted a cigarette and held it between his latex-gloved fingers as he stood over the bloated body bag lying in the bed of his battered pickup truck. The woman had been fished out ...
Cheechako
4 weeks ago
Views: 463 • Comments: 0 • Rating: 1

► LGF Headlines

Loading...

► Top 10 Comments

Loading...

► Bottom Comments

Loading...

► Recent Comments

Loading...

► Tools/Info

► Tag Cloud