The amount of data Facebook collects from its nearly one billion users is astounding.
The highlights from Facebook is collecting your data — 500 terabytes a day — Data | GigaOM:
- 2.5 billion content items shared per day (status updates + wall posts + photos + videos + comments)
- 2.7 billion Likes per day
- 300 million photos uploaded per day
- 100+ petabytes of disk space in one of Facebook’s largest Hadoop (HDFS) clusters
- 105 terabytes of data scanned via Hive, Facebook’s Hadoop query language, every 30 minutes
- 70,000 queries executed on these databases per day
- 500+ terabytes of new data ingested into the databases every day
Facebook is a single location for 950 million users, and it’s a public company. So, these type of data can be tracked, providing perspectives that have never be achieved for the Internet at-large.
Interesting stuff and scary given that one company controls it all.