Today was the kick-off meeting of NOSQL Summer here in Graz, Austria. A NOSQL Summer is a network of local reading groups, that will decipher & discuss NOSQL-related articles, from late June to early September 2010. Each group sets its own meeting pace (usually once a week or once every two weeks) and select which papers [...]
I have just found this very interesting video dealing with the implementation of HBase and Pig in combination with Hadoop at Twitter: Related articles by Zemanta HBase Digest, March 2010 (sematext.com) Cassandra+Hadoop (slideshare.net) Hadoop, Hbase and Hive- Bay area Hadoop User Group (slideshare.net)
Kevin Weil, Analytics Lead at Twitter recently gave a presentation on Twitter’s use of Cassandra, Pig and HBase. Specially interesting is how Twitter uses Hadoop and Pig in their data analysis process. (via @kevinweil) Another great presentation from Tobias Ivarsson gives an overview on NoSQL: (via @thobe)
Tags:
business,
Cassandra,
Data Analysis,
Data mining,
Database,
Distributed computing,
Hadoop,
HBase,
Kevin Weil,
Pig,
twitter
Here is a great video explaining how Facebook uses Hadoop and Hive to build Data Warehouses and analyse their huge ammount of data. Every day Facebook gets around 4TB+ of compressed data, which is amazing and also difficult to be analysed. (via Cloudera)
Tags:
analysis,
Data,
Data mining,
Data warehouse,
Data Warehousing,
Database,
Extraction and Transformation,
Facebook,
Hadoop,
Hive,
MapReduce,
Metadata