Tag Archives: Hadoop

NoSQL Databases: Why, what and when – An Overview on NoSQL Datastores

6 Mar

For everybody interested in NoSQL datastores the following presentation by Lorenzo Alberton is worth reading. It gives a good overview of the currently used technologies in the area of NoSQL and discusses the ground filling basics every NoSQL enthusiast should know. NoSQL Databases: Why, what and when (via quipo)   Related Articles NoSQL: Node.js + [...]

[video] – How is Hadoop used at Twitter?

10 Sep

In the following video Dmitriy Ryaboy, a Twitter Analytics Engineer and a former Cloudera Intern, explains how Twitter uses Hadoop and Pig. Enjoy the video and have a good weekend! Related articles How GE uses Hadoop to analyze big data (news.cnet.com) How GE uses Hadoop to analyze big data (news.cnet.com) Hadoop There it Was–Hadoop Summit [...]

NOSQL Summer in Graz Kick-Off Meeting

21 Jul

Today was the kick-off  meeting of NOSQL Summer here in Graz, Austria. A NOSQL Summer is a network of local reading groups, that will decipher & discuss NOSQL-related articles, from late June to early September 2010. Each group sets its own meeting pace (usually once a week or once every two weeks) and select which papers [...]

[video] – HBase and Pig: The Hadoop Ecosystem at Twitter

25 Jun

I have just found this very interesting video dealing with the implementation of HBase and Pig in combination with Hadoop at Twitter: Related articles by Zemanta HBase Digest, March 2010 (sematext.com) Cassandra+Hadoop (slideshare.net) Hadoop, Hbase and Hive- Bay area Hadoop User Group (slideshare.net)

Twitter’s use of Cassandra, Hadoop, Pig and HBase for highly distributed Data Processing and Analysis

6 May

Kevin Weil, Analytics Lead at Twitter recently gave a presentation on Twitter’s use of Cassandra, Pig and HBase. Specially interesting is how Twitter uses Hadoop and Pig in their data analysis process. (via @kevinweil) Another great presentation from Tobias Ivarsson gives an overview on NoSQL: (via @thobe)