Tag Archives: twitter

[video] – The Filter Bubble – From Human to Algorithmic Gatekeepers

6 May

Web personalization and personalized recommendations are recently gaining more and more interest. Companies like Amazon, Google, Netflix, The New York Times, Facebook, Twitter, … already personalize their products in different ways. If you take Google’s search results as an example. Have you ever noticed that a friend of you gets different search results as you [...]

[video] – How is Hadoop used at Twitter?

10 Sep

In the following video Dmitriy Ryaboy, a Twitter Analytics Engineer and a former Cloudera Intern, explains how Twitter uses Hadoop and Pig. Enjoy the video and have a good weekend! Related articles How GE uses Hadoop to analyze big data (news.cnet.com) How GE uses Hadoop to analyze big data (news.cnet.com) Hadoop There it Was–Hadoop Summit [...]

[video] – HBase and Pig: The Hadoop Ecosystem at Twitter

25 Jun

I have just found this very interesting video dealing with the implementation of HBase and Pig in combination with Hadoop at Twitter: Related articles by Zemanta HBase Digest, March 2010 (sematext.com) Cassandra+Hadoop (slideshare.net) Hadoop, Hbase and Hive- Bay area Hadoop User Group (slideshare.net)

Grabeeter – Grab and Search your Tweets offline and online

19 Jun

Grabeeter (@grabeeter) has just been launched and I think it is a very useful tool. Grabeeter enables you to grab your tweets which means that you are able to store your tweets on your local harddrive in a structured format (xml at the time). Using the Grabeeter Client you are also able to perform searches [...]

Twitter’s use of Cassandra, Hadoop, Pig and HBase for highly distributed Data Processing and Analysis

6 May

Kevin Weil, Analytics Lead at Twitter recently gave a presentation on Twitter’s use of Cassandra, Pig and HBase. Specially interesting is how Twitter uses Hadoop and Pig in their data analysis process. (via @kevinweil) Another great presentation from Tobias Ivarsson gives an overview on NoSQL: (via @thobe)