February 2011
4 posts
http://www.toao.com/posts/finding-similar-items-key... →
Article introduces what minhashing is and proves that the probability of 2 sets being similar is actually equal to the probability of their minhashes matching. So you can actually calculate the minhashes of sets and use that to determine if the sets are similar/dissimilar without having to compare each and every element.
http://bartoszmilewski.wordpress.com/2010/09/11/bey... →
Bartosz Milewski writes a great article on how STMs are implemented at a high-level.
http://developer.yahoo.com/blogs/hadoop/posts/2011/... →
Proposed redesign of Hadoop by the Y! Hadoop team. In short, HDFS stays the same, but MapReduce becomes an application-level library, and so the existing JobTracker and TaskTrackers get replaced by more generic ResourceManager and NodeManagers.
If your ideas are not being rejected at least 50% of the time, you are playing...
– Summation: Dealing with rejection is a core competency