Dear wiki user, You have subscribed to a wiki page "Hadoop Wiki" for change notification.
The page PoweredBy has been reverted to revision 246 by Niels Basjes. The comment on this change is: Spam. http://wiki.apache.org/hadoop/PoweredBy?action=diff&rev1=247&rev2=248 -------------------------------------------------- * [[http://www.gruter.com|Gruter. Corp.]] * 30 machine cluster (4 cores, 1TB~2TB/machine storage) * storage for blog data and web documents - * used for data indexing [[http://watch-movie-online.org/watch-the-eagle-online/|by]] MapReduce + * used for data indexing by MapReduce * link analyzing and Machine Learning by MapReduce * [[http://gumgum.com|GumGum]] @@ -215, +215 @@ * [[http://www.hostinghabitat.com/|Hosting Habitat]] * We use a customised version of Hadoop and Nutch in a currently experimental 6 node/Dual Core cluster environment. - * What we crawl are our clients Websites and from the information we gather. We fingerprint old and non updated software packages in that shared hosting environment. We can then inform [[http://watch-movie-online.org/watch-the-roommate-online/|our]] clients that they have old and non updated software running after matching a signature to a Database. With that information we know which sites would require patching as a free and courtesy service to protect the majority of users. Without the technologies of Nutch and Hadoop this would be a far harder to accomplish task. + * What we crawl are our clients Websites and from the information we gather. We fingerprint old and non updated software packages in that shared hosting environment. We can then inform our clients that they have old and non updated software running after matching a signature to a Database. With that information we know which sites would require patching as a free and courtesy service to protect the majority of users. Without the technologies of Nutch and Hadoop this would be a far harder to accomplish task. = I = * [[http://www.ibm.com|IBM]]
