Hadoop User Group (UK)

2008-04-25 Thread Johan Oskarsson
d Hadoop Martin Dittus and Johan Oskarsson (Last.fm) - Hadoop usage at Last.fm More details, presenters and venue announced at a later date. Keep an eye on the upcoming event page.

Re: Hadoop User Group (UK)

2008-04-29 Thread Johan Oskarsson
gards, Lukas On Fri, Apr 25, 2008 at 5:29 PM, Johan Oskarsson <[EMAIL PROTECTED]> wrote: August 19th brings the first of many Hadoop User Group meetups in the UK. It will be hosted somewhere in London and we'll have presentations from both developers and users of Apache Hadoop. The event is

Re: can you refer me to a User with Hadoop in production

2008-07-10 Thread Johan Oskarsson
There's a number of companies using hadoop in production, listed here: http://wiki.apache.org/hadoop/PoweredBy Bill Boas wrote: Please? Bill Boas VP, Business Development System Fabric Works 510-375-8840 [EMAIL PROTECTED] www.systemfabricworks.com

Hadoop User Group UK

2008-07-14 Thread Johan Oskarsson
artfrog and Hadoop 12.15 -> 13.15: Free lunch! (Sandwich, fruit, drink and crisps. Meat and veggie options available) 13.15 -> 14.00: Martin Dittus and Johan Oskarsson (Last.fm) - Hadoop usage at Last fm 14.00 -> 15.00: Lightning talks (5-10 minutes each) 15.00 -> 16.00: Panel discus

Re: Hadoop User Group (Bay Area) Oct 15th

2008-10-15 Thread Johan Oskarsson
Since I'm not based in the San Francisco I would love to see the slides from this meetup uploaded somewhere. Especially the database join techniques talk sounds very interesting to me. /Johan Ajay Anand wrote: > The next Bay Area User Group meeting is scheduled for October 15th at > Yahoo! 2821 M

Re: Practical limits on number of blocks per datanode.

2008-11-21 Thread Johan Oskarsson
Hi Rick, unfortunately 4,800,000 blocks per node is going to be too much. Ideally you'd want to merge your files into as few as possible, even 1MB per file is quite small for Hadoop. Would it be possible to merge them into hundreds of mbs or preferably gigabyte files? In newer Hadoop versions the

Hadoop User Group UK Meetup - April 14th

2009-02-02 Thread Johan Oskarsson
I've started organizing the next Hadoop meetup in London, UK. The date is April 14th and the presentations so far include: Michael Stack (Powerset): Apache HBase Isabel Drost (Neofonie): Introducing Apache Mahout Iadh Ounis and Craig Macdonalt (University of Glasgow): Terrier Paolo Castagna (HP):

Re: Hadoop User Group UK Meetup - April 14th

2009-02-18 Thread Johan Oskarsson
Sun beer) 17.00 – 00.00: Discussions continues at a nearby pub The event is hosted by Sun in London, near Monument station, for more details see the event page or the blog: http://huguk.org/ /Johan Johan Oskarsson wrote: > I've started organizing the next Hadoop meetup in London, UK. The d

Splittable lzo files

2009-03-03 Thread Johan Oskarsson
Hi, thought I'd pass on this blog post I just wrote about how we compress our raw log data in Hadoop using Lzo at Last.fm. The essence of the post is that we're able to make them splittable by indexing where each compressed chunk starts in the file, similar to the gzip input format being wor

Re: Splittable lzo files

2009-03-03 Thread Johan Oskarsson
7;m looking at a 100+ GB gzipped file ...) Miles 2009/3/3 Johan Oskarsson : Hi, thought I'd pass on this blog post I just wrote about how we compress our raw log data in Hadoop using Lzo at Last.fm. The essence of the post is that we're able to make them splittable by indexing where e

Re: Fastlz coming?

2009-06-04 Thread Johan Oskarsson
We're using Lzo still, works great for those big log files: http://code.google.com/p/hadoop-gpl-compression/ /Johan Kris Jirapinyo wrote: > Hi all, >In the remove lzo JIRA ticket > https://issues.apache.org/jira/browse/HADOOP-4874 Tatu mentioned he was > going to port fastlz from C to Java an