Re: Can I post pig questions on this forum?

2009-06-26 Thread Christophe Bisciglia
pig-u...@hadoop.apache.org

On Fri, Jun 26, 2009 at 4:34 PM, pmg wrote:
>
>
> --
> View this message in context: 
> http://www.nabble.com/Can-I-post-pig-questions-on-this-forum--tp24228728p24228728.html
> Sent from the Hadoop lucene-users mailing list archive at Nabble.com.
>
>



-- 
get hadoop: cloudera.com/hadoop
online training: cloudera.com/hadoop-training
blog: cloudera.com/blog
twitter: twitter.com/cloudera


Next Bay Area Hadoop User Group - Focus on Hadoop 0.20 and Core Project Split

2009-06-24 Thread Christophe Bisciglia
Bay Area Hadoop Fans,

We're excited to hold our first Hadoop User Group at Cloudera's office
in Burlingame (just south of SFO). We pushed the start time back 30
minutes to allow a little extra time to drive further north, and we
hope the mid-way location brings more users from San Francisco.

Since meetup.com seems to be the norm for HUGs around the country, we
created a meetup group for the bay area
(http://www.meetup.com/Bay-Area-Hadoop-User-Group-HUG). Join this
group to stay up to date with additional meetings and locations -
we're hoping to move the location around potentially alternating
between north bay and south bay.

We've scheduled the next meetup for July 15th at 6:30 PM. Our office
isn't huge, but we do have room for 40 friendly people:
http://www.meetup.com/Bay-Area-Hadoop-User-Group-HUG/calendar/10728923/

We'll focus this meeting on Hadoop 0.20 and the split of "core" into
mapreduce, hdfs and common projects. Specifically, we'll go over new
features, API changes, upgrade experiences and more. If you'd like to
present about your experience, please let me know. If you'd like to
present about something else all together, also let me know, and we'll
see what we can do at this, or a later meetup.

We'll provide beer, drinks and snacks, and if there are any board game
fans in the house, we won't kick you our afterwards :-) On a more
serious note, after the meetup is a great opportunity to meet
Cloudera's engineering team and get advice about any headaches you
might be having.

We'll post the agenda to the meetup group as soon as we hear from
potential presenters and nail things down.

Christophe

-- 
get hadoop: cloudera.com/hadoop
online training: cloudera.com/hadoop-training
blog: cloudera.com/blog
twitter: twitter.com/cloudera


Re: Are .bz2 extensions supported in Hadoop 18.3

2009-06-24 Thread Christophe Bisciglia
This is correct - thanks for the note Jason. You can see the current
patch list for Cloudera's Distribution (based on 18.3) at:
http://www.cloudera.com/hadoop-manifest

In addition to Bzip2, we have patched in: DBInputFormat, the fair
scheduler, job level task limiting, "soft" fd leak fix, a fix for HDFS
under-replication, shuffle improvements, EC2/S3 improvements, and
Sqoop - database import for Hadoop.

You can download RPMs and Ubuntu packages as well as preconfigured EC2
images from: http://www.cloudera.com/hadoop

Cheers,
Christophe

On Wed, Jun 24, 2009 at 6:47 AM, jason hadoop wrote:
> I believe the cloudera 18.3 supports bzip2
>
> On Wed, Jun 24, 2009 at 3:45 AM, Usman Waheed  wrote:
>
>> Hi All,
>>
>> Can I map/reduce logs that have the .bz2 extension in Hadoop 18.3?
>> I tried but interestingly the output was not what i expected versus what i
>> got when my data was in uncompressed format.
>>
>> Thanks,
>> Usman
>>
>
>
>
> --
> Pro Hadoop, a book to guide you from beginner to hadoop mastery,
> http://www.amazon.com/dp/1430219424?tag=jewlerymall
> www.prohadoopbook.com a community for Hadoop Professionals
>



-- 
get hadoop: cloudera.com/hadoop
online training: cloudera.com/hadoop-training
blog: cloudera.com/blog
twitter: twitter.com/cloudera


Announcing Sqoop: Database import for Hadoop

2009-06-01 Thread Christophe Bisciglia
Hadoop Fans,

I'm happy to announce a new tool from the Cloudera team.

We often found our customers wanting to import data from RDBMSs so
they could conduct deeper analysis. To facilitate this, we built a
command line tool that allows you to extract data from any JDBC source
and build database-specific extensions to increase performance (we
ship with an improved MySQL extension that leverages mysqldump and
look forward to developing additional extensions with the community).

We affectionately refer to this tool as Sqoop: SQL to Hadoop. Sqoop is
available with the most recent update to Cloudera's Distribution for
Hadoop (http://www.cloudera.com/hadoop) and has been contributed to
Apache as well.

You can use Sqoop to dump tables or entire databases to Hadoop. By
default, it uses DBInputFormat, generates all of the necessary Java
classes to work with your records, and also allows you to import data
directly into Hive.

You can get more details and see a video of Aaron Kimball's
presentation at last month's Hadoop User Group meeting at Y!:
http://www.cloudera.com/blog/2009/06/01/introducing-sqoop/

Also, our upcoming intermediate training session in Washington DC will
cover Sqoop usage in detail: http://www.eventbrite.com/event/351945679

Cheers,
Christophe and the Cloudera Team

-- 
get hadoop: cloudera.com/hadoop
online training: cloudera.com/hadoop-training
blog: cloudera.com/blog
twitter: twitter.com/cloudera


Annoucement: RFP Open - Hadoop Summit East - Oct 2, 2009 - NYC

2009-05-27 Thread Christophe Bisciglia
Hadoop Fans,

Lately, we've been spending a lot of time on the East Coast, and one thing
is clear: Hadoop is everywhere.

Hadoop usage on the East Coast tends to be slightly different. There are
still web companies with armies of tech gurus, but there are also many
"regular" industries and enterprises using and exploring Hadoop. It's time
to get together and learn a thing or two from one other.

The Hadoop Summit East  will
take place on October 2nd, 2009 and focus on two areas of interest to
enterprise users. We've opened requests for proposals at:
http://hadoop-summit-east-rfp.eventbrite.com/

Development and Administration:
* Core Hadoop: Areas for Development, Major Upcoming Contributions,
Functional Deep Dives
* Administration: Large Cluster Overviews, Performance Tips, Resource
Management, High Availability
* Developer: IDEs, QA Best Practices, Sharing Code / Data / Clusters, Higher
Level Abstractions

Hadoop Applications:
* Lessons from the Web: What can traditional industries learn from companies
with web scale data?
* Industry Case Studies: Finance, Telecom / Utilities, Retail, Biotech, etc.
* Integration with Existing Systems: Databases, BI Tools, Message Buses and
other Infrastructure
* New Ideas / Applications: Big Ideas for Hadoop
* Hosted Solutions: Hadoop and the Cloud

We'll close the RFP on July 31st and announce the schedule soon thereafter.
More details including discounted / early registration and sponsorship
info available at: http://www.cloudera.com/hadoop-summit-east

Cheers,
Christophe and the Cloudera Team

-- 
get hadoop: cloudera.com/hadoop
online training: cloudera.com/hadoop-training
blog: cloudera.com/blog
twitter: twitter.com/cloudera


Announcement: Cloudera Hadoop Training in Washington DC (June 22-23)

2009-05-26 Thread Christophe Bisciglia
Hadoop Fans, just a quick note that we are hosting two days of Hadoop
training in Washington DC area (Alexandria, VA) on June 22 and 23.

We cover Hadoop, Hive, Pig and more with a focus on hands-on work.
Please share this with your friends who might not be on the user list
yet.

Both are listed under "Live Sessions" at http://www.cloudera.com/hadoop-training

Registration is discounted for the next few days, so if you are
interested, feel free to take advantage of the savings ($200) by
registering early.

We also have room for one more private session later in the week.
Please contact me directly if you are interested in a customized,
private session for your organization.

Christophe

-- 
get hadoop: cloudera.com/hadoop
online training: cloudera.com/hadoop-training
blog: cloudera.com/blog
twitter: twitter.com/cloudera


Free Training at 2009 Hadoop Summit

2009-05-05 Thread Christophe Bisciglia
Just wanted to follow up on this and let everyone know that Cloudera and Y!
are teaming up to offer two day-long training sessions for free on the day
after the summit (June 11th).

We'll cover Hadoop basics, Pig, Hive and some new tools Cloudera is
releasing for importing data to Hadoop from existing databases.

http://hadoopsummit09-training.eventbrite.com

Each of these sessions normally runs about $1000 but were taking advantage
of having so much of the Hadoop community in one place and offering this for
free at the 2009 Hadoop Summit.

Basic training is appropriate for people just getting started with Hadoop,
and the advanced training will focus on augmenting your existing
infrastructure with Hadoop and taking advantage of Hadoop's advanced
features and related projects.

Space is limited, so sign up before time runs out.

Hope to see you there!

Christophe and the Cloduera Team

On Wed, May 6, 2009 at 6:10 AM, Ajay Anand  wrote:
> This year’s Hadoop Summit
> (http://developer.yahoo.com/events/hadoopsummit09/) is confirmed for June
> 10th at the Santa Clara Marriott, and is now open for registration.
>
>
>
> We have a packed agenda, with three tracks – for developers,
administrators,
> and one focused on new and innovative applications using Hadoop. The
> presentations include talks from Amazon, IBM, Sun, Cloudera, Facebook, HP,
> Microsoft, and the Yahoo! team, as well as leading universities including
UC
> Berkeley, CMU, Cornell, U of Maryland, U of Nebraska and SUNY.
>
>
>
> From our experience last year with the rush for seats, I would encourage
> people to register early at http://hadoopsummit09.eventbrite.com/
>
>
>
> Looking forward to seeing you at the summit!
>
>
>
> Ajay



-- 
get hadoop: cloudera.com/hadoop
online training: cloudera.com/hadoop-training
blog: cloudera.com/blog
twitter: twitter.com/cloudera


Hadoop Training, May 15th: SF Bay Area with Online Participation Available

2009-04-27 Thread Christophe Bisciglia
OK, last announcement from me today :-)

We're hosting a training session in the SF bay area (at the Cloudera
office) on Friday, May 15th.

We're doing two things differently:
1) We've allocated a chunk of discounted "early bird" registrations -
first come first serve until May 1st, at which point, only regular
registration is available.
2) We're enabling people from outside the bay area to attend through
some pretty impressive web based video remote presence software we've
been piloting - all you need is a browser with flash. If you have a
webcam and mic, all the better. We're working with a startup on this,
and we're really impressed with the technology. Since this is new for
us, we've discounted web based participation significantly for this
session.

registration: http://cloudera.eventbrite.com/

Cheers,
Christophe

-- 
get hadoop: cloudera.com/hadoop
online training: cloudera.com/hadoop-training
blog: cloudera.com/blog
twitter: twitter.com/cloudera


Debian support for Cloudera's Distribution

2009-04-27 Thread Christophe Bisciglia
Hey Hadoop fans, just wanted to drop a quick note to let you know that
we now have debian packages for our distribution in addition to RPMs.
We will continue to support both platforms going forward.

Todd Lipcon put in many late nights for this, so next time you see
him, but him a beer :-)

http://www.cloudera.com/hadoop-deb

Cheers,
Christophe

-- 
get hadoop: cloudera.com/hadoop
online training: cloudera.com/hadoop-training
blog: cloudera.com/blog
twitter: twitter.com/cloudera


Re: RPM spec file for 0.19.1

2009-04-02 Thread Christophe Bisciglia
Hey Ian, we are totally fine with this - the only reason we didn't
contribute the SPEC file is that it is the output of our internal
build system, and we don't have the bandwidth to properly maintain
multiple RPMs.

That said, we chatted about this a bit today, and were wondering if
the community would like us to host RPMs for all releases in our
"devel" repository. We can't stand behind these from a reliability
angle the same way we can with our "blessed" RPMs, but it's a
manageable amount of additional work to have our build system spit
those out as well.

If you'd like us to do this, please add a "me too" to this page:
http://www.getsatisfaction.com/cloudera/topics/should_we_release_host_rpms_for_all_releases

We could even skip the branding on the "devel" releases :-)

Cheers,
Christophe

On Thu, Apr 2, 2009 at 12:46 PM, Ian Soboroff  wrote:
>
> I created a JIRA (https://issues.apache.org/jira/browse/HADOOP-5615)
> with a spec file for building a 0.19.1 RPM.
>
> I like the idea of Cloudera's RPM file very much.  In particular, it has
> nifty /etc/init.d scripts and RPM is nice for managing updates.
> However, it's for an older, patched version of Hadoop.
>
> This spec file is actually just Cloudera's, with suitable edits.  The
> spec file does not contain an explicit license... if Cloudera have
> strong feelings about it, let me know and I'll pull the JIRA attachment.
>
> The JIRA includes instructions on how to roll the RPMs yourself.  I
> would have attached the SRPM but they're too big for JIRA.  I can offer
> noarch RPMs build with this spec file if someone wants to host them.
>
> Ian
>
>


Re: Cloudera's Distribution for Hadoop

2009-03-16 Thread Christophe Bisciglia
Mark, this is great feedback.
To everyone else, let me get a little more explicit about our community
support. We use Get Satisfaction: http://www.getsatisfaction.com/cloudera

You'll notice a topic for "Ubuntu support" - this is essentially asking for
.debs - http://www.getsatisfaction.com/cloudera/topics/ubuntu_support

If you want this, get on there and say "me too" - if you want
another platform, say that too.

Really - we'll listen. We're building this for you. We want it to be as easy
as possible for developers to get up and running so when your managers
realize how cool Hadoop is, they can consider the value of paying for
additional support.

Christophe

On Mon, Mar 16, 2009 at 7:21 PM, Mark Kerzner  wrote:

> Christophe,
>
> if you do .deb, I will be the first one to try. As it is, I am second :)
>
> Mark
>
> On Mon, Mar 16, 2009 at 7:42 PM, Christophe Bisciglia <
> christo...@cloudera.com> wrote:
>
> > Hey Hadoop Fans,
> >
> > It's been a crazy week here at Cloudera. Today we launched our
> > Distribution for Hadoop. This is targeted at Hadoop users who want to
> > use the most recent stable version of Hadoop and take advantage of
> > standard packaging and deployment tools like RPMs and YUM. We also
> > provide an AJAXy wizard to help you configure your cluster. We'll
> > include more options for deployment (.debs, solaris packages, etc) as
> > you ask for them, so please don't be shy - hit up our community
> > support page.
> >
> > The high level features for our first release include:
> > * RPM Deployment and a public YUM repository
> > * Client RPMs for Hive and Pig (what else should we include? Tell us
> > on community support! Link below.)
> > * Standard Linux Service Management
> > * Local Documentation and Man Pages
> >
> > We'll be going over some details and walk through deployment at the
> > Bay Area Hadoop Users Group at Y! this Wednesday, but if you're from
> > out of town, or want a head start, here are some links:
> > * Blog post announcement:
> >
> http://www.cloudera.com/blog/2009/03/15/cloudera-distribution-for-hadoop/
> > * Cloudera's Distribution for Hadoop Home Page:
> > http://www.cloudera.com/hadoop
> > * Community Support: http://www.cloudera.com/community-support
> >
> > Also, we turning into twitter junkies, so if you've been infected too,
> > follow @cloudera for updates.
> >
> > See you Wednesday!
> >
> > Cheers,
> > Christophe
> >
>


Cloudera's Distribution for Hadoop

2009-03-16 Thread Christophe Bisciglia
Hey Hadoop Fans,

It's been a crazy week here at Cloudera. Today we launched our
Distribution for Hadoop. This is targeted at Hadoop users who want to
use the most recent stable version of Hadoop and take advantage of
standard packaging and deployment tools like RPMs and YUM. We also
provide an AJAXy wizard to help you configure your cluster. We'll
include more options for deployment (.debs, solaris packages, etc) as
you ask for them, so please don't be shy - hit up our community
support page.

The high level features for our first release include:
* RPM Deployment and a public YUM repository
* Client RPMs for Hive and Pig (what else should we include? Tell us
on community support! Link below.)
* Standard Linux Service Management
* Local Documentation and Man Pages

We'll be going over some details and walk through deployment at the
Bay Area Hadoop Users Group at Y! this Wednesday, but if you're from
out of town, or want a head start, here are some links:
* Blog post announcement:
http://www.cloudera.com/blog/2009/03/15/cloudera-distribution-for-hadoop/
* Cloudera's Distribution for Hadoop Home Page: http://www.cloudera.com/hadoop
* Community Support: http://www.cloudera.com/community-support

Also, we turning into twitter junkies, so if you've been infected too,
follow @cloudera for updates.

See you Wednesday!

Cheers,
Christophe


Re: Cloudera Hadoop and Hive training now free online

2009-03-13 Thread Christophe Bisciglia
Hey Lukas, we love hearing about what you'd like to see in training.
If you make a note on get satisfaction, we'll track it and keep you
appraised of updates:
http://getsatisfaction.com/cloudera/products/cloudera_hadoop_training

Christophe

On Fri, Mar 13, 2009 at 2:27 PM, Lukáš Vlček  wrote:
> Hi,
> This is excellent!
>
> Does any of these presentations deal specifically with processing tree and
> graph data structures? I know that some basics can be found in the fifth
> MapReduce lecture here (http://www.youtube.com/watch?v=BT-piFBP4fE)
> presented by Aaron Kimball or here (
> http://video.google.com/videoplay?docid=741403180270990805) by Barry Brumit
> but something more detailed and comparing different approaches would be
> really helpful.
>
> Tree is often used in many algorithms (not only it can express hierarchy but
> can be used to compress data and many other fancy things...). I think there
> should be some knowledge about what works well and what does not with
> connection to MapReduce and trees (or graphs). I am looking for this
> information.
>
> Regards,
> Lukas
>
> On Fri, Mar 13, 2009 at 9:42 PM, Christophe Bisciglia <
> christo...@cloudera.com> wrote:
>
>> Hey there, today we released our basic Hadoop and Hive training
>> online. Access is free, and we address questions through Get
>> Satisfaction.
>>
>> Many on this list are surely pros, but when you have friends trying to
>> get up to speed, feel free to send this along. We provide a VM so new
>> users can start doing the exercises right away.
>>
>> http://www.cloudera.com/hadoop-training-basic
>>
>> Cheers,
>> Christophe
>>
>


Cloudera Hadoop and Hive training now free online

2009-03-13 Thread Christophe Bisciglia
Hey there, today we released our basic Hadoop and Hive training
online. Access is free, and we address questions through Get
Satisfaction.

Many on this list are surely pros, but when you have friends trying to
get up to speed, feel free to send this along. We provide a VM so new
users can start doing the exercises right away.

http://www.cloudera.com/hadoop-training-basic

Cheers,
Christophe


where should Cloudera host their next public training session?

2009-03-11 Thread Christophe Bisciglia
Hey there, we're trying to decide where to host our next public
training session, so I'd like to simply ask - where is it needed? Use
this form or just drop me a note:
http://spreadsheets.google.com/viewform?formkey=cHZfNzNoLUlkU0dJY0VhUVUwVlpnUUE6MA

We'll do this over two days, with one day being basic training, and
the second being advanced. Both sessions cover Hive as well, with the
advanced session getting pretty deep and addressing database
input/output as well.

Anyway, we'll post more details on content, but what we really want to
know is, where would you like us to host the event? We've done a few
in the bay area, and have been meaning to come to the east coast - we
just want to make sure there's enough interest.

Also, if you want training in Europe, we'll be at apachecon on March
23rd: http://eu.apachecon.com/c/aceu2009/sessions/230

Cheers,
Christophe


seeking beta testers for Hadoop RPMs

2009-02-24 Thread Christophe Bisciglia
Hadoop Fans, we've been hard at work for the last few weeks, and we're
ready to get some feedback from the community.

We're working on RPM based deployment for hadoop, and we're ready for
some beta testers before releasing to the public.

We're specifically looking for folks with some sysadmin experience,
who are already familiar with deploying RPMs on redhat variants.

If you have time to install Haddop 18.3 on a dev cluster this week (or
just a few machines you can borrow), and are interested in helping
out, please let me know the following:

What operating system are you using?
How many nodes can you test?
Can you provide feedback by this monday?

If you can help, we'll provide you with a link to download the RPMs.
We'd like you to try and install them, and run a job on the resulting
cluster. We'd like to hear about any part of that which isn't smooth.
We expect this to be smooth, but need some feedback from people that
weren't in the room during development.

We'll also let the first 10 people who provide feedback "beta test"
our schwag. Might be a thumb drive, might be a tshirt, might be a
coffee mug. We'll let you know what we have when we're done :-)

If you're interested, please respond to Matt (cc'd) and myself. Please
spare the list :-)

Christophe, Matt and the Cloudera Team


Hadoop Workshop for College Teaching Faculty

2009-02-09 Thread Christophe Bisciglia
Hey Hadoop Fans, I wanted to call your attention to an event we're
putting on next month that would be great for your academic contacts.
Please take a moment and forward this to any faculty you think might
be interested.

http://www.cloudera.com/sigcse-2009-disc-workshop

One of the big challenges to Hadoop adoption is that it requires
thinking about data and computation in new ways. One of the best
things we can do as a community, long term, is help educators prepare
their students to work with big data using Hadoop. This is a chance to
help faculty impart skills that will continue to drive Hadoop adoption
for years to come.

Once a year, Computer Science educators from around the world gather
at the ACM's Special Interest Group for Computer Science Education:
SIGCSE

This year, Cloudera, is hosting a day long workshop at SIGCSE to
introduce faculty to the MapReduce programming model, demonstrate how
to integrate material into various types of courses, and go over some
great sample projects for Hadoop. We'll also go over technical
logistics around spinning up clusters on EC2 and getting free credits
from Amazon for classroom use. A lot of this material is based on past
work we have done with the National Science Foundation.

That link again: http://www.cloudera.com/sigcse-2009-disc-workshop

There is no charge for this event, and we'd love to see all your
favorite computer science teachers there.

Cheers,
Christophe


Re: [ANNOUNCE] Registration for ApacheCon Europe 2009 is now open!

2009-01-28 Thread Christophe Bisciglia
I wanted to provide two additional notes about my talk on this list.

First, you're really coming to see Aaron Kimball and Tom White - I'm
working on getting that fixed on the conference pages.

Second, "my" talk is actually a full day of intermediate/advanced
Hadoop training on Monday. It will be similar in style to what we
offer locally (http://www.cloudera.com/hadoop-training), but will be
specifically targeted at more advanced users (we will not be building
an inverted index, and I have heard rumors the star wars kid may be
involved). Here's the conference page:
http://eu.apachecon.com/c/aceu2009/sessions/230 - you can register
using the conference registration system (even if you don't attend the
conference itself).

If your boss won't fly you to Amsterdam for conference, calling it
"training" has been known to help ;-)

We'll keep the cluster up and running for the whole week, and provide
support / advice for training participants throughout.

Cheers,
Christophe

On Tue, Jan 27, 2009 at 8:35 AM, Owen O'Malley  wrote:
> All,
>   I'm broadcasting this to all of the Hadoop dev and users lists, however,
> in the future I'll only send cross-subproject announcements to
> gene...@hadoop.apache.org. Please subscribe over there too! It is very low
> traffic.
>  Anyways, ApacheCon Europe is coming up in March. There are a range of
> Hadoop talks being given:
>
> Introduction to Hadoop by Owen O'Malley
> Hadoop Map/Reduce: Tuning and Debugging by Arun Murthy
> Pig - Making Hadoop Easy by Olga Natkovich
> Running Hadoop in the Cloud by Tom White
> Architectures for the Cloud by Steve Loughran
> Configuring Hadoop for Grid Services by Allen Wittenauer
> Dynamic Hadoop Clusters by Steve Loughran
> HBasics: An Introduction to Hadoop's Bid Data Database by Michael Stack
> Hadoop Tools and Tricks for Data Pipelines by Christophe Bisciglia
> Introducing Mahout: Apache Machine Learning by Grant Ingersoll
>
> -- Owen
>
> Begin forwarded message:
>
>> From: Shane Curcuru 
>> Date: January 27, 2009 6:15:25 AM PST
>> Subject: [ANN] Registration for ApacheCon Europe 2009 is now open!
>>
>> PMC moderators - please forward the below to any appropriate dev@ or
>> users@ lists so your larger community can hear about ApacheCon Europe.
>> Remember, ACEU09 has scheduled sessions spanning the breadth of the ASF's
>> projects, subprojects, and podlings, including at least: ActiveMQ,
>> SerivceMix, CXF, Axis2, Hadoop, Felix, Sling, Maven, Struts, Roller,
>> Shindig, Geronimo, Lucene, Solr, BSF, Mina, Directory, Tomcat, httpd,
>> Mahout, Bayeux, CouchDB, AntUnit, Jackrabbit, Archiva, Wicket, POI, Pig,
>> Synapse, Droids, Continuum.
>>
>>
>> ApacheCon EU 2009 registration is now open!
>> 23-27 March -- Mövenpick Hotel, Amsterdam, Netherlands
>> http://www.eu.apachecon.com/
>> 
>>
>> Registration for ApacheCon Europe 2009 is now open - act before early
>> bird prices expire 6 February.  Remember to book a room at the Mövenpick
>> and use the Registration Code: Special package attendees for the
>> conference registration, and get 150 Euros off your full conference
>> registration.
>>
>> Lower Costs - Thanks to new VAT tax laws, our prices this year are 19%
>> lower than last year in Europe!  We've also negotiated a Mövenpick rate
>> of a maximum of 155 Euros per night for attendees in our room block.
>>
>> Quick Links:
>>
>>  http://xrl.us/aceu09sp  See the schedule
>>  http://xrl.us/aceu09hp  Get your hotel room
>>  http://xrl.us/aceu09rp  Register for the conference
>>
>> Other important notes:
>>
>> - Geeks for Geeks is a new mini-track where we can feature advanced
>> technical content from project committers.  And our Hackathon on Monday
>> and Tuesday is open to all attendees - be sure to check it off in your
>> registration.
>>
>> - The Call for Papers for ApacheCon US 2009, held 2-6 November
>> 2009 in Oakland, CA, is open through 28 February, so get your
>> submissions in now.  This ApacheCon will feature special events with
>> some of the ASF's original founders in celebration of the 10th
>> anniversary of The Apache Software Foundation.
>>
>>  http://www.us.apachecon.com/c/acus2009/
>>
>> - Interested in sponsoring the ApacheCon conferences?  There are plenty
>> of sponsor packages available - please contact Delia Frees at
>> de...@apachecon.com for further information.
>>
>> ==
>> ApacheCon EU 2008: A week of Open Source at it's best!
>>
>> Hackathon - open to all! | Geeks for Geeks | Lunchtime Sessions
>> In-Depth Trainings | Multi-Track Sessions | BOFs | Business Panel
>> Lightning Talks | Receptions | Fast Feather Track | Expo... and more!
>>
>> - Shane Curcuru, on behalf of
>>  Noirin Shirley, Conference Lead,
>>  and the whole ApacheCon Europe 2009 Team
>>  http://www.eu.apachecon.com/  23-27 March -- Amsterdam, Netherlands
>>
>>
>
>


Re: hadoop consulting?

2009-01-23 Thread Christophe Bisciglia
Thanks Mark. I'll be getting in touch early next week.

Others, I see replies default strait to the list. Please feel free to
email just me (christo...@cloudera.com), unless, well, you're in the
mood to share you bio with everyone :-)

Cheers,
Christophe

On Fri, Jan 23, 2009 at 2:31 PM, Mark Kerzner - SHMSoft
 wrote:
> Christophe,
>
> I am writing my first Hadoop project now, and I have 20 years of consulting,
> and I am in Houston. Here is my resume, http://markkerzner.googlepages.com.
> I have used EC2.
>
> Sincerely,
> Mark
>
>
> On Fri, Jan 23, 2009 at 4:04 PM, Christophe Bisciglia <
> christo...@cloudera.com> wrote:
>
>> Hey all, I wanted to reach out to the user / development community to
>> start identifying those of you who are interested in consulting /
>> contract work for new Hadoop deployments.
>>
>> A number of our larger customers are asking for more extensive on-site
>> help than would normally happen under a support contract, especially
>> to get them started. We're looking for some outside help to staff
>> these projects. This list is where the right people hang out.
>>
>> If you're interested, drop me a note with a bit of background, and
>> we'll figure it out from there.
>>
>> Cheers,
>> Christophe and the Cloudera Team
>>
>


hadoop consulting?

2009-01-23 Thread Christophe Bisciglia
Hey all, I wanted to reach out to the user / development community to
start identifying those of you who are interested in consulting /
contract work for new Hadoop deployments.

A number of our larger customers are asking for more extensive on-site
help than would normally happen under a support contract, especially
to get them started. We're looking for some outside help to staff
these projects. This list is where the right people hang out.

If you're interested, drop me a note with a bit of background, and
we'll figure it out from there.

Cheers,
Christophe and the Cloudera Team


Announcing Cloudera's One Day Hadoop Training

2008-12-12 Thread Christophe Bisciglia
Hadoop Fans,

I'm happy to announce that Cloudera, in addition to providing
commercial support for Hadoop, is now offering a one-day, professional
training course for Hadoop. It's open to anyone in the community, and
is focused on helping you get the most out of Hadoop and related
tools. Come spend a day working with us and other users facing similar
challenges to your own.

For full details, see our website: http://www.cloudera.com/hadoop-training

We primarily focus on the following the themes:
   * What must our organization do differently to capture and
effectively use very-large scale data?
   * What tools help us analyze large-scale data and extract
meaningful results, and how do we use them?
   * How can we reorient our data generation and collection processes
to enable more powerful analysis later?

The morning is instructional, the afternoon is hands on. We provide a
cluster with interesting data, and you are free to load your own as
well.

Cheers,
Christophe