Re: bulk data transfer to HDFS remotely (e.g. via wan)

2010-03-02 Thread Mattmann, Chris A (388J)
Hi All, I co-authored a paper about this that was published at the NASA/IEEE Mass Storage conference in 2006 [1]. Also, my Ph.D. Dissertation [2] contains information about making these types of data movement selections when needed. Thought I'd throw it out there in case it helps. HTH, Chris

Re: HDF5 and Hadoop

2010-05-03 Thread Mattmann, Chris A (388J)
Hi Andrew, There has been some work in the Tika [1] project recently on looking at NetCDF4 [2] and HDF4/5 [3] and extracting metadata/text content from them. Though this doesn't directly apply to your question below, it might be worth perhaps looking at how to marry Tika and Hadoop in that rega

Re: REST web service on top of Hadoop

2010-07-28 Thread Mattmann, Chris A (388J)
Hi Alex, I had one of my students in my Search Engines class at USC prepare this very project. I will work on cleaning it up and trying to get it patch-ready... Cheers, Chris On 7/28/10 1:03 PM, "Alex Kozlov" wrote: Since noone answered: AFAIK there is no REST interface to Hadoop/HDFS. Would

Re: Similar frameworks like hadoop and taxonomy of distributed computing

2012-01-11 Thread Mattmann, Chris A (388J)
Also check out my paper on The Anatomy and Physiology of the Grid Revisited just Google for it where we also tried to look at this very issue. Cheers, Chris Sent from my iPhone On Jan 11, 2012, at 3:55 PM, "Brian Bockelman" wrote: > > On Jan 11, 2012, at 10:15 AM, George Kousiouris wrote: >

Re: Similar frameworks like hadoop and taxonomy of distributed computing

2012-01-11 Thread Mattmann, Chris A (388J)
Here's some links to it: Long Version: http://csse.usc.edu/csse/TECHRPTS/2008/usc-csse-2008-820/usc-csse-2008-820.pdf Shorter Version (published in WICSA): http://wwwp.dnsalias.org/w/images/3/3f/AnatomyPhysiologyGridRevisited66.pdf Cheers, Chris On Jan 11, 2012, at 4:02 PM, Mattmann, Ch

Re: [blog post] Accumulo, Nutch, and Gora

2012-02-28 Thread Mattmann, Chris A (388J)
UMMM wow! That's awesome Jason! Thanks so much! Cheers, Chris On Feb 28, 2012, at 5:41 PM, Jason Trost wrote: > Blog post for anyone who's interested. I cover a basic howto for > getting Nutch to use Apache Gora to store web crawl data in Accumulo. > > Let me know if you have any questions. >

Fwd: May 21 talk at Pasadena JUG

2012-05-04 Thread Mattmann, Chris A (388J)
(apologies for cross posting) Hey Folks in the SoCal area -- if you're around on May 21st, I'll be speaking at the Pasadena JUG on Apache OODT, Big Data and likely Apache Hadoop (in prep for my Hadoop Summit coming talk). Info is below thanks to David Noble for setting this up! Cheers, Chris B

Re: AWS Hadoop 20.2 AMIs

2010-11-18 Thread Mattmann, Chris A (388J)
Hey Mike, Do you have time to submit a patch? You could probably create a jira issue here [1] and then attach a diff of your update... Cheers, Chris [1] http://issues.apache.org/jira/browse/HADOOP On Nov 17, 2010, at 11:26 AM, Gangl, Michael E (388K) wrote: > FYI, I commented out the Kernal v

[Call for Papers] ICSE Software Engineering for Cloud Computing (SECLOUD) Workshop

2011-01-03 Thread Mattmann, Chris A (388J)
(apologies for the cross posting) Please consider submitting a paper to the ICSE 2011 Software Engineering for Cloud Computing (SECLOUD) Workshop to be held Sunday, May 22, 2011, at the Hilton Hawaiian Village Resort in Waikiki, Honolulu, HI. This workshop focuses on identifying the grand chall

[Call for Papers] ICSE Software Engineering for Cloud Computing (SECLOUD) Workshop

2011-01-20 Thread Mattmann, Chris A (388J)
(apologies for the cross posting) *** PLEASE NOTE - the deadline for submitting papers has been extended by 1 week to 1/28/2011! *** Please consider submitting a paper to the ICSE 2011 Software Engineering for Cloud Computing (SECLOUD) Workshop to be held Sunday, May 22, 2011, at the Hilton Ha