Re: Image indexing/searching with Hadoop and MPI

2009-06-03 Thread tog
On Wed, Jun 3, 2009 at 5:17 PM, Edward J. Yoon wrote:

> > This is a kind of newbie question (at least as far as Hadoop is
> concerned).
> > I was wondering if they were any Hadoop based project around dealing with
> > Image indexing and searching ? We are working is this area and might be
> > interesting to have a look in such a project.
>
> There is a text-search engine library, called lucene. See also the
> nutch project. Otherwise, Did you mean something like content-based
> image indexing and searching usig image attributes, such as, color,
> texture, and etc., not the text of image tag?


Yes this is exactly what I mean, I am looking at a project doing
content-based image indexing using for example GIST, BOF, ...
Does such a project exist ?


>
>
> I think the MPI programming isn't suitable for the concept of
> distributed hdfs and map/reduce programming system, since MPI requires
> the heavy communication among the nodes.


Ok I can understand your point - but I am sure that some people have been
trying to use map-reduce programming model to do CFD, or any other
scientific computing.
Any experience in this area from the list ?

Cheers
Guillaume


Image indexing/searching with Hadoop and MPI

2009-06-03 Thread tog
Hi there,

This is a kind of newbie question (at least as far as Hadoop is concerned).
I was wondering if they were any Hadoop based project around dealing with
Image indexing and searching ? We are working is this area and might be
interesting to have a look in such a project.
Second question is dealing with scientific computing with Haddop. Does
anyone has try to use Hadoop to parallelize a scientific application ? I
know there is Hama but it does not seem very active these days (I might be
wrong ;) )
Some time ago, I heard of an attempt of implementing some MPI implementation
on top of Hadoop , was it really the plan, is there any update ?
Anyway, I would be interested in any paper/fedeback on the performance of
scientific application running on large clusters using Hadoop.

Best Regards
Guillaume