Near Duplicate Documents

2007-11-16 Thread Rishabh Joshi
Hi,

I am evaluating "Solr 1.2" for my project and wanted to know if it can
return near duplicate documents (near dups) and how do i go about it? I am
not sure, but is "MoreLikeThisHandler" the implementation for near dups?

Rishabh


client api for UpdateRichDocuments

2007-11-16 Thread Dwarak R
Hey All

Is there any client api  for UpdateRichDocuments ?.


Regards

Dwarak R

This message is for the designated recipient only and may contain privileged, 
proprietary, or otherwise private information. If you have received it in 
error, please notify the sender&[EMAIL PROTECTED]  immediately and delete the 
original. Any other use of the email by you is prohibited.


Re: Problems with Basic Install (newbie question)

2007-11-16 Thread Daniel Naber
On Donnerstag, 15. November 2007, Paul21 wrote:

> I never did install Tomcat. Maybe that's the problem?

Are you sure you have installed the JDK, not just the JRE?

Regards
 Daniel

-- 
http://www.danielnaber.de


Re: PriceJunkie.com using solr!

2007-11-16 Thread William Silva
Hi Nick,
For example, searching for tv , the first document is "*Samsung SlimFit HDTV
TX-T2782 27" TV*" in 2 Sellers, Amazon and Circuit City. The tvs name are:
Amazon => Slimfit Digital Tv 27"
Circuit City => Samsung 27" SlimFit Tube TV
How do you know that they are the same Television ?

Thanks,
William.



On 11/14/07, Nick Jenkin <[EMAIL PROTECTED]> wrote:
>
> Hi
> This is faceting, http://wiki.apache.org/solr/SolrFacetingOverview
> -Nick
>
> On Nov 14, 2007 1:21 AM, William Silva <[EMAIL PROTECTED]> wrote:
> > Hi Mike,
> > I´m checking out www.pricejunkie.com and Í would like to know how do
> > you group the products and find the price range. Is it a batch process
> > ? Are you using MoreLikeThis to do it ?
> > Thanks,
> > William.
> >
> >
> >
> > >ahait is wonderful.
> >
> > >2007/5/24, Mike Austin <[EMAIL PROTECTED]>:
> >
> > >Just one.
> >
> > >-Original Message-
> > >From: James liu [EMAIL PROTECTED]
> > >Sent: Wednesday, May 16, 2007 10:30 PM
> > >To: solr-user@lucene.apache.org
> > >Subject: Re: PriceJunkie.com using solr!
> >
> >
> > >how many solr instance?
> >
> >
> > >2007/5/17, Yonik Seeley <[EMAIL PROTECTED]>:
> > >
> > > Congrats, very nice job!
> > > It's fast too.
> > >
> > > -Yonik
> > >
> > > On 5/16/07, Mike Austin <[EMAIL PROTECTED]> wrote:
> > > > I just wanted to say thanks to everyone for the creation of
> > solr.  I've
> > > been
> > > > using it for a while now and I have recently brought one of my side
> > > projects
> > > > online.  I have several other projects that will be using solr for
> > it's
> > > > search and facets.
> > > >
> > > > Please check out www.pricejunkie.com and let us know what you
> think..
> > > You
> > > > can give feedback and/or sign up on the mailing list for future
> > updates.
> > > > The site is very basic right now and many new and useful features
> plus
> > > > merchants and product categories will be coming soon!  I thought it
> > > would be
> > > > a good idea to at least have a few people use it to get some
> feedback
> > > early
> > > > and often.
> > > >
> > > > Some of the nice things behind the scenes that we did with solr:
> > > > - created custom request handlers that have category to facet to
> > > attribute
> > > > caching built in
> > > > - category to facet management
> > > > - ability to manage facet groups (attributes within a set
> > facet)
> > > and assign
> > > > them to categories
> > > > - ability to create any category structure and share facet
> > > groups
> > > >
> > > > - facet inheritance for any category (a facet group can be defined
> on
> > a
> > > > parent category and pushed down to all children)
> > > > - ability to create sub-categories as facets instead of normal sub
> > > > categories
> > > > - simple xml configuration for the final outputted category
> > > configuration
> > > > file
> > > >
> > > >
> > > > I'm sure there are more cool things but that is all for now.  Join
> the
> > > > mailing list to see more improvements in the future.
> > > >
> > > > Also.. how do I get added to the Using Solr wiki page?
> > > >
> > > >
> > > > Thanks,
> > > > Mike Austin
> > >
> >
>


RE: snappuller and UTF-8

2007-11-16 Thread patrik
I've been using SOLR for a while now and came across an issue today with a
master/slave server setup using snappuller. When I post a document with
Russian, or any non-ASCII, text into my master server, everything is fine. A
quick query through the admin panel shows me the correct Russian text.

 

However, once that document gets rsync'd to the slave boxes, the same query
shows me results where the Russian text is obviously mangled. It only shows
up as question marks. 

 

Anyone else seen this kind of behavior before? Am I missing some rsync
params? Obviously SOLR isn't the problem, but this can't be the first time
it's been faced.

 

 

patrik

 



Re: snappuller and UTF-8

2007-11-16 Thread Mike Klaas

On 16-Nov-07, at 6:44 PM, patrik wrote:

I've been using SOLR for a while now and came across an issue today  
with a
master/slave server setup using snappuller. When I post a document  
with
Russian, or any non-ASCII, text into my master server, everything  
is fine. A

quick query through the admin panel shows me the correct Russian text.



However, once that document gets rsync'd to the slave boxes, the  
same query
shows me results where the Russian text is obviously mangled. It  
only shows

up as question marks.



Anyone else seen this kind of behavior before? Am I missing some rsync
params? Obviously SOLR isn't the problem, but this can't be the  
first time

it's been faced.


That is quite strange.  Have you tried inserting the document into  
the slave index directly to eliminate the rsync from the possible  
causes?


It sounds to me like your slave web container is configured  
differently from the master.


-Mike


RE: snappuller and UTF-8

2007-11-16 Thread patrik
Now I feel kind of stupid for even mentioning it. Turns out ehcache, which I
had in front of the slave servers was the problem.

pb

-Original Message-
From: Mike Klaas [mailto:[EMAIL PROTECTED] 
Sent: Friday, November 16, 2007 7:01 PM
To: solr-user@lucene.apache.org
Subject: Re: snappuller and UTF-8

On 16-Nov-07, at 6:44 PM, patrik wrote:

> I've been using SOLR for a while now and came across an issue today  
> with a
> master/slave server setup using snappuller. When I post a document  
> with
> Russian, or any non-ASCII, text into my master server, everything  
> is fine. A
> quick query through the admin panel shows me the correct Russian text.
>
>
>
> However, once that document gets rsync'd to the slave boxes, the  
> same query
> shows me results where the Russian text is obviously mangled. It  
> only shows
> up as question marks.
>
>
>
> Anyone else seen this kind of behavior before? Am I missing some rsync
> params? Obviously SOLR isn't the problem, but this can't be the  
> first time
> it's been faced.

That is quite strange.  Have you tried inserting the document into  
the slave index directly to eliminate the rsync from the possible  
causes?

It sounds to me like your slave web container is configured  
differently from the master.

-Mike



Re: PriceJunkie.com using solr!

2007-11-16 Thread Nick Jenkin
I would guess by model number
-Nick
On 11/17/07, William Silva <[EMAIL PROTECTED]> wrote:
> Hi Nick,
> For example, searching for tv , the first document is "*Samsung SlimFit HDTV
> TX-T2782 27" TV*" in 2 Sellers, Amazon and Circuit City. The tvs name are:
> Amazon => Slimfit Digital Tv 27"
> Circuit City => Samsung 27" SlimFit Tube TV
> How do you know that they are the same Television ?
>
> Thanks,
> William.
>
>
>
> On 11/14/07, Nick Jenkin <[EMAIL PROTECTED]> wrote:
> >
> > Hi
> > This is faceting, http://wiki.apache.org/solr/SolrFacetingOverview
> > -Nick
> >
> > On Nov 14, 2007 1:21 AM, William Silva <[EMAIL PROTECTED]> wrote:
> > > Hi Mike,
> > > I´m checking out www.pricejunkie.com and Í would like to know how do
> > > you group the products and find the price range. Is it a batch process
> > > ? Are you using MoreLikeThis to do it ?
> > > Thanks,
> > > William.
> > >
> > >
> > >
> > > >ahait is wonderful.
> > >
> > > >2007/5/24, Mike Austin <[EMAIL PROTECTED]>:
> > >
> > > >Just one.
> > >
> > > >-Original Message-
> > > >From: James liu [EMAIL PROTECTED]
> > > >Sent: Wednesday, May 16, 2007 10:30 PM
> > > >To: solr-user@lucene.apache.org
> > > >Subject: Re: PriceJunkie.com using solr!
> > >
> > >
> > > >how many solr instance?
> > >
> > >
> > > >2007/5/17, Yonik Seeley <[EMAIL PROTECTED]>:
> > > >
> > > > Congrats, very nice job!
> > > > It's fast too.
> > > >
> > > > -Yonik
> > > >
> > > > On 5/16/07, Mike Austin <[EMAIL PROTECTED]> wrote:
> > > > > I just wanted to say thanks to everyone for the creation of
> > > solr.  I've
> > > > been
> > > > > using it for a while now and I have recently brought one of my side
> > > > projects
> > > > > online.  I have several other projects that will be using solr for
> > > it's
> > > > > search and facets.
> > > > >
> > > > > Please check out www.pricejunkie.com and let us know what you
> > think..
> > > > You
> > > > > can give feedback and/or sign up on the mailing list for future
> > > updates.
> > > > > The site is very basic right now and many new and useful features
> > plus
> > > > > merchants and product categories will be coming soon!  I thought it
> > > > would be
> > > > > a good idea to at least have a few people use it to get some
> > feedback
> > > > early
> > > > > and often.
> > > > >
> > > > > Some of the nice things behind the scenes that we did with solr:
> > > > > - created custom request handlers that have category to facet to
> > > > attribute
> > > > > caching built in
> > > > > - category to facet management
> > > > > - ability to manage facet groups (attributes within a set
> > > facet)
> > > > and assign
> > > > > them to categories
> > > > > - ability to create any category structure and share facet
> > > > groups
> > > > >
> > > > > - facet inheritance for any category (a facet group can be defined
> > on
> > > a
> > > > > parent category and pushed down to all children)
> > > > > - ability to create sub-categories as facets instead of normal sub
> > > > > categories
> > > > > - simple xml configuration for the final outputted category
> > > > configuration
> > > > > file
> > > > >
> > > > >
> > > > > I'm sure there are more cool things but that is all for now.  Join
> > the
> > > > > mailing list to see more improvements in the future.
> > > > >
> > > > > Also.. how do I get added to the Using Solr wiki page?
> > > > >
> > > > >
> > > > > Thanks,
> > > > > Mike Austin
> > > >
> > >
> >
>