RE: MoreLikeThis - Odd results - what am I doing wrong?

2013-04-03 Thread DC tech
2013 1:02 PM >To: solr-user@lucene.apache.org >Subject: Re: MoreLikeThis - Odd results - what am I doing wrong? > >OK - so I have my SOLR instance running on AWS. >Any suggestions on how to safely share the link? Right now, the whole SOLR >instance is totally open. > > >

RE: MoreLikeThis - Odd results - what am I doing wrong?

2013-04-02 Thread David Parks
if you're not using HTTPS, best lock the thing down behind a firewall). Dave -Original Message- From: DC tech [mailto:dctech1...@gmail.com] Sent: Tuesday, April 02, 2013 1:02 PM To: solr-user@lucene.apache.org Subject: Re: MoreLikeThis - Odd results - what am I doing wrong? OK

Re: MoreLikeThis - Odd results - what am I doing wrong?

2013-04-01 Thread DC tech
OK - so I have my SOLR instance running on AWS. Any suggestions on how to safely share the link? Right now, the whole SOLR instance is totally open. Gagandeep singh wrote: >say &debugQuery=true&mlt=true and see the scores for the MLT query, not a >sample query. You can use Amazon ec2 to br

Re: MoreLikeThis - Odd results - what am I doing wrong?

2013-03-31 Thread Gagandeep singh
say &debugQuery=true&mlt=true and see the scores for the MLT query, not a sample query. You can use Amazon ec2 to bring up your solr, you should be able to get a micro instance for free trial. On Mon, Apr 1, 2013 at 5:10 AM, dc tech wrote: > I did try the raw query against the *simi* field and

Re: MoreLikeThis - Odd results - what am I doing wrong?

2013-03-31 Thread dc tech
I did try the raw query against the *simi* field and those seem to return results in the order expected. For instance, Acura MDX has ( large, SUV, 4WD Luxury) in the simi field. Running a query with those words against the simi field returns the expected models (X5, Audi Q5, etc) and then the su

Re: MoreLikeThis - Odd results - what am I doing wrong?

2013-03-31 Thread Gagandeep singh
If you can bring up your solr setup on a public machine then im sure a lot of debugging can be done. Without that, i think what you should look at is the tf-idf scores of the terms like "camry" etc. Usually idf is the deciding factor into which results show at the top (tf should be 1 for your data)

MoreLikeThis - Odd results - what am I doing wrong?

2013-03-31 Thread dc tech
I am running some experiments on more like this and the results seem rather odd - I am doing something wrong but just cannot figure out what. Basically, the similarity results are decent - but not great. *Issue 1 = Quality* Toyota Camry : finds Altima (good) but then next one is Camry Hybrid wher