Re: Custom Analyzer

2010-12-30 Thread nitishgarg

Thanks for replying Erick!
I executed this query jar -tf jarfile.jar and my classes were indeed there.
Regarding the package I had this line written in my java file:
package org.apache.lucene.analysis.mr
I removed this line but still it was not working.
Then I tried creating a small project (marathi.jar, containing the class
files) and place it in the lib folder, I even set the CLASSPATH to the jar
file, but still Solr is throwing me a ClassNotFoundException.
Please help me out!
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Custom-Analyzer-tp2162710p2166185.html
Sent from the Solr - User mailing list archive at Nabble.com.


Inner index stock big BLOB or separate in a DB ?

2010-12-30 Thread Hong-Thai Nguyen
Hi,

With big BLOB objects transcripted in Base64, anyone tried a performance test 
in query with a huge data (~M documents, ~500Gb) to compare 2 methods:

-  store directly theses BLOB-Base64 objects in Index by String field

-  separate these BLOB in a DB, then a URL index field point to 
associate record to mapping in result search

 

Could you suggest which one is better?

 

Thanks,

 

Hong-Thai NGUYEN

 



Re: Inner index stock big BLOB or separate in a DB ?

2010-12-30 Thread Grijesh.singh

Second option is better. Storing Big BLOB data in index will increase index
size and it will create performance issues

-
Grijesh
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Inner-index-stock-big-BLOB-or-separate-in-a-DB-tp2166964p2167129.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Start solr unsuccessfully on Geronimo

2010-12-30 Thread Bac Hoang
Hi Pankaj and to whom it may concern - The crux and cure have been 
identified. This is because of missing jasper deployer in Geronimo 
2.1.6, which are available in earlier Geronimo version


Just download the following components, manually copy them to relevant 
folders. Done :)


http://repo1.maven.org/maven2/org/apache/geronimo/modules/geronimo-jasper-builder/2.1.6/geronimo-jasper-builder-2.1.6.jar
http://repo1.maven.org/maven2/org/apache/geronimo/configs/jasper-deployer/2.1.6/jasper-deployer-2.1.6.car

Cheers and happy new year all
Bac Hoang

On 12/28/2010 5:38 PM, pankaj bhatt wrote:

hi Bac,
this is an Geronimo specific problem and it come due to the 
following reason:
it may be that aur app server uses a particular attribute named 
'org.apache.InstanceManager' when will destroy the JSP..
Can you try it by creating a simple war file of a simple jsp and try 
with it.. i presume the same error will come.


   it might be possible, you are not using Geronimo full distribution.
Please refer to this:
http://www.mail-archive.com/user@geronimo.apache.org/msg11891.html
/ Pankaj Bhatt.


On Tue, Dec 28, 2010 at 3:26 PM, Bac Hoang bac.ho...@axonactive.vn 
mailto:bac.ho...@axonactive.vn wrote:


Hello all - Could any one please shed a light to the hassle issue
below when *start * Solr on Geronimo.

===
org.apache.jasper.JasperException:
*java.lang.IllegalStateException: No org.apache.InstanceManager
set in ServletContext*

org.apache.jasper.servlet.JspServletWrapper.getServlet(JspServletWrapper.java:151)


org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:324)

org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:313)
org.apache.jasper.servlet.JspServlet.service(JspServlet.java:260)
javax.servlet.http.HttpServlet.service(HttpServlet.java:806)

org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:286)
===

Additional info.:
- deployed successfully but error when start. It starts well on
Tomcat and Jetty
- use the solr 1.4.1.
- Geronimo 2.1.6
- OS: Unix and Win 7 (both got the same error)
- other applications (war) run well on this Geronimo
- jasper plugins are installed (jasper and jasper deployer)

Thanks much!




Carrot2 clustering tool Beginner

2010-12-30 Thread Isha Garg

Hi,
   I am new to carrot2 clustering tool. Can anyone Guide me related to 
this tool and how it  can integrate  with solr or lucene.


Thanks!
Seeking for your guidance.


Highlighter problem when using WordDelimiterFilter and term vectors

2010-12-30 Thread Oliver Messner
Hi,

when using WordDelimiterFilterFactory in the fieldType definition and
setting termVectors=true termPositions=true termOffsets=true on
the field, Solr gives me the following response for the query request
?q=warmwasserspeicherversion=2.2indent=onhl=true

lst name=highlighting
  lst name=id-1
arr name=content
  strsome text WarmemWarmWasserSpeicher/em here/str
/arr
  /lst
/lst

As you can see, the highlighter does not work like expected (at least
for me). If the term vectors are not stored into the index, I get the
expected result strsome text emWarmWasserSpeicher/em here/str.

I'm using Solr version 1.4.1
BTW, this problem does not occur when using the FastVectorHighlighter
(after applying patches https://issues.apache.org/jira/browse/SOLR-1268)


Any ideas?


Uploaded document:
add
  doc
field name=idid-1/field
field name=contentsome text WarmWasserSpeicher here/field
  /doc
/add


Field type definition:
fieldType name=text class=solr.TextField positionIncrementGap=100
  analyzer type=index
tokenizer class=solr.WhitespaceTokenizerFactory/
filter class=solr.WordDelimiterFilterFactory
generateWordParts=1 generateNumberParts=1 catenateWords=1
catenateNumbers=0 catenateAll=0 splitOnCaseChange=1/
filter class=solr.LowerCaseFilterFactory/
  /analyzer
  analyzer type=query
tokenizer class=solr.WhitespaceTokenizerFactory/
filter class=solr.WordDelimiterFilterFactory
generateWordParts=1 generateNumberParts=1 catenateWords=0
catenateNumbers=0 catenateAll=0 splitOnCaseChange=0/
filter class=solr.LowerCaseFilterFactory/
  /analyzer
/fieldType


Field definition:
fields
  ...
  field name=content type=text indexed=true stored=true
termVectors=true termPositions=true termOffsets=true/
/fields


solrconf.xml:
requestHandler name=dismax class=solr.SearchHandler default=true
  lst name=defaults
bool name=tvtrue/bool
str name=defTypedismax/str
str name=qfcontent/str
str name=mm1/str
str name=hltrue/str
str name=flscore/str
  /lst
  arr name=last-components
strtvComponent/str
  /arr
/requestHandler
...
searchComponent name=tvComponent
class=org.apache.solr.handler.component.TermVectorComponent/


Thanks,
Oliver


Re: Carrot2 clustering tool Beginner

2010-12-30 Thread Markus Jelsma
How about reading the wiki:
http://wiki.apache.org/solr/ClusteringComponent

On Thursday 30 December 2010 13:21:19 Isha Garg wrote:
 Hi,
 I am new to carrot2 clustering tool. Can anyone Guide me related to
 this tool and how it  can integrate  with solr or lucene.
 
 Thanks!
 Seeking for your guidance.

-- 
Markus Jelsma - CTO - Openindex
http://www.linkedin.com/in/markus17
050-8536620 / 06-50258350


Re: UPDATE query in deltaquery

2010-12-30 Thread Juan Manuel Alvarez
Hi Erick!

Here is my DIH configuration:

dataConfig
dataSource name=jdbc driver=org.postgresql.Driver

url=jdbc:postgresql://${dataimporter.request.dbHost}:${dataimporter.request.dbPort}/${dataimporter.request.dbName}
user=${dataimporter.request.dbUser}
password=${dataimporter.request.dbPassword} autoCommit=false
transactionIsolation=TRANSACTION_READ_UNCOMMITTED
holdability=CLOSE_CURSORS_AT_COMMIT/
document
entity name='myEntity' dataSource='jdbc' pk='id'
query='  . '
  deltaImportQuery='  . '
deltaQuery=' . '

/entity
/document
/dataConfig

I have tried two options for the deltaQuery:
UPDATE Global.Projects SET prj_lastSync = now() WHERE prj_id =
'2';  Throws a null pointer exception as described in the
previous email

The second option is a DB function that I am calling this way:
SELECT get_deltaimport_items AS id FROM
project.get_deltaimport_items(2, #39;project#39;);

The function inside executes the UPDATE query shown above and a SELECT
query for the ids.
The ids are returned ok, but the UPDATE has no effect on the database.

Cheers!
Juan M.


On Thu, Dec 30, 2010 at 1:32 AM, Erick Erickson erickerick...@gmail.com wrote:
 Well, let's see the queries you're sending, and your DIH configuration.

 Otherwise, we're just guessing...

 Best
 Erick

 On Wed, Dec 29, 2010 at 9:58 PM, Juan Manuel Alvarez 
 naici...@gmail.comwrote:

 Hi! I would like to ask you a question about using a deltaQuery in DIH.
 I am syncing with a PostgreSQL database.

 At first I was calling a function that made two queries: an UPDATE and a
 SELECT.
 The select result was properly returned, but the UPDATE query did not
 made any changes,
 so I tried calling the same function from a PostgreSQL client and
 everything went OK.

 So I tried calling a simple UPDATE query directly in the deltaQuery
 and I receive a
 NullPointerException that I traced to the line 251 of the
 JdbcDataSource.java
 colNames = readFieldNames(resultSet.getMetaData());

 The question is: is there a way I can make the update query work in
 the deltaQuery
 or am I doing something wrong?

 Happy new year
 Cheers!
 Juan M.




Re: Custom Analyzer

2010-12-30 Thread Erick Erickson
What does jar -tf your jar file here show you the actual classes in your
jar are?

You're still saying it doesn't work, without providing details that let us
help.
Imagine we're asking you for help. Does your message give enough info to
suggest much?

Best
Erick

On Wed, Dec 29, 2010 at 11:51 PM, nitishgarg nitishgarg1...@gmail.comwrote:


 Thanks for replying Erick!
 I executed this query jar -tf jarfile.jar and my classes were indeed there.
 Regarding the package I had this line written in my java file:
 package org.apache.lucene.analysis.mr
 I removed this line but still it was not working.
 Then I tried creating a small project (marathi.jar, containing the class
 files) and place it in the lib folder, I even set the CLASSPATH to the jar
 file, but still Solr is throwing me a ClassNotFoundException.
 Please help me out!
 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/Custom-Analyzer-tp2162710p2166185.html
 Sent from the Solr - User mailing list archive at Nabble.com.



Re: UPDATE query in deltaquery

2010-12-30 Thread Erick Erickson
WARNING: DIH isn't my strong suit, I generally prefer doing things
in SolrJ. Mostly I asked for clarification so someone #else# who
actually knows DIH details could chime in...

That said, I'm a bit confused. As I understand it, you shouldn't
be UPDATEing anything in DIH, it's a select where documents
then get added to Solr by magic. Your post leads me to believe
that you're trying to change the database via DIH, is that at
all true?

This is based in part on
The ids are returned ok, but the UPDATE has no effect on the database
Or do you mean effect on the index? If the latter, then the select
would only have a chance of updating the IDs of the Solr documents...

At least I think that's close to reality...

Best
Erick

On Thu, Dec 30, 2010 at 7:52 AM, Juan Manuel Alvarez naici...@gmail.comwrote:

 Hi Erick!

 Here is my DIH configuration:

 dataConfig
dataSource name=jdbc driver=org.postgresql.Driver

  
 url=jdbc:postgresql://${dataimporter.request.dbHost}:${dataimporter.request.dbPort}/${dataimporter.request.dbName}
user=${dataimporter.request.dbUser}
 password=${dataimporter.request.dbPassword} autoCommit=false
transactionIsolation=TRANSACTION_READ_UNCOMMITTED
 holdability=CLOSE_CURSORS_AT_COMMIT/
document
entity name='myEntity' dataSource='jdbc' pk='id'
query='  . '
  deltaImportQuery='  . '
deltaQuery=' . '

/entity
/document
 /dataConfig

 I have tried two options for the deltaQuery:
 UPDATE Global.Projects SET prj_lastSync = now() WHERE prj_id =
 '2';  Throws a null pointer exception as described in the
 previous email

 The second option is a DB function that I am calling this way:
 SELECT get_deltaimport_items AS id FROM
 project.get_deltaimport_items(2, #39;project#39;);

 The function inside executes the UPDATE query shown above and a SELECT
 query for the ids.
 The ids are returned ok, but the UPDATE has no effect on the database.

 Cheers!
 Juan M.


 On Thu, Dec 30, 2010 at 1:32 AM, Erick Erickson erickerick...@gmail.com
 wrote:
  Well, let's see the queries you're sending, and your DIH configuration.
 
  Otherwise, we're just guessing...
 
  Best
  Erick
 
  On Wed, Dec 29, 2010 at 9:58 PM, Juan Manuel Alvarez naici...@gmail.com
 wrote:
 
  Hi! I would like to ask you a question about using a deltaQuery in DIH.
  I am syncing with a PostgreSQL database.
 
  At first I was calling a function that made two queries: an UPDATE and a
  SELECT.
  The select result was properly returned, but the UPDATE query did not
  made any changes,
  so I tried calling the same function from a PostgreSQL client and
  everything went OK.
 
  So I tried calling a simple UPDATE query directly in the deltaQuery
  and I receive a
  NullPointerException that I traced to the line 251 of the
  JdbcDataSource.java
  colNames = readFieldNames(resultSet.getMetaData());
 
  The question is: is there a way I can make the update query work in
  the deltaQuery
  or am I doing something wrong?
 
  Happy new year
  Cheers!
  Juan M.
 
 



Re: UPDATE query in deltaquery

2010-12-30 Thread Juan Manuel Alvarez
Erick:

Thanks for the quick response.

I can't use the timestamp for doing DIH, so I need to use a custom
field that I need to update one for each delta-import, so that is why
I need to execute an UPDATE on the deltaQuery.

Cheers!
Juan M.

On Thu, Dec 30, 2010 at 10:07 AM, Erick Erickson
erickerick...@gmail.com wrote:
 WARNING: DIH isn't my strong suit, I generally prefer doing things
 in SolrJ. Mostly I asked for clarification so someone #else# who
 actually knows DIH details could chime in...

 That said, I'm a bit confused. As I understand it, you shouldn't
 be UPDATEing anything in DIH, it's a select where documents
 then get added to Solr by magic. Your post leads me to believe
 that you're trying to change the database via DIH, is that at
 all true?

 This is based in part on
 The ids are returned ok, but the UPDATE has no effect on the database
 Or do you mean effect on the index? If the latter, then the select
 would only have a chance of updating the IDs of the Solr documents...

 At least I think that's close to reality...

 Best
 Erick

 On Thu, Dec 30, 2010 at 7:52 AM, Juan Manuel Alvarez 
 naici...@gmail.comwrote:

 Hi Erick!

 Here is my DIH configuration:

 dataConfig
    dataSource name=jdbc driver=org.postgresql.Driver

  url=jdbc:postgresql://${dataimporter.request.dbHost}:${dataimporter.request.dbPort}/${dataimporter.request.dbName}
        user=${dataimporter.request.dbUser}
 password=${dataimporter.request.dbPassword} autoCommit=false
        transactionIsolation=TRANSACTION_READ_UNCOMMITTED
 holdability=CLOSE_CURSORS_AT_COMMIT/
    document
        entity name='myEntity' dataSource='jdbc' pk='id'
                query='  . '
          deltaImportQuery='  . '
                deltaQuery=' . '
        
        /entity
    /document
 /dataConfig

 I have tried two options for the deltaQuery:
 UPDATE Global.Projects SET prj_lastSync = now() WHERE prj_id =
 '2';  Throws a null pointer exception as described in the
 previous email

 The second option is a DB function that I am calling this way:
 SELECT get_deltaimport_items AS id FROM
 project.get_deltaimport_items(2, #39;project#39;);

 The function inside executes the UPDATE query shown above and a SELECT
 query for the ids.
 The ids are returned ok, but the UPDATE has no effect on the database.

 Cheers!
 Juan M.


 On Thu, Dec 30, 2010 at 1:32 AM, Erick Erickson erickerick...@gmail.com
 wrote:
  Well, let's see the queries you're sending, and your DIH configuration.
 
  Otherwise, we're just guessing...
 
  Best
  Erick
 
  On Wed, Dec 29, 2010 at 9:58 PM, Juan Manuel Alvarez naici...@gmail.com
 wrote:
 
  Hi! I would like to ask you a question about using a deltaQuery in DIH.
  I am syncing with a PostgreSQL database.
 
  At first I was calling a function that made two queries: an UPDATE and a
  SELECT.
  The select result was properly returned, but the UPDATE query did not
  made any changes,
  so I tried calling the same function from a PostgreSQL client and
  everything went OK.
 
  So I tried calling a simple UPDATE query directly in the deltaQuery
  and I receive a
  NullPointerException that I traced to the line 251 of the
  JdbcDataSource.java
  colNames = readFieldNames(resultSet.getMetaData());
 
  The question is: is there a way I can make the update query work in
  the deltaQuery
  or am I doing something wrong?
 
  Happy new year
  Cheers!
  Juan M.
 
 




Sort Facet Query

2010-12-30 Thread Em

Hi List,

I got a little issue with sorting a FacetQuery.

Currently I am doing something like that in SolrJ:

SolrQuery q = new SolrQuery(myQuery);
q.setFacetQuery(names:thomas);//want to see the count of thomas's
documents.
q.setFacetPrefix(short, th);

I don't know any better example, but the result from all those facets should
be returned in lexicographic order, not by count - so i can ensure that
every constraint is returned at the same place.

Any ideas?

Thank you!

-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Sort-Facet-Query-tp2167635p2167635.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: old index files not deleted on slave

2010-12-30 Thread sakunthalakishan

We have tried all locktypes simple, single, native.  But nothing worked.  I
have upgraded to Solr 1.4, and when i used the replication system of 1.4. 
It is working fine.  Not sure why the scripts are not able to replicate the
index on Linux, but java based replication is working.

Thanks for the info.
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/old-index-files-not-deleted-on-slave-tp2113493p2167789.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Sort Facet Query

2010-12-30 Thread Stephen Duncan Jr
Set facet.limit to -1 (globally or for that field).  That will return all
the facets, in lexicographical order.

Stephen Duncan Jr
www.stephenduncanjr.com


On Thu, Dec 30, 2010 at 9:04 AM, Em mailformailingli...@yahoo.de wrote:


 Hi List,

 I got a little issue with sorting a FacetQuery.

 Currently I am doing something like that in SolrJ:

 SolrQuery q = new SolrQuery(myQuery);
 q.setFacetQuery(names:thomas);//want to see the count of thomas's
 documents.
 q.setFacetPrefix(short, th);

 I don't know any better example, but the result from all those facets
 should
 be returned in lexicographic order, not by count - so i can ensure that
 every constraint is returned at the same place.

 Any ideas?

 Thank you!

 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/Sort-Facet-Query-tp2167635p2167635.html
 Sent from the Solr - User mailing list archive at Nabble.com.



Re: Sort Facet Query

2010-12-30 Thread Markus Jelsma
No
http://wiki.apache.org/solr/SimpleFacetParameters#facet.sort


On Thursday 30 December 2010 15:42:14 Stephen Duncan Jr wrote:
 Set facet.limit to -1 (globally or for that field).  That will return all
 the facets, in lexicographical order.
 
 Stephen Duncan Jr
 www.stephenduncanjr.com
 
 On Thu, Dec 30, 2010 at 9:04 AM, Em mailformailingli...@yahoo.de wrote:
  Hi List,
  
  I got a little issue with sorting a FacetQuery.
  
  Currently I am doing something like that in SolrJ:
  
  SolrQuery q = new SolrQuery(myQuery);
  q.setFacetQuery(names:thomas);//want to see the count of thomas's
  documents.
  q.setFacetPrefix(short, th);
  
  I don't know any better example, but the result from all those facets
  should
  be returned in lexicographic order, not by count - so i can ensure that
  every constraint is returned at the same place.
  
  Any ideas?
  
  Thank you!
  
  --
  View this message in context:
  http://lucene.472066.n3.nabble.com/Sort-Facet-Query-tp2167635p2167635.htm
  l Sent from the Solr - User mailing list archive at Nabble.com.

-- 
Markus Jelsma - CTO - Openindex
http://www.linkedin.com/in/markus17
050-8536620 / 06-50258350


Re: Sort Facet Query

2010-12-30 Thread Em

Markus is right, it will return them by count.

I think my question could me more general:

How can I set limit, sort etc. for a Facet Query?
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Sort-Facet-Query-tp2167635p2167965.html
Sent from the Solr - User mailing list archive at Nabble.com.


Saravanan Chinnadurai/Actionimages is out of the office.

2010-12-30 Thread Saravanan . Chinnadurai
I will be out of the office starting  30/12/2010 and will not return until
03/01/2011.

Please email to itsta...@actionimages.com  for any urgent issues.


Action Images is a division of Reuters Limited and your data will therefore be 
protected
in accordance with the Reuters Group Privacy / Data Protection notice which is 
available
in the privacy footer at www.reuters.com
Registered in England No. 145516   VAT REG: 397000555


RE: UPDATE query in deltaquery

2010-12-30 Thread Ephraim Ofir
This may sound silly, but are you sure the user you're using has
permissions to do the updates you want? Not sure about postgres but I
think some jdbc's require that the connection be defined as rw, maybe
you should try adding readOnly=false to your jdbc definition.

Ephraim Ofir

-Original Message-
From: Juan Manuel Alvarez [mailto:naici...@gmail.com] 
Sent: Thursday, December 30, 2010 2:52 PM
To: solr-user@lucene.apache.org
Subject: Re: UPDATE query in deltaquery

Hi Erick!

Here is my DIH configuration:

dataConfig
dataSource name=jdbc driver=org.postgresql.Driver
 
url=jdbc:postgresql://${dataimporter.request.dbHost}:${dataimporter.req
uest.dbPort}/${dataimporter.request.dbName}
user=${dataimporter.request.dbUser}
password=${dataimporter.request.dbPassword} autoCommit=false
transactionIsolation=TRANSACTION_READ_UNCOMMITTED
holdability=CLOSE_CURSORS_AT_COMMIT/
document
entity name='myEntity' dataSource='jdbc' pk='id'
query='  . '
  deltaImportQuery='  . '
deltaQuery=' . '

/entity
/document
/dataConfig

I have tried two options for the deltaQuery:
UPDATE Global.Projects SET prj_lastSync = now() WHERE prj_id =
'2';  Throws a null pointer exception as described in the
previous email

The second option is a DB function that I am calling this way:
SELECT get_deltaimport_items AS id FROM
project.get_deltaimport_items(2, #39;project#39;);

The function inside executes the UPDATE query shown above and a SELECT
query for the ids.
The ids are returned ok, but the UPDATE has no effect on the database.

Cheers!
Juan M.


On Thu, Dec 30, 2010 at 1:32 AM, Erick Erickson
erickerick...@gmail.com wrote:
 Well, let's see the queries you're sending, and your DIH
configuration.

 Otherwise, we're just guessing...

 Best
 Erick

 On Wed, Dec 29, 2010 at 9:58 PM, Juan Manuel Alvarez
naici...@gmail.comwrote:

 Hi! I would like to ask you a question about using a deltaQuery in
DIH.
 I am syncing with a PostgreSQL database.

 At first I was calling a function that made two queries: an UPDATE
and a
 SELECT.
 The select result was properly returned, but the UPDATE query did not
 made any changes,
 so I tried calling the same function from a PostgreSQL client and
 everything went OK.

 So I tried calling a simple UPDATE query directly in the deltaQuery
 and I receive a
 NullPointerException that I traced to the line 251 of the
 JdbcDataSource.java
 colNames = readFieldNames(resultSet.getMetaData());

 The question is: is there a way I can make the update query work in
 the deltaQuery
 or am I doing something wrong?

 Happy new year
 Cheers!
 Juan M.




RE: Issue using filter query with spellCheck component

2010-12-30 Thread Dyer, James
The SpellCheckComponent in v1.4 does not use fq.  All it does is take the 
keywords out of the q (or spellcheck.q) parameter and check them against the 
entire dictionary.  If any keyword is not in the dictionary, it gives you a 
list of alternatives.  The collate function then takes the query and 
substitutes the top alternative for each keyword with suggestions.  There is no 
filtering.  It doesn't even assure you the suggestions or the collation will be 
any better than the original query.  If you were using fq to filter out 
secret information, spell checking might reveal the secrets to your user anyway.

In 3.x, there is an additional feature that will check the collation against 
your index, including the fq parameters you sent.  This will assure you that 
the Collation is going to return some hits if you re-query for it.  This is 
probably closer to what you want.  This does not affect the individual word 
suggestions, however, as it only works for the collation query.

See http://wiki.apache.org/solr/SpellCheckComponent#spellcheck.collate for more 
information.

This added 3.x functionality is also available in 1.4.1 as a patch.  See 
https://issues.apache.org/jira/browse/SOLR-2010

James Dyer
E-Commerce Systems
Ingram Content Group
(615) 213-4311


-Original Message-
From: taimurAQ [mailto:taimur_qure...@hotmail.com] 
Sent: Thursday, December 30, 2010 12:20 AM
To: solr-user@lucene.apache.org
Subject: Issue using filter query with spellCheck component


Hi,

I am trying to use a filter query along with the spellcheck component but
the resultant rows are completely ignoring the filter query parameter.

I am using solrJ API to communicate with solr1.4. Here is a code snippet of
how i am passing the query parameters:

//start code snippet
paramsForSpellCheck.set(q, *:*);
paramsForSpellCheck.set(spellcheck.q, query);
paramsForSpellCheck.set(rows, 0);
paramsForSpellCheck.set(indent, on);
paramsForSpellCheck.set(spellcheck, on);

paramsForSpellCheck.set(spellcheck.build, true);
paramsForSpellCheck.set(spellcheck.extendedResults, true);
paramsForSpellCheck.set(spellcheck.dictionary, a_spell);
paramsForSpellCheck.set(spellcheck.collate, true);
paramsForSpellCheck.set(spellcheck.count, 3);
paramsForSpellCheck.set(spellcheck.onlyMorePopular, false);
paramsForSpellCheck.set(spellcheck.reload, true);
paramsForSpellCheck.set(qt, standard);
paramsForSpellCheck.set(fq, catalog_id: + catalogs);

//end code snippet

The issue that i am facing now is that the fq query parameter is not
having any affect on the spellcheck query. Are there any known issues while
using filter query with the spellcheck component? If not, what am i doing
wrong?

Thanks in advance.

Regards,
Taimur


-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Issue-using-filter-query-with-spellCheck-component-tp2166322p2166322.html
Sent from the Solr - User mailing list archive at Nabble.com.


StreamingUpdateSolrServer - connection refused

2010-12-30 Thread Christopher Gross
Hi all.

  I have designed a synchronizer that goes out to various databases,
extracts some data, does some processing, and then uses the
StreamingUpdateSolrServer to send the records to a Solr index.  When
everything is up, it works just fine.

  Now I'm trying to account for problems, like if the Solr index is down for
some reason.  I'm trying to get it so that I can keep track of items that
were missed for synchronization, so that they can get replayed when the
index comes back up.  Before I was using the StreamingUpdateSolrServer, I'd
get a message back immediately that there was a problem.  Now that it
streams, the error comes outside of the thread that I'm using to run the
updates, so I can't catch the exception, so my thread isn't aware of the
error, and just chugs along.

  Has anyone run into this problem?  Is there anything built into the server
so that I can know which records it was unable to send, and then pull them
out in order to try running them again later?  Any insight that anyone has
would be greatly appreciated.

  Thanks!

-- Christopher Gross


Re: Sort Facet Query

2010-12-30 Thread Erick Erickson
At the end of Marcus' link is facet.sort=false which will return in
lexigraphical order (sometimes called index order).

Best
Erick

On Thu, Dec 30, 2010 at 10:26 AM, Em mailformailingli...@yahoo.de wrote:


 Markus is right, it will return them by count.

 I think my question could me more general:

 How can I set limit, sort etc. for a Facet Query?
 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/Sort-Facet-Query-tp2167635p2167965.html
 Sent from the Solr - User mailing list archive at Nabble.com.



Re: UPDATE query in deltaquery

2010-12-30 Thread Juan Manuel Alvarez
Hi Ephraim! Thanks for the answer!

Actually the user has permissions to make UPDATE queries.

I changed the configuration to
dataSource name=jdbc driver=org.postgresql.Driver

url=jdbc:postgresql://${dataimporter.request.dbHost}:${dataimporter.request.dbPort}/${dataimporter.request.dbName}
user=${dataimporter.request.dbUser}
password=${dataimporter.request.dbPassword}
transactionIsolation=TRANSACTION_SERIALIZABLE
readOnly=false/

and I still get the same results.

Cheers!
Juan M.

On Thu, Dec 30, 2010 at 12:40 PM, Ephraim Ofir ephra...@icq.com wrote:
 This may sound silly, but are you sure the user you're using has
 permissions to do the updates you want? Not sure about postgres but I
 think some jdbc's require that the connection be defined as rw, maybe
 you should try adding readOnly=false to your jdbc definition.

 Ephraim Ofir

 -Original Message-
 From: Juan Manuel Alvarez [mailto:naici...@gmail.com]
 Sent: Thursday, December 30, 2010 2:52 PM
 To: solr-user@lucene.apache.org
 Subject: Re: UPDATE query in deltaquery

 Hi Erick!

 Here is my DIH configuration:

 dataConfig
    dataSource name=jdbc driver=org.postgresql.Driver

 url=jdbc:postgresql://${dataimporter.request.dbHost}:${dataimporter.req
 uest.dbPort}/${dataimporter.request.dbName}
        user=${dataimporter.request.dbUser}
 password=${dataimporter.request.dbPassword} autoCommit=false
        transactionIsolation=TRANSACTION_READ_UNCOMMITTED
 holdability=CLOSE_CURSORS_AT_COMMIT/
    document
        entity name='myEntity' dataSource='jdbc' pk='id'
                query='  . '
          deltaImportQuery='  . '
                deltaQuery=' . '
        
        /entity
    /document
 /dataConfig

 I have tried two options for the deltaQuery:
 UPDATE Global.Projects SET prj_lastSync = now() WHERE prj_id =
 '2';  Throws a null pointer exception as described in the
 previous email

 The second option is a DB function that I am calling this way:
 SELECT get_deltaimport_items AS id FROM
 project.get_deltaimport_items(2, #39;project#39;);

 The function inside executes the UPDATE query shown above and a SELECT
 query for the ids.
 The ids are returned ok, but the UPDATE has no effect on the database.

 Cheers!
 Juan M.


 On Thu, Dec 30, 2010 at 1:32 AM, Erick Erickson
 erickerick...@gmail.com wrote:
 Well, let's see the queries you're sending, and your DIH
 configuration.

 Otherwise, we're just guessing...

 Best
 Erick

 On Wed, Dec 29, 2010 at 9:58 PM, Juan Manuel Alvarez
 naici...@gmail.comwrote:

 Hi! I would like to ask you a question about using a deltaQuery in
 DIH.
 I am syncing with a PostgreSQL database.

 At first I was calling a function that made two queries: an UPDATE
 and a
 SELECT.
 The select result was properly returned, but the UPDATE query did not
 made any changes,
 so I tried calling the same function from a PostgreSQL client and
 everything went OK.

 So I tried calling a simple UPDATE query directly in the deltaQuery
 and I receive a
 NullPointerException that I traced to the line 251 of the
 JdbcDataSource.java
 colNames = readFieldNames(resultSet.getMetaData());

 The question is: is there a way I can make the update query work in
 the deltaQuery
 or am I doing something wrong?

 Happy new year
 Cheers!
 Juan M.





Re: UPDATE query in deltaquery

2010-12-30 Thread Juan Manuel Alvarez
Hi Travis!

I am executing a function in de DB that has two queries: an UPDATE and
a SELECT, and I am getting the select results ok, but the update has
no effects, so it seems that that approach is not working.

Cheers!
Juan M.

On Thu, Dec 30, 2010 at 11:26 AM, Travis Low t...@4centurion.com wrote:
 If you are getting a null pointer exception here:

  colNames = readFieldNames(resultSet.getMetaData());

 Then that implies the DIH code is written to expect a select statement.  You
 might be able to fool it with some SQL injection:

  update blah set foo=bar where id=1234; select id from blah

 But if that doesn't work then you may be out of luck.

 cheers,

 Travis

 On Thu, Dec 30, 2010 at 8:26 AM, Juan Manuel Alvarez 
 naici...@gmail.comwrote:

 Erick:

 Thanks for the quick response.

 I can't use the timestamp for doing DIH, so I need to use a custom
 field that I need to update one for each delta-import, so that is why
 I need to execute an UPDATE on the deltaQuery.

 Cheers!
 Juan M.

 On Thu, Dec 30, 2010 at 10:07 AM, Erick Erickson
 erickerick...@gmail.com wrote:
  WARNING: DIH isn't my strong suit, I generally prefer doing things
  in SolrJ. Mostly I asked for clarification so someone #else# who
  actually knows DIH details could chime in...
 
  That said, I'm a bit confused. As I understand it, you shouldn't
  be UPDATEing anything in DIH, it's a select where documents
  then get added to Solr by magic. Your post leads me to believe
  that you're trying to change the database via DIH, is that at
  all true?
 
  This is based in part on
  The ids are returned ok, but the UPDATE has no effect on the database
  Or do you mean effect on the index? If the latter, then the select
  would only have a chance of updating the IDs of the Solr documents...
 
  At least I think that's close to reality...
 
  Best
  Erick
 
  On Thu, Dec 30, 2010 at 7:52 AM, Juan Manuel Alvarez naici...@gmail.com
 wrote:
 
  Hi Erick!
 
  Here is my DIH configuration:
 
  dataConfig
     dataSource name=jdbc driver=org.postgresql.Driver
 
 
  url=jdbc:postgresql://${dataimporter.request.dbHost}:${dataimporter.request.dbPort}/${dataimporter.request.dbName}
         user=${dataimporter.request.dbUser}
  password=${dataimporter.request.dbPassword} autoCommit=false
         transactionIsolation=TRANSACTION_READ_UNCOMMITTED
  holdability=CLOSE_CURSORS_AT_COMMIT/
     document
         entity name='myEntity' dataSource='jdbc' pk='id'
                 query='  . '
           deltaImportQuery='  . '
                 deltaQuery=' . '
         
         /entity
     /document
  /dataConfig
 
  I have tried two options for the deltaQuery:
  UPDATE Global.Projects SET prj_lastSync = now() WHERE prj_id =
  '2';  Throws a null pointer exception as described in the
  previous email
 
  The second option is a DB function that I am calling this way:
  SELECT get_deltaimport_items AS id FROM
  project.get_deltaimport_items(2, #39;project#39;);
 
  The function inside executes the UPDATE query shown above and a SELECT
  query for the ids.
  The ids are returned ok, but the UPDATE has no effect on the database.
 
  Cheers!
  Juan M.
 
 
  On Thu, Dec 30, 2010 at 1:32 AM, Erick Erickson 
 erickerick...@gmail.com
  wrote:
   Well, let's see the queries you're sending, and your DIH
 configuration.
  
   Otherwise, we're just guessing...
  
   Best
   Erick
  
   On Wed, Dec 29, 2010 at 9:58 PM, Juan Manuel Alvarez 
 naici...@gmail.com
  wrote:
  
   Hi! I would like to ask you a question about using a deltaQuery in
 DIH.
   I am syncing with a PostgreSQL database.
  
   At first I was calling a function that made two queries: an UPDATE
 and a
   SELECT.
   The select result was properly returned, but the UPDATE query did not
   made any changes,
   so I tried calling the same function from a PostgreSQL client and
   everything went OK.
  
   So I tried calling a simple UPDATE query directly in the deltaQuery
   and I receive a
   NullPointerException that I traced to the line 251 of the
   JdbcDataSource.java
   colNames = readFieldNames(resultSet.getMetaData());
  
   The question is: is there a way I can make the update query work in
   the deltaQuery
   or am I doing something wrong?
  
   Happy new year
   Cheers!
   Juan M.
  
  
 
 




 --

 **

 *Travis Low, Director of Development*


 ** t...@4centurion.com* *

 *Centurion Research Solutions, LLC*

 *14048 ParkEast Circle *•* Suite 100 *•* Chantilly, VA 20151*

 *703-956-6276 *•* 703-378-4474 (fax)*

 *http://www.centurionresearch.com* http://www.centurionresearch.com

 **The information contained in this email message is confidential and
 protected from disclosure.  If you are not the intended recipient, any use
 or dissemination of this communication, including attachments, is strictly
 prohibited.  If you received this email message in error, please delete it
 and immediately notify the sender.

 This email message and any attachments have been scanned and are 

RE: UPDATE query in deltaquery

2010-12-30 Thread Ephraim Ofir
Does your function get_deltaimport_items perform the update first and then the 
select? Does it make a difference if you change the order? Did you try omitting 
the TRANSACTION_SERIALIZABLE part?

Ephraim Ofir

-Original Message-
From: Juan Manuel Alvarez [mailto:naici...@gmail.com] 
Sent: Thursday, December 30, 2010 7:04 PM
To: solr-user@lucene.apache.org
Subject: Re: UPDATE query in deltaquery

Hi Ephraim! Thanks for the answer!

Actually the user has permissions to make UPDATE queries.

I changed the configuration to
dataSource name=jdbc driver=org.postgresql.Driver

url=jdbc:postgresql://${dataimporter.request.dbHost}:${dataimporter.request.dbPort}/${dataimporter.request.dbName}
user=${dataimporter.request.dbUser}
password=${dataimporter.request.dbPassword}
transactionIsolation=TRANSACTION_SERIALIZABLE
readOnly=false/

and I still get the same results.

Cheers!
Juan M.

On Thu, Dec 30, 2010 at 12:40 PM, Ephraim Ofir ephra...@icq.com wrote:
 This may sound silly, but are you sure the user you're using has
 permissions to do the updates you want? Not sure about postgres but I
 think some jdbc's require that the connection be defined as rw, maybe
 you should try adding readOnly=false to your jdbc definition.

 Ephraim Ofir

 -Original Message-
 From: Juan Manuel Alvarez [mailto:naici...@gmail.com]
 Sent: Thursday, December 30, 2010 2:52 PM
 To: solr-user@lucene.apache.org
 Subject: Re: UPDATE query in deltaquery

 Hi Erick!

 Here is my DIH configuration:

 dataConfig
    dataSource name=jdbc driver=org.postgresql.Driver

 url=jdbc:postgresql://${dataimporter.request.dbHost}:${dataimporter.req
 uest.dbPort}/${dataimporter.request.dbName}
        user=${dataimporter.request.dbUser}
 password=${dataimporter.request.dbPassword} autoCommit=false
        transactionIsolation=TRANSACTION_READ_UNCOMMITTED
 holdability=CLOSE_CURSORS_AT_COMMIT/
    document
        entity name='myEntity' dataSource='jdbc' pk='id'
                query='  . '
          deltaImportQuery='  . '
                deltaQuery=' . '
        
        /entity
    /document
 /dataConfig

 I have tried two options for the deltaQuery:
 UPDATE Global.Projects SET prj_lastSync = now() WHERE prj_id =
 '2';  Throws a null pointer exception as described in the
 previous email

 The second option is a DB function that I am calling this way:
 SELECT get_deltaimport_items AS id FROM
 project.get_deltaimport_items(2, #39;project#39;);

 The function inside executes the UPDATE query shown above and a SELECT
 query for the ids.
 The ids are returned ok, but the UPDATE has no effect on the database.

 Cheers!
 Juan M.


 On Thu, Dec 30, 2010 at 1:32 AM, Erick Erickson
 erickerick...@gmail.com wrote:
 Well, let's see the queries you're sending, and your DIH
 configuration.

 Otherwise, we're just guessing...

 Best
 Erick

 On Wed, Dec 29, 2010 at 9:58 PM, Juan Manuel Alvarez
 naici...@gmail.comwrote:

 Hi! I would like to ask you a question about using a deltaQuery in
 DIH.
 I am syncing with a PostgreSQL database.

 At first I was calling a function that made two queries: an UPDATE
 and a
 SELECT.
 The select result was properly returned, but the UPDATE query did not
 made any changes,
 so I tried calling the same function from a PostgreSQL client and
 everything went OK.

 So I tried calling a simple UPDATE query directly in the deltaQuery
 and I receive a
 NullPointerException that I traced to the line 251 of the
 JdbcDataSource.java
 colNames = readFieldNames(resultSet.getMetaData());

 The question is: is there a way I can make the update query work in
 the deltaQuery
 or am I doing something wrong?

 Happy new year
 Cheers!
 Juan M.





Re: UPDATE query in deltaquery

2010-12-30 Thread Juan Manuel Alvarez
Hi Ephraim! Thanks again for taking the time to help me. Really appreciated =o)

The UPDATE was before the SELECT, but putting it after leads to the
same result, with or without the TRANSACTION_SERIALIZABLE.

Cheers!
Juan M.

2010/12/30 Ephraim Ofir ephra...@icq.com:
 Does your function get_deltaimport_items perform the update first and then 
 the select? Does it make a difference if you change the order? Did you try 
 omitting the TRANSACTION_SERIALIZABLE part?

 Ephraim Ofir

 -Original Message-
 From: Juan Manuel Alvarez [mailto:naici...@gmail.com]
 Sent: Thursday, December 30, 2010 7:04 PM
 To: solr-user@lucene.apache.org
 Subject: Re: UPDATE query in deltaquery

 Hi Ephraim! Thanks for the answer!

 Actually the user has permissions to make UPDATE queries.

 I changed the configuration to
    dataSource name=jdbc driver=org.postgresql.Driver
        
 url=jdbc:postgresql://${dataimporter.request.dbHost}:${dataimporter.request.dbPort}/${dataimporter.request.dbName}
        user=${dataimporter.request.dbUser}
 password=${dataimporter.request.dbPassword}
        transactionIsolation=TRANSACTION_SERIALIZABLE
        readOnly=false/

 and I still get the same results.

 Cheers!
 Juan M.

 On Thu, Dec 30, 2010 at 12:40 PM, Ephraim Ofir ephra...@icq.com wrote:
 This may sound silly, but are you sure the user you're using has
 permissions to do the updates you want? Not sure about postgres but I
 think some jdbc's require that the connection be defined as rw, maybe
 you should try adding readOnly=false to your jdbc definition.

 Ephraim Ofir

 -Original Message-
 From: Juan Manuel Alvarez [mailto:naici...@gmail.com]
 Sent: Thursday, December 30, 2010 2:52 PM
 To: solr-user@lucene.apache.org
 Subject: Re: UPDATE query in deltaquery

 Hi Erick!

 Here is my DIH configuration:

 dataConfig
    dataSource name=jdbc driver=org.postgresql.Driver

 url=jdbc:postgresql://${dataimporter.request.dbHost}:${dataimporter.req
 uest.dbPort}/${dataimporter.request.dbName}
        user=${dataimporter.request.dbUser}
 password=${dataimporter.request.dbPassword} autoCommit=false
        transactionIsolation=TRANSACTION_READ_UNCOMMITTED
 holdability=CLOSE_CURSORS_AT_COMMIT/
    document
        entity name='myEntity' dataSource='jdbc' pk='id'
                query='  . '
          deltaImportQuery='  . '
                deltaQuery=' . '
        
        /entity
    /document
 /dataConfig

 I have tried two options for the deltaQuery:
 UPDATE Global.Projects SET prj_lastSync = now() WHERE prj_id =
 '2';  Throws a null pointer exception as described in the
 previous email

 The second option is a DB function that I am calling this way:
 SELECT get_deltaimport_items AS id FROM
 project.get_deltaimport_items(2, #39;project#39;);

 The function inside executes the UPDATE query shown above and a SELECT
 query for the ids.
 The ids are returned ok, but the UPDATE has no effect on the database.

 Cheers!
 Juan M.


 On Thu, Dec 30, 2010 at 1:32 AM, Erick Erickson
 erickerick...@gmail.com wrote:
 Well, let's see the queries you're sending, and your DIH
 configuration.

 Otherwise, we're just guessing...

 Best
 Erick

 On Wed, Dec 29, 2010 at 9:58 PM, Juan Manuel Alvarez
 naici...@gmail.comwrote:

 Hi! I would like to ask you a question about using a deltaQuery in
 DIH.
 I am syncing with a PostgreSQL database.

 At first I was calling a function that made two queries: an UPDATE
 and a
 SELECT.
 The select result was properly returned, but the UPDATE query did not
 made any changes,
 so I tried calling the same function from a PostgreSQL client and
 everything went OK.

 So I tried calling a simple UPDATE query directly in the deltaQuery
 and I receive a
 NullPointerException that I traced to the line 251 of the
 JdbcDataSource.java
 colNames = readFieldNames(resultSet.getMetaData());

 The question is: is there a way I can make the update query work in
 the deltaQuery
 or am I doing something wrong?

 Happy new year
 Cheers!
 Juan M.






Re: Sort Facet Query

2010-12-30 Thread Em

Yes, I understood.

But what if I DON'T want to return ALL facet fields in index-order, but only
2 of 5?

When faceting on fields I could just specify it, but how can I do so with a
FacetQuery without making *all* Facets sorted in the same way?
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Sort-Facet-Query-tp2167635p2168762.html
Sent from the Solr - User mailing list archive at Nabble.com.


Different behavior for q=goo.com vs q=@goo.com in queries?

2010-12-30 Thread mrw

Using Lucid's Solr 1.4 distribution, if I index my email inbox and then
search it by passing in different email expressions, I notice that I get
different results based on whether the '@' character is included, even
though the character is present in every email address in the field I'm
searching.

For example, q=goo.com returns multiple items, as expected.

However, q...@goo.com return no results.  Since every address containing
goo.com also contains @goo.com, I would expect the same number of
results.

I get this from both the Solr admin console and from my application, which
URL-encodes the query.

I Googled, searched the Wiki, and grepped the Pugh and Lucid books, but
don't see anything about this.  


Ideas?

Thanks!
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Different-behavior-for-q-goo-com-vs-q-goo-com-in-queries-tp2168935p2168935.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: StreamingUpdateSolrServer - connection refused

2010-12-30 Thread Christopher Gross
My current solution is to use the ping() function -- which doesn't run in a
thread -- to test the connection before trying to send the data to the Solr
index.  It isn't elegant, but it works.

If anyone has a better idea, I'd like to hear it.

-- Chris


On Thu, Dec 30, 2010 at 11:10 AM, Christopher Gross cogr...@gmail.comwrote:

 Hi all.

   I have designed a synchronizer that goes out to various databases,
 extracts some data, does some processing, and then uses the
 StreamingUpdateSolrServer to send the records to a Solr index.  When
 everything is up, it works just fine.

   Now I'm trying to account for problems, like if the Solr index is down
 for some reason.  I'm trying to get it so that I can keep track of items
 that were missed for synchronization, so that they can get replayed when the
 index comes back up.  Before I was using the StreamingUpdateSolrServer, I'd
 get a message back immediately that there was a problem.  Now that it
 streams, the error comes outside of the thread that I'm using to run the
 updates, so I can't catch the exception, so my thread isn't aware of the
 error, and just chugs along.

   Has anyone run into this problem?  Is there anything built into the
 server so that I can know which records it was unable to send, and then pull
 them out in order to try running them again later?  Any insight that anyone
 has would be greatly appreciated.

   Thanks!

 -- Christopher Gross



Re: Different behavior for q=goo.com vs q=@goo.com in queries?

2010-12-30 Thread Erick Erickson
What steps have you taken to figure out whether the
contents of your index are what you think? I suspect
that the fields you're indexing aren't being
analyzed/tokenized quite the way you expect either at
query time or index time (or maybe both!).

Take a look at the admin/analysis page for the field you're indexing
the data into. If that doesn't shed any light on the problem,
please paste in the fieldType definition for the field in question,
maybe another set of eyes can see the issue.

Best
Erick

On Thu, Dec 30, 2010 at 2:11 PM, mrw mikerobertsw...@gmail.com wrote:


 Using Lucid's Solr 1.4 distribution, if I index my email inbox and then
 search it by passing in different email expressions, I notice that I get
 different results based on whether the '@' character is included, even
 though the character is present in every email address in the field I'm
 searching.

 For example, q=goo.com returns multiple items, as expected.

 However, q...@goo.com return no results.  Since every address containing
 goo.com also contains @goo.com, I would expect the same number of
 results.

 I get this from both the Solr admin console and from my application, which
 URL-encodes the query.

 I Googled, searched the Wiki, and grepped the Pugh and Lucid books, but
 don't see anything about this.


 Ideas?

 Thanks!
 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/Different-behavior-for-q-goo-com-vs-q-goo-com-in-queries-tp2168935p2168935.html
 Sent from the Solr - User mailing list archive at Nabble.com.



Re: Different behavior for q=goo.com vs q=@goo.com in queries?

2010-12-30 Thread mrw


Basically, just what you've suggested.  I did the field/query analysis piece
with verbose output.  Not entirely sure how to interpret the results, of
course.  Currently reading anything I can find on that.


Thanks


Erick Erickson wrote:
 
 What steps have you taken to figure out whether the
 contents of your index are what you think? I suspect
 that the fields you're indexing aren't being
 analyzed/tokenized quite the way you expect either at
 query time or index time (or maybe both!).
 
 Take a look at the admin/analysis page for the field you're indexing
 the data into. If that doesn't shed any light on the problem,
 please paste in the fieldType definition for the field in question,
 maybe another set of eyes can see the issue.
 
 Best
 Erick
 
 
 
 

-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Different-behavior-for-q-goo-com-vs-q-goo-com-in-queries-tp2168935p2169478.html
Sent from the Solr - User mailing list archive at Nabble.com.


post search using solrj

2010-12-30 Thread Don Hill
Hi. I am using solrj and it has been working fine. I now have a requirement
to add more parameters. So many that I get a max URI exceeded error. Is
there anyway using SolrQuery todo a http post so I don't have these issues?

don


Re: post search using solrj

2010-12-30 Thread Sascha SZOTT

Hi Don,

you could give the HTTP method to be used as a second argument to the 
QueryRequest constructor:


[http://lucene.apache.org/solr/api/org/apache/solr/client/solrj/request/QueryRequest.html#QueryRequest(org.apache.solr.common.params.SolrParams,%20org.apache.solr.client.solrj.SolrRequest.METHOD)]

-Sascha


Don Hill wrote:

Hi. I am using solrj and it has been working fine. I now have a requirement
to add more parameters. So many that I get a max URI exceeded error. Is
there anyway using SolrQuery todo a http post so I don't have these issues?

don



Re: DIH for taxonomy faceting in Lucid webcast

2010-12-30 Thread Chris Hostetter

:  : NonFic/Science, how do I turn that into 0/NonFic
:   
:  : 1/NonFic/Science using the DIH?
:  
:  I don't have any specific suggestions for you -- i've never
...
: Thanks Chris.
: 
: What did you use to generate those encodings if not DIH?

I've used this gereral approach several times -- in most of them i had 
other people to write the code for me once i spec'ed it out, and i don't 
know how they actually implemented it.

IN two specific cases i do know of: one used custom loader code that was 
part of the publishing system for the authoritative data store 
(essentially building the individual terms in SolrInputDocuments and 
sending them to solr via SolrJ) and the other case i was doing a really 
super simple ingestor that batch processed directories of XML files, and 
used XSLT to transform them into Solr's XML format (i think the category 
breadcrumb fields relied heavily on using xsl:for-each/ in conjunction 
with the position() function)


-Hoss


Re: dynamic fields revisited

2010-12-30 Thread Lance Norskog
solr/admin/analysis.jsp uses the Luke handler. You can browse facets and fields.

On Wed, Dec 29, 2010 at 7:46 PM, Ahmet Arslan iori...@yahoo.com wrote:
 If I understand you correctly, for an INT dynamic field
 called *_int2
 filled with field callled my_number_int2 during data
 import
 in a query, I will search in the index on the field
 called:
   my_number_int2

 correct?


 Exactly.

 Using http://wiki.apache.org/solr/LukeRequestHandler you can retrieve real 
 field names under *_int2, if thats help.







-- 
Lance Norskog
goks...@gmail.com


Re: post search using solrj

2010-12-30 Thread Lance Norskog
Another way is to create a requestHandler entry point in
solrconfig.xml that includes lots of parameters in the defaults
section. This way your URLs only have things that change.

On Thu, Dec 30, 2010 at 3:12 PM, Sascha SZOTT sz...@gmx.de wrote:
 Hi Don,

 you could give the HTTP method to be used as a second argument to the
 QueryRequest constructor:

 [http://lucene.apache.org/solr/api/org/apache/solr/client/solrj/request/QueryRequest.html#QueryRequest(org.apache.solr.common.params.SolrParams,%20org.apache.solr.client.solrj.SolrRequest.METHOD)]

 -Sascha


 Don Hill wrote:

 Hi. I am using solrj and it has been working fine. I now have a
 requirement
 to add more parameters. So many that I get a max URI exceeded error. Is
 there anyway using SolrQuery todo a http post so I don't have these
 issues?

 don





-- 
Lance Norskog
goks...@gmail.com


exception obtaining write lock on startup

2010-12-30 Thread Tri Nguyen
Hi,
 
I'm getting this exception when I have 2 cores as masters.  Seems like one of 
the cores obtains a lock (file) and then the other tries to obtain the same 
one.   However, the first one is not deleted.
 
How do I fix this?
 
Dec 30, 2010 4:34:48 PM org.apache.solr.handler.ReplicationHandler inform
WARNING: Unable to get IndexCommit on startup
org.apache.lucene.store.LockObtainFailedException: Lock obtain timed out: Native
fsl...@..\webapps\solr\tnsolr\data\index\lucene-fe3fc928a4bbfeb55082e49b32a70c10
-write.lock
    at org.apache.lucene.store.Lock.obtain(Lock.java:85)
    at org.apache.lucene.index.IndexWriter.init(IndexWriter.java:1565)
    at org.apache.lucene.index.IndexWriter.init(IndexWriter.java:1421)
    at org.apache.solr.update.SolrIndexWriter.init(SolrIndexWriter.java:19
1)
    at org.apache.solr.update.UpdateHandler.createMainIndexWriter(UpdateHand
ler.java:98)
    at org.apache.solr.update.DirectUpdateHandler2.openWriter(DirectUpdateHa
ndler2.java:173)
    at org.apache.solr.update.DirectUpdateHandler2.forceOpenWriter(DirectUpd
ateHandler2.java:376)
    at org.apache.solr.handler.ReplicationHandler.inform(ReplicationHandler.
 
 
Tri

Re: dynamic fields revisited

2010-12-30 Thread Dennis Gearon
When my Solr guru gets back, we'll redo the schema and see what happens, thanks!

 Dennis Gearon


Signature Warning

It is always a good idea to learn from your own mistakes. It is usually a 
better 
idea to learn from others’ mistakes, so you do not have to make them yourself. 
from 'http://blogs.techrepublic.com.com/security/?p=4501tag=nl.e036'


EARTH has a Right To Life,
otherwise we all die.



- Original Message 
From: Lance Norskog goks...@gmail.com
To: solr-user@lucene.apache.org
Sent: Thu, December 30, 2010 4:26:58 PM
Subject: Re: dynamic fields revisited

solr/admin/analysis.jsp uses the Luke handler. You can browse facets and fields.

On Wed, Dec 29, 2010 at 7:46 PM, Ahmet Arslan iori...@yahoo.com wrote:
 If I understand you correctly, for an INT dynamic field
 called *_int2
 filled with field callled my_number_int2 during data
 import
 in a query, I will search in the index on the field
 called:
   my_number_int2

 correct?


 Exactly.

 Using http://wiki.apache.org/solr/LukeRequestHandler you can retrieve real 
field names under *_int2, if thats help.







-- 
Lance Norskog
goks...@gmail.com



Re: exception obtaining write lock on startup

2010-12-30 Thread Lance Norskog
This will not work. At all.

You can only have one Solr core instance changing an index.

On Thu, Dec 30, 2010 at 4:38 PM, Tri Nguyen tringuye...@yahoo.com wrote:
 Hi,

 I'm getting this exception when I have 2 cores as masters.  Seems like one of 
 the cores obtains a lock (file) and then the other tries to obtain the same 
 one.   However, the first one is not deleted.

 How do I fix this?

 Dec 30, 2010 4:34:48 PM org.apache.solr.handler.ReplicationHandler inform
 WARNING: Unable to get IndexCommit on startup
 org.apache.lucene.store.LockObtainFailedException: Lock obtain timed out: 
 Native
 fsl...@..\webapps\solr\tnsolr\data\index\lucene-fe3fc928a4bbfeb55082e49b32a70c10
 -write.lock
     at org.apache.lucene.store.Lock.obtain(Lock.java:85)
     at org.apache.lucene.index.IndexWriter.init(IndexWriter.java:1565)
     at org.apache.lucene.index.IndexWriter.init(IndexWriter.java:1421)
     at 
 org.apache.solr.update.SolrIndexWriter.init(SolrIndexWriter.java:19
 1)
     at 
 org.apache.solr.update.UpdateHandler.createMainIndexWriter(UpdateHand
 ler.java:98)
     at 
 org.apache.solr.update.DirectUpdateHandler2.openWriter(DirectUpdateHa
 ndler2.java:173)
     at 
 org.apache.solr.update.DirectUpdateHandler2.forceOpenWriter(DirectUpd
 ateHandler2.java:376)
     at 
 org.apache.solr.handler.ReplicationHandler.inform(ReplicationHandler.


 Tri



-- 
Lance Norskog
goks...@gmail.com


Advice on Exact Matching?

2010-12-30 Thread Scott Gonyea
Hi,

I am trying to make sure that when I search for text—regardless of
what that text is—that I get an exact match.  I'm *still* getting some
issues, and this last mile is becoming very painful.  The solr field,
for which I'm setting this up on, is pasted below my explanation.  I
appreciate any help.

Explanation:

I'm crawling websites with Nutch.  I'm performing some
mechanical-turk-like filtering and term matching.  The problem is,
there's some very gnarly behavior in Solr due to any number of
gotchas.

If I want to find *all* Solr documents that match
[id]somejunk\hi[/id] then life is instantly hell.

Likewise, lots of whitespace in between words throws it off  john
says hello,  how are you?  I would love to be able to search for
these exact phrases.  If that's just not practical (I'm more than
willing to live with a bloated search index), what would some other
strategies be?

There's no MapReduce in Solr; I could attempt to do Hadoop-streaming,
but that's not very ideal for a variety of reasons.


Solr Schema.xml, fieldType text (no, this is not used everywhere;
only on 2 fields):


fieldType name=textclass=solr.TextField
positionIncrementGap=100
  analyzer
tokenizer class=solr.WhitespaceTokenizerFactory/
filter class=solr.WordDelimiterFilterFactory
generateWordParts=1 generateNumberParts=1 catenateWords=1
catenateNumbers=1 catenateAll=1 splitOnCaseChange=1/
filter class=solr.EnglishPorterFilterFactory
protected=protwords.txt/
filter class=solr.SynonymFilterFactory
synonyms=synonyms.txt expand=true ignoreCase=true/
filter class=solr.LowerCaseFilterFactory/
  /analyzer
/fieldType


Thank you,
Scott Gonyea


Re: Advice on Exact Matching?

2010-12-30 Thread Erick Erickson
You can gain a lot of insight into this kind of thing with the
admin/analysis page. Often the issue is that your tokenizing/
filtering isn't doing quite what you think. Try turning on the
debug checkboxes on that page and seeing what tokens are
generated at index and analysis page.

In particular, WordDelimiterFactory is often a surprise in how
it splits and recombines tokens. Including synonyms is another
potential issue. Not to mention the EnglishPorterFilterFactory.

If that's not helpful, could you paste some examples that you expect
to match that don't?

Best
Erick

On Thu, Dec 30, 2010 at 8:04 PM, Scott Gonyea sc...@aitrus.org wrote:

 Hi,

 I am trying to make sure that when I search for text—regardless of
 what that text is—that I get an exact match.  I'm *still* getting some
 issues, and this last mile is becoming very painful.  The solr field,
 for which I'm setting this up on, is pasted below my explanation.  I
 appreciate any help.

 Explanation:

 I'm crawling websites with Nutch.  I'm performing some
 mechanical-turk-like filtering and term matching.  The problem is,
 there's some very gnarly behavior in Solr due to any number of
 gotchas.

 If I want to find *all* Solr documents that match
 [id]somejunk\hi[/id] then life is instantly hell.

 Likewise, lots of whitespace in between words throws it off  john
 says hello,  how are you?  I would love to be able to search for
 these exact phrases.  If that's just not practical (I'm more than
 willing to live with a bloated search index), what would some other
 strategies be?

 There's no MapReduce in Solr; I could attempt to do Hadoop-streaming,
 but that's not very ideal for a variety of reasons.


 Solr Schema.xml, fieldType text (no, this is not used everywhere;
 only on 2 fields):


fieldType name=textclass=solr.TextField
 positionIncrementGap=100
  analyzer
tokenizer class=solr.WhitespaceTokenizerFactory/
filter class=solr.WordDelimiterFilterFactory
 generateWordParts=1 generateNumberParts=1 catenateWords=1
 catenateNumbers=1 catenateAll=1 splitOnCaseChange=1/
filter class=solr.EnglishPorterFilterFactory
 protected=protwords.txt/
filter class=solr.SynonymFilterFactory
 synonyms=synonyms.txt expand=true ignoreCase=true/
filter class=solr.LowerCaseFilterFactory/
  /analyzer
/fieldType


 Thank you,
 Scott Gonyea



Question on long delta import

2010-12-30 Thread Mark
When using DIH my delta imports appear to finish quickly.. ie it says 
Indexing completed. Added/Updated: 95491 documents. Deleted 11148 
documents. in a relatively short amount of time (~30mins).


However the importMessage says A command is still running... for a 
really long time (~60mins). What is happening during this phase and how 
could I speed this up?


Thanks!