Re: Configuring RequestHandler in solrconfig.xml OR in the Servlet code using SolrJ

2010-06-21 Thread Abdelhamid ABID
Why would someone port the solr config into servlet code  ?
IMO the first option would be the best choice, one obvious reason is that,
when alter the solr config you only need to restart the server, whereas
changing in the source drive you to redeploy your app and restart the
server.



On 6/21/10, Saïd Radhouani  wrote:
>
> Hello,
>
> I'm developing a Web application that communicate with Solr using SolrJ. I
> have three search interfaces, and I'm facing two options:
>
> 1- Configuring one SearchHandler per search interface in solrconfig.xml
>
> Or
>
> 2- Write the configuration in the java servlet code that is using SolrJ
>
> It there any significant difference between these two options ? If yes,
> what's the best choice?
>
> Thanks,
>
> -Saïd




-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB


Re: how to patch?

2010-06-13 Thread Abdelhamid ABID
check this
http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201003.mbox/%3c495b6571003260900p2d1e7a01ib5aa211f84379...@mail.gmail.com%3e

2010/6/13 Li Li 

> I want to use fast highlighter in solr1.4 and find a issue in
> https://issues.apache.org/jira/browse/SOLR-1268
>  File Name   Date Attached  ↑
> Attached By   Size
>   SOLR-1268.patch   2010-02-05 10:32 PM
> Koji Sekiguchi4 kB
>   SOLR-1268-0_fragsize.patch   2010-02-04 10:43 PM
> Koji Sekiguchi2 kB
>   SOLR-1268-0_fragsize.patch   2010-01-29 11:51 PM
> Koji Sekiguchi1 kB
>   SOLR-1268.patch  2010-01-03 11:27 PM
> Koji Sekiguchi   48 kB
>   SOLR-1268.patch  2010-01-02 10:42 PM
> Koji Sekiguchi   35 kB
>
> I am not familiar with patch. There are 3 patch named SOLR-1268, which
> one shoud I use? I guess I should the newest one but it's so small.
> SOLR-1268-0_fragsize also have 2 files
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB


Re: Not able to access Solr Admin

2010-06-02 Thread Abdelhamid ABID
When you access from another machine what message error do you get ?

Check your remote access with Telnet to see if the server respond

On Wed, Jun 2, 2010 at 10:26 PM, Bondiga, Murali <
murali.krishna.bond...@hmhpub.com> wrote:

> Thank you so much for the reply.
>
> I am using Jetty which comes with Solr installation.
>
> http://localhost:8983/solr/
>
> The above URL works fine.
>
> The below URL does not work:
>
> http://177.44.9.119:8983/solr/
>
>
> -----Original Message-
> From: Abdelhamid ABID [mailto:aeh.a...@gmail.com]
> Sent: Wednesday, June 02, 2010 5:07 PM
> To: solr-user@lucene.apache.org
> Subject: Re: Not able to access Solr Admin
>
> details... detailseverybody let's say details !
>
> Which app server are you using ?
> What is the error message that you get when trying to access solr admin
> from
> another machine  ?
>
>
>
> On Wed, Jun 2, 2010 at 9:39 PM, Bondiga, Murali <
> murali.krishna.bond...@hmhpub.com> wrote:
>
> > Hi,
> >
> > I installed Solr Server on my machine and able to access with localhost.
> I
> > tried accessing from a different machine with IP Address but not able to
> > access it. What do I need to do to be able to access the Solr instance
> from
> > any machine within the network?
> >
> > Thanks,
> > Murali
> >
>
>
>
> --
> Abdelhamid ABID
> Software Engineer- J2EE / WEB
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB


Re: Not able to access Solr Admin

2010-06-02 Thread Abdelhamid ABID
details... detailseverybody let's say details !

Which app server are you using ?
What is the error message that you get when trying to access solr admin from
another machine  ?



On Wed, Jun 2, 2010 at 9:39 PM, Bondiga, Murali <
murali.krishna.bond...@hmhpub.com> wrote:

> Hi,
>
> I installed Solr Server on my machine and able to access with localhost. I
> tried accessing from a different machine with IP Address but not able to
> access it. What do I need to do to be able to access the Solr instance from
> any machine within the network?
>
> Thanks,
> Murali
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB


Re: Solr 1.4 query fails against all fields, but succeed if field is specified.

2010-05-31 Thread Abdelhamid ABID
Check your request handler setting, what do you have in the query field (qf)
entry ?

On 5/31/10, olivier sallou  wrote:
>
> Hi,
> I have created in index with several fields.
> If I query my index in the admin section of solr (or via http request), I
> get results for my search if I specify the requested field:
> Query:   note:Aspergillus  (look for "Aspergillus" in field "note")
> However, if I query the same word against all fields  ("Aspergillus" or
> "all:Aspergillus") , I have no match in response from Solr.
>
> Do you have any idea of what can be wrong with my index?
>
> Regards
>
>
> Olivier
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB


Re: Solr on Tomcat, how to use an external data directory?

2010-05-30 Thread Abdelhamid ABID
.. and to unset dataDir just leave it blank
${solr.data.dir:}

On Sun, May 30, 2010 at 12:15 AM, Chris Hostetter
wrote:

>
> : Most likely you have missed to point data directory in solrconf.xml,
> : this should help :
> : http://wiki.apache.org/solr/SolrConfigXml#dataDir_parameter
>
> right .. double check what the dataDir setting looks like ... if it's
> unset it uses "data" in your solr instance directory, but if it is set,
> it's (unfortunately) evaluated relative to the "current working directory"
> of your servlet container and some versions of solr had "./data" listed in
> the example solrconfig.xml
>
>
>
> -Hoss
>
>


-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB


Re: Solr on Tomcat, how to use an external data directory?

2010-05-29 Thread Abdelhamid ABID
Most likely you have missed to point data directory in solrconf.xml,
this should help :
http://wiki.apache.org/solr/SolrConfigXml#dataDir_parameter

On Sat, May 29, 2010 at 9:48 PM, jlist9  wrote:

> My solr index works fine with the embedded Jetty. I'm trying to move the
> index to Tomcat. Following the wiki page
> http://wiki.apache.org/solr/SolrTomcat,
> I put this line in setenv.bat:
>
> set JAVA_OPTS=%JAVA_OPTS% "-Dfile.encoding=UTF-8"
> "-Dsolr.solr.home=D:\opt\solr\example"
>
> Tomcat seems to be picking it up. If I point it to a non-existent
> directory or an empty
> directory, Tomcat complains about it in console log.
>
> However, the /solr/admin/stats.jsp page still shows that solr is using a
> default
> directory, which is unsurprisingly located in tomcat bin dir:
>
> readerDir : org.apache.lucene.store.simplefsdirect...@d
> :\Java\apache-tomcat-6.0.20\bin\solr\data\index
>
> Any idea what else I need to do to use an external solr index
> directory in Tomcat?
>
> Thanks,
> Jack
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB


Re: Non-English query via Solr Example Admin corrupts text

2010-05-20 Thread Abdelhamid ABID
I had had the same issue  within tomcat, further to what Ahmet wrote I
recommend to plug a filter in your solr context that forces responses and
requests to be encodded in UTF8

On Thu, May 20, 2010 at 5:11 PM, Ahmet Arslan  wrote:

> In my SolrJ using application, I have a
> test case which queries for “numéro” and
> succeeds if I use Embedded and fails if I use CommonsHttpSolrServer… I
> don’t want to use embedded for a number of reasons including that its not
> recommended (http://wiki.apache.org/solr/EmbeddedSolr)
>
>
>
> I am sorry if you’d dealt with this issue in the past,
> I’ve spent a few hours googling for solr
> utf-8 query and glassfishv3 utf-8
> uri  plus other permutations/combinations but there were
> seemingly endless amounts of chaff that I couldn’t find anything useful
> after scouring it for a few hours.  I can’t
> decide whether it’s a glassfish issue or not so I am not sure where to
> direct my energy.  Any tips or advice are appreciated!
>
>   I have never used glassfish but I am pretty sure it is a glassfish issue.
> The same thing happens in Tomcat if you don't set URIEncoing="UTF-8".
>
> http://wiki.apache.org/solr/SolrTomcat#URI_Charset_Confighttp://forums.java.net/jive/thread.jspa?threadID=38020http://wiki.glassfish.java.net/Wiki.jsp?page=FaqHttpRequestParameterEncoding
>
>
>
>
>
>
>
>
>
>
>




-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB


Re: Any way to get top 'n' queries searched from Solr?

2010-04-30 Thread Abdelhamid ABID
Hi,
Why you don't just create a filter in the solr context, by this way you can
grasp user q param and persist it.

On 4/30/10, pk  wrote:
>
>
> Peter,
> It seems that your solution (SOLR-1872) requires authentication too (and be
> tracked via ur uuid), but my users will be general public using browsers,
> and i can't force any such auth restrictions. Also you didn't mention if
> you
> are already persisting the audit data.. Or i may need to extend it to work
> for my problem..
>
> My requirement is simple: to know top n query strings with their
> frequencies
> etc..
> Thanks though.
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Any-way-to-get-top-n-queries-searched-from-Solr-tp767165p767482.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB


Re: Solr Spellcheck on Large index size

2010-04-27 Thread Abdelhamid ABID
Hi,
With the spellcheck.build=true, IMO solr will build the spellcheck
disctionnary at each request, so with the 29m documents solr can popup from
the server with some error like  "I quit"  :)
I would build the dictionnary once after data index creation, you may set
this option to the spell request handler in the solrconfig

true



On 4/27/10, Kyle J G  wrote:
>
>
> I am trying to create a spell checker for my companies website.
>
> Currently there are approx 29million documents in the index.
>
> When trying to create the spelling index it just seems to skip over the
> command.
>
> My fields in schema.xml look like the following:
>
> 
>  required="true" />
>  required="true" />
>  stored="true"
> required="true" />
>  stored="true" required="true" />
>  required="true"
> />
>  required="true" />
>  required="true" />
>  />
>  required="true" />
>  required="true" />
>  required="true" />
>  required="true"
> />
>  required="true" />
>  required="true" />
>  required="true" />
>  required="true" />
>  required="true" />
>
> And copying fields as such:
>
>
>
>
>
> 
>
> My spell checker config looks like the following:
>
> 
>
> 
>
> 
>   default
>   spell
>   true
>   true
>   
> name="spellcheckIndexDir">C:\Users\kyleg\apache-solr-1.4.0\productGroups\solr\data\spellchecker
> 
>
> 
>
> 
> 
>   solr.FileBasedSpellChecker
>   file
>   spellings.txt
>   UTF-8
>   ./spellcheckerFile
> 
>   
>
>
> The command that I am sending to try to build looks like the following:
>
> http://localhost:8983/solr/spell/?q=ACORA&version=2.2&start=0&rows=10&indent=on&spellcheck=true&spellcheck.dictionary=default&spellcheck.build=true&spellcheck.collate=true&spellcheck.limit=5
>
>
> I have also tried to reduce the size of the index to around 10,000
> documents
> and still no luck.
>
> Any help would be appreciated.
>
> Thank you,
> Kyle
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Solr-Spellcheck-on-Large-index-size-tp760416p760416.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB


Re: "Solr 1.4 Enterprise Search Server" book examples

2010-04-26 Thread Abdelhamid ABID
Hi,
I'm also interested to get those examples, would someone to share them ?

On 4/26/10, markus.rietz...@rzf.fin-nrw.de 
wrote:
>
> i have send you a private mail.
>
> markus
>
> > -Ursprüngliche Nachricht-
> > Von: Johan Cwiklinski [mailto:johan.cwiklin...@ajlsm.com]
> > Gesendet: Montag, 26. April 2010 10:58
> > An: solr-user@lucene.apache.org
> > Betreff: "Solr 1.4 Enterprise Search Server" book examples
> >
> > Hello,
> >
> > We've recently acquired the "Solr 1.4 Enterprise Search Server" book.
> >
> > I've tried to download the example ZIP file from the editor's website,
> > but the file is actually corrupted, and I cannot unzip it :(
> >
> > Could someone tell me if I can get these examples from
> > another location?
> >
> > I've send a message last week to the editor reporting the issue, but
> > that is not yet fixed ; and I'd really like to take a look at the
> > example code and make some tests.
> >
> > Regards,
> > --
> > Johan Cwiklinski
> >
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB


Re: mac osx server + tomcat + solr

2010-04-21 Thread Abdelhamid ABID
This is my last shot !   your solr webapp doesn't know where to locate solr
home, solr home must have at least this two folders:
  conf/
 solrconfig.xml
 schema.xml
  data/
   (after indexing) index/
 spellchecker/

try to ping your solr using  http://localhost:../solr/admin/ping.
As your solr webapp run without error as you mentioned before, it's most
likely you are having a miss configured solr home.



On 4/21/10, Jan Kammer  wrote:
>
> Is also correct.
>
> Abdelhamid ABID schrieb:
>
>  Well, check your solr config file if the folder "Data" is correctly
>> defined,
>> you may set it like this: ${solr.data.dir:}
>>
>> On 4/21/10, Jan Kammer  wrote:
>>
>>
>>> Exactly that i got in the file but it doesn't work :-(
>>>
>>> Abdelhamid ABID schrieb:
>>>
>>>  The path paramater defines your context path, "/solr" for example, what
>>>
>>>
>>>> you
>>>> are missing, I think, is the environment  variable 'solr/home' which
>>>> points
>>>> to solr home (again) and not the solr web-app. solr/home may be set in
>>>> your
>>>> xml-file  like this
>>>> 
>>>>  >>> value="PATH_TO_SOLR_HOME" override="false"/>
>>>> 
>>>>
>>>> PATH_TO_SOLR_HOME must contains "conf" and "data" folders
>>>>
>>>> On 4/21/10, Jan Kammer  wrote:
>>>>
>>>>
>>>>
>>>>
>>>>> hi,
>>>>> thanks for your answer.
>>>>> I defined the path in an xml-file in Tomcat/conf/Catalina/localhost/ I
>>>>> read
>>>>> that this must be the right place. Could there be anywhere an other
>>>>> mistake?
>>>>>
>>>>> greetz, Jan
>>>>>
>>>>> Abdelhamid ABID schrieb:
>>>>>
>>>>>  You are missing the  "solr/home" context paramater , which points to
>>>>> the
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>> folder (solr) holding "data" and "conf"
>>>>>>
>>>>>> On 4/21/10, Jan Kammer  wrote:
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>> Hi there,
>>>>>>>
>>>>>>> i got a problem with the combination solr + tomcat under mac osx
>>>>>>> server
>>>>>>> (snow leopard).
>>>>>>> Versions: tomcat 6.0 and newest solr.
>>>>>>>
>>>>>>> Solr works fine with jetty. But with the same solr i get the error
>>>>>>> "HTTP
>>>>>>> Status 404 - missing core name in path". I see the first site, but
>>>>>>> when
>>>>>>> i
>>>>>>> click on "Solr Admin", the described error appears.
>>>>>>> I asked google,  but didn't find the right answer to solve my
>>>>>>> problem.
>>>>>>>
>>>>>>> thanks in advance.
>>>>>>>
>>>>>>> Greetz, Jan
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>>
>>>>
>>>
>>>
>>
>>
>>
>>
>
>


-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB


Re: mac osx server + tomcat + solr

2010-04-21 Thread Abdelhamid ABID
Well, check your solr config file if the folder "Data" is correctly defined,
you may set it like this: ${solr.data.dir:}

On 4/21/10, Jan Kammer  wrote:
>
> Exactly that i got in the file but it doesn't work :-(
>
> Abdelhamid ABID schrieb:
>
>  The path paramater defines your context path, "/solr" for example, what
>> you
>> are missing, I think, is the environment  variable 'solr/home' which
>> points
>> to solr home (again) and not the solr web-app. solr/home may be set in
>> your
>> xml-file  like this
>> 
>>  > value="PATH_TO_SOLR_HOME" override="false"/>
>> 
>>
>> PATH_TO_SOLR_HOME must contains "conf" and "data" folders
>>
>> On 4/21/10, Jan Kammer  wrote:
>>
>>
>>> hi,
>>> thanks for your answer.
>>> I defined the path in an xml-file in Tomcat/conf/Catalina/localhost/ I
>>> read
>>> that this must be the right place. Could there be anywhere an other
>>> mistake?
>>>
>>> greetz, Jan
>>>
>>> Abdelhamid ABID schrieb:
>>>
>>>  You are missing the  "solr/home" context paramater , which points to the
>>>
>>>
>>>> folder (solr) holding "data" and "conf"
>>>>
>>>> On 4/21/10, Jan Kammer  wrote:
>>>>
>>>>
>>>>
>>>>
>>>>> Hi there,
>>>>>
>>>>> i got a problem with the combination solr + tomcat under mac osx server
>>>>> (snow leopard).
>>>>> Versions: tomcat 6.0 and newest solr.
>>>>>
>>>>> Solr works fine with jetty. But with the same solr i get the error
>>>>> "HTTP
>>>>> Status 404 - missing core name in path". I see the first site, but when
>>>>> i
>>>>> click on "Solr Admin", the described error appears.
>>>>> I asked google,  but didn't find the right answer to solve my problem.
>>>>>
>>>>> thanks in advance.
>>>>>
>>>>> Greetz, Jan
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>
>>>
>>
>>
>>
>>
>
>


-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB


Re: mac osx server + tomcat + solr

2010-04-21 Thread Abdelhamid ABID
The path paramater defines your context path, "/solr" for example, what you
are missing, I think, is the environment  variable 'solr/home' which points
to solr home (again) and not the solr web-app. solr/home may be set in your
xml-file  like this

  


PATH_TO_SOLR_HOME must contains "conf" and "data" folders

On 4/21/10, Jan Kammer  wrote:
>
> hi,
> thanks for your answer.
> I defined the path in an xml-file in Tomcat/conf/Catalina/localhost/ I read
> that this must be the right place. Could there be anywhere an other mistake?
>
> greetz, Jan
>
> Abdelhamid ABID schrieb:
>
>  You are missing the  "solr/home" context paramater , which points to the
>> folder (solr) holding "data" and "conf"
>>
>> On 4/21/10, Jan Kammer  wrote:
>>
>>
>>> Hi there,
>>>
>>> i got a problem with the combination solr + tomcat under mac osx server
>>> (snow leopard).
>>> Versions: tomcat 6.0 and newest solr.
>>>
>>> Solr works fine with jetty. But with the same solr i get the error "HTTP
>>> Status 404 - missing core name in path". I see the first site, but when i
>>> click on "Solr Admin", the described error appears.
>>> I asked google,  but didn't find the right answer to solve my problem.
>>>
>>> thanks in advance.
>>>
>>> Greetz, Jan
>>>
>>>
>>>
>>
>>
>>
>>
>>
>
>


-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB


Re: mac osx server + tomcat + solr

2010-04-21 Thread Abdelhamid ABID
You are missing the  "solr/home" context paramater , which points to the
folder (solr) holding "data" and "conf"

On 4/21/10, Jan Kammer  wrote:
>
> Hi there,
>
> i got a problem with the combination solr + tomcat under mac osx server
> (snow leopard).
> Versions: tomcat 6.0 and newest solr.
>
> Solr works fine with jetty. But with the same solr i get the error "HTTP
> Status 404 - missing core name in path". I see the first site, but when i
> click on "Solr Admin", the described error appears.
> I asked google,  but didn't find the right answer to solve my problem.
>
> thanks in advance.
>
> Greetz, Jan
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB


Re: Jetty, Tomcat or JBoss?

2010-04-20 Thread Abdelhamid ABID
Which are JEE Web components, aren't they?

On 4/20/10, Bill Au  wrote:
>
> Solr only uses Servlet and JSP.
>
>
> Bill
>
>
> On Sat, Apr 17, 2010 at 9:11 AM, Abdelhamid ABID 
> wrote:
>
> > Solr does use JEE WEB components
> >
> > On 4/17/10, Lukáš Vlček  wrote:
> > >
> > > Hi,
> > >
> > > may be you should be aware that JBoss AS is using Tomcat for web
> > container
> > > (with modified classloader), so if your web application is running
> inside
> > > JBoss AS then it is in fact running in Tomcat.
> > > I don't think Solr uses JEE technologies provided by JEE Application
> > server
> > > (JMS, Transaction services, pooling services, clustered EJB... etc...).
> > All
> > > it requires is web container AFAIK. This being said it will always take
> > > longer for application server to start and it will require more
> resources
> > > as
> > > opposed to lightweight web container.
> > >
> > > Regards,
> > > Lukas
> > >
> > >
> > > On Sat, Apr 17, 2010 at 11:08 AM, Andrea Gazzarini <
> > > andrea.gazzar...@atcult.it> wrote:
> > >
> > > > Hi all,
> > > > I have a web application which is basically a (user) search interface
> > > > towards SOLR.
> > > > My index is something like 7GB and has a lot of records so apart
> other
> > > > things like optiming SOLR schema, config ,clustering etc... I'd like
> to
> > > keep
> > > > SOLR installation as light as possible.
> > > > At the moment my SOLR instance is running under JBoss but I saw that
> > > > running under the bundled Jetty it takes a very little amount of
> memory
> > > (at
> > > > least at startup and after one hour of usage)
> > > >
> > > > So my questions is: since SOLR is using JEE web components what are
> the
> > > > drawback of using the following architecture?
> > > >
> > > > -My Application (Full JEE application with web components and EJB) on
> > > > JBoss;
> > > > - SOLR on Jetty or Tomcat
> > > >
> > > > Having said that and supposing that the idea is good, what are the
> main
> > > > differences / advantages / disadvamtages (from this point of view)
> > > between
> > > > Tomcat and Jetty?
> > > >
> > > > Best Regards,
> > > > Andrea
> > > >
> > > >
> > >
> >
> >
> >
> > --
> > Abdelhamid ABID
> >
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB


Re: Jetty, Tomcat or JBoss?

2010-04-17 Thread Abdelhamid ABID
Solr does use JEE WEB components

On 4/17/10, Lukáš Vlček  wrote:
>
> Hi,
>
> may be you should be aware that JBoss AS is using Tomcat for web container
> (with modified classloader), so if your web application is running inside
> JBoss AS then it is in fact running in Tomcat.
> I don't think Solr uses JEE technologies provided by JEE Application server
> (JMS, Transaction services, pooling services, clustered EJB... etc...). All
> it requires is web container AFAIK. This being said it will always take
> longer for application server to start and it will require more resources
> as
> opposed to lightweight web container.
>
> Regards,
> Lukas
>
>
> On Sat, Apr 17, 2010 at 11:08 AM, Andrea Gazzarini <
> andrea.gazzar...@atcult.it> wrote:
>
> > Hi all,
> > I have a web application which is basically a (user) search interface
> > towards SOLR.
> > My index is something like 7GB and has a lot of records so apart other
> > things like optiming SOLR schema, config ,clustering etc... I'd like to
> keep
> > SOLR installation as light as possible.
> > At the moment my SOLR instance is running under JBoss but I saw that
> > running under the bundled Jetty it takes a very little amount of memory
> (at
> > least at startup and after one hour of usage)
> >
> > So my questions is: since SOLR is using JEE web components what are the
> > drawback of using the following architecture?
> >
> > -My Application (Full JEE application with web components and EJB) on
> > JBoss;
> > - SOLR on Jetty or Tomcat
> >
> > Having said that and supposing that the idea is good, what are the main
> > differences / advantages / disadvamtages (from this point of view)
> between
> > Tomcat and Jetty?
> >
> > Best Regards,
> > Andrea
> >
> >
>



-- 
Abdelhamid ABID


Re: closest terms, sentence, boosting 'business' keywords instead of field ?

2010-04-13 Thread Abdelhamid ABID
>
> Do you have an example of what you are trying to do?
>>>
>>>


For instance a request like: "tomcat servlet" should return document which
>>>>>> have "tomcat is a servlet container" rather than a document that 
>>>>>> have"tomcat offers the last specification implementaion of the servlet
>>>>>> technology", at  least this last should not come before the first in
>>>>>> results.
>>>>>
>>>>>


> Boost in the query or during indexing?
>>>
>>>
I think boosting in query is more relevant.
>>>>>
>>>>>
Thanks




On 4/13/10, Grant Ingersoll  wrote:
>
>
> On Apr 12, 2010, at 7:57 PM, Abdelhamid ABID wrote:
>
> > Hi,
> > - I'm bit confused on how analyzer apply filters on query, I know that
> they
> > are applied in order on which they are declared, but still, does the
> search
> > result
> > include only the final digest of the filters chain or at each filter step
> > solr add the matched term to results set.
>
>
> It is the final output.
>
>
> >
> > - Does Dismax request handler support quoted keywords ? if not, how can I
> > search for an exact sentence using dismax.
>
>
> It does.
>
>
> > - How to match a request with the documents that only have keywords that
> > appears in the closest positions.
>
>
> Do you have an example of what you are trying to do?
>
>
> > - How can I boost a set of keywords instead of fields? this would be
> useful
> > in case where a document with one single searchable field, which is of
> type
> > text and where "boosting field" has no sense.
>
>
> Boost in the query or during indexing?
>
>
> --
> Grant Ingersoll
> http://www.lucidimagination.com/
>
> Search the Lucene ecosystem using Solr/Lucene:
> http://www.lucidimagination.com/search
>
>


-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB / ESB MULE


closest terms, sentence, boosting 'business' keywords instead of field ?

2010-04-12 Thread Abdelhamid ABID
Hi,
- I'm bit confused on how analyzer apply filters on query, I know that they
are applied in order on which they are declared, but still, does the search
result
include only the final digest of the filters chain or at each filter step
solr add the matched term to results set.

- Does Dismax request handler support quoted keywords ? if not, how can I
search for an exact sentence using dismax.
- How to match a request with the documents that only have keywords that
appears in the closest positions.
- How can I boost a set of keywords instead of fields? this would be useful
in case where a document with one single searchable field, which is of type
text and where "boosting field" has no sense.



-- 
Elsadek
Software Engineer- J2EE / WEB


Re: Trouble compiling SOLR

2010-03-26 Thread Abdelhamid ABID
Is there a trouble in downloading Ant 1.7 and setting the new ANT_HOME ??

On 3/26/10, Shawn Heisey  wrote:
>
> I have checked out trunk from the new svn location that has the merged
> lucene/solr.  When I try to build it, it gets a certain distance through the
> build and spits out this:
>
> BUILD FAILED
> /opt/ncindex/src/solr/trunk/solr/common-build.xml:178: The  type
> doesn't support the nested "resources" element.
>
> The only useful thing I have found relating to this suggests that I need
> ant 1.7.  Is this the case?  I really hope it isn't, because CentOS 5.4,
> being the latest version of that OS, has 1.6.5.
>
>
> http://www.coderanch.com/t/108791/Ant-Maven-Other-Build-Tools/copy-folders-with-ant
>
> Thanks,
> Shawn
>
>


-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB / ESB MULE


Re: how to apply patch SOLR-1316

2010-03-26 Thread Abdelhamid ABID
Well, The patch itself is a plain text by which a patch tool use to update a
project source, it's editable with any text editor.
You understand by this that you have to build the project yoursef after
applying the patch.

In general you need to:
- Download an IDE with svn client, Eclipse is a great one.
- Create a project from SVN by connecting to the Solr svn repo at
http://svn.apache.org/repos/asf/lucene/solr and checkout the version that
fulfill your requirements. Solr1.4 is under 'tag' folder.

- Download the patch you want to be applied.
- In your IDE, right click (not left click :) ) on the src folder, and
select 'team' or 'versioning' or what ever to apply the patch.
- Build using Eclipse or directly with Ant from the project folder in the
command line.


After successful build you can take your new brand Solr from dist folder
located in the project home.

I hope that this very very quick guide will be of help to you. I suggest
that you read further on  Java development life-cycle and tools.


On 3/26/10, nabil rabhi  wrote:
>
> thanks for the reply Abdelhamid, but could you give me more details please?
>
> 2010/3/26 Abdelhamid ABID 
>
>
> > To apply the patch you need the source, you may have it from Solr svn,
> > using
> > an ide  will greatly make things friendly,
> > Applying the patch is a matter of left click on the project and .. "apply
> > patch" !
> >
> > On 3/26/10, nabil rabhi  wrote:
> > >
> > > I want to apply this patch
> > http://issues.apache.org/jira/browse/SOLR-1316
> > > to solr 1.4 so I can implement the autocomplete feature
> > > ca anyone help?
> > >
> >
> >
> >
> > --
> > Abdelhamid ABID
> > Software Engineer- J2EE / WEB / ESB MULE
> >
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB / ESB MULE


Re: how to apply patch SOLR-1316

2010-03-26 Thread Abdelhamid ABID
To apply the patch you need the source, you may have it from Solr svn, using
an ide  will greatly make things friendly,
Applying the patch is a matter of left click on the project and .. "apply
patch" !

On 3/26/10, nabil rabhi  wrote:
>
> I want to apply this patch http://issues.apache.org/jira/browse/SOLR-1316
> to solr 1.4 so I can implement the autocomplete feature
> ca anyone help?
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB / ESB MULE


Re: Solrj doesn't tell if PDF was actually parsed by Tika

2010-03-26 Thread Abdelhamid ABID
Well done : https://issues.apache.org/jira/browse/SOLR-1847

meanwhile, is there any workaround ?

On 3/26/10, Lance Norskog  wrote:
>
> Please file a bug for this on the JIRA.
>
> https://issues.apache.org/jira/secure/Dashboard.jspa
>
>
> On Thu, Mar 25, 2010 at 7:21 AM, Abdelhamid  ABID 
> wrote:
> > Hi,
> > When posting pdf files using solrj the only response we get from Solr is
> > only server response status, but never know whether
> > pdf was actually parsed or not, checking the log I found that some Tika
> > wasn't able
> > to succeed with some pdf files because of content nature (texts in images
> > only) or are corrupted:
> >
> > 25 mars 2010 14:54:07 org.apache.pdfbox.util.PDFStreamEngine
> > processOperator
> > INFO: unsupported/disabled operation: EI
> >
> > 25 mars 2010 14:54:02 org.apache.pdfbox.filter.FlateFilter decode
> > GRAVE: Stop reading corrupt stream
> >
> >
> > The question is how can I catch these kinds of exceptions through Solrj ?
> >
> > --
> > Elsadek
> >
>
>
>
>
> --
> Lance Norskog
> goks...@gmail.com
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB / ESB MULE


Solrj doesn't tell if PDF was actually parsed by Tika

2010-03-25 Thread Abdelhamid ABID
Hi,
When posting pdf files using solrj the only response we get from Solr is
only server response status, but never know whether
pdf was actually parsed or not, checking the log I found that some Tika
wasn't able
to succeed with some pdf files because of content nature (texts in images
only) or are corrupted:

 25 mars 2010 14:54:07 org.apache.pdfbox.util.PDFStreamEngine
processOperator
 INFO: unsupported/disabled operation: EI

 25 mars 2010 14:54:02 org.apache.pdfbox.filter.FlateFilter decode
 GRAVE: Stop reading corrupt stream


The question is how can I catch these kinds of exceptions through Solrj ?

-- 
Elsadek


Re: How to compose a query from multiple HTTP URL parameters?

2010-03-25 Thread Abdelhamid ABID
Hi,
You may implement this alternative by using "URL rewrite" mechanism either
by coding your own filter or by pulling back your servlet
engine behind the Apache httpd in order to benefit from mod_rewrite.


On 3/25/10, Conal Tuohy  wrote:
>
> I would like to be able to specify a query over multiple fields using just
> an HTML form with different parameters for the different fields.
>
> Is it possible to configure Solr to accept a URL of this form:
>
> select?Species=Pseudonaja+textilis&Hospital=Griffith+Base+Hospital
>
> ... instead of:
>
> q=Species:'Pseudonaja+textilis'+Hospital:'Griffith+Base+Hospital'
>
> I have read some information about parameter indirection or parameter
> dereferencing, but I haven't been able to get it to work.
>
>
>


-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB / ESB MULE


Re: Query interface

2010-03-22 Thread Abdelhamid ABID
Hi,
I think there isn't better than using XSLT as a mean to query solr and
render results.
Within an xslt file you would combine search form with search results in one
place, by this way you free the server from the heavy duty tasks of xslt
transformation and let the client -which is in the most cases a browser- do
the work.

On 3/22/10, Gora Mohanty  wrote:
>
> On Mon, 22 Mar 2010 15:26:41 +0100
> Sebastian Funk  wrote:
>
> > hey there,
> >
> > i've been using solr for some time now and set everything up the
> > way it's supposed to..
> > now for the user interface: simply writing a javascript (or
> > something else) website that passes the query-URL to solr and
> > interprets the XML given as a result. is that the easiest way?
> > i've noticed some problems with umlauts etc.. when using jetty or
> > tomcat as a server..
> >
> > is there another way to query solr and retrieve the results?
>
> [...]
>
> Many modern frameworks (I certainly know of Ruby on Rails, and
> Django), have Solr integrated via an application. I really like
> Django Haystack for how it offers an easy way to get started with
> various search back-ends, with a very Django-ish feel to the
> interface: http://haystacksearch.org/
>
> Regards,
>
> Gora
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB / ESB MULE


Re: Question about query

2010-03-22 Thread Abdelhamid ABID
Well, here what I figure out !

(mm=1<50% , qf=topic , q="1" "0" ) ==> q=topic:0 or topic:1


On 3/22/10, Armando Ota  wrote:
>
> Hi
>
> I need a little help with query for my problem (if it can be solved)
>
> I have a field in a document called topic
>
> this field contains some values, 0 (for no topic) or  1 (topic 1), 2, 3,
> etc ...
>
> It can contain many values like 1, 10, 50, etc (for 1 doc)
>
> So now to the problem:
> I would like to get documents that have 0 for topic value and documents
> that only have for example 1 for topic value inserted
>
> articles for example:
> article 1topics: 1, 5, 10, 20, 24
> article 2 topics: 0
> article 3 topics: 1
> article 4 topic: 5, 10, 20
> article 5 topic: 1, 13, 19
>
> So I need search query to return me only article 2 and 3 not other articles
> with 1 for topic value
>
> Can that be done ? Any help appreciated
>
> Kind regards
>
> Armando
>
>


-- 
Elsadek
Software Engineer- J2EE / WEB / ESB MULE


Re: PDF extraction leads to reversed words

2010-03-16 Thread Abdelhamid ABID
Hi again ,
I just came from trying the version 1.5-dev from Solr trunk.
After applying the patch you provided, and adding icu4j-3_8_1 in classpath,
results are pretty good different then before.
Now words and texts are not reversed and are displayed correctly except some
pdf files's text parts that Solr display in a strange
manner, specially when arabic and latin are in the same paragraph, I 'll
check again for this.



On Tue, Mar 9, 2010 at 4:13 PM, Robert Muir  wrote:

> On Tue, Mar 9, 2010 at 10:10 AM, Abdelhamid  ABID 
> wrote:
> > nor 3.8 version does change anythings !
> >
>
> the patch (https://issues.apache.org/jira/browse/SOLR-1813) can only
> work on Solr trunk. It will not work with Solr 1.4.
>
>
> Solr 1.4 uses pdfbox-0.7.3.jar, which does not support Arabic.
> Solr trunk uses pdfbox-0.8.0-incubating.jar, which does support
> Arabic, if you also put ICU in the classpath.
>
> --
> Robert Muir
> rcm...@gmail.com
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB / ESB MULE


Re: PDF extraction leads to reversed words

2010-03-09 Thread Abdelhamid ABID
I tried couples of times to get this patch, but downloads fail, filesize
missmach or someting like error poped up
is there another link

On 3/9/10, Dominique Bejean  wrote:
>
> Hi,
>
> The problem comes form PDFBox (
> http://brutus.apache.org/jira/browse/PDFBOX-377) and is fixed now. However
> Tika doesn't yet use this version of PDFBox.
> So for PDF text extraction, I doesn't use Tika but pdftotext.
>
> Dominique
>
>
> Le 09/03/10 06:00, Robert Muir a écrit :
>
>  it is an optional dependency of PDFBox. If ICU is available, then it
>> is capable of processing Arabic PDF files.
>>
>> The problem is that Arabic "text" in PDF files is really glyphs
>> (encoded in visual order) and needs to be 'unshaped' with some stuff
>> that isn't in the JDK.
>>
>> If the size of the default ICU jar file is the issue here, we can
>> consider an alternative: The default ICU jar is very large as it
>> includes everything, yet it can be customized to only include what is
>> needed: http://apps.icu-project.org/datacustom/
>>
>> We did this in lucene for the collation contrib, to shrink the jar
>> about 2MB: http://issues.apache.org/jira/browse/LUCENE-1867
>>
>> For this use-case, it could be even smaller, as most of the huge size
>> of ICU comes from large CJK collation tables (needed for collation,
>> but not for this Arabic PDF extraction).
>>
>> In reality I don't really like doing this as it might confuse users
>> (e.g. people that want collation, too), and ICU is useful for other
>> things, but if thats what we have to do, we should do it so that
>> Arabic PDF files will work.
>>
>> On Mon, Mar 8, 2010 at 11:53 PM, Lance Norskog  wrote:
>>
>>
>>> Is this a mistake in the Tika library collection in the Solr trunk?
>>>
>>> On Mon, Mar 8, 2010 at 5:15 PM, Robert Muir  wrote:
>>>
>>>
>>>> I think the problem is that Solr does not include the ICU4J jar, so it
>>>> won't work with Arabic PDF files.
>>>>
>>>> Try putting ICU4J 3.8 (http://site.icu-project.org/download) in your
>>>> classpath.
>>>>
>>>> On Mon, Mar 8, 2010 at 6:30 PM, Abdelhamid  ABID
>>>>  wrote:
>>>>
>>>>
>>>>> Hi,
>>>>> Posting arabic pdf files to Solr using a web form (to
>>>>> solr/update/extract)
>>>>> get extracted texts and each words displayed in reverse
>>>>> direction(instead of
>>>>> right to left).
>>>>> When perform search against these texts with -always- reversed
>>>>> key-words I
>>>>> get results but reversed.
>>>>> This problem doesn't occur when posting MsWord document.
>>>>> I think the problem come from Tika !
>>>>>
>>>>> Any clue ?
>>>>>
>>>>> --
>>>>> elsadek
>>>>> Software Engineer- J2EE / WEB / ESB MULE
>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>> Robert Muir
>>>> rcm...@gmail.com
>>>>
>>>>
>>>>
>>>
>>>
>>> --
>>> Lance Norskog
>>> goks...@gmail.com
>>>
>>>
>>>
>>
>>
>>
>>
>


-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB / ESB MULE


Re: PDF extraction leads to reversed words

2010-03-09 Thread Abdelhamid ABID
I put ICU4J 4.2 in the lib of Solr, nothing changed, I'm trying now with
ICU4J 3.8

On 3/9/10, Robert Muir  wrote:
>
> I think the problem is that Solr does not include the ICU4J jar, so it
> won't work with Arabic PDF files.
>
> Try putting ICU4J 3.8 (http://site.icu-project.org/download) in your
> classpath.
>
>
> On Mon, Mar 8, 2010 at 6:30 PM, Abdelhamid  ABID 
> wrote:
> > Hi,
> > Posting arabic pdf files to Solr using a web form (to
> solr/update/extract)
> > get extracted texts and each words displayed in reverse direction(instead
> of
> > right to left).
> > When perform search against these texts with -always- reversed key-words
> I
> > get results but reversed.
> > This problem doesn't occur when posting MsWord document.
> > I think the problem come from Tika !
> >
> > Any clue ?
> >
> > --
> > elsadek
> > Software Engineer- J2EE / WEB / ESB MULE
> >
>
>
>
>
> --
> Robert Muir
> rcm...@gmail.com
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB / ESB MULE


Re: PDF extraction leads to reversed words

2010-03-09 Thread Abdelhamid ABID
nor 3.8 version does change anythings !

On 3/9/10, Robert Muir  wrote:
>
> I think the problem is that Solr does not include the ICU4J jar, so it
> won't work with Arabic PDF files.
>
> Try putting ICU4J 3.8 (http://site.icu-project.org/download) in your
> classpath.
>
>
> On Mon, Mar 8, 2010 at 6:30 PM, Abdelhamid  ABID 
> wrote:
> > Hi,
> > Posting arabic pdf files to Solr using a web form (to
> solr/update/extract)
> > get extracted texts and each words displayed in reverse direction(instead
> of
> > right to left).
> > When perform search against these texts with -always- reversed key-words
> I
> > get results but reversed.
> > This problem doesn't occur when posting MsWord document.
> > I think the problem come from Tika !
> >
> > Any clue ?
> >
> > --
> > elsadek
> > Software Engineer- J2EE / WEB / ESB MULE
> >
>
>
>
>
> --
> Robert Muir
> rcm...@gmail.com
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB / ESB MULE


Re: PDF extraction leads to reversed words

2010-03-09 Thread Abdelhamid ABID
I doen't know about pdftotext, is it pluggable with Solr, or do we need
hard-code the step of extraction before Solr turn.

On 3/9/10, Dominique Bejean  wrote:
>
> Hi,
>
> The problem comes form PDFBox (
> http://brutus.apache.org/jira/browse/PDFBOX-377) and is fixed now. However
> Tika doesn't yet use this version of PDFBox.
> So for PDF text extraction, I doesn't use Tika but pdftotext.
>
> Dominique
>
>
> Le 09/03/10 06:00, Robert Muir a écrit :
>
>  it is an optional dependency of PDFBox. If ICU is available, then it
>> is capable of processing Arabic PDF files.
>>
>> The problem is that Arabic "text" in PDF files is really glyphs
>> (encoded in visual order) and needs to be 'unshaped' with some stuff
>> that isn't in the JDK.
>>
>> If the size of the default ICU jar file is the issue here, we can
>> consider an alternative: The default ICU jar is very large as it
>> includes everything, yet it can be customized to only include what is
>> needed: http://apps.icu-project.org/datacustom/
>>
>> We did this in lucene for the collation contrib, to shrink the jar
>> about 2MB: http://issues.apache.org/jira/browse/LUCENE-1867
>>
>> For this use-case, it could be even smaller, as most of the huge size
>> of ICU comes from large CJK collation tables (needed for collation,
>> but not for this Arabic PDF extraction).
>>
>> In reality I don't really like doing this as it might confuse users
>> (e.g. people that want collation, too), and ICU is useful for other
>> things, but if thats what we have to do, we should do it so that
>> Arabic PDF files will work.
>>
>> On Mon, Mar 8, 2010 at 11:53 PM, Lance Norskog  wrote:
>>
>>
>>> Is this a mistake in the Tika library collection in the Solr trunk?
>>>
>>> On Mon, Mar 8, 2010 at 5:15 PM, Robert Muir  wrote:
>>>
>>>
>>>> I think the problem is that Solr does not include the ICU4J jar, so it
>>>> won't work with Arabic PDF files.
>>>>
>>>> Try putting ICU4J 3.8 (http://site.icu-project.org/download) in your
>>>> classpath.
>>>>
>>>> On Mon, Mar 8, 2010 at 6:30 PM, Abdelhamid  ABID
>>>>  wrote:
>>>>
>>>>
>>>>> Hi,
>>>>> Posting arabic pdf files to Solr using a web form (to
>>>>> solr/update/extract)
>>>>> get extracted texts and each words displayed in reverse
>>>>> direction(instead of
>>>>> right to left).
>>>>> When perform search against these texts with -always- reversed
>>>>> key-words I
>>>>> get results but reversed.
>>>>> This problem doesn't occur when posting MsWord document.
>>>>> I think the problem come from Tika !
>>>>>
>>>>> Any clue ?
>>>>>
>>>>> --
>>>>> elsadek
>>>>> Software Engineer- J2EE / WEB / ESB MULE
>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>> Robert Muir
>>>> rcm...@gmail.com
>>>>
>>>>
>>>>
>>>
>>>
>>> --
>>> Lance Norskog
>>> goks...@gmail.com
>>>
>>>
>>>
>>
>>
>>
>>
>


-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB / ESB MULE


Re: PDF extraction leads to reversed words

2010-03-09 Thread Abdelhamid ABID
I'm using 1.4 version of Solr

On 3/9/10, Robert Muir  wrote:
>
> On Tue, Mar 9, 2010 at 9:44 AM, Abdelhamid  ABID 
> wrote:
> > I put ICU4J 4.2 in the lib of Solr, nothing changed, I'm trying now with
> > ICU4J 3.8
> >
>
>
> Hello, what version of Solr are you using? I think you will need to
> use the trunk version.
>
> I created a patch for this issue that you can apply to trunk (with all
> necessary resources)
> here: https://issues.apache.org/jira/browse/SOLR-1657
>
> The included testcase fails without adding icu4j to the lib directory
> (as the arabic text
> is reversed), and passes with it.
>
>
> --
>
> Robert Muir
> rcm...@gmail.com
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB / ESB MULE


Re: PDF extraction leads to reversed words

2010-03-09 Thread Abdelhamid ABID
nor 3.8 version does change anythings !

On 3/9/10, Robert Muir  wrote:
>
> I think the problem is that Solr does not include the ICU4J jar, so it
> won't work with Arabic PDF files.
>
> Try putting ICU4J 3.8 (http://site.icu-project.org/download) in your
> classpath.
>
>
> On Mon, Mar 8, 2010 at 6:30 PM, Abdelhamid  ABID 
> wrote:
> > Hi,
> > Posting arabic pdf files to Solr using a web form (to
> solr/update/extract)
> > get extracted texts and each words displayed in reverse direction(instead
> of
> > right to left).
> > When perform search against these texts with -always- reversed key-words
> I
> > get results but reversed.
> > This problem doesn't occur when posting MsWord document.
> > I think the problem come from Tika !
> >
> > Any clue ?
> >
> > --
> > elsadek
> > Software Engineer- J2EE / WEB / ESB MULE
> >
>
>
>
>
> --
> Robert Muir
> rcm...@gmail.com
>



-- 
Abdelhamid ABID
Software Engineer- J2EE / WEB / ESB MULE


PDF extraction leads to reversed words

2010-03-08 Thread Abdelhamid ABID
Hi,
Posting arabic pdf files to Solr using a web form (to solr/update/extract)
get extracted texts and each words displayed in reverse direction(instead of
right to left).
When perform search against these texts with -always- reversed key-words I
get results but reversed.
This problem doesn't occur when posting MsWord document.
I think the problem come from Tika !

Any clue ?

-- 
elsadek
Software Engineer- J2EE / WEB / ESB MULE