Re: using extract handler: data not extracted

2014-01-12 Thread Andrea Gazzarini
Wait, don't confuse things...they should be three different issues: 1. with curl indexing happens but leaves the content field empty, so probably something occurs at tika level during the text extraction. That's the reason why I told you about the tika logging 2. with solrj ineexing doesn'happen

Re: using extract handler: data not extracted

2014-01-12 Thread sweety
ya right all 3 points are right. Let me solve the 1 first, there is some errror in tika level indexing, for that i need to debug at tika level right?? but how to do that?? Solr admin does not show package wise logging. -- View this message in context:

Problem with starting solr

2014-01-12 Thread lukaszzenko
Hi. I'm total beginner to Solr. When I'm trying to start it I get following errors: lukasz@lukasz-VirtualBox:~/PycharmProjects/solr/solr-4.6.0/example$ java -jar start.jar 0[main] INFO org.eclipse.jetty.server.Server – jetty-8.1.10.v20130312 104 [main] INFO

Re: Problem with starting solr

2014-01-12 Thread Rafał Kuć
Hello! If you want to use Morfologik you need to include it jar files, because they are in included in the standard distribution. Look at the downloaded Solr distribution, you should find the contrib/analysis-extras/lib directory there. Get those libraries, put it in a directory (ie:

Re: using extract handler: data not extracted

2014-01-12 Thread Andrea Gazzarini
A premise: as Erik explained, most probably this issue has nothing to do with SOLR. So, these are the options that, in my mind, you have *OPTION #1 : Using Tika as command line tool*a) Download Tika. Make sure the same version of your SOLR b) Read here:

Re: using extract handler: data not extracted

2014-01-12 Thread sweety
through command line(java -jar tika-app-1.4.jar -v C:Cloud.docx) apache tika is able to parse .docx files, so can i use this tika-app-1.4.jar in solr?? how to do that?? -- View this message in context:

Re: using extract handler: data not extracted

2014-01-12 Thread Andrea Gazzarini
Please stay on (or clarify) your issue: in the first example you told us the problem is with Coding.pdf file. What is that Cloud.docx? Why don't you try with Coding.pdf? And what is the result of the extraction from command line with Coding.pdf and the same tika version that is in your SOLR? I

Correct to use to store urls (unicode)

2014-01-12 Thread Hakim Benoudjit
hi, what's the correct type used to store urls, which can contain some encoded unicode caracters in the form '%some-digits'. Because, the string type returns an error when I try to store these urls. Btw, I'm using a python client which gives me this error: 'ascii' codec can't decode byte 0xc3.

Re: Correct to use to store urls (unicode)

2014-01-12 Thread Gora Mohanty
On 12 January 2014 19:45, Hakim Benoudjit h.benoud...@gmail.com wrote: hi, what's the correct type used to store urls, which can contain some encoded unicode caracters in the form '%some-digits'. Because, the string type returns an error when I try to store these urls. Please provide more

Re: Correct to use to store urls (unicode)

2014-01-12 Thread Hakim Benoudjit
I can add this link using sunburnt (solr python client) so it can not be related to solr. I think you're right it might be a python issue. Thanks. 2014/1/12 Gora Mohanty g...@mimirtech.com On 12 January 2014 19:45, Hakim Benoudjit h.benoud...@gmail.com wrote: hi, what's the correct type

Re: Correct to use to store urls (unicode)

2014-01-12 Thread Hakim Benoudjit
I have just forget the u'' next to a unicode string :\, 2014/1/12 Hakim Benoudjit h.benoud...@gmail.com I can add this link using sunburnt (solr python client) so it can not be related to solr. I think you're right it might be a python issue. Thanks. 2014/1/12 Gora Mohanty

Re: Correct to use to store urls (unicode)

2014-01-12 Thread Gora Mohanty
On 12 January 2014 20:07, Hakim Benoudjit h.benoud...@gmail.com wrote: I have just forget the u'' next to a unicode string :\, Heh! Handling of Unicode in Python 2.x is annoying. 3.x is better, but only a little. Off-topic on this list, so I will shut up now. Regards, Gora

Re: using extract handler: data not extracted

2014-01-12 Thread sweety
Sorry for the mistake. im using solr 4.2, it has tika-1.3. So now, java -jar tika-app-1.3.jar -v C:Coding.pdf , parses pdf document without error or msg. Also, java -jar tika-app-1.4.jar* -t *C:Cloud.docx, shows the entire document. Which means there is no problem in tika right?? -- View this

Re: using extract handler: data not extracted

2014-01-12 Thread sweety
Sorry for the mistake. im using solr 4.2, it has tika-1.3. So now, java -jar tika-app-1.3.jar -v C:\Coding.pdf , parses pdf document without error or msg. Also, java -jar tika-app-1.3.jar -t C:\Coding.pdf, shows the entire document. Which means there is no problem in tika right?? -- View

Re: using extract handler: data not extracted

2014-01-12 Thread sweety
Sorry for the mistake. im using solr 4.2, it has tika-1.3. So now, java -jar tika-app-1.3.jar -v C:\Coding.pdf , parses pdf document without error or msg. Also, java -jar tika-app-1.3.jar -t C:\Coding.pdf, shows the entire document. Which means there is no problem in tika right?? -- View

Re: Correct to use to store urls (unicode)

2014-01-12 Thread Hakim Benoudjit
Yep sure. But is it good for me to store a link(http://...) in a solr string field? knowing that this field isnt indexed, only stored. 2014/1/12 Gora Mohanty g...@mimirtech.com On 12 January 2014 20:07, Hakim Benoudjit h.benoud...@gmail.com wrote: I have just forget the u'' next to a unicode

Re: using extract handler: data not extracted

2014-01-12 Thread Andrea Gazzarini
Not really sure...the issue seems related to text extraction so the first suspect is tika...SOLR is playing a secondary role here. If Tika is doing extraction good there should be an error, a warning on solr side (an exception, a content field too long warning or something like that) What about

Re: How to index data in muliValue field with key

2014-01-12 Thread rachun
thank you Mr. Steve. Now I understood and i figured out to separate field to title_th and title_en and it worked ;) -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-index-data-in-muliValue-field-with-key-tp4110653p4110981.html Sent from the Solr - User mailing list

How Solr join query works?

2014-01-12 Thread solr2020
Hi All, Can anyone please explain how solr join query works in solr4.2. we have 2 different documents.Both are in the same index. document1 contains the columns: docdate: 01-12-2012 previousmonthdate :01-11-2012 price:15 and some more fields. document2 contains: docdate :01-11-2012

Re: Correct to use to store urls (unicode)

2014-01-12 Thread Gora Mohanty
On 13 January 2014 00:30, Hakim Benoudjit h.benoud...@gmail.com wrote: Yep sure. But is it good for me to store a link(http://...) in a solr string field? knowing that this field isnt indexed, only stored. Yes, there is no issue. Not sure why they are not indexed, but if that is what you want,

background merge hit exception while optimizing index (SOLR 4.4.0)

2014-01-12 Thread Ralf Matulat
Hi, I am currently running into merge-issues while optimizing an index. To give you some informations: We are running 4 SOLR Servers with identical OS, VM-Hardware, RAM etc. Only one Server by now is having issues, the others are fine. We are running SOLR 4.4.0 with Tomcat 6.0 It was running

Re: using extract handler: data not extracted

2014-01-12 Thread sweety
I am working on Windows 7 -- View this message in context: http://lucene.472066.n3.nabble.com/using-extract-handler-data-not-extracted-tp4110850p4110993.html Sent from the Solr - User mailing list archive at Nabble.com.