Wait, don't confuse things...they should be three different issues:
1. with curl indexing happens but leaves the content field empty, so
probably something occurs at tika level during the text extraction. That's
the reason why I told you about the tika logging
2. with solrj ineexing doesn'happen
ya right all 3 points are right.
Let me solve the 1 first, there is some errror in tika level indexing, for
that i need to debug at tika level right??
but how to do that?? Solr admin does not show package wise logging.
--
View this message in context:
Hi.
I'm total beginner to Solr.
When I'm trying to start it I get following errors:
lukasz@lukasz-VirtualBox:~/PycharmProjects/solr/solr-4.6.0/example$ java
-jar start.jar
0[main] INFO org.eclipse.jetty.server.Server – jetty-8.1.10.v20130312
104 [main] INFO
Hello!
If you want to use Morfologik you need to include it jar files,
because they are in included in the standard distribution. Look at the
downloaded Solr distribution, you should find the
contrib/analysis-extras/lib directory there. Get those libraries, put
it in a directory (ie:
A premise: as Erik explained, most probably this issue has nothing to do
with SOLR.
So, these are the options that, in my mind, you have
*OPTION #1 : Using Tika as command line tool*a) Download Tika. Make sure
the same version of your SOLR
b) Read here:
through command line(java -jar tika-app-1.4.jar -v C:Cloud.docx) apache
tika is able to parse .docx files, so can i use this tika-app-1.4.jar in
solr?? how to do that??
--
View this message in context:
Please stay on (or clarify) your issue: in the first example you told us
the problem is with Coding.pdf file. What is that Cloud.docx? Why don't
you try with Coding.pdf? And what is the result of the extraction from
command line with Coding.pdf and the same tika version that is in your SOLR?
I
hi,
what's the correct type used to store urls, which can contain some
encoded unicode caracters in the form '%some-digits'. Because, the
string type returns an error when I try to store these urls.
Btw, I'm using a python client which gives me this error: 'ascii'
codec can't decode byte 0xc3.
On 12 January 2014 19:45, Hakim Benoudjit h.benoud...@gmail.com wrote:
hi,
what's the correct type used to store urls, which can contain some
encoded unicode caracters in the form '%some-digits'. Because, the
string type returns an error when I try to store these urls.
Please provide more
I can add this link using sunburnt (solr python client) so it can not be
related to solr.
I think you're right it might be a python issue.
Thanks.
2014/1/12 Gora Mohanty g...@mimirtech.com
On 12 January 2014 19:45, Hakim Benoudjit h.benoud...@gmail.com wrote:
hi,
what's the correct type
I have just forget the u'' next to a unicode string :\,
2014/1/12 Hakim Benoudjit h.benoud...@gmail.com
I can add this link using sunburnt (solr python client) so it can not be
related to solr.
I think you're right it might be a python issue.
Thanks.
2014/1/12 Gora Mohanty
On 12 January 2014 20:07, Hakim Benoudjit h.benoud...@gmail.com wrote:
I have just forget the u'' next to a unicode string :\,
Heh! Handling of Unicode in Python 2.x is annoying. 3.x
is better, but only a little. Off-topic on this list, so I will
shut up now.
Regards,
Gora
Sorry for the mistake.
im using solr 4.2, it has tika-1.3.
So now, java -jar tika-app-1.3.jar -v C:Coding.pdf , parses pdf document
without error or msg.
Also, java -jar tika-app-1.4.jar* -t *C:Cloud.docx, shows the entire
document.
Which means there is no problem in tika right??
--
View this
Sorry for the mistake.
im using solr 4.2, it has tika-1.3.
So now, java -jar tika-app-1.3.jar -v C:\Coding.pdf , parses pdf document
without error or msg.
Also, java -jar tika-app-1.3.jar -t C:\Coding.pdf, shows the entire
document.
Which means there is no problem in tika right??
--
View
Sorry for the mistake.
im using solr 4.2, it has tika-1.3.
So now, java -jar tika-app-1.3.jar -v C:\Coding.pdf , parses pdf document
without error or msg.
Also, java -jar tika-app-1.3.jar -t C:\Coding.pdf, shows the entire
document.
Which means there is no problem in tika right??
--
View
Yep sure. But is it good for me to store a link(http://...) in a solr
string field? knowing that this field isnt indexed, only stored.
2014/1/12 Gora Mohanty g...@mimirtech.com
On 12 January 2014 20:07, Hakim Benoudjit h.benoud...@gmail.com wrote:
I have just forget the u'' next to a unicode
Not really sure...the issue seems related to text extraction so the first
suspect is tika...SOLR is playing a secondary role here. If Tika is doing
extraction good there should be an error, a warning on solr side (an
exception, a content field too long warning or something like that)
What about
thank you Mr. Steve. Now I understood and i figured out to separate field to
title_th and title_en and it worked ;)
--
View this message in context:
http://lucene.472066.n3.nabble.com/How-to-index-data-in-muliValue-field-with-key-tp4110653p4110981.html
Sent from the Solr - User mailing list
Hi All,
Can anyone please explain how solr join query works in solr4.2.
we have 2 different documents.Both are in the same index.
document1 contains the columns:
docdate: 01-12-2012
previousmonthdate :01-11-2012
price:15
and some more fields.
document2 contains:
docdate :01-11-2012
On 13 January 2014 00:30, Hakim Benoudjit h.benoud...@gmail.com wrote:
Yep sure. But is it good for me to store a link(http://...) in a solr
string field? knowing that this field isnt indexed, only stored.
Yes, there is no issue. Not sure why they are not indexed, but if
that is what you want,
Hi,
I am currently running into merge-issues while optimizing an index.
To give you some informations:
We are running 4 SOLR Servers with identical OS, VM-Hardware, RAM etc.
Only one Server by now is having issues, the others are fine.
We are running SOLR 4.4.0 with Tomcat 6.0
It was running
I am working on Windows 7
--
View this message in context:
http://lucene.472066.n3.nabble.com/using-extract-handler-data-not-extracted-tp4110850p4110993.html
Sent from the Solr - User mailing list archive at Nabble.com.
22 matches
Mail list logo