Re: Using Spring standalone jar as Custom Transformer

2014-09-26 Thread keeblerh
All, I have done what I really didn't want to do and it works for now, which is unpacked the solr.war and added all my libs to the WEB-INF/lib. If there is a better solution using the recommended configurations please let me know. Thanks. -- View this message in context: http://lucene.4720

Using Spring standalone jar as Custom Transformer

2014-09-26 Thread keeblerh
I'm trying to use a standalone Spring jar for a custom transformer and have banged my way through most of the issues so far but can't seem to get through this one. When the transformer executes it loads the spring applicationContext.xml file and throws the following error trying to parse it. org.

Re: How to exclude a mimetype in tika?

2014-09-18 Thread keeblerh
eShard wrote > Good afternoon, > I'm using solr 4.0 Final > I need movies "hidden" in zip files that need to be excluded from the > index. > I can't filter movies on the crawler because then I would have to exclude > all zip files. > I was told I can have tika skip the movies. > the details are esc

Re: solr 4.8 Tika stripping out all xml tags

2014-09-17 Thread keeblerh
Sorry...adding more information. Note that it does wrap my data in html but it is after it strips all my xml tags out. So the data I am interested in parsing which would be something something 12345,12345,0 end up like /n something /t/n something /n 12345,12345,0 etc. -- View this mess

solr 4.8 Tika stripping out all xml tags

2014-09-17 Thread keeblerh
I'm processing a zip file with an xml file. The TikaEntityProcessor opens the zip, reads the file but is stripping the xml tags even though I have supplied the htmlMapper="identity" attribute. It maintains any html that is contained in a CDATA section but seems to strip the other xml tags. Is

Re: ExtractingRequestHandler indexing zip files

2014-09-11 Thread keeblerh
Working now - fyi - the "update/extract" from a post works extracting from a kmz(zip) but I am still having trouble from the dataimport. I'll move to another thread for that. THANKS all. -- View this message in context: http://lucene.472066.n3.nabble.com/ExtractingRequestHandler-indexing-zip

Re: ExtractingRequestHandler indexing zip files

2014-09-10 Thread keeblerh
Thanks for the info Sergio. I updated my 4.8.1 version with that patch and SOLR 4216 (which was really the same thing). It took a day to get it to compile on my network and it still doesn't work. Did my config file look correct? I'm wondering if I need another param somewhere. "Patch has to be

Re: ExtractingRequestHandler indexing zip files

2014-09-09 Thread keeblerh
I am also having the issue where my zip contents (or kmz contents) are not being processed - only the file names are processed. It seems to recognize the kmz extension and open the file just doesn't recurse the processing on the contents. The patch you mention has been around for a while. I am ru