Re: Java Heap Error

2019-03-18 Thread wclarke
I was mistaken xms and xmx are equal. I was looking in the wrong spot. I will look at the rest. Thanks! -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Java Heap Error

2019-03-18 Thread wclarke
Hi, I am doing a recursive search on a local hard drive through DIH with Tika. It starts slow and then it dies at Thread-14 every time with a Java Heap error. My Xmx is at 10g, my Xms is at 512m (which might be the problem). It takes hours to fail after it has fetched over 300,000 items and

Re: Help with a DIH config file

2019-03-15 Thread wclarke
One last question. I have everything running as it should finally. However, when I pull out of testing to do the entire directory it's just cycling through. The directory is full of folders that have the documents in them. Do I need an html or other file sitting in there randomly to get it to

Re: Help with a DIH config file

2019-03-15 Thread wclarke
Thanks! that fixed it. -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: Help with a DIH config file

2019-03-15 Thread wclarke
Thank you so much. You helped a great deal. I am running into one last issue where the Tika DIH is stopping at a specific language and fails there (Malayalam). Do you know of a work around? -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Tika Error work around?

2019-03-15 Thread wclarke
I am getting an error that stops Tika fetching/processing/and committing when it reaches a specific language (Malayalam). Is there a work around? -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Is this a bug or just something I am doing wrong?

2019-03-13 Thread wclarke
I have been running a data import of local files using Tika. It runs fine until it gets to a language it doesn't recognoze and then it stops processing and won't auto-commit. It is happening in tika-app-1.17 through 1.20 at least. It threw errors in 1.3, but kept running and did the commit. I

Re: Help with a DIH config file

2019-03-13 Thread wclarke
Got each one working individually, but not multiples. Is it possible? Please see attached files. Thanks!!! tika-data-config-2.xml solrconfig.xml --

Re: Help with a DIH config file

2019-03-13 Thread wclarke
I didn't know I could do an updateProcessorChain and call it in the config file. I tried doing it in the solrconfig, but it just wouldn't take. I will try this though! Thanks The value is the file path in id/url. -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: Help with a DIH config file

2019-03-13 Thread wclarke
Absolutely! I attached it to the original message, But I can post here too. I am VERY new to Solr and am winging it and while the documentation has been a little helpful, I just need more complex examples. tika-data-config-2.xml

Help with a DIH config file

2019-03-12 Thread wclarke
I have a previous post that looks like this: I am pulling a large amount of data from a local source D:\foo\resource\. I am using tika through a DIH to index the multiple file formats with text and metadata. I have almost all the information being pulled that I want, however, I am having a

Embedding XPathEntityProcessor inside TikaEntityProcessor

2019-03-12 Thread wclarke
I am pulling a large amount of data from a local source D:\foo\resource\. I am using tika through a DIH to index the multiple file formats with text and metadata. I have almost all the information being pulled that I want, however, I am having a couple of issues: 1. I need to run a regex