Re: Pushing a whole set of pdf-files to solr

2013-04-24 Thread sdspieg
I am still struggling with this. I have solr 4.2.1.2013.03.26.08.26.55 installed. So are you telling me that I should somehow install the older version of that tool that comes with Solr 3.x? Because with the newer version I get the errors I already mentioned. Now I suppose I may be an untypical

Re: Pushing a whole set of pdf-files to solr

2013-04-24 Thread sdspieg
(Just documenting my experiences). I stopped and restarted solr in the tomcat web application manager. Everything seems fine http://lucene.472066.n3.nabble.com/file/n4058786/4-25-2013_2-38-43_AM.png And yet I still get that same error message. -- View this message in context:

Re: Pushing a whole set of pdf-files to solr

2013-04-24 Thread Jack Krupansky
Your errors may simply have been improperly encoded documents. Or some encoding that is not supported. Hard to say. Start with a simple case, then build on success. I think you're just trying to do too much all at once. Do one PDF file first, then work up to a directory, and only when you've

Re: Pushing a whole set of pdf-files to solr

2013-04-10 Thread sdspieg
Jack - I apologize for my ignorance here, but when you keep emphasizing 'new' - does that mean that there is ANOTHER version of this tool than the one that is built into solr-4.2.1? And on the encoding issue - I thought pdf was platform-agnostic? Or is the problem on my windows system - i.e. that

Re: Pushing a whole set of pdf-files to solr

2013-04-10 Thread Jack Krupansky
Yes, there is the version that comes with Solr 3.x. I'm not aware of an encoding issue. -- Jack Krupansky -Original Message- From: sdspieg Sent: Wednesday, April 10, 2013 8:11 AM To: solr-user@lucene.apache.org Subject: Re: Pushing a whole set of pdf-files to solr Jack - I apologize

Re: Pushing a whole set of pdf-files to solr

2013-04-09 Thread sdspieg
If anybody could still help me out with this, I'd really appreciate it. Thanks! -- View this message in context: http://lucene.472066.n3.nabble.com/Pushing-a-whole-set-of-pdf-files-to-solr-tp4025256p4054885.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Pushing a whole set of pdf-files to solr

2013-04-09 Thread Furkan KAMACI
Apache Solr 4 Cookbok says that: curl http://localhost:8983/solr/update/extract?literal.id=1commit=true; -F myfile=@cookbook.pdf is that what you want? 2013/4/10 sdspieg sdsp...@mail.ru If anybody could still help me out with this, I'd really appreciate it. Thanks! -- View this message

Re: Pushing a whole set of pdf-files to solr

2013-04-09 Thread Jack Krupansky
The newer release of SimplePostTool with Solr 4.x makes it easy to post PDF files from a directory, including automatically adding the file name to a field. But SolrCell is the direct API that it uses as well. -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Tuesday,

Re: Pushing a whole set of pdf-files to solr

2013-04-09 Thread sdspieg
Thanks for those replies. I will look into them. But if anyone knows of a site that describes step by step how a windows user who has already installed solr (and tomcat) can easily feed a folder (and subfolders) with 100s of pdfs into solr, or would be willing to write down down those steps, I

Re: Pushing a whole set of pdf-files to solr

2013-04-09 Thread sdspieg
I am able to run the java -jar post.jar -help command which I found here: http://docs.lucidworks.com/display/solr/Running+Solr. But now how can I tell post to post all pdf files in a certain folder (preferably recursively) to a collection? Could anybody please post the exact command for that?

Re: Pushing a whole set of pdf-files to solr

2013-04-09 Thread Gora Mohanty
On 10 April 2013 07:28, sdspieg sdsp...@mail.ru wrote: I am able to run the java -jar post.jar -help command which I found here: http://docs.lucidworks.com/display/solr/Running+Solr. But now how can I tell post to post all pdf files in a certain folder (preferably recursively) to a collection?

Re: Pushing a whole set of pdf-files to solr

2013-04-09 Thread sdspieg
Another progress report. I 'flattened' all the folders which contained the pdf files with Fileboss and then moved the pdf files to the directory where I found the post.jar file (in solr-4.2.1\solr-4.2.1\example\exampledocs). I then ran java -Ddata=files -jar post.jar *.pdf and in the command

Re: Pushing a whole set of pdf-files to solr

2013-04-09 Thread Gora Mohanty
On 10 April 2013 08:11, sdspieg sdsp...@mail.ru wrote: Another progress report. I 'flattened' all the folders which contained the pdf files with Fileboss and then moved the pdf files to the directory where I found the post.jar file (in solr-4.2.1\solr-4.2.1\example\exampledocs). I then ran

Re: Pushing a whole set of pdf-files to solr

2013-04-09 Thread Jack Krupansky
The newer SimplePostTool can in fact recurse a directory of PDFs. Just get the usage for the tool. I'm sure it lists the command options. -- Jack Krupansky -Original Message- From: sdspieg Sent: Tuesday, April 09, 2013 9:48 PM To: solr-user@lucene.apache.org Subject: Re: Pushing a