Thanks Alex. I copied the solrconfig.xml over from 7.7.3 to the 8.8.0 conf folder and restarted the server. Now indexing works without erroring on sample.html. There is 1K difference between the 2 files so I'll diff them to see what was left out of the 8.8 version.
On Sat, Feb 20, 2021 at 4:27 PM Alexandre Rafalovitch <arafa...@gmail.com> wrote: > Most likely issue is that your core configuration (solrconfig.xml) > does not have the request handler for that. The same config may have > had that in 7.x, but changed since. > > More details: > https://lucene.apache.org/solr/guide/8_8/uploading-data-with-solr-cell-using-apache-tika.html > > Regards, > Alex. > > On Sat, 20 Feb 2021 at 17:59, cratervoid <cratervoi...@gmail.com> wrote: > > > > I am trying out indexing the exampledocs in the examples folder with the > > SimplePostTool on windows 10 using solr 8.8. All the documents index > > except sample.html. For that file I get the errors below. I then > > downloaded solr 7.7.3 and indexed the exampledocs folder with no errors, > > including sample.html. > > ``` > > PS C:\solr-8.8.0> java -jar -Dc=gettingstarted -Dauto > > example\exampledocs\post.jar example\exampledocs\sample.html > > SimplePostTool version 5.0.0 > > Posting files to [base] url > > http://localhost:8983/solr/gettingstarted/update... > > Entering auto mode. File endings considered are > > > xml,json,jsonl,csv,pdf,doc,docx,ppt,pptx,xls,xlsx,odt,odp,ods,ott,otp,ots,rtf,htm,html,txt,log > > POSTing file sample.html (text/html) to [base]/extract > > SimplePostTool: WARNING: Solr returned an error #404 (Not Found) for url: > > > http://localhost:8983/solr/gettingstarted/update/extract?resource.name=C%3A%5Csolr-8.8.0%5Cexample%5Cexampledocs%5Csample.html&literal.id=C%3A%5Csolr-8.8.0%5Cexample%5Cexampledocs%5Csample.html > > SimplePostTool: WARNING: Response: <html> > > <head> > > <meta http-equiv="Content-Type" content="text/html;charset=utf-8"/> > > <title>Error 404 Not Found</title> > > </head> > > <body><h2>HTTP ERROR 404 Not Found</h2> > > <table> > > <tr><th>URI:</th><td>/solr/gettingstarted/update/extract</td></tr> > > <tr><th>STATUS:</th><td>404</td></tr> > > <tr><th>MESSAGE:</th><td>Not Found</td></tr> > > <tr><th>SERVLET:</th><td>default</td></tr> > > </table> > > > > </body> > > </html> > > SimplePostTool: WARNING: IOException while reading response: > > java.io.FileNotFoundException: > > > http://localhost:8983/solr/gettingstarted/update/extract?resource.name=C%3A%5Csolr-8.8.0%5Cexample%5Cexampledocs%5Csample.html&literal.id=C%3A%5Csolr-8.8.0%5Cexample%5Cexampledocs%5Csample.html > > 1 files indexed. > > COMMITting Solr index changes to > > http://localhost:8983/solr/gettingstarted/update... > > Time spent: 0:00:00.086 > > ``` > > > > However the json and all other file types index with no problem. For > > example: > > ``` > > PS C:\solr-8.8.0> java -jar -Dc=gettingstarted -Dauto > > example\exampledocs\post.jar example\exampledocs\books.json > > SimplePostTool version 5.0.0 > > Posting files to [base] url > > http://localhost:8983/solr/gettingstarted/update... > > Entering auto mode. File endings considered are > > > xml,json,jsonl,csv,pdf,doc,docx,ppt,pptx,xls,xlsx,odt,odp,ods,ott,otp,ots,rtf,htm,html,txt,log > > POSTing file books.json (application/json) to [base]/json/docs > > 1 files indexed. > > COMMITting Solr index changes to > > http://localhost:8983/solr/gettingstarted/update... > > ``` > > Just following this tutorial:[ > > > https://lucene.apache.org/solr/guide/8_8/post-tool.html#post-tool-windows-support][1 > > ] > > > > [1]: > > > https://lucene.apache.org/solr/guide/8_8/post-tool.html#post-tool-windows-support >