Thank you Susheel, Here is what I see from /opt/solr-7.7.2/server/solr/configsets/_default/conf/solrconfig.xml:
<requestHandler name="/update/extract" startup="lazy" class="solr.extraction.ExtractingRequestHandler" > <lst name="defaults"> <str name="lowernames">true</str> <str name="fmap.meta">ignored_</str> <str name="fmap.content">_text_</str> </lst> </requestHandler> Is there anything wrong with it and how to fix it? Thank you. Pasle Choix On Mon, Sep 23, 2019 at 2:09 PM Susheel Kumar <susheel2...@gmail.com> wrote: > Not sure which configuration you are using but double check solrconfig.xml > to have entries like below and have below sr_mv_txt below in schema.xml for > storing and indexing. > > <requestHandler name="/update/extract" > startup="lazy" > class="solr.extraction.ExtractingRequestHandler" > > <lst name="defaults"> > <str name="lowernames">true</str> > <str name="fmap.meta">ignored_</str> > <str name="fmap.content">sr_mv_txt</str> > </lst> > </requestHandler> > > > Thnx > > > On Thu, Sep 19, 2019 at 11:02 PM PasLe Choix <paslecho...@gmail.com> > wrote: > >> I am on Solr 7.7, according to the official document: >> https://lucene.apache.org/solr/guide/7_7/solr-tutorial.html >> Although it is mentioned Post Tool can index a directory of files, and can >> handle HTML, PDF, Office formats like Word, however no example working >> command is given. >> >> ./bin/post -c localDocs ~/DocumentsError:<p>Problem accessing >> /solr/books/update. Reason: >> <pre> Not Found</pre></p> >> >> or if I directly upload a pdf as Document through Admin GUI, I will get >> Unsupported ContentType: application/pdf Not in: [application/xml, >> application/csv, application/json, text/json, text/csv, text/xml, >> application/javabin] >> >> Can anyone please share the correct way to index on pdf/doc/docx, etc.? >> through both Admin GUI and command line. >> >> Thank you very much. >> >> >> Pasle Choix >> >