I think that I should start a new thread for my question to help people who
searches for same situation.

2013/4/26 Furkan KAMACI <furkankam...@gmail.com>

> If you can help me it would be nice. I get that error:
>
> SimplePostTool version 1.5
> Posting files to base url http://localhost:8983/solr/update/extract..
> Entering auto mode. File endings considered are
> xml,json,csv,pdf,doc,docx,ppt,pptx,xls,xlsx,odt,odp,ods,ott,otp,ots,rtf,htm,html,txt,log
> POSTing file 523387.pdf (application/pdf)
> SimplePostTool: WARNING: Solr returned an error #404 Not Found
> SimplePostTool: WARNING: IOException while reading response:
> java.io.FileNotFoundException:
> http://localhost:8983/solr/update/extract/extract?resource.name=%2Fhome%2Fll%2FDesktop%2Fb%2Flucene-solr-lucene_solr_4_2_1%2Fsolr%2Fexample%2Fexampledocs%2F523387.pdf&literal.id=%2Fhome%2Fll%2FDesktop%2Fb%2Flucene-solr-lucene_solr_4_2_1%2Fsolr%2Fexample%2Fexampledocs%2F523387.pdf
> 1 files indexed.
> COMMITting Solr index changes to http://localhost:8983/solr/update/extract
> ..
> Disconnected from the target VM, address: '127.0.0.1:58385', transport:
> 'socket'
> Time spent: 0:00:00.194
>
> and there is nothing indexed. Here is my server log:
>
> Apr 26, 2013 2:55:58 PM org.apache.solr.update.DirectUpdateHandler2 commit
> INFO: start
> commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
> Apr 26, 2013 2:55:58 PM org.apache.solr.core.SolrDeletionPolicy onCommit
> INFO: SolrDeletionPolicy.onCommit: commits:num=2
>  
> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@/home/ll/Desktop/b/lucene-solr-lucene_solr_4_2_1/solr/example/solr/collection1/data/index
> lockFactory=org.apache.lucene.store.NativeFSLockFactory@386b8592;
> maxCacheMB=48.0
> maxMergeSizeMB=4.0),segFN=segments_c,generation=12,filenames=[segments_c]
>  
> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@/home/ll/Desktop/b/lucene-solr-lucene_solr_4_2_1/solr/example/solr/collection1/data/index
> lockFactory=org.apache.lucene.store.NativeFSLockFactory@386b8592;
> maxCacheMB=48.0
> maxMergeSizeMB=4.0),segFN=segments_d,generation=13,filenames=[segments_d]
> Apr 26, 2013 2:55:58 PM org.apache.solr.core.SolrDeletionPolicy
> updateCommits
> INFO: newest commit = 13[segments_d]
> Apr 26, 2013 2:55:58 PM org.apache.solr.search.SolrIndexSearcher <init>
> INFO: Opening Searcher@37342445 main
> Apr 26, 2013 2:55:58 PM org.apache.solr.update.DirectUpdateHandler2 commit
> INFO: end_commit_flush
> Apr 26, 2013 2:55:58 PM org.apache.solr.core.QuerySenderListener
> newSearcher
> INFO: QuerySenderListener sending requests to 
> Searcher@37342445main{StandardDirectoryReader(segments_2:1:nrt)}
> Apr 26, 2013 2:55:58 PM org.apache.solr.core.QuerySenderListener
> newSearcher
> INFO: QuerySenderListener done.
> Apr 26, 2013 2:55:58 PM org.apache.solr.core.SolrCore registerSearcher
> INFO: [collection1] Registered new searcher 
> Searcher@37342445main{StandardDirectoryReader(segments_2:1:nrt)}
> Apr 26, 2013 2:55:58 PM
> org.apache.solr.update.processor.LogUpdateProcessor finish
> INFO: [collection1] webapp=/solr path=/update/extract params={commit=true}
> {commit=} 0 156
>
>
>
>
>
> 2013/4/26 Jan Høydahl <jan....@cominvent.com>
>
>> http://wiki.apache.org/solr/post.jar
>>
>> --
>> Jan Høydahl, search solution architect
>> Cominvent AS - www.cominvent.com
>> Solr Training - www.solrtraining.com
>>
>> 26. apr. 2013 kl. 13:28 skrev Furkan KAMACI <furkankam...@gmail.com>:
>>
>> > Hi Raymond;
>> >
>> > Now I get that error: SimplePostTool: WARNING: IOException while reading
>> > response: java.io.FileNotFoundException:
>> >
>> > 2013/4/26 Raymond Wiker <rwi...@gmail.com>
>> >
>> >> You could start by doing
>> >>
>> >> java post.jar -help
>> >>
>> >> --- the 7th example shows exactly what you need to do to add a
>> document id.
>> >>
>> >> On Fri, Apr 26, 2013 at 11:30 AM, Furkan KAMACI <
>> furkankam...@gmail.com
>> >>> wrote:
>> >>
>> >>> I use Solr 4.2.1 and these are my fields:
>> >>>
>> >>> <field name="id" type="string" indexed="true" stored="true"
>> >> required="true"
>> >>> multiValued="false" />
>> >>> <field name="text" type="text_general" indexed="true" stored="true"/>
>> >>>
>> >>>
>> >>> <!-- Common metadata fields, named specifically to match up with
>> >>> SolrCell metadata when parsing rich documents such as Word, PDF.
>> >>> Some fields are multiValued only because Tika currently may return
>> >>> multiple values for them. Some metadata is parsed from the documents,
>> >>> but there are some which come from the client context:
>> >>> "content_type": From the HTTP headers of incoming stream
>> >>> "resourcename": From SolrCell request param resource.name
>> >>> -->
>> >>> <field name="title" type="text_general" indexed="true" stored="true"
>> >>> multiValued="true"/>
>> >>> <field name="subject" type="text_general" indexed="true"
>> stored="true"/>
>> >>> <field name="description" type="text_general" indexed="true"
>> >>> stored="true"/>
>> >>> <field name="comments" type="text_general" indexed="true"
>> stored="true"/>
>> >>> <field name="author" type="text_general" indexed="true"
>> stored="true"/>
>> >>> <field name="keywords" type="text_general" indexed="true"
>> stored="true"/>
>> >>> <field name="category" type="text_general" indexed="true"
>> stored="true"/>
>> >>> <field name="resourcename" type="text_general" indexed="true"
>> >>> stored="true"/>
>> >>> <field name="url" type="text_general" indexed="true" stored="true"/>
>> >>> <field name="content_type" type="string" indexed="true" stored="true"
>> >>> multiValued="true"/>
>> >>> <field name="last_modified" type="date" indexed="true" stored="true"/>
>> >>> <field name="links" type="string" indexed="true" stored="true"
>> >>> multiValued="true"/>
>> >>>
>> >>> <!-- Main body of document extracted by SolrCell.
>> >>> NOTE: This field is not indexed by default, since it is also copied to
>> >>> "text"
>> >>> using copyField below. This is to save space. Use this field for
>> >> returning
>> >>> and
>> >>> highlighting document content. Use the "text" field to search the
>> >> content.
>> >>> -->
>> >>> <field name="content" type="text_general" indexed="false"
>> stored="true"
>> >>> multiValued="true"/>
>> >>>
>> >>>
>> >>> <!-- catchall field, containing all other searchable text fields
>> >>> (implemented
>> >>> via copyField further on in this schema -->
>> >>> <!--
>> >>> <field name="text" type="text_general" indexed="true" stored="false"
>> >>> multiValued="true"/>
>> >>> -->
>> >>> <!-- catchall text field that indexes tokens both normally and in
>> reverse
>> >>> for efficient
>> >>> leading wildcard queries. -->
>> >>> <field name="text_rev" type="text_general_rev" indexed="true"
>> >>> stored="false" multiValued="true"/>
>> >>>
>> >>> <!-- non-tokenized version of manufacturer to make it easier to sort
>> or
>> >>> group
>> >>> results by manufacturer. copied from "manu" via copyField -->
>> >>> <field name="manu_exact" type="string" indexed="true" stored="false"/>
>> >>>
>> >>> <field name="payloads" type="payloads" indexed="true" stored="true"/>
>> >>>
>> >>> <field name="_version_" type="long" indexed="true" stored="true"/>
>> >>>
>> >>> I run that command:
>> >>>
>> >>> java -Durl=http://localhost:8983/solr/update/extract -jar post.jar
>> >>> 523387.pdf
>> >>>
>> >>> However I get that error, any ideas?
>> >>>
>> >>> Apr 26, 2013 12:26:51 PM org.apache.solr.common.SolrException log
>> >>> SEVERE: org.apache.solr.common.SolrException: Document is missing
>> >> mandatory
>> >>> uniqueKey field: id
>> >>> at
>> >>>
>> >>>
>> >>
>> org.apache.solr.update.AddUpdateCommand.getIndexedId(AddUpdateCommand.java:88)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.apache.solr.update.processor.DistributedUpdateProcessor.versionAdd(DistributedUpdateProcessor.java:464)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.apache.solr.update.processor.DistributedUpdateProcessor.processAdd(DistributedUpdateProcessor.java:346)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.apache.solr.update.processor.LogUpdateProcessor.processAdd(LogUpdateProcessorFactory.java:100)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.apache.solr.handler.extraction.ExtractingDocumentLoader.doAdd(ExtractingDocumentLoader.java:121)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.apache.solr.handler.extraction.ExtractingDocumentLoader.addDoc(ExtractingDocumentLoader.java:126)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingDocumentLoader.java:228)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:74)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:135)
>> >>> at org.apache.solr.core.SolrCore.execute(SolrCore.java:1817)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:639)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:345)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:141)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1307)
>> >>> at
>> >>>
>> >>
>> org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:453)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:137)
>> >>> at
>> >>>
>> >>
>> org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:560)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:231)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1072)
>> >>> at
>> >>>
>> org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:382)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:193)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1006)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:135)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:255)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:154)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116)
>> >>> at org.eclipse.jetty.server.Server.handle(Server.java:365)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(AbstractHttpConnection.java:485)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(BlockingHttpConnection.java:53)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.server.AbstractHttpConnection.content(AbstractHttpConnection.java:937)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.content(AbstractHttpConnection.java:998)
>> >>> at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:856)
>> >>> at
>> org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:240)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpConnection.java:72)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(SocketConnector.java:264)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:608)
>> >>> at
>> >>>
>> >>>
>> >>
>> org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:543)
>> >>> at java.lang.Thread.run(Thread.java:722)
>> >>>
>> >>
>>
>>
>

Reply via email to