[ https://issues.apache.org/jira/browse/SOLR-11549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16219808#comment-16219808 ]
actcat edited comment on SOLR-11549 at 10/26/17 1:14 AM: --------------------------------------------------------- {color:#f79232}I has close schemaless{color} <!-- <updateRequestProcessorChain name="add-unknown-fields-to-the-schema" default="${update.autoCreateFields:true}" processor="uuid,remove-blank,field-name-mutating,parse-boolean,parse-long,parse-double,parse-date,add-schema-fields"> <processor class="solr.LogUpdateProcessorFactory"/> <processor class="solr.DistributedUpdateProcessorFactory"/> <processor class="solr.RunUpdateProcessorFactory"/> </updateRequestProcessorChain> --> {color:#f79232}and i use extract pdf function config is below:{color} <requestHandler name="/update/extract" startup="lazy" class="solr.extraction.ExtractingRequestHandler" > <lst name="defaults"> <str name="wt">xml</str> <str name="lowernames">true</str> <str name="fmap.meta">ignored_</str> <str name="fmap.content">_text_</str> </lst> </requestHandler> {color:#f79232}how to config extract pdf function, not include extra columns{color} {color:blue}<response> <lst name="responseHeader"> <int name="status">400</int> <int name="QTime">81</int> </lst> <lst name="error"> <lst name="metadata"> <str name="error-class">org.apache.solr.common.SolrException</str> <str name="root-error-class">org.apache.solr.common.SolrException</str> </lst> <str name="msg">ERROR: [doc=2N6C000A] unknown field 'date'</str> <int name="code">400</int> </lst> </response> <response> <lst name="responseHeader"> <int name="status">400</int> <int name="QTime">15</int> </lst> <lst name="error"> <lst name="metadata"> <str name="error-class">org.apache.solr.common.SolrException</str> <str name="root-error-class">org.apache.solr.common.SolrException</str> </lst> <str name="msg">ERROR: [doc=2EE0000A] unknown field 'date'</str> <int name="code">400</int> </lst> </response> <response> <lst name="responseHeader"> <int name="status">400</int> <int name="QTime">108</int> </lst> <lst name="error"> <lst name="metadata"> <str name="error-class">org.apache.solr.common.SolrException</str> <str name="root-error-class">org.apache.solr.common.SolrException</str> </lst> <str name="msg">ERROR: [doc=24ER000A] unknown field 'stream_size'</str> <int name="code">400</int> </lst> </response>{color} was (Author: actcat): {color:#f79232}I has close schemaless{color} <!-- <updateRequestProcessorChain name="add-unknown-fields-to-the-schema" default="${update.autoCreateFields:true}" processor="uuid,remove-blank,field-name-mutating,parse-boolean,parse-long,parse-double,parse-date,add-schema-fields"> <processor class="solr.LogUpdateProcessorFactory"/> <processor class="solr.DistributedUpdateProcessorFactory"/> <processor class="solr.RunUpdateProcessorFactory"/> </updateRequestProcessorChain> --> {color:#f79232}and i use extract pdf function{color} <requestHandler name="/update/extract" startup="lazy" class="solr.extraction.ExtractingRequestHandler" > <lst name="defaults"> <str name="wt">xml</str> <str name="lowernames">true</str> <str name="fmap.meta">ignored_</str> <str name="fmap.content">_text_</str> </lst> </requestHandler> {color:#f79232}how to config extract pdf function, not include extra columns{color} {color:blue}<response> <lst name="responseHeader"> <int name="status">400</int> <int name="QTime">81</int> </lst> <lst name="error"> <lst name="metadata"> <str name="error-class">org.apache.solr.common.SolrException</str> <str name="root-error-class">org.apache.solr.common.SolrException</str> </lst> <str name="msg">ERROR: [doc=2N6C000A] unknown field 'date'</str> <int name="code">400</int> </lst> </response> <response> <lst name="responseHeader"> <int name="status">400</int> <int name="QTime">15</int> </lst> <lst name="error"> <lst name="metadata"> <str name="error-class">org.apache.solr.common.SolrException</str> <str name="root-error-class">org.apache.solr.common.SolrException</str> </lst> <str name="msg">ERROR: [doc=2EE0000A] unknown field 'date'</str> <int name="code">400</int> </lst> </response> <response> <lst name="responseHeader"> <int name="status">400</int> <int name="QTime">108</int> </lst> <lst name="error"> <lst name="metadata"> <str name="error-class">org.apache.solr.common.SolrException</str> <str name="root-error-class">org.apache.solr.common.SolrException</str> </lst> <str name="msg">ERROR: [doc=24ER000A] unknown field 'stream_size'</str> <int name="code">400</int> </lst> </response>{color} > SOLR 7 Extract PDF will Add many meta columns > ---------------------------------------------- > > Key: SOLR-11549 > URL: https://issues.apache.org/jira/browse/SOLR-11549 > Project: Solr > Issue Type: Improvement > Security Level: Public(Default Security Level. Issues are Public) > Components: clients - C#, contrib - Solr Cell (Tika extraction) > Affects Versions: 7.1 > Environment: 1.SOLR 7.1 > 2.Use C# 4.6, Lib use SolrNet 0.8.1 > Reporter: actcat > Labels: extract, hasError, pdf > > in solr 6,extract pdf data to default column {color:red}content{color} > in solr 7,extract pdf data ,will {color:red}add many meta columns{color} > how to don't add many meta > columns -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org