[ 
https://issues.apache.org/jira/browse/SOLR-12278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16459412#comment-16459412
 ] 

Cao Manh Dat edited comment on SOLR-12278 at 5/1/18 3:28 AM:
-------------------------------------------------------------

[~dsmiley] some problem of that approach
 * we have to modify all other parsers,
 * each parser has its own set of parameters, which make the size of a 
SolrInputDocument quite different with the number of bytes from the input (ie: 
SOLR-6304)
 * what happens if the users have some processor in the middle which 
significantly enriches the SolrInputDocument

In short vision, IgnoreLargeDocumentProcessor might handy for users who need to 
filter large documents and work accurately in all cases.


was (Author: caomanhdat):
[~dsmiley] some problem of that approach
 * we have to modify all other parsers,
 * each parser has its own set of parameters, which make the size of a 
SolrInputDocument quite different with the number of bytes from the input (ie: 
SOLR-6304)
 * what happens if the users have some processor in the middle which 
significantly enriches the SolrInputDocument

In short vision, IgnoreLargeDocumentProcessor might handy for users who need to 
filter large documents.

> Ignore very large document on indexing
> --------------------------------------
>
>                 Key: SOLR-12278
>                 URL: https://issues.apache.org/jira/browse/SOLR-12278
>             Project: Solr
>          Issue Type: Improvement
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Cao Manh Dat
>            Assignee: Cao Manh Dat
>            Priority: Major
>         Attachments: SOLR-12278.patch, SOLR-12278.patch
>
>
> Solr should be able to ignore very large document, so it won't affect the 
> index as well as the tlog. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to