[ 
https://issues.apache.org/jira/browse/SOLR-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shalin Shekhar Mangar updated SOLR-1406:
----------------------------------------

    Attachment: SOLR-1406.patch

# Made FileDataSource and FileListEntityProcessor more extensible.
# I also found that biggerThan and smallerThan were two parameters in 
FileListEntityProcessor which were never being set. I've fixed that.

I have not exposed FileListEntityProcessor's getFolderFiles method because I'd 
prefer to keep the implementation private for now so that we can change it in 
future without worrying about back-compat (see SOLR-1313). However, if one 
wants to know the file names being processed, one can override 
EntityProcessorBase#getNext.

Luke, does this help in your use-case?

> Refactor FileDataSource and FileListEntityProcessor to be more extensible
> -------------------------------------------------------------------------
>
>                 Key: SOLR-1406
>                 URL: https://issues.apache.org/jira/browse/SOLR-1406
>             Project: Solr
>          Issue Type: Improvement
>          Components: contrib - DataImportHandler
>    Affects Versions: 1.4
>            Reporter: Luke Forehand
>            Assignee: Noble Paul
>         Attachments: SOLR-1406.patch, SOLR-1406.patch
>
>
> FileDataSource should make openStream method protected so we can extend 
> FileDataSource for other File types such as GZip, by controlling the 
> underlying InputStreamReader implementation being returned.
> FileListEntityProcessor needs to aggregate a list of files that were 
> processed and expose that list in an accessible way so that further 
> processing on that file list can be done in the close method.  For example, 
> deletion or archiving.
> Another improvement would be that in the event of an indexing rollback event, 
> processing of the close method either does not occur, or the close method is 
> allowed access to that event, to prevent processing within the close method 
> if necessary.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to