[ 
https://jira.duraspace.org/browse/DS-892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Van de Velde reassigned DS-892:
-------------------------------------

    Assignee: Kevin Van de Velde
    
> Performance issues in update enabling the StatisticsLoggingConsumer
> -------------------------------------------------------------------
>
>                 Key: DS-892
>                 URL: https://jira.duraspace.org/browse/DS-892
>             Project: DSpace
>          Issue Type: Bug
>          Components: Solr
>    Affects Versions: 1.6.0, 1.6.1, 1.6.2, 1.7.0, 1.7.1
>            Reporter: Andrea Bollini
>            Assignee: Kevin Van de Velde
>            Priority: Critical
>
> We have found that enabling the StatisticsLoggingConsumer to keep statistics 
> data up-to-date after item changes (metadata edit or collection 
> moving/mapping) the item update operations become slowly and the system 
> unusable.
> NOTE: the StatisticsLoggingConsumer is NOT enabled out-of-box in the 
> dspace.cfg this imply that your statistics data could be incongruous (item 
> access assigned to incorrect communities/collections)
> We noticed problems when there are large amount of statistics data (> 20M 
> records), for small repository (< 1M statistics record) the overhead is 
> acceptable.
> Finally, after the introduction of the autocommit patch, the 
> StatisticsLoggingConsumer is not more able to assure the data consistence 
> because the statistics data collected between two auto-commit are not 
> processed by the class.
> Our current idea is to discard the consumer approach in favour to implement a 
> batch tools to periodically analyze the statistics data and fix it as 
> appropriate.
> This issue is a placeholder for such feature and discussion around it.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://jira.duraspace.org/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure 
contains a definitive record of customers, application performance, 
security threats, fraudulent activity, and more. Splunk takes this 
data and makes sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-novd2d
_______________________________________________
Dspace-devel mailing list
Dspace-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-devel

Reply via email to