[ 
https://jira.duraspace.org/browse/DS-599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=22588#comment-22588
 ] 

Mark Diggory commented on DS-599:
---------------------------------

Peter, what is the memory profile?  If we can update a larger number of solr 
documents at once this might improve performance significantly. For instance, 
if a database query could be constructed that calculated the Bundle for each 
Bitstream, then a retrieval from solr of all events that match a Bitstream, 
then you can generate a Solr update call that would update all those documents 
at once.

For instance, if you were to approach this as a csv based update, you could 
work to serialize the relevant Solr Docs to a csv file, update the csv file to 
have the new bundle field in it, and then reimport the file back into Solr:

http://wiki.apache.org/solr/UpdateCSV
                
> SOLR statistics file download displays all files and not only those in the 
> Bundle Original
> ------------------------------------------------------------------------------------------
>
>                 Key: DS-599
>                 URL: https://jira.duraspace.org/browse/DS-599
>             Project: DSpace
>          Issue Type: Bug
>          Components: Solr
>    Affects Versions: 1.6.0, 1.6.1, 1.6.2
>            Reporter: Claudia Jürgen
>            Assignee: Kevin Van de Velde
>            Priority: Major
>             Fix For: 1.8.0
>
>         Attachments: DS-559--AddBundleNameToSOLR.patch, 
> DS-559--AddBundleNameToSOLR_V0_2.patch, 
> DS-559--AddBundleNameToSOLR_V0_3.patch, Original_bundle_bugfix.patch
>
>
> The file download statistic for an item displays all the bitstreams 
> regardless of the bundle they belong to.
> So licenses, extracted texts got displayed and counted.  This is a bit 
> confusing for the normal user as their existence is usually hidden from him.
> Furthermore I wonder whether views from the edit item stage should be counted 
> as "regular" views at all.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://jira.duraspace.org/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

------------------------------------------------------------------------------
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security
threats, fraudulent activity, and more. Splunk takes this data and makes
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2dcopy2
_______________________________________________
Dspace-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-devel

Reply via email to