[
https://issues.apache.org/jira/browse/JENA-44?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13098319#comment-13098319
]
Paolo Castagna commented on JENA-44:
------------------------------------
Thanks Stephen, it is always good to have another pair (or more) of eyes to
look at my stuff.
I wasn't sure about 3) but you convinced me it's not necessary. I put it there
thinking it was necessary in case .cancel() is called before
SortedBindingIterator is initialized, but at that point we cannot have open
iterators or open files which need to be closed.
Pending "thumbs up" from Andy, I am ready to commit this (it has been a long
journey, but at the end I think we addressed all the Andy concerns and with
your great help we have a good quality improvement to spill stuff on disk for
better scalability.
> Support external sorting of bindings in ARQ
> -------------------------------------------
>
> Key: JENA-44
> URL: https://issues.apache.org/jira/browse/JENA-44
> Project: Jena
> Issue Type: New Feature
> Components: ARQ
> Reporter: Sam Tunnicliffe
> Assignee: Paolo Castagna
> Priority: Minor
> Attachments: JENA-44-0.patch, JENA-44-ARQ_r1165687.patch,
> JENA-44-Depends-on-JENA-99-r1157891.patch, JENA-44_ARQ_r1156212.patch,
> JENA-44_ARQ_r1165123.patch, JENA-44_ARQ_r8531.patch, JENA-44_ARQ_r8724.patch
>
>
> In QueryIterSort, the sorting of the contents of an Iterator<Binding> is done
> in memory, using Arrays.sort. This can be problematic where the set to be
> sorted is large. A possible solution could be to use an external, disk-backed
> algorithm. A hybrid approach may be better, whereby we attempt the in-memory
> sort, but when the number of bindings encountered goes over a certain number,
> resort to the disk-backed variant.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira