[
https://issues.apache.org/jira/browse/JENA-44?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13034274#comment-13034274
]
Andy Seaborne commented on JENA-44:
-----------------------------------
a few questions:
1 - what can be shared between JENA-45 (spill to disk update)? Code for
to/from disk?
JENA-45 uses SSE for tuples parsing (but what about bnodes?_
JENA-44 has BindingSerializer.
Is there anything in common?
What about the deferred buffering?
2 - Control of resources (part 1)
JENA-44 reads a symbol ; JENA-45 has a ThresholdPolicyCount. Do we need a
combination?
3 - Generally, RAM is a system-wide resource used for caching in TDB/32. Do we
need to have something adaptive?
> Support external sorting of bindings in ARQ
> -------------------------------------------
>
> Key: JENA-44
> URL: https://issues.apache.org/jira/browse/JENA-44
> Project: Jena
> Issue Type: New Feature
> Components: ARQ
> Reporter: Sam Tunnicliffe
> Assignee: Paolo Castagna
> Priority: Minor
> Attachments: JENA-44-0.patch, JENA-44_ARQ_r8531.patch,
> JENA-44_ARQ_r8724.patch
>
>
> In QueryIterSort, the sorting of the contents of an Iterator<Binding> is done
> in memory, using Arrays.sort. This can be problematic where the set to be
> sorted is large. A possible solution could be to use an external, disk-backed
> algorithm. A hybrid approach may be better, whereby we attempt the in-memory
> sort, but when the number of bindings encountered goes over a certain number,
> resort to the disk-backed variant.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira