[ 
https://issues.apache.org/jira/browse/COUCHDB-1288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Filipe Manana updated COUCHDB-1288:
-----------------------------------

    Attachment: couchdb_1288_2.patch

Second version of the patch, for _doc_ids, the optimized code patch is only 
triggered if the number of doc IDs is not greater than 100. This is too avoid 
loading too many full_doc_info records into memory, which can be big if the rev 
trees are long and/or with many branches.

> More efficient builtin filters _doc_ids and _design
> ---------------------------------------------------
>
>                 Key: COUCHDB-1288
>                 URL: https://issues.apache.org/jira/browse/COUCHDB-1288
>             Project: CouchDB
>          Issue Type: Improvement
>            Reporter: Filipe Manana
>         Attachments: couchdb_1288.patch, couchdb_1288_2.patch
>
>
> We have the _doc_ids and _design _changes filter as of CouchDB 1.1.0.
> While they meet the expectations of applications/users, they're far from 
> efficient for large databases.
> Basically the implementation folds the entire seq btree and then filters 
> values by the document's ID, causing too much IO and busting caches. This 
> makes replication by doc IDs not so efficient as it could be.
> The proposed patch avoids this by doing direct lookups in the ID btree, for 
> _doc_ids, and ranged fold for _design.
> If there are no objections, I would apply to branch 1.2.x besides 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to