[
https://issues.apache.org/jira/browse/SOLR-2613?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15197981#comment-15197981
]
Shawn Heisey commented on SOLR-2613:
------------------------------------
This came up in a discussion on IRC today, talking about nested entity
situations where the inner entities have a very large number of rows, so
memory-based caches would require far more memory than the machine can hold.
The Oracle Berkeley DB implementation was specifically mentioned, which is why
I'm here instead of opening a new issue. This is licensed under the AGPL, so
we can't distribute it, but I wonder if maybe we could implement enough of an
API layer that a user could provide the jar themselves, tell Solr what class
will be needed, and be in business. Is this what the patch on this issue does?
I haven't looked deeply.
Other ideas, which might need a separate issue for disk-based caching
implementations:
I had the idea of using SQLite for caching in a single-file database. SQLite
is public domain, and there are ways to access it from Java.
Even just a simple implementation that writes little files to the disk would
work. To avoid tons of files in a single directory, perhaps this idea could
get a 32-bit hash of the key and write to a four-level directory structure
where each directory is two hex characters. df/8c/12/b5
A disk-based solution would not be as fast as the memory-based solution already
available, but as long as it was on a local physical disk, it would probably be
faster than making N+1 queries to a remote database.
> DIH Cache backed w/bdb-je
> -------------------------
>
> Key: SOLR-2613
> URL: https://issues.apache.org/jira/browse/SOLR-2613
> Project: Solr
> Issue Type: Improvement
> Components: contrib - DataImportHandler
> Affects Versions: 4.0-ALPHA
> Reporter: James Dyer
> Priority: Minor
> Attachments: SOLR-2613.patch, SOLR-2613.patch, SOLR-2613.patch,
> SOLR-2613.patch, SOLR-2613.patch, SOLR-2613.patch, SOLR-2613.patch
>
>
> This is spun out of SOLR-2382, which provides a framework for multiple
> cacheing implementations with DIH. This cache implementation is fast &
> flexible, supporting persistence and delta updates. However, it depends on
> Berkley Database Java Edition so in order to evaluate this and use it you
> must download bdb-je from Oracle and accept the license requirements.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]