Again, I would hope that solr builds a storage agnostic solution.

As long as we have a simple interface to load/store documents, it should be easy to write a JDBC/ehcache/disk/Cassandra/whatever implementation.

ryan


On Dec 4, 2008, at 10:29 AM, Noble Paul നോബിള്‍ नोब्ळ् wrote:

Cassandra does not meet our requirements.
we do not need that kind of scalability

Moreover its future is uncertain and they are trying to incubate it into Solr


On Thu, Dec 4, 2008 at 8:52 PM, Sami Siren <[EMAIL PROTECTED]> wrote:
Yet another possibility: http://wiki.apache.org/incubator/Cassandra

It at least claims to be scalable, no personal experience.

--
Sami Siren

Noble Paul ??????? ?????? wrote:

Another persistence solution is ehcache with diskstore. It even has
replication

I have never used  ehcache . So I cannot comment on it

any comments?

--Noble

On Wed, Dec 3, 2008 at 8:50 PM, Noble Paul ??????? ??????
<[EMAIL PROTECTED]> wrote:


On Wed, Dec 3, 2008 at 5:52 PM, Grant Ingersoll <[EMAIL PROTECTED] >
wrote:


On Dec 3, 2008, at 1:28 AM, Noble Paul ??????? ?????? wrote:



The code can be written against JDBC. But we need to test the DDL and
data types on al the supported DBs

But , which one would we like to ship with Solr as a default option?


Why do we need a default option? Is this something that is intended to
be
on by default? Or, do you mean just to have one for unit tests to work?


Default does not mean that it is enabled bby default. But if it is
enabled I can have defaults for stuff like driver, url , DDL etc. And
the user may not need to provide an extra jar


I don't know if it is still the case, but I often find embedded dbs to
be
quite annoying since you often can't connect to them from other clients outside of the JVM which makes debugging harder. Of course, maybe I
just
don't know the tricks to do it. Derby is one DB that you can still
connect
to even when it is embedded.


Embedded is the best bet for us because of performance reasons and
zero management.
The users can still read the data through Solr itself .


Also, whatever is chosen needs to scale to millions of documents, and I
wonder about an embedded DB doing that.  I also have a hard time
believing
that both a DB w/ millions of docs and Solr can live on the same
machine,
which is presumably what an embedded DB must do. Presumably, it also
needs
to be able to be replicated, right?


millions of docs.?
then you must configure a remote DB for storage reasons
and must manage the replication separately




H2 looks impressive. the jar (small) is just 667KB and the memory
footprint is small too
--Noble

On Wed, Dec 3, 2008 at 10:30 AM, Ryan McKinley <[EMAIL PROTECTED]>
wrote:


check http://www.h2database.com/ in my view the best embedded DB out
there.

from the maker of HSQLDB...  is second round.

However, from anything solr, I would hope it would just rely on JDBC.


On Dec 2, 2008, at 12:08 PM, Shalin Shekhar Mangar wrote:



HSQLDB has a limit of upto 8GB of data. In Solr, you might want to go
beyond
that without a commit.

On Tue, Dec 2, 2008 at 10:33 PM, Dawid Weiss
<[EMAIL PROTECTED]>wrote:



Isn't HSQLDB an option? Its performance ranges a lot depending on
the
volume of data and queries, but otherwise the license looks BSDish.

http://hsqldb.org/web/hsqlLicense.html

Dawid



--
Regards,
Shalin Shekhar Mangar.




--
--Noble Paul


--------------------------
Grant Ingersoll

Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ













--
--Noble Paul











--
--Noble Paul

Reply via email to