Hi David:

Please see the new patch that I made available for JIRA:
https://issues.apache.org/jira/browse/SOLR-3816

This removes the code that Yonik had highlighted and also introduces a granularity based realtime-search, a request granularity and an intra-request granularity. request granularity means that each request may return new results based. The underlying code ensures that all the components of a request, search, highlighting, faceting, etc. see the same view of the index. intra-request granularity means that each component may see the changes happening to the index so each may return different results ...

request granularity has higher performance compared to intra-request granularity. The SolrIndexSearch object is not closed as before. Commit or autocommit time can be set to a very high value, the transaction log disabled (for use without SolrCloud) for further improvement in performance.


Regards,

Nagendra Nagarajayya
http://solr-ra.tgels.org
http://elasticsearch-ra.tgels.org
http://rankingalgorithm.tgels.org


On 1/30/2013 7:42 AM, Nagendra Nagarajayya wrote:
Hi David:

There are no NRT tricks being used. It uses the NRT capable Reader made available by the IndexWriter.
(the source is attached to the JiRA)

I would suggest that you download and give this a try. You can download from here: http://solr-ra.tgels.org ( you can download the 4.0 or the 3.6.2 version )

I have a user who is using this in realtime having indexed closed to 2 billion docs with no issues. If you do find any problems, please let me know or add onto the existing JIRA so that I can fix it.

Regards,

Nagendra Nagarajayya
http://solr-ra.tgels.org
http://elasticsearch-ra.tgels.org
http://rankingalgorithm.tgels.org


On 1/29/2013 8:23 AM, Smiley, David W. wrote:
Hi.

Speaking for myself anyway, I am leary of using it without it having
extensive concurrent tests to validate that the NRT tricks your doing
don't have incorrect results.  It would no doubt be very difficult to
develop this test. And this test would be the kind of tests that runs for
a while and you would stop it after running it overnight or whatever
duration to make one feel comfortable.

~ David

On 1/29/13 8:34 AM, "Nagendra Nagarajayya" <[email protected]>
wrote:

Hi David:

Did you have a chance to see my comments in the JIRA ?

Regards,
-NN

On 1/28/2013 11:58 AM, Smiley, David W. wrote:
Nagendra,

I'm surprised to see you're still promoting your realtime-search based
system given the critical problem that Yonik found:

https://issues.apache.org/jira/browse/SOLR-3816?focusedCommentId=13494815
&p

age=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comm
en
t-13494815

That is a serious fundamental flaw, I'm sorry to say.  To quote Yonik:
"You'll get incorrect documents back, incorrect facets back, pretty much
any number of random looking bugs because internal docids will be
changing
underneath you." This won't necessarily happen all the time depending
on
the timing of the search with respect to concurrent changes, but it can
happen.

~ David Smiley

On 1/27/13 9:25 AM, "Nagendra Nagarajayya"
<[email protected]>
wrote:

Hi:

I am very excited to announce the availability of Apache Solr 3.6.2
with
RankingAlgorithm30 1.4.3 with realtime-search support. realtime-search is very fast NRT and allows you to not only lookup a document by id but
also allows you to search in realtime, see
http://tgels.org/realtime-nrt.jsp. The update performance is about
10,000 docs / sec. The query performance is in ms, allows you to query
a 10m wikipedia index (complete index) in <50 ms.

This release also includes a experimental TimedSerialMergeScheduler

<http://rankingalgorithm.1050964.n5.nabble.com/TimedSerialMergerSchedule
r-

java-allows-merges-to-be-deferred-to-a-known-time-like-11pm-or-1am-tp570
63
50.html> that
allows you to postpone your merges to off hours time like 11pm or 1am
increasing performance.

RankingAlgorithm30 1.4.3 supports the entire Lucene Query Syntax, ±
and/or boolean queries.

You can get more information about realtime-search performance from
here:
http://solr-ra.tgels.org/wiki/en/Near_Real_Time_Search_ver3.x

You can download Solr 3.6.2 with RankingAlgorithm30 1.4.3 from here:
http://solr-ra.tgels.org

Please download and give the new version a try.

Note:
1. Apache Solr 3.6.2 with RankingAlgorithm30 1.4.3 is an external
project.
2. realtime-search has been contributed back to Apache Solr, see
https://issues.apache.org/jira/browse/SOLR-3816


Regards,

Nagendra Nagarajayya
http://solr-ra.tgels.org
http://elasticsearch-ra.tgels.org
http://rankingalgorithm.tgels.org








Reply via email to