efficient ?

Dorian Hoxha Fri, 20 Jan 2017 08:13:32 -0800

Hi friends,

I was thinking how scylladb architecture
<http://www.scylladb.com/technology/architecture/> works compared to
cassandra which gives them 10x+ performance and lower latency. If you were
starting lucene and solr from scratch what would you do to achieve
something similar ?


Different language (rust/c++?) for better SIMD
<http://blog-archive.griddynamics.com/2015/06/lucene-simd-codec-benchmark-and-future.html>
?
Use a GPU with a SSD for posting-list intersection ?(not out yet)
Make it in-memory and use better data structures?
Shard on cores like scylladb (so 1 shard for each core on the machine) ?
External cache (like keeping n redis-servers with big ram/network & slow
cpu/disk just for cache) ??
Use better data structures (like algolia autocomplete radix
<https://blog.algolia.com/inside-the-algolia-engine-part-2-the-indexing-challenge-of-instant-search/>
)
Distributing documents by term instead of id
<http://research.microsoft.com/en-us/um/people/trishulc/papers/Maguro.pdf> ?
Using ASIC / FPGA ?

Regards,
Dorian

How would you architect solr/lucene if you were starting from scratch for them to be 10X+ faster/efficient ?

Reply via email to