Hi friends, I was thinking how scylladb architecture <http://www.scylladb.com/technology/architecture/> works compared to cassandra which gives them 10x+ performance and lower latency. If you were starting lucene and solr from scratch what would you do to achieve something similar ?
Different language (rust/c++?) for better SIMD <http://blog-archive.griddynamics.com/2015/06/lucene-simd-codec-benchmark-and-future.html> ? Use a GPU with a SSD for posting-list intersection ?(not out yet) Make it in-memory and use better data structures? Shard on cores like scylladb (so 1 shard for each core on the machine) ? External cache (like keeping n redis-servers with big ram/network & slow cpu/disk just for cache) ?? Use better data structures (like algolia autocomplete radix <https://blog.algolia.com/inside-the-algolia-engine-part-2-the-indexing-challenge-of-instant-search/> ) Distributing documents by term instead of id <http://research.microsoft.com/en-us/um/people/trishulc/papers/Maguro.pdf> ? Using ASIC / FPGA ? Regards, Dorian