Re: Tag filtering data model

2015-09-19 Thread Artur Siekielski
I came to a similar conclusion, that is if you have more than a few tags, then the problem is no more simple "tagging" but more like regular "document search" with indexed words. There are too many word subsets to precompute matching documents, so you need to index documents individually and

Re: Tag filtering data model

2015-09-16 Thread Naresh Yadav
We also had similar usecase, after lot of trials with cassandra, we finally created solr schema doc_id(unique key), tags(indexed) in apache solr for answering search query "Get me matching docs by any given no of tags" and that solved our usecase. We had usecase of millions of docs and in tags we

Re: Tag filtering data model

2015-09-15 Thread Carlos Alonso
Really interesting question Artur. Have you gone any further? I think, based on my experience and recalling Cassandra's good practices, that full denormalisation is the Cassandra way to go. Cheers Carlos Alonso | Software Engineer | @calonso On 11 September 2015

Tag filtering data model

2015-09-11 Thread Artur Siekielski
I store documents submitted by users, with optional tags (lists of strings): CREATE TABLE doc ( user_id uuid, date text, // part of partition key, to distribute data better doc_id uuid, tags list, contents text, PRIMARY KEY((user_id, date), doc_id) ); What is the best way to