Re: Query on Data Modelling of a specific usecase

2017-04-20 Thread Naresh Yadav
suggest optimal table design keeping one table in mind with 10 millions to 1 billion rows scale for the mentioned queries. Thanks, Naresh Yadav On Wed, Apr 19, 2017 at 9:26 PM, Jon Haddad wrote: > How much data do you plan to store in each table? > > I’ll be honest, this doesn’t sou

Re: Query on Data Modelling of a specific usecase

2017-04-18 Thread Naresh Yadav
Looking for cassandra expert's recommendation on above usecase, please reply. On Mon, Apr 17, 2017 at 7:37 PM, Naresh Yadav wrote: > Hi all, > > This is my existing table configured on apache-cassandra-3.0.9: > > CREATE TABLE report_id1 ( >mc_id text, >tag_id t

Query on Data Modelling of a specific usecase

2017-04-17 Thread Naresh Yadav
Hi all, This is my existing table configured on apache-cassandra-3.0.9: CREATE TABLE report_id1 ( mc_id text, tag_id text, e_date timestamp. value text PRIMARY KEY ((mc_id, tag_id), e_date) } I create table dynamically for each report from application. Need to support upto 1000 re

Re: Tag filtering data model

2015-09-16 Thread Naresh Yadav
We also had similar usecase, after lot of trials with cassandra, we finally created solr schema doc_id(unique key), tags(indexed) in apache solr for answering search query "Get me matching docs by any given no of tags" and that solved our usecase. We had usecase of millions of docs and in tags we c

Re: Help me on Cassandra Data Modelling

2014-01-28 Thread Naresh Yadav
please inputs on last email if any.. On Tue, Jan 28, 2014 at 7:18 AM, Naresh Yadav wrote: > yes thunder you are right, i had simplified that by moving *tags > *search(partial/exact) > in separate column family tagcombination which will act as index for all > search based on tags

Re: Help me on Cassandra Data Modelling

2014-01-27 Thread Naresh Yadav
toring each partial tag combination as well as > individual tags in the partitioning key, you will be severely duplicating > your storage. You might want to just store individual keys in the > partitioning key. > > Good luck, > Thunder > > > > > On Mon, Jan 27, 2014 at

Re: Help me on Cassandra Data Modelling

2014-01-27 Thread Naresh Yadav
lt;http://www.datastax.com/what-we-offer/products-services/training/virtual-training> > > > On Mon, Jan 27, 2014 at 7:24 AM, Naresh Yadav wrote: > >> Hi all, >> >> Urgently need help on modelling this usecase on Cassandra. >> >> I have concept of tags and tag

Help me on Cassandra Data Modelling

2014-01-27 Thread Naresh Yadav
Hi all, Urgently need help on modelling this usecase on Cassandra. I have concept of tags and tagcombinations. For example U.S.A and Pen are two tags AND if they come together in some definition then register a tagcombination(U.S.A-Pen) for that.. *tags *(U.S.A, Pen, Pencil, India, Shampoo) *tag

Re: Best design for a usecase ??

2014-01-21 Thread Naresh Yadav
just to add : on this table there will be lakhs of select queries to get tagcombinationid fro a partial set of tags... On Tue, Jan 21, 2014 at 2:33 PM, Naresh Yadav wrote: > Hi, > > I need to design a table which will give a UUID to set of tags. > Each tag itself has

Best design for a usecase ??

2014-01-21 Thread Naresh Yadav
Hi, I need to design a table which will give a UUID to set of tags. Each tag itself has unique UUID *TagCombination* table TC1 -> India, Pen TC2 -> Shampoo, U.K TC3 -> Team1, Product1, Location1 TC4 -> Office1, India, Pen I can have *billion *of such unique combinations and there can be

Getting indexoutbound exception for a specific query on cassandra trunk

2014-01-16 Thread Naresh Yadav
I had taken latest source code of cassandra trunk to evaluate performance of indexing on collections new feature( https://issues.apache.org/jira/browse/CASSANDRA-4511) for my usecase.. IF you configure table like this with commands in given order : CREATE TABLE testcollectionindex(userid text, ti

Re: Help on Designing Cassandra table for my usecase

2014-01-10 Thread Naresh Yadav
On Fri, Jan 10, 2014 at 6:05 PM, Vivek Mishra wrote: > @Naresh > Too many indices or indices with high cardinality should be discouraged > and are always performance issues. A set will not contain duplicate values. > > -Vivek > > > On Fri, Jan 10, 2014 at 5:4

Re: Help on Designing Cassandra table for my usecase

2014-01-10 Thread Naresh Yadav
fining secondary index on it. Is there any risk of performance problem of this design keeping in mind huge data ??? Naresh On Fri, Jan 10, 2014 at 10:26 AM, Naresh Yadav wrote: > @Thunder thanks for suggesting design but my main problem is > indexing/quering dynamic Tag on each row that is mai

Probable release date for cassandra 2.1 ??

2014-01-10 Thread Naresh Yadav
Hi, I am looking feature(CASSANDRA-4511) which allows Index on Collections. Any idea about release date of Cassandra 2.1 ?? Till this releases, i am thinking to take source code of 2.1 and build it on my machine to test the required feature.

Re: Help on Designing Cassandra table for my usecase

2014-01-09 Thread Naresh Yadav
gt; >> To my eye that looks something what the traditional analytics systems do. >> You can check out e.g. Acunu Analytics which uses Cassandra as a backend. >> >> Cheers, >> Hannu >> >> >> 2014/1/9 Naresh Yadav >> >>> Hi all, >>

Re: Help on Designing Cassandra table for my usecase

2014-01-09 Thread Naresh Yadav
@thunder It will be write once 80% of time but there can be cases client makes correction in data and then we need to overwrite that.. Thanks Naresh On Thu, Jan 9, 2014 at 11:49 PM, Naresh Yadav wrote: > @thunder thanks for guidance queries will be fired by application on this >

Re: Help on Designing Cassandra table for my usecase

2014-01-09 Thread Naresh Yadav
> store for Pig/Hive. It is a little finicky to get working depending on all > your versions but does work fairly well in my limited experience. > > Perhaps with a little better understanding of your workload needs others > can chime in too. Good luck. > > -Thunder > >

Help on Designing Cassandra table for my usecase

2014-01-09 Thread Naresh Yadav
Hi all, I have a use case with huge data which i am not able to design in cassandra. Table name : MetricResult Sample Data : Metric=Sales, Time=Month, Period=Jan-10, Tag=U.S.A, Tag=Pen, Value=10 Metric=Sales, Time=Month, Period=Jan-10, Tag=U.S.A, Tag=Pencil, Value=20 Metric=Sales, Time=Mo

Re: Setting up a multi-node cluster

2013-08-27 Thread Naresh Yadav
You would need to configure rpc_address also with hostname/ips on both the nodes. Naresh On Wed, Aug 28, 2013 at 10:15 AM, Dinesh wrote: > Hi, > > I am trying to setup a two node Cassandra cluster > > Able to start the first node, but not seeing the following exception while > starting the seco

Re: Cassandra HANGS after some writes

2013-08-16 Thread Naresh Yadav
Wed, Aug 14, 2013 at 10:43 PM, Robert Coli wrote: > On Tue, Aug 13, 2013 at 10:39 PM, Naresh Yadav wrote: > >> I made one single change in default cassandra.yaml, just to experiment. >> >> native_transport_min_threads: *1* >> native_transport_max_threads: *1* >&g

Re: Cassandra HANGS after some writes

2013-08-13 Thread Naresh Yadav
I made one single change in default cassandra.yaml, just to experiment. native_transport_min_threads: *1* native_transport_max_threads: *1* with max one single thread for native protocol requests i noticed some improvement, earlier with default yaml most of time it was failing after * 10K* combin

Re: Cassandra HANGS after some writes

2013-08-13 Thread Naresh Yadav
Hi all, I started cassandra few weeks back and i am on development enviornment, it will take months for production as everything in development.But i will spend time and setup one machine with UBuntu and will check if similar problem comes or not...Also i had started hands on Hadoop then linux

Re: Cassandra HANGS after some writes

2013-08-13 Thread Naresh Yadav
ed to change the data and commitlog > directories in cassandra.yaml. Also you should check the log directories. > > See the section 2.1 http://wiki.apache.org/cassandra/GettingStarted > > > On Tue, Aug 13, 2013 at 8:28 AM, Naresh Yadav wrote: > >> >> Thanks Alain,

Re: Cassandra HANGS after some writes

2013-08-13 Thread Naresh Yadav
are known as being bottlenecks / pertinent metrics ? > > Alain > > > 2013/8/13 Naresh Yadav > >> >> Hi All, >> >> I have single node cassandra using CQL using datastax java driver 1.0.1 >> and cassandra verison 1.2.6. >> >> *Infrastruct

Cassandra HANGS after some writes

2013-08-13 Thread Naresh Yadav
Hi All, I have single node cassandra using CQL using datastax java driver 1.0.1 and cassandra verison 1.2.6. *Infrastructure :* 16GB machine with 8GB heap given to cassandra, i7 processor.. DEFAULT cassandra.yaml no change done by me. -Xms1G^ -Xmx12G^ no other change in cassandra.bat *