Re: Example Data Modelling

2015-07-11 Thread Jérôme Mainaud
;>>>> On Tue, Jul 7, 2015 at 11:32 AM, Peer, Oded wrote: >>>>> >>>>>> The data model suggested isn’t optimal for the “end of month” query >>>>>> you want to run since you are not querying by partition key. >>>>>> >

Re: Example Data Modelling

2015-07-08 Thread Saladi Naidu
month as clustering and keep employee details as static columns so they wont be repeated  Naidu Saladi From: Srinivasa T N To: "user@cassandra.apache.org" Sent: Tuesday, July 7, 2015 3:07 AM Subject: Re: Example Data Modelling Thanks for the inputs. Now my question is how

Re: Example Data Modelling

2015-07-07 Thread John Sanda
or a specific month which might > cause hotspots on those nodes. > > > > Choose the approach that works best for you. > > > > > > *From:* Carlos Alonso [mailto:i...@mrcalonso.com > ] > *Sent:* Monday, July 06, 2015 7:04 PM > *To:* user@cassandra.apache.or

Re: Example Data Modelling

2015-07-07 Thread Carlos Alonso
;>>>> where month = 1” which requires filtering and has unpredictable >>>>> performance. >>>>> >>>>> >>>>> >>>>> For this type of query to be fast you can use the “month” column as >>>>> the pa

Re: Example Data Modelling

2015-07-07 Thread Jérôme Mainaud
This approach also has drawbacks: >>>> >>>> 1. This data model creates a wide row. Depending on the number of >>>> employees this partition might be very large. You should limit partition >>>> sizes to 25MB >>>> >>>> 2. D

Re: Example Data Modelling

2015-07-07 Thread Rory Bramwell, DevOp Services
ng data according to month means that only a small number >>> of nodes will hold all of the salary data for a specific month which might >>> cause hotspots on those nodes. >>> >>> >>> >>> Choose the approach that works best for you. >>

Re: Example Data Modelling

2015-07-07 Thread Carlos Alonso
ion >> sizes to 25MB >> >> 2. Distributing data according to month means that only a small number of >> nodes will hold all of the salary data for a specific month which might >> cause hotspots on those nodes. >> >> >> >> Choose the approach that

Re: Example Data Modelling

2015-07-07 Thread Srinivasa T N
number of > nodes will hold all of the salary data for a specific month which might > cause hotspots on those nodes. > > > > Choose the approach that works best for you. > > > > > > *From:* Carlos Alonso [mailto:i...@mrcalonso.com] > *Sent:* Monday, Ju

RE: Example Data Modelling

2015-07-06 Thread Peer, Oded
7:04 PM To: user@cassandra.apache.org Subject: Re: Example Data Modelling Hi Srinivasa, I think you're right, In Cassandra you should favor denormalisation when in RDBMS you find a relationship like this. I'd suggest a cf like this CREATE TABLE salaries ( EmpID varchar, FN var

Re: Example Data Modelling

2015-07-06 Thread Carlos Alonso
Hi Srinivasa, I think you're right, In Cassandra you should favor denormalisation when in RDBMS you find a relationship like this. I'd suggest a cf like this CREATE TABLE salaries ( EmpID varchar, FN varchar, LN varchar, Phone varchar, Address varchar, month integer, basic integer,

Example Data Modelling

2015-07-06 Thread Srinivasa T N
Hi, I have basic doubt: I have an RDBMS with the following two tables: Emp - EmpID, FN, LN, Phone, Address Sal - Month, Empid, Basic, Flexible Allowance My use case is to print the Salary slip at the end of each month and the slip contains emp name and his other details. Now, if