og (or whatever that was).
>
Absolutely nothing gets written to /var/log/cassandra/system.log (when
trying to invoke cassandra via cron).
>
> Hannu
>
>
> On 11 Jan 2017, at 16.42, Ajay Garg <ajaygargn...@gmail.com> wrote:
>
> Tried everything.
> Every other cron jo
On Wed, Jan 11, 2017 at 8:29 PM, Martin Schröder <mar...@oneiros.de> wrote:
> 2017-01-11 15:42 GMT+01:00 Ajay Garg <ajaygargn...@gmail.com>:
> > Tried everything.
>
> Then try
>service cassandra start
> or
>systemctl start cassandra
>
> You
t I checked, cron doesn't load the same, full environment you see when
>> you log in. Also, why put Cassandra on a cron?
>> On Mon, Jan 9, 2017 at 9:47 PM Bhuvan Rawal <bhu1ra...@gmail.com> wrote:
>>
>>> Hi Ajay,
>>>
>>> Have you had a look at cron lo
;cron" service is running.
Any ideas what might be wrong?
I am pasting the cassandra script for brevity.
Thanks and Regards,
Ajay
#! /bin/sh
### BEGIN INIT INFO
# Provides:
Hi Everyone.
Kindly reply in "yes" or "no", as to whether it is possible to setup
encryption only between particular pair of nodes?
Or is it an "all" or "none" feature, where encryption is present between
EVERY PAIR of nodes, or in NO PAIR of nodes.
Also, wondering what is the difference between "all" and "dc" in
"internode_encryption".
Perhaps my answer lies in this?
On Mon, Apr 18, 2016 at 9:51 AM, Ajay Garg <ajaygargn...@gmail.com> wrote:
> Ok, trying to wake up this thread again.
>
> I we
is the backup centre, while DC1 is the
primary-centre connected directly to the application-server. We don't want
to screw things if something goes bad in DC1.
Will be grateful for pointers.
Thanks and Regards,
Ajay
On Sun, Jan 17, 2016 at 9:09 PM, Ajay Garg <ajaygargn...@gmail.com> wrote:
&g
Something like ::
##
class A {
@Id
@Column (name = "pojo_key")
int key;
@Ttl(10)
@Column (name = "pojo_temporary_guest")
String guest;
}
##
When I persist, let's say value &
up".
Thanks and Regards,
Ajay
On Wed, Jan 6, 2016 at 4:16 PM, Ajay Garg <ajaygargn...@gmail.com> wrote:
> Thanks everyone for the reply.
>
> I actually have a fair bit of questions, but it will be nice if someone
> could please tell me the flow (implementation-wise), as
d on node2* in cassandra.yaml.
if it is, secure-communication is allowed.
Is my thinking right?
I
On Wed, Jan 6, 2016 at 1:55 PM, Neha Dave <nehajtriv...@gmail.com> wrote:
> Hi Ajay,
> Have a look here :
> https://docs.datastax.com/en/cassandra/1.2/cassandra/security/secureSSLNode
and restart it, will the traffic
between DC1M1/DC1M2 and DC2M2 be secure? Or security will kick in ONLY
AFTER the changes are made in all the 4 machines?
Asking here, because I don't want to screw up a live cluster due to my lack
of experience.
Looking forward to some pointers.
--
Regards,
Ajay
(didn't really need that), and we have not
observed the error since about an hour or so.
Thanks Eric and Bryan for the help !!!
Thanks and Regards,
Ajay
On Wed, Nov 4, 2015 at 8:51 AM, Ajay Garg <ajaygargn...@gmail.com> wrote:
> Hmm... ok.
>
> Ideally, we require ::
>
> a
ease your internal node-to-node timeouts .
>
> On Mon, Nov 2, 2015 at 8:01 PM, Ajay Garg <ajaygargn...@gmail.com> wrote:
>
>> Hi Eric,
>>
>> I am sorry, but I don't understand.
>>
>> If there had been some issue in the configuration, then the
>> con
a "write-options with LOCAL_QUORUM
consistency-level" in all create/save statements, but I still get this
error.
Does something else need to be changed in /etc/cassandra/cassandra.yaml too?
Or may be some another place?
--
Regards,
Ajay
nsistency.
>
> See
> http://docs.datastax.com/en/cassandra/2.0/cassandra/dml/dml_ltwt_transaction_c.html
>
> On Mon, Nov 2, 2015 at 1:29 AM Ajay Garg <ajaygargn...@gmail.com> wrote:
>
>> Hi All.
>>
>> I have a 2*2 Network-Topology Replication setup, and I run m
only 0 acknowledged the write)*
Am I missing something?
--
Regards,
Ajay
or
CAS12 to be up (although the expectation is that the driver must work fine
if ANY of the 4 nodes is up).
Thoughts, experts !? :)
On Sat, Oct 24, 2015 at 9:40 PM, Ajay Garg <ajaygargn...@gmail.com> wrote:
> Ideas please, on what I may be doing wrong?
>
> On Sat, Oct 24, 2015 at 5:4
Bingo !!!
Using "LoadBalancingPolicy" did the trick.
Exactly what was needed !!!
Thanks and Regards,
Ajay
On Sun, Oct 25, 2015 at 5:52 PM, Ryan Svihla <r...@foundev.pro> wrote:
> Ajay,
>
> So It's the default driver behavior to pin requests to the first
Ideas please, on what I may be doing wrong?
On Sat, Oct 24, 2015 at 5:48 PM, Ajay Garg <ajaygargn...@gmail.com> wrote:
> Hi All.
>
> I have been doing extensive testing, and replication works fine, even if
> any permuatation of CAS11, CAS12, CAS21, CAS22 are downed and bro
Never mind Vasileios, you have been a great help !!
Thanks a ton again !!!
Thanks and Regards,
Ajay
On Sat, Oct 24, 2015 at 10:17 PM, Vasileios Vlachos <
vasileiosvlac...@gmail.com> wrote:
> I am not sure I fully understand the question, because nodetool repair is
> one of th
maintained. If that
is the case, I think we are good.
Thanks and Regards,
Ajay
On Sat, Oct 24, 2015 at 6:03 PM, Vasileios Vlachos <
vasileiosvlac...@gmail.com> wrote:
> Hello Ajay,
>
> Here is a good link:
>
> http://docs.datastax.com/en/cassandra/2.1/cassandra/operations/opsRepairNodesMan
nly
communicate over public IP-Addresses with these nodes).
Are any further optimizations possible, in the sense that specifying
private-IP-Addresses would work?
I ask this, because we need to minimize network-latency, so possibility of
private-IP-addresses will help in this regard.
Thanks and Regards,
Ajay
If a node in the cluster goes down and comes up, the data gets synced up on
this downed node.
Is there a limit on the interval for which the node can remain down? Or the
data will be synced up even if the node remains down for weeks/months/years?
--
Regards,
Ajay
and Regards,
Ajay
On Sat, Oct 24, 2015 at 4:14 PM, Vasileios Vlachos <
vasileiosvlac...@gmail.com> wrote:
> Hello Ajay,
>
> Have a look in the *max_hint_window_in_ms* :
>
>
> http://docs.datastax.com/en/cassandra/2.0/cassandra/configuration/configCassandra_yaml_r.h
rs.
Ultimately, it is the Java-driver that will be used in production, so it is
imperative that data-persistence/replication happens for any downing of any
permutation of node(s).
Thanks and Regards,
Ajay
Any ideas, please?
To repeat, we are using the exact same cassandra-version on all 4 nodes
(2.1.10).
On Fri, Oct 23, 2015 at 9:43 AM, Ajay Garg <ajaygargn...@gmail.com> wrote:
> Hi Michael.
>
> Please find below the contents of cassandra.yaml for CAS11 (the files on
> the rest
l the help guys !!!
Thanks and Regards,
Ajay
On Sat, Oct 24, 2015 at 2:06 AM, Steve Robenalt <sroben...@highwire.org>
wrote:
> Hi Ajay,
>
> Please take a look at the cassandra.yaml configuration reference regarding
> intial_token and num_tokens:
>
>
> http://docs.da
cassandra-2.1.10.jar:2.1.10]
at java.lang.Thread.run(Thread.java:745) ~[na:1.8.0_60]
Ideas?
Thanks and Regards,
Ajay
On Mon, Oct 12, 2015 at 3:46 PM, Carlos Alonso <i...@mrcalonso.com> wrote:
> Yes Ajay, in yo
ll
inter_dc_tcp_nodelay: false
What changes need to be made, so that whenever a downed server comes back
up, the missing data comes back over to it?
Thanks and Regards,
Ajay
On Fri, Oct 23, 2015 at 9:05 AM, Michael Shuler <mich...@pbandjelly.org>
wrote:
Thanks a ton Anuja for the help !!!
On Fri, Oct 9, 2015 at 12:38 PM, anuja jain <anujaja...@gmail.com> wrote:
> Hi Ajay,
>
>
> On Fri, Oct 9, 2015 at 9:00 AM, Ajay Garg <ajaygargn...@gmail.com> wrote:
>>
> In this case, it will be the responsibility of AP
On Thu, Oct 8, 2015 at 9:47 AM, Ajay Garg <ajaygargn...@gmail.com> wrote:
> Thanks Eric for the reply.
>
>
> On Thu, Oct 8, 2015 at 1:44 AM, Eric Stevens <migh...@gmail.com> wrote:
>> If you're at 1 node (N=1) and RF=1 now, and you want to go N=3 RF=3, you
>>
give huge confidence-boosts in our endeavours.
Thanks and Regards,
Ajay
the schema, to allow a replication_factor of 3.
b)
(As advised by you) Duplicating the already-existing-data on the other 2 nodes.
c)
Thereafter, let Cassandra create a total of 3 copies for every row of
new-incoming-data.
Once again, thanks a ton for the help !!
Thanks and Regards,
Ajay
> You
Thanks Mam for the reply.
I guess there is manual work needed to bring all the SSTables files
into one directory, so doesn't really solve the purpose I guess. So,
going the "vanilla" way might be simpler :)
Thanks anyways for the help !!!
Thanks and Regards,
Ajay
On Tue, Sep 15, 201
Thanks and Regards,
Ajay
On Tue, Sep 15, 2015 at 12:04 PM, Ajay Garg <ajaygargn...@gmail.com> wrote:
> Hi All.
>
> Taking snapshots sometimes works, sometimes don't.
> Following is the stacktrace whenever
Hi All.
Taking snapshots sometimes works, sometimes don't.
Following is the stacktrace whenever the process fails ::
##
ajay@ajay-HP-15-Notebook-PC:/var/lib/cassandra/data/instamsg$ nodetool
-h localhost
::
###
ajay@comp:~$ sudo netstat -apn | grep 9042
[sudo] password for admin:
tcp6 0 0 127.0.0.1:9042 :::*
LISTEN 10169/java
hidden problem.
I am heartfully grateful to everyone for bearing with me.
Thanks and Regards,
Ajay
On Tue, Sep 15, 2015 at 10:16 AM, Ajay Garg <ajaygargn...@gmail.com> wrote:
> Hi Jared.
>
> Thanks for your help.
>
> I made the config-changes.
> Also, I changed the seed
list of addresses.
# Ex: ",,"
- seeds: "our.ip.address.here"
Following is the netstat output ::
####
ajay@comp:~$ sudo netstat -
;
requires a dedicated step.
So, if the schema has 100 "tables", we would need 100 steps.
Is it so? If yes, can the entire data be dumped/restored in one go?
Just asking, to save time, if it could :)
Thanks and Regards,
Ajay
Testing simple content, as my previous email bounced :(
--
Regards,
Ajay
::
###
ajay@comp:~$ cqlsh
Connection error: ('Unable to connect to any servers', {'127.0.0.1':
error(None, "Tried connecting to [('127.0.0.1', 9042)]. Last error:
Try with the correct IP address as below:
cqlsh 192.248.15.219 -u sinmin -p xx
CQL documentation -
http://docs.datastax.com/en/cql/3.0/cql/cql_reference/cqlsh.html
On Sun, Jul 19, 2015 at 2:00 PM, Chamila Wijayarathna
cdwijayarat...@gmail.com wrote:
Hello all,
After starting cassandra,
Any pointers on this?.
In 2.1, when updating the counter with UNLOGGED batch using timestamp isn't
safe as other column update with consistency level (with timestamp counter
update can be idempotent? ).
Thanks
Ajay
On 09-Jul-2015 11:47 am, Ajay ajay.ga...@gmail.com wrote:
Hi,
What
known issues in 2.1 using
UNLOGGED batch for counter update?
Thanks
Ajay
has, is easy in HBase, but making it as
consistent as the rest of HBase is hard)
Thanks
Ajay
On May 29, 2015, at 12:09 PM, Ajay ajay.ga...@gmail.com wrote:
Hi,
I need some info on Hbase vs Cassandra as a data store (in general plus
specific to time series data).
The comparison
for Cassandra over HBase. But this
alone is not a sufficient driver. *
Cheers,
Jens
On Mon, Jun 8, 2015 at 9:59 AM, Ajay ajay.ga...@gmail.com wrote:
Hi All,
Thanks for all the input. I posted the same question in HBase forum and
got more response.
Posting the consolidated list here.
Our
Hi,
I need some info on Hbase vs Cassandra as a data store (in general plus
specific to time series data).
The comparison in the following helps:
1: features
2: deployment and monitoring
3: performance
4: anything else
Thanks
Ajay
cassandra nodes restart. Does the server prepared statements
cache is persisted or in memory?. If it is in memory, how do we handle
stale prepared statement in the cache?
Thanks
Ajay
On Fri, May 15, 2015 at 6:28 PM, ja jaa...@gmail.com wrote:
Hi,
Isn't it a good to have feature for the java driver
Thanks everyone.
Basically we are looking at Hive because it supports advanced queries (CQL
is limited to the data model).
Does Stratio supports similar to Hive?
Thanks
Ajay
On Thu, May 7, 2015 at 10:33 PM, Andres de la Peña adelap...@stratio.com
wrote:
You may also find interesting https
Hi,
Does Apache Cassandra (not DSE) support Hive Integration?
I found couple of open source efforts but nothing is available currently.
Thanks
Ajay
Hi,
What are the guidelines on when to use STCS/DTCS/LCS?. Most preferred way
to test it with each of them and find the best fit. But is there some
guidelines or best practices (out of experience) which one to use when?
Thanks
Ajay
Adding Java driver forum.
Even we like to know more on this.
-
Ajay
On Wed, Apr 8, 2015 at 8:15 PM, Jack Krupansky jack.krupan...@gmail.com
wrote:
Just a couple of quick comments:
1. The driver is supposed to be doing availability and load balancing
already.
2. If your cluster is lightly
Hi,
Now that 2.0.13 is out, I don't see nodetool cleanup issue(
https://issues.apache.org/jira/browse/CASSANDRA-8718) been fixed yet. The
bug show priority Minor. Anybody facing this issue?.
Thanks
Ajay
On Thu, Mar 12, 2015 at 11:41 PM, Robert Coli rc...@eventbrite.com wrote:
On Thu, Mar 12
Yes we see https://issues.apache.org/jira/browse/CASSANDRA-8716 in our
testing
Thanks
Ajay
On Tue, Mar 17, 2015 at 3:20 PM, Marcus Eriksson krum...@gmail.com wrote:
Do you see the segfault or do you see
https://issues.apache.org/jira/browse/CASSANDRA-8716 ?
On Tue, Mar 17, 2015 at 10:34 AM
Is there a separate forum for Opscenter?
Thanks
Ajay
On 11-Mar-2015 4:16 pm, Ajay ajay.ga...@gmail.com wrote:
Hi,
While adding a Cassandra node using OpsCenter (which is recommended), the
versions of Cassandra (Datastax community edition) shows only 2.0.9 and not
later versions in 2.0.x
that we are deploying Cassandra cluster for the 1st time (so
likely that we add more data centers/nodes and schema changes in the
initial few months)
Thanks
Ajay
On Thu, Jan 1, 2015 at 9:49 PM, Neha Trivedi nehajtriv...@gmail.com wrote:
Use 2.0.11 for production
On Wed, Dec 31, 2014 at 11:50 PM
Thanks Mark.
-
Ajay
On 12-Mar-2015 11:08 pm, Mark Reddy mark.l.re...@gmail.com wrote:
It's always good to run nodetool describecluster after a schema change,
this will show you all the nodes in your cluster and what schema version
they have. If they have different versions you have a schema
Thanks Nick.
Does it mean that only adding a new node with 2.0.10 or later is a
problem?. If a new node added manually can be monitored from Opscenter?
Thanks
Ajay
On 12-Mar-2015 10:19 pm, Nick Bailey n...@datastax.com wrote:
There isn't an OpsCenter specific mailing list no.
To answer your
Hi,
While adding a Cassandra node using OpsCenter (which is recommended), the
versions of Cassandra (Datastax community edition) shows only 2.0.9 and not
later versions in 2.0.x. Is there a reason behind it? 2.0.9 is recommended
than 2.0.11?
Thanks
Ajay
Hi,
Are there any steps to do (like nodetool or restart node) or any
precautions after schema changes are done in a column family say adding a
new column or modifying any table properties?
Thanks
Ajay
?
Thanks
Ajay
On 02-Mar-2015 12:13 pm, Ankush Goyal ank...@gmail.com wrote:
Hi Ajay,
I would suggest, looking at the approximate size of individual elements in
the batch, and based on that compute max size (chunk size).
Its not really a straightforward calculation, so I would further suggest
Hi Ankush,
We are already using Prepared statement and our case is a time series data
as well.
Thanks
Ajay
On 02-Mar-2015 10:00 pm, Ankush Goyal ank...@gmail.com wrote:
Ajay,
First of all, I would recommend using PreparedStatements, so you only
would be sending the variable bound arguments
-6487
How do we compute(approximately using ColumnDefintions or ColumnMetadata)
the size of a row of a column family from the client side using Cassandra
Java driver?
Thanks
Ajay
and Cluster instance which I
think doesn't matter)?.
Thanks
Ajay
On Sat, Feb 28, 2015 at 9:17 AM, Ajay ajay.ga...@gmail.com wrote:
Thanks Olivier.
Most of the REST query calls would come from other applications to
write/read to/from Cassandra which means most queries from an application
would
driver code, the Cluster class stores the PreparedStatements as
a weak reference (to rebuild when a node is down or a new node added).
Thanks
Ajay
The syntax suggested by Ondrej is not working in some case in 2.0.11 and
logged an issue for the same.
https://issues.apache.org/jira/browse/CASSANDRA-8797
Thanks
Ajay
On Feb 12, 2015 11:01 PM, Bulat Shakirzyanov
bulat.shakirzya...@datastax.com wrote:
Fixed my Mail.app settings so you can see
with the driver.
Thanks
Ajay
On Feb 12, 2015 8:22 PM, Eric Stevens migh...@gmail.com wrote:
Your page state then needs to track the last ck1 and last ck2 you saw.
Pages 2+ will end up needing to be up to two queries if the first query
doesn't fill the page size.
CREATE TABLE foo (
partitionkey int
for all preceding clustering key
columns. With this I cannot get the pagination correct.
Thanks
Ajay
I can't believe that everyone read process all rows at once (without
pagination).
Probably not too many people try to read all rows in a table as a single
rolling operation with a standard client
Basically I am trying different queries with your approach.
One such query is like
Select * from mycf where condition on partition key order by ck1 asc, ck2
desc where ck1 and ck2 are clustering keys in that order.
Here how do we achieve pagination support?
Thanks
Ajay
On Feb 11, 2015 11:16 PM
Thanks Alex.
But is there any workaround possible?. I can't believe that everyone read
process all rows at once (without pagination).
Thanks
Ajay
On Feb 10, 2015 11:46 PM, Alex Popescu al...@datastax.com wrote:
On Tue, Feb 10, 2015 at 4:59 AM, Ajay ajay.ga...@gmail.com wrote:
1) Java
if the query contains ORDER BY clause.
Is there any other way to achieve the pagination support?
Thanks
Ajay
Thanks Eric. I didn't know the point about the token aware routing.
But with points 2 and 3 I didn't notice much improvement with prepared
statement. I have 2 cassandra nodes running in virtual boxes in the same
machine and test client running in the same machine.
Thanks
Ajay
Prepared
Thanks Tyler Hobbs.
We need to capture what are the queries ran by a user in a session and its
time taken. (don't need query plan or so). Is that possible? With
Authenticator we can capture only the session creation right?
Thanks
Ajay
On Sat, Jan 10, 2015 at 6:07 AM, Tyler Hobbs ty
KEY((prodgroup), productid, status).
Thanks
Ajay
On Sat, Jan 10, 2015 at 6:03 AM, Tyler Hobbs ty...@datastax.com wrote:
Your proposed model for the table to handle the last query looks good, so
I would stick with that.
On Mon, Jan 5, 2015 at 5:45 AM, Nagesh nageswara.r...@gmail.com wrote
?. Also is it recommended to enable system_trace in production or to
know how many sessions started by a user?
Thanks
Ajay
use the token function in the CQL.
Thanks
Ajay
for a specific Ad Id(one or more may be).
How do we support 1 and 2 with a same data model? (I thought to use Ad ID +
Hour data as Partition key to avoid hotspots)
Thanks
Ajay
On Wed, Jan 7, 2015 at 6:34 PM, Sylvain Lebresne sylv...@datastax.com
wrote:
On Wed, Jan 7, 2015 at 10:18 AM, Ajay
We noticed the same issue. From the cassandra-cli, it allows to use upper
case or mixed case Keyspace name but from cqlsh it auto converts to lower
case.
Thanks
Ajay
On Wed, Jan 7, 2015 at 9:44 PM, Harel Gliksman harelg...@gmail.com wrote:
Hi,
We have a Cassandra cluster with Keyspaces
Neha,
This is just for a trial set up. Anyway, thanks for the suggestion(more
than 1 seed node).
I figured out the problem. The Node2 was having the incorrect Cluster name.
The error seems to be misleading though.
Thanks
Ajay Garga
On Mon, Jan 5, 2015 at 4:21 PM, Neha Trivedi nehajtriv
the
java.lang.RuntimeException: Unable to news with any seeds.
What could be the reason?
Thanks
Ajay
Thanks Eric.
Happy new year 2015 for all Cassandra developers and Users :). This group
seems the most active of apache big data projects.
Will come back with more questions :)
Thanks
Ajay
On Dec 31, 2014 8:02 PM, Eric Stevens migh...@gmail.com wrote:
You can totally avoid the impact
Hi All,
For my research and learning I am using Cassandra 2.1.2. But I see couple
of mail threads going on issues in 2.1.2. So what is the stable or popular
build for production in Cassandra 2.x series.
Thanks
Ajay
)?
Thanks
Ajay
On Tue, Dec 30, 2014 at 10:37 PM, Janne Jalkanen janne.jalka...@ecyrd.com
wrote:
Hi!
Yes, since all the writes for a partition (or row if you speak Thrift)
always go to the same replicas, you will need to design to avoid hotspots -
a pure day row will cause all the writes
Hi,
Is it better to use Counter to User click count than maintaining creating
new row as user id : timestamp and count it.
Basically we want to track the user clicks and use the same for
hourly/daily/monthly report.
Thanks
Ajay
Hi,
So you mean to say counters are not accurate? (It is highly likely that
multiple parallel threads trying to increment the counter as users click
the links).
Thanks
Ajay
On Mon, Dec 29, 2014 at 4:49 PM, Janne Jalkanen janne.jalka...@ecyrd.com
wrote:
Hi!
It’s really a tradeoff between
the data will be huge with more writes
and more rows to count for reads right?
Thanks
Ajay
On Mon, Dec 29, 2014 at 7:10 PM, Alain RODRIGUEZ arodr...@gmail.com wrote:
Hi Ajay,
Here is a good explanation you might want to read.
http://www.datastax.com/dev/blog/whats-new-in-cassandra-2-1-a-better
Thanks.
I went through some articles which mentioned that the client to pass the
timestamp for insert and update. Is that anyway we can avoid it and
Cassandra assume the current time of the server?
Thanks
Ajay
On Dec 26, 2014 10:50 PM, Eric Stevens migh...@gmail.com wrote:
Timestamps
Hi,
If the nodes of Cassandra ring are in different timezone, could it affect
the counter column as it depends on the timestamp?
Thanks
Ajay
the time
taken to accept input, process and respond back. If Latency is more how
come the Throughput is high?
Thanks
Ajay
Thanks Thomas for the clarification.
If I use the Consistency level of QUORUM for Read and Write, the Latency
would affect the Throughput right?
Thanks
Ajay
On Fri, Dec 26, 2014 at 11:15 AM, Job Thomas j...@suntecgroup.com wrote:
Hi,
First of all,the write latency of cassandra is not high
Hi Thomas,
I am little confused when you say multithreaded client. Actually we don't
explicitly invoke read on multiple servers (for replicated data) from the
client code. So how does multithreaded client fix this?
Thanks
Ajay
On Fri, Dec 26, 2014 at 12:08 PM, Job Thomas j...@suntecgroup.com
)
|
Streaming processing (Spark Streaming or Storm. Yet to be decided.
Spark streaming is relatively new)
|
My SQL/Mongo/Real Time data
Since we are planning to build it as a service, we cannot consider a
particular data access pattern.
Thanks
Ajay
On Thu, Dec 18
also plans to support Spark SQL soon). The idea is to abstract the
business users from the storage solutions (more than one. Cassandra/HBase
Mongo).
Thanks
Ajay
On Thu, Dec 18, 2014 at 8:01 PM, Peter Lin wool...@gmail.com wrote:
by data warehouse, what kind do you mean?
is it the traditional
will
have writes and reads (but reads will be more say 40% writes and 60%
reads). We are planning to use Spark as the in memory computation engine.
Thanks
Ajay
) It takes around .6 second using Spark (either SELECT * FROM users WHERE
name='Anna' or javaFunctions(sc).cassandraTable(test, people,
mapRowTo(Person.class)).where(name=?, Anna);
Please let me know if I am missing something in Spark configuration or
Cassandra-Spark Driver.
Thanks
Ajay Garga
96 matches
Mail list logo