newbie question regarding sorted data process, and sequential match of records

2015-03-16 Thread Troy X
Hi Experts, I'm trying to transform couple of thousands delimited files that is stored on HDFS using PIG. Each file is between 20 to 200 MB in size. The files have very simple column definitions like event history ; TimeStamp, Location, Source, Target, EventType,Description The logic is as foll

Cassandra-Pig integration

2015-03-16 Thread Ronald Green
Hi, I'm trying to read (and write) data with Pig 0.12.1 on an HDP 1.3 cluster (single node dev environment for now) from a Cassandra 2.0.12 cluster (on other machines). I built Cassandra 2.0.12 on my HDP 1.3 cluster and managed to use cqlsh on the cluster to connect to the Cassandra cluster, creat