Re: export data from HBase to mysql

2011-06-23 Thread Vishal Kapoor
thought it was only SQL-to-Hadoop can it also dump from HBase to SQL? how do I take care of transformations? On Thu, Jun 23, 2011 at 2:33 PM, Ted Yu wrote: > Have you checked out sqoop ? > > On Thu, Jun 23, 2011 at 11:32 AM, Vishal Kapoor > wrote: > > > Looking for tool

export data from HBase to mysql

2011-06-23 Thread Vishal Kapoor
Looking for tools to help export Data from HBase to MySQL, will need some transformations before inserting. thanks, Vishal

VMWare and Hadoop/Hbase

2011-05-09 Thread Vishal Kapoor
We were wondering if its advisable to provision hbase/hadoop nodes as VMWare instances? any suggestions? thanks, Vishal

Re: Append value to a cell?

2011-04-12 Thread Vishal Kapoor
fairly efficient. > > Dave > > > -----Original Message- > From: Vishal Kapoor [mailto:vishal.kapoor...@gmail.com] > Sent: Tuesday, April 12, 2011 8:29 AM > To: user@hbase.apache.org > Subject: Append value to a cell? > > Do we have any API which can append text value

Append value to a cell?

2011-04-12 Thread Vishal Kapoor
rebuilding the entire reverse lookup time from ground up. thanks, Vishal Kapoor

Re: does Scan guarantee min to max rows?

2011-04-12 Thread Vishal Kapoor
ving it or switching to a design that is proven to be working on this kind of problem will help me. thanks, Vishal Kapoor On Wed, Apr 6, 2011 at 12:56 PM, Stack wrote: > On Wed, Apr 6, 2011 at 5:12 AM, Vishal Kapoor > wrote: >> I am getting shuffled rows? is there a problem at my end somew

does Scan guarantee min to max rows?

2011-04-06 Thread Vishal Kapoor
I am getting shuffled rows? is there a problem at my end somewhere? we did some manual split of tables. have a scoreboard kind of code for staged processing of table based on it, which is going for a toss. thanks, Vishal

versions stored in a cell

2011-04-03 Thread Vishal Kapoor
two questions, 1) if I give number of versions for a family as 365*3 is it a bad design? how many versions are a good practice? if I have two many versions will that be a single seek when I get the row Id? if yes, will it take longer to store data? pros and cons? 2) how do I get the number of ver

Scan rows in reverse order?

2011-04-03 Thread Vishal Kapoor
I have my row id starting with a reverse time stamp so that I can get to the latest data first, but I also need to process data as it comes in, is there any way I can get scan to give me data from bottom up? thanks, Vishal

Re: HTable first and last row?

2011-03-28 Thread Vishal Kapoor
Ted, thanks for the quick response vk On Mon, Mar 28, 2011 at 6:32 PM, Ted Yu wrote: > See the 'min, max' discussion on hbase user mailing list > > On Mon, Mar 28, 2011 at 3:28 PM, Vishal Kapoor > wrote: >> >> How do I find the first and last row id

HTable first and last row?

2011-03-28 Thread Vishal Kapoor
How do I find the first and last row id of the table in Hbase? I tried Htable.getStopKeys() and Htable.getEndKeys() does not work, all I need is the absolute start/stop row id of a table at a given point in time. any direction? thanks, vk

Re: Observer/Observable MapReduce

2011-03-25 Thread Vishal Kapoor
second job startup you can read that meta data and set your scan > /input splits appropriately? > Dave > > -Original Message----- > From: Vishal Kapoor [mailto:vishal.kapoor...@gmail.com] > Sent: Friday, March 25, 2011 11:21 AM > To: user@hbase.apache.org > Subject:

Observer/Observable MapReduce

2011-03-25 Thread Vishal Kapoor
scan(start,stop) typically data comes in a few tables for us and we start crunching it and then adding some more data to man tables like info etc to get rid of table joins. a light weight framework will do better than a typical workflow management tool. thanks, Vishal Kapoor

Re: JobControl and HBase MR chaining

2011-03-22 Thread Vishal Kapoor
Can't use JobControl. > > > -Original Message- > From: Vishal Kapoor [mailto:vishal.kapoor...@gmail.com] > Sent: Tuesday, March 22, 2011 1:34 PM > To: user@hbase.apache.org > Subject: JobControl and HBase MR chaining > > with JobControl not rewritten under mapr

JobControl and HBase MR chaining

2011-03-22 Thread Vishal Kapoor
with JobControl not rewritten under mapreduce and continues to be under mapred( hadoop version 20.2) expecting subclasses of Mapper from mapred how do we use TableMapper and TableReducer to chain jobs? I am utterly confused, any ray of hope? thanks, Vishal

Re: problem bringing Hbase back up after power outage and removal of nodes

2011-03-17 Thread Vishal Kapoor
you should have more info on why dfs is in the safe mode in the logs, you can always leave safe mode hadoop dfs -safemode leave but again, thats a symptom, not a problem. Vishal On Thu, Mar 17, 2011 at 1:55 PM, Taylor, Ronald C wrote: > Folks, > > We had a power outage here, and we are trying

Scoreboard kind of functionality

2011-03-17 Thread Vishal Kapoor
If I have a bunch of MRs and I want to keep a tab on what they should process in terms of scope of row ids eg, first run : startRow1 stopRowN second run startRowN+1 stopRowM and similar for others, is there any light weight accomplish this? thanks, vk

Re: intersection of row ids

2011-03-11 Thread Vishal Kapoor
t 9:44, Amandeep Khurana wrote: > > > You can scan through one table and see if the other one has those rowids > or > > not. > > > > On Thu, Mar 10, 2011 at 8:08 PM, Vishal Kapoor > > wrote: > > > >> Friends, > >> how do I best achieve

intersection of row ids

2011-03-10 Thread Vishal Kapoor
Friends, how do I best achieve intersection of sets of row ids suppose I have two tables with similar row ids how can I get the row ids present in one and not in the other? does things get better if I have row ids as values in some qualifier/ qualifier itself? I hope the question is not too confusi

async table updates?

2011-02-23 Thread Vishal Kapoor
dvisable... thanks, Vishal Kapoor

Re: Representing Hierarchy in HBase

2011-02-21 Thread Vishal Kapoor
atisfied from this datatset? > > Thanks > Jai > > > On 2/22/11 12:16 AM, "Vishal Kapoor" wrote: > > Friends, > > I was deciding on table designs and came up with a problem where I needed > parent child relationship, a tree kind of dataset. > I did look a f

Representing Hierarchy in HBase

2011-02-21 Thread Vishal Kapoor
epresent Countries/States/Zips etc just wanted to get feedback from friends here... thanks, Vishal Kapoor

Re: Hbase inserts very slow

2011-02-17 Thread Vishal Kapoor
his > at least would get rid of the block cache churning and give us a > better picture of what's going on in the logs. > > J-D > > On Thu, Feb 17, 2011 at 9:20 AM, Vishal Kapoor > wrote: > > J-D, > > I do not see any significant improvement on combining the

Re: Hbase inserts very slow

2011-02-17 Thread Vishal Kapoor
t does (because of HBASE-3149), but given > the current situation it's hard to recommend multiple families. > > J-D > > On Wed, Feb 16, 2011 at 4:32 PM, Vishal Kapoor > wrote: > > thanks J-D. for all your help, I will combine the three families and > > re-base

Re: Hbase inserts very slow

2011-02-16 Thread Vishal Kapoor
single family approach. Vishal On Wed, Feb 16, 2011 at 6:53 PM, Jean-Daniel Cryans wrote: > I don't understand... is having the same qualifier a hard requirement? > Worst case you could have a prefix. > > J-D > > On Wed, Feb 16, 2011 at 3:29 PM, Vishal Kapoor > wrote:

Re: Hbase inserts very slow

2011-02-16 Thread Vishal Kapoor
ten there's no need for more than 1 family, I would suggest you > explore that possibility first. > > J-D > > On Wed, Feb 16, 2011 at 3:13 PM, Vishal Kapoor > wrote: > > does that mean I am only left with the choice of writing to the three > > families in three diff

Re: Hbase inserts very slow

2011-02-16 Thread Vishal Kapoor
alse) before passing it to > TableMapReduceUtil.initTableMapperJob > > Finally, you may want to give more memory to the region server. > > J-D > > On Wed, Feb 16, 2011 at 7:35 AM, Vishal Kapoor > wrote: > > Lars, > > > > I am still working on pseudo distributed. > > h

Re: Hbase inserts very slow

2011-02-16 Thread Vishal Kapoor
ACHE => 'true'}, {NAME => 'B', BLOOMFILTER => 'NONE', REPLICATION_SCOPE => '0', VERSIONS => '1', COMPRESSION => 'NONE', TTL => '2147483647', BLOCKSIZE => '65536', IN_MEMORY => 'false&#x

Hbase inserts very slow

2011-02-15 Thread Vishal Kapoor
, single=76.48 MB, multi=90.35 MB, memory=166.09 KB I haven't changed anything including the table definitions. please let me know where to look... thanks, Vishal Kapoor

Re: Get to Put

2011-01-24 Thread Vishal Kapoor
I think it boils down to how can I make a Put out of a Result object barring the rowid... thanks, Vishal On Mon, Jan 24, 2011 at 2:37 AM, Vishal Kapoor wrote: > I have table tableCombined : family 'live', family 'a', family 'b', family > 'c'

Get to Put

2011-01-23 Thread Vishal Kapoor
I have table tableCombined : family 'live', family 'a', family 'b', family 'c' and also have almost static tables below tableA with family 'a' tableB with family 'b' tableC with family 'c' what is the best way to take a Get of family 'a', from tableA and Put it to family 'a' of tableCombined whil