thought it was only SQL-to-Hadoop
can it also dump from HBase to SQL?
how do I take care of transformations?
On Thu, Jun 23, 2011 at 2:33 PM, Ted Yu wrote:
> Have you checked out sqoop ?
>
> On Thu, Jun 23, 2011 at 11:32 AM, Vishal Kapoor
> wrote:
>
> > Looking for tool
Looking for tools to help export Data from HBase to MySQL,
will need some transformations before inserting.
thanks,
Vishal
We were wondering if its advisable to provision hbase/hadoop nodes as VMWare
instances?
any suggestions?
thanks,
Vishal
fairly efficient.
>
> Dave
>
>
> -----Original Message-
> From: Vishal Kapoor [mailto:vishal.kapoor...@gmail.com]
> Sent: Tuesday, April 12, 2011 8:29 AM
> To: user@hbase.apache.org
> Subject: Append value to a cell?
>
> Do we have any API which can append text value
rebuilding the entire reverse lookup time
from ground up.
thanks,
Vishal Kapoor
ving it or switching to a design that is proven
to be working on this kind of problem will help me.
thanks,
Vishal Kapoor
On Wed, Apr 6, 2011 at 12:56 PM, Stack wrote:
> On Wed, Apr 6, 2011 at 5:12 AM, Vishal Kapoor
> wrote:
>> I am getting shuffled rows? is there a problem at my end somew
I am getting shuffled rows? is there a problem at my end somewhere? we
did some manual split of tables.
have a scoreboard kind of code for staged processing of table based on
it, which is going for a toss.
thanks,
Vishal
two questions,
1) if I give number of versions for a family as 365*3 is it a bad
design? how many versions are a good practice? if I have two many
versions will that be a single seek when I get the row Id? if yes,
will it take longer to store data? pros and cons?
2) how do I get the number of ver
I have my row id starting with a reverse time stamp so that I can get
to the latest data first, but I also need to process data as it comes
in, is there any way I can get scan to give me data from bottom up?
thanks,
Vishal
Ted,
thanks for the quick response
vk
On Mon, Mar 28, 2011 at 6:32 PM, Ted Yu wrote:
> See the 'min, max' discussion on hbase user mailing list
>
> On Mon, Mar 28, 2011 at 3:28 PM, Vishal Kapoor
> wrote:
>>
>> How do I find the first and last row id
How do I find the first and last row id of the table in Hbase?
I tried
Htable.getStopKeys() and Htable.getEndKeys()
does not work,
all I need is the absolute start/stop row id of a table at a given
point in time.
any direction?
thanks,
vk
second job startup you can read that meta data and set your scan
> /input splits appropriately?
> Dave
>
> -Original Message-----
> From: Vishal Kapoor [mailto:vishal.kapoor...@gmail.com]
> Sent: Friday, March 25, 2011 11:21 AM
> To: user@hbase.apache.org
> Subject:
scan(start,stop)
typically data comes in a few tables for us and we start crunching it
and then adding some more data to man tables like info etc to get rid
of table joins.
a light weight framework will do better than a typical workflow management tool.
thanks,
Vishal Kapoor
Can't use JobControl.
>
>
> -Original Message-
> From: Vishal Kapoor [mailto:vishal.kapoor...@gmail.com]
> Sent: Tuesday, March 22, 2011 1:34 PM
> To: user@hbase.apache.org
> Subject: JobControl and HBase MR chaining
>
> with JobControl not rewritten under mapr
with JobControl not rewritten under mapreduce and continues to be
under mapred( hadoop version 20.2) expecting subclasses of Mapper
from mapred
how do we use TableMapper and TableReducer to chain jobs?
I am utterly confused, any ray of hope?
thanks,
Vishal
you should have more info on why dfs is in the safe mode in the logs,
you can always leave safe mode
hadoop dfs -safemode leave
but again, thats a symptom, not a problem.
Vishal
On Thu, Mar 17, 2011 at 1:55 PM, Taylor, Ronald C wrote:
> Folks,
>
> We had a power outage here, and we are trying
If I have a bunch of MRs and I want to keep a tab on what they should
process in terms of scope of row ids
eg,
first run :
startRow1
stopRowN
second run
startRowN+1
stopRowM
and similar for others,
is there any light weight accomplish this?
thanks,
vk
t 9:44, Amandeep Khurana wrote:
>
> > You can scan through one table and see if the other one has those rowids
> or
> > not.
> >
> > On Thu, Mar 10, 2011 at 8:08 PM, Vishal Kapoor
> > wrote:
> >
> >> Friends,
> >> how do I best achieve
Friends,
how do I best achieve intersection of sets of row ids
suppose I have two tables with similar row ids
how can I get the row ids present in one and not in the other?
does things get better if I have row ids as values in some qualifier/
qualifier itself?
I hope the question is not too confusi
dvisable...
thanks,
Vishal Kapoor
atisfied from this datatset?
>
> Thanks
> Jai
>
>
> On 2/22/11 12:16 AM, "Vishal Kapoor" wrote:
>
> Friends,
>
> I was deciding on table designs and came up with a problem where I needed
> parent child relationship, a tree kind of dataset.
> I did look a f
epresent
Countries/States/Zips etc
just wanted to get feedback from friends here...
thanks,
Vishal Kapoor
his
> at least would get rid of the block cache churning and give us a
> better picture of what's going on in the logs.
>
> J-D
>
> On Thu, Feb 17, 2011 at 9:20 AM, Vishal Kapoor
> wrote:
> > J-D,
> > I do not see any significant improvement on combining the
t does (because of HBASE-3149), but given
> the current situation it's hard to recommend multiple families.
>
> J-D
>
> On Wed, Feb 16, 2011 at 4:32 PM, Vishal Kapoor
> wrote:
> > thanks J-D. for all your help, I will combine the three families and
> > re-base
single family approach.
Vishal
On Wed, Feb 16, 2011 at 6:53 PM, Jean-Daniel Cryans wrote:
> I don't understand... is having the same qualifier a hard requirement?
> Worst case you could have a prefix.
>
> J-D
>
> On Wed, Feb 16, 2011 at 3:29 PM, Vishal Kapoor
> wrote:
ten there's no need for more than 1 family, I would suggest you
> explore that possibility first.
>
> J-D
>
> On Wed, Feb 16, 2011 at 3:13 PM, Vishal Kapoor
> wrote:
> > does that mean I am only left with the choice of writing to the three
> > families in three diff
alse) before passing it to
> TableMapReduceUtil.initTableMapperJob
>
> Finally, you may want to give more memory to the region server.
>
> J-D
>
> On Wed, Feb 16, 2011 at 7:35 AM, Vishal Kapoor
> wrote:
> > Lars,
> >
> > I am still working on pseudo distributed.
> > h
ACHE => 'true'}, {NAME => 'B',
BLOOMFILTER => 'NONE', REPLICATION_SCOPE => '0', VERSIONS => '1',
COMPRESSION => 'NONE', TTL => '2147483647', BLOCKSIZE => '65536', IN_MEMORY
=> 'false
, single=76.48 MB, multi=90.35 MB,
memory=166.09 KB
I haven't changed anything including the table definitions.
please let me know where to look...
thanks,
Vishal Kapoor
I think it boils down to how can I make a Put out of a Result object
barring the rowid...
thanks,
Vishal
On Mon, Jan 24, 2011 at 2:37 AM, Vishal Kapoor
wrote:
> I have table tableCombined : family 'live', family 'a', family 'b', family
> 'c'
I have table tableCombined : family 'live', family 'a', family 'b', family
'c'
and also have almost static tables below
tableA with family 'a'
tableB with family 'b'
tableC with family 'c'
what is the best way to take a Get of family 'a', from tableA and Put
it to family 'a' of tableCombined whil
31 matches
Mail list logo