Re: Storing images in Hbase

2013-01-10 Thread shashwat shriparv
Hi Kavish, i have a better idea for you copy your image files to a single file on hdfs, and if new image comes append it to the existing image, and keep and update the metadata and the offset to the HBase. Because if you put bigger image in hbase it wil lead to some issue. ∞ Shashwat Shriparv

Re: Maximizing throughput

2013-01-10 Thread anil gupta
Sorry, I meant to ask about "setAutoFlush". Is setAutoFlush true or false? On Thu, Jan 10, 2013 at 8:42 PM, anil gupta wrote: > Is flushCommits true or false? > > > On Thu, Jan 10, 2013 at 8:40 PM, Anoop Sam John wrote: > >> Hi >>You mind telling the configs that you changed and set? BTW

Re: Maximizing throughput

2013-01-10 Thread anil gupta
Is flushCommits true or false? On Thu, Jan 10, 2013 at 8:40 PM, Anoop Sam John wrote: > Hi >You mind telling the configs that you changed and set? BTW which > version of HBase you are using? > > -Anoop- > > From: Bryan Keller [brya...@gmail.com] >

Re: persistence in Hbase

2013-01-10 Thread shashwat shriparv
Why do you really want to switch the system where your precious data is lying and also all 10 nodes at a time. i am not really sure of data lost, many hase gave the answer on that prospect, but what i can say there may be data corruption if you don't follow the correct handling of the cluster. :) r

RE: Maximizing throughput

2013-01-10 Thread Anoop Sam John
Hi You mind telling the configs that you changed and set? BTW which version of HBase you are using? -Anoop- From: Bryan Keller [brya...@gmail.com] Sent: Friday, January 11, 2013 10:01 AM To: user@hbase.apache.org Subject: Maximizing throughput I am

Maximizing throughput

2013-01-10 Thread Bryan Keller
I am attempting to configure HBase to maximize throughput, and have noticed some bottlenecks. In particular, with my configuration, write performance is well below theoretical throughput. I have a test program that inserts many rows into a test table. Network I/O is less than 20% of max, and dis

Re: persistence in Hbase

2013-01-10 Thread anil gupta
Never mind i got the Jira: https://issues.apache.org/jira/browse/HBASE-5954 On Thu, Jan 10, 2013 at 8:16 PM, anil gupta wrote: > Hi Lars, > > Yes, that is true. I also came to know about it few days ago that data is > present in Memory(rather than persistent storage) of 3 DataNode if the > repli

Re: persistence in Hbase

2013-01-10 Thread anil gupta
Hi Lars, Yes, that is true. I also came to know about it few days ago that data is present in Memory(rather than persistent storage) of 3 DataNode if the replication factor is 3. In case of disaster like entire data center failure there might be some data loss. But these kinds of disaster are very

Re: persistence in Hbase

2013-01-10 Thread lars hofhansl
Not entirely true, though. Data is not sync'ed to disk, but only distributed to all HDFS replicas. During a power outage event across all HDFS failure zones (such as a data center) you can lose data. -- Lars - Original Message - From: anil gupta To: user@hbase.apache.org Cc: Sent: T

Re: Storing images in Hbase

2013-01-10 Thread lars hofhansl
Interesting. That's close to a PB if my math is correct. Is there a write up about this somewhere? Something that we could link from the HBase homepage? -- Lars - Original Message - From: Jack Levin To: user@hbase.apache.org Cc: Andrew Purtell Sent: Thursday, January 10, 2013 9:24 AM

Re: persistence in Hbase

2013-01-10 Thread lars hofhansl
The answer here is actually not so simple. Durable sync was only added to HDFS with HDFS-744, and I have not gotten to make matching changes to HBase, yet. Before HDFS-744 there was only hflush, which guarantees that the data reached all replica datanodes (3 by default), but not that the data is

HDFS disk space requirements

2013-01-10 Thread Panshul Whisper
Hello, I have a 5 node hadoop cluster and a fully distributed Hbase setup on the cluster with 130 GB of HDFS space avaialble. HDFS replication is set to 5. I have a total of 115 GB of JSON files that need to be loaded into the Hbase database and then they have to processed. So is the available H

Embeded thrift server performance

2013-01-10 Thread Lukáš Drbal
Hi, works somebody on "TODO: Override more methods to short-circuit for performance" from src/main/java/org/apache/hadoop/hbase/regionserver/HRegionThriftServer.java:110 (path and line number it's from 0.94 branch)? Today i take a look on this code and when i implement thrift method GetRow, i have

Re: Storing images in Hbase

2013-01-10 Thread Michael Segel
Been there, done that... kind of an interesting problem... Someone earlier said that HBase isn't good for images. It works pretty well, again it depends on the use case. Your schema is also going to play a role and you're going to have to tune things a little differently because when you pull

Re: persistence in Hbase

2013-01-10 Thread Mohammad Tariq
Hello Anil, You are absolutely correct and I haven't faced any such situation till date. I think I have made a controversial statement via my reply :) Apologies for the mess. Thank you Warm Regards, Tariq https://mtariq.jux.com/ On Fri, Jan 11, 2013 at 4:08 AM, anil gupta wrote: > Hi M

Re: persistence in Hbase

2013-01-10 Thread anil gupta
Hi Mohammad, If the Write Ahead Log(WAL) is "turned on" then in **NO** case data should be lost. HBase is strongly-consistent. If you know of any case when WAL is turned on and data is lost then IMO that's a Critical bug in HBase. Thanks, Anil Gupta On Thu, Jan 10, 2013 at 7:37 AM, Mohit Anchlia

Re: Storing images in Hbase

2013-01-10 Thread Mohammad Tariq
Thanks Leonid. Warm Regards, Tariq https://mtariq.jux.com/ On Fri, Jan 11, 2013 at 2:15 AM, Leonid Fedotov wrote: > I'm voting for continuing here as well… > So, location is up to Jack. :) > > Thank you! > > Sincerely, > Leonid Fedotov > > On Jan 10, 2013, at 11:24 AM, Mohammad Tariq wrote: > >

Re: Storing images in Hbase

2013-01-10 Thread Leonid Fedotov
I'm voting for continuing here as well… So, location is up to Jack. :) Thank you! Sincerely, Leonid Fedotov On Jan 10, 2013, at 11:24 AM, Mohammad Tariq wrote: > Jack, Leonid, > >I request you guys to please continue the discussion > through the thread itself if possible for you both. I wo

Re: Storing images in Hbase

2013-01-10 Thread Marcos Ortiz
This is a very interesting setup to analyze. I´m working in a similar problem with HBase, so, any help is welcome. El 10/01/2013 16:39, Doug Meil escribió: +1. This question comes up enough on the dist-list it's worth getting some pointers on record. On 1/10/13 2:24 PM, "Mohammad Tariq"

Re: Storing images in Hbase

2013-01-10 Thread Doug Meil
+1. This question comes up enough on the dist-list it's worth getting some pointers on record. On 1/10/13 2:24 PM, "Mohammad Tariq" wrote: >Jack, Leonid, > >I request you guys to please continue the discussion >through the thread itself if possible for you both. I would >like to know a

Re: Storing images in Hbase

2013-01-10 Thread Mohammad Tariq
Jack, Leonid, I request you guys to please continue the discussion through the thread itself if possible for you both. I would like to know about Jack's setup. I too find it quite interesting. Many thanks. Warm Regards, Tariq https://mtariq.jux.com/ On Fri, Jan 11, 2013 at 12:50 AM, Leonid

Re: Storing images in Hbase

2013-01-10 Thread Jean-Marc Spaggiari
It might be interesting to share that here, just in case someone else is facing the same usecase? JM 2013/1/10, Leonid Fedotov : > Jack, > yes, this is very interesting to know your setup details. > Could you please provide more information? > Or we can take this off the list if you like… > > Tha

Re: Storing images in Hbase

2013-01-10 Thread Leonid Fedotov
Jack, yes, this is very interesting to know your setup details. Could you please provide more information? Or we can take this off the list if you like… Thank you! Sincerely, Leonid Fedotov On Jan 10, 2013, at 9:24 AM, Jack Levin wrote: > We stored about 1 billion images into hbase with file si

Re: Storing images in Hbase

2013-01-10 Thread Jack Levin
We stored about 1 billion images into hbase with file size up to 10MB. Its been running for close to 2 years without issues and serves delivery of images for Yfrog and ImageShack. If you have any questions about the setup, I would be glad to answer them. -Jack On Sun, Jan 6, 2013 at 1:09 PM, Mo

Re: persistence in Hbase

2013-01-10 Thread Mohit Anchlia
Data also gets written in WAL. See: http://hbase.apache.org/book/perf.writing.html On Thu, Jan 10, 2013 at 7:36 AM, ramkrishna vasudevan < ramkrishna.s.vasude...@gmail.com> wrote: > Yes definitely you will get back the data. > > Please read the HBase Book that explains things in detail. > http:/

Re: persistence in Hbase

2013-01-10 Thread ramkrishna vasudevan
Yes definitely you will get back the data. Please read the HBase Book that explains things in detail. http://hbase.apache.org/book.html. Regards Ram On Thu, Jan 10, 2013 at 8:48 PM, Panshul Gupta wrote: > Hello, > > I was wondering if it is possible that I have data stored in Hbase tables > o

Re: persistence in Hbase

2013-01-10 Thread Leonid Fedotov
Panshul, Of course you will have all your data in place. It stored in HDFS and remains in there after restarting. Thank you! Sincerely, Leonid Fedotov On Jan 10, 2013, at 7:18 AM, Panshul Gupta wrote: > Hello, > > I was wondering if it is possible that I have data stored in Hbase tables > on

Re: persistence in Hbase

2013-01-10 Thread Mohammad Tariq
Hello Pranshul, The data which you are pushing to the HBase doesn't go its final destination directly. When you start pushing the data, it first goes to an in-memory log file. Once the log files are considerably big, they are merged together and put in an in-memory store called as memstore.

Re: Region status.

2013-01-10 Thread ramkrishna vasudevan
Hi Jean The region transition states are not exposed to the end user. You can only know if the table is enabled, enabling, disabled or disabling. But if you want to do in any of your testcases then YES it is possible. Regards Ram On Thu, Jan 10, 2013 at 7:42 PM, Jean-Marc Spaggiari < jean-m...@

Region status.

2013-01-10 Thread Jean-Marc Spaggiari
Hi, How can I get the status of a region? I mean, I have a HRegionInfo and I want to know if this region is opened, closed, opening, in transit, etc. How can I retreive this information? >From the table I can get a fresh region info, but I'm not able to find the status into it. I just have "isOf