Re: Storing XML file in Hbase

2016-11-28 Thread Richard Startin
bzadeh <mich.talebza...@gmail.com> Sent: 28 November 2016 21:57 To: user@hbase.apache.org Subject: Re: Storing XML file in Hbase Thanks Richard. How would one decide on the number of column family and columns? Is there a ballpark approach Cheers Dr Mich Talebzadeh LinkedIn * https://www.li

Re: Storing XML file in Hbase

2016-11-28 Thread Mich Talebzadeh
Thanks Richard. How would one decide on the number of column family and columns? Is there a ballpark approach Cheers Dr Mich Talebzadeh LinkedIn * https://www.linkedin.com/profile/view?id=AAEWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw

Re: Storing XML file in Hbase

2016-11-28 Thread Richard Startin
Hi Mich, If you want to store the file whole, you'll need to enforce a 10MB limit to the file size, otherwise you will flush too often (each time the me store fills up) which will slow down writes. Maybe you could deconstruct the xml by extracting columns from the xml using xpath? If the

Re: Storing XML file in Hbase

2016-11-28 Thread Dima Spivak
Hi Mich, How many files are you looking to store? How often do you need to read them? What's the total size of all the files you need to serve? Cheers, Dima On Mon, Nov 28, 2016 at 7:04 AM Mich Talebzadeh wrote: > Hi, > > Storing XML file in Big Data. Are there any

Storing XML file in Hbase

2016-11-28 Thread Mich Talebzadeh
Hi, Storing XML file in Big Data. Are there any strategies to create multiple column families or just one column family and in that case how many columns would be optional? thanks Dr Mich Talebzadeh LinkedIn * https://www.linkedin.com/profile/view?id=AAEWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw