Mich, I am okay with replacing the columns data with some characters like asterisk. Thanks
On Thursday, March 17, 2016, Mich Talebzadeh <mich.talebza...@gmail.com> wrote: > Hi Ajay, > > Do you want to be able to unmask it (at any time) or just have it totally > scrambled (for example replace the column with random characters) in Hive? > > Dr Mich Talebzadeh > > > > LinkedIn * > https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw > <https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>* > > > > http://talebzadehmich.wordpress.com > > > > On 17 March 2016 at 15:14, Ajay Chander <hadoopde...@gmail.com > <javascript:_e(%7B%7D,'cvml','hadoopde...@gmail.com');>> wrote: > >> Mich thbaks for looking into this. I have a 'csvfile.txt ' on hdfs. I >> have created an external table 'xyz' to load that data into it. One of the >> columns data 'ssn' needs to be masked. Is there any built in function is >> give that I could use? >> >> >> On Thursday, March 17, 2016, Mich Talebzadeh <mich.talebza...@gmail.com >> <javascript:_e(%7B%7D,'cvml','mich.talebza...@gmail.com');>> wrote: >> >>> Are you loading your CSV file from an External table into Hive table.? >>> >>> Basically you want to scramble that column before putting into Hive >>> table? >>> >>> Dr Mich Talebzadeh >>> >>> >>> >>> LinkedIn * >>> https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw >>> <https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>* >>> >>> >>> >>> http://talebzadehmich.wordpress.com >>> >>> >>> >>> On 17 March 2016 at 14:37, Ajay Chander <hadoopde...@gmail.com> wrote: >>> >>>> Tustin, Is there anyway I can deidentify it in hive ? >>>> >>>> >>>> On Thursday, March 17, 2016, Marcin Tustin <mtus...@handybook.com> >>>> wrote: >>>> >>>>> This is a classic transform-load problem. You'll want to anonymise it >>>>> once before making it available for analysis. >>>>> >>>>> On Thursday, March 17, 2016, Ajay Chander <hadoopde...@gmail.com> >>>>> wrote: >>>>> >>>>>> Hi Everyone, >>>>>> >>>>>> I have a csv.file which has some sensitive data in a particular >>>>>> column in it. Now I have to create a table in hive and load the data >>>>>> into >>>>>> it. But when loading the data I have to make sure that the data is >>>>>> masked. >>>>>> Is there any built in function is used ch supports this or do I have to >>>>>> write UDF ? Any suggestions are appreciated. Thanks >>>>> >>>>> >>>>> Want to work at Handy? Check out our culture deck and open roles >>>>> <http://www.handy.com/careers> >>>>> Latest news <http://www.handy.com/press> at Handy >>>>> Handy just raised $50m >>>>> <http://venturebeat.com/2015/11/02/on-demand-home-service-handy-raises-50m-in-round-led-by-fidelity/> >>>>> led >>>>> by Fidelity >>>>> >>>>> >>> >