RE: Replace anonymized data in string

2021-11-12 Thread Patrick FICHE
> On 12 Nov 2021, at 15:12, Patrick FICHE  wrote:

> Is there any way to do this ?

There was a presentation on the subject of anonymization and data masking at 
Fosdem PGDay 2019, maybe the slides from there can give any insights?

https://www.postgresql.eu/events/fosdem2019/schedule/session/2287-anonymization-and-data-masking-with-postgresql/

--

Thanks a lot for your answer.
This is a very good presentation for anonymization technics.
Unfortunately, my data has been already anonymized and I'm trying to random the 
anonymized part... which is a bit different from what I could find here 😊


Re: Replace anonymized data in string

2021-11-12 Thread Rob Sargent

On 11/12/21 7:12 AM, Patrick FICHE wrote:


Hi Team,

I have some data that has been anonymized and I would like to generate 
some test data from this. In some way, I would like to deanonymize 
this data with random data.


For example, phone numbers have been anonymized with changing the 5 
right digits with the 8 digit (preserving length).


Applying this, the number 390694802756 was changed to 390694808.

I would like to get random digits at the end of the phone number 
knowing that anonymized data can be a variable length.


So, I would like to change every sequence (at least 2) of 8 by random 
value of same length (I don’t worry if phone number contains 88 in the 
middle and this sequence is changed to random data)…


I tried to do this with replace / regexp_replace functions but could 
not achieve what I wanted to do.


I don’t want these digits to be changed with a single one (8 by 
11 or 9 but something like 42384)…


Ideally, the new string would be different if multiple sequences of 8 
appear in a string and would be different from one record to the other 
when applied to a full table…


Is there any way to do this ?

Thanks,

Patrick


Usual trick it to select floor(random()*10);


Re: Replace anonymized data in string

2021-11-12 Thread Daniel Gustafsson
> On 12 Nov 2021, at 15:12, Patrick FICHE  wrote:

> Is there any way to do this ?

There was a presentation on the subject of anonymization and data masking at
Fosdem PGDay 2019, maybe the slides from there can give any insights?

https://www.postgresql.eu/events/fosdem2019/schedule/session/2287-anonymization-and-data-masking-with-postgresql/

--
Daniel Gustafsson   https://vmware.com/





Replace anonymized data in string

2021-11-12 Thread Patrick FICHE
Hi Team,

I have some data that has been anonymized and I would like to generate some 
test data from this. In some way, I would like to deanonymize this data with 
random data.

For example, phone numbers have been anonymized with changing the 5 right 
digits with the 8 digit (preserving length).
Applying this, the number 390694802756 was changed to 390694808.

I would like to get random digits at the end of the phone number knowing that 
anonymized data can be a variable length.
So, I would like to change every sequence (at least 2) of 8 by random value of 
same length (I don't worry if phone number contains 88 in the middle and this 
sequence is changed to random data)...

I tried to do this with replace / regexp_replace functions but could not 
achieve what I wanted to do.
I don't want these digits to be changed with a single one (8 by 11 or 
9 but something like 42384)...
Ideally, the new string would be different if multiple sequences of 8 appear in 
a string and would be different from one record to the other when applied to a 
full table...

Is there any way to do this ?

Thanks,
Patrick