[ https://issues.apache.org/jira/browse/MAPREDUCE-3581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Allen Wittenauer updated MAPREDUCE-3581: ---------------------------------------- Affects Version/s: (was: 0.24.0) 3.0.0 > [Rumen] Rumen anonymizer should handle composite string data > ------------------------------------------------------------ > > Key: MAPREDUCE-3581 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-3581 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: security, tools/rumen > Affects Versions: 3.0.0 > Reporter: Amar Kamat > Assignee: Amar Kamat > Labels: anonymization, chunking, rumen > > Rumen's Anonymizer currently considers string as a single entity. At times, > strings can be composed of smaller sub-strings which can be anonymized > individually. Anonymizing sub-strings separately will result in retaining > certain statistics like frequency ('daily', 'weekly' etc). This was brought > up by Chris while developing the Anonymizer. -- This message was sent by Atlassian JIRA (v6.3.4#6332)