Gelesh created MAPREDUCE-4512: --------------------------------- Summary: TextInputFormat delimiter bug:- Input Text portion ends with & Delimiter starts with same char/char sequence Key: MAPREDUCE-4512 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4512 Project: Hadoop Map/Reduce Issue Type: Bug Components: contrib/mumak, mr-am, mrv1, mrv2, task Affects Versions: 2.0.0-alpha Environment: Lynux Reporter: Gelesh Fix For: 0.20.204.0
TextInputFormat delimiter bug scenario , a character sequence of the input text, in which the first character matches with the first character of delimiter, and reaming input text character sequence matches with the entire delimiter character sequence from the starting position of the delimiter. eg delimiter ="record"; and Text = record 1:- name = "Gelesh" e mail = gelesh.had...@gmail.com Location Bangalore record 2: name = sdf .. location =Bangalorrecord 3: name .... Here string "=Bangalorrecord 3: " satisfy two condition 1) contains the delimiter "record" 2) The character / character sequence immediately b4 the delimiter (ie 'r') matches with first character (or character sequence ) of delimiter. (ie "=Bangalor" ends with and Delimiter starts with same character/char sequence 'r' ), Hear the delimiter is skipped -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira