[ https://issues.apache.org/jira/browse/BEAM-7008?focusedWorklogId=240402&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-240402 ]
ASF GitHub Bot logged work on BEAM-7008: ---------------------------------------- Author: ASF GitHub Bot Created on: 10/May/19 20:40 Start Date: 10/May/19 20:40 Worklog Time Spent: 10m Work Description: angoenka commented on issue #8544: Revert "Merge pull request #8228: [BEAM-7008] adding UTF8 String codeā¦ URL: https://github.com/apache/beam/pull/8544#issuecomment-491423651 Run Python PostCommit ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 240402) Time Spent: 6h 10m (was: 6h) > adding UTF8 String coder to Java SDK ModelCoders > ------------------------------------------------ > > Key: BEAM-7008 > URL: https://issues.apache.org/jira/browse/BEAM-7008 > Project: Beam > Issue Type: Bug > Components: sdk-java-core > Reporter: Heejong Lee > Assignee: Heejong Lee > Priority: Major > Fix For: 2.13.0 > > Time Spent: 6h 10m > Remaining Estimate: 0h > > It looks like UTF-8 String Coder in Java and Python SDKs uses different > encoding schemes. StringUtf8Coder in Java SDK puts the varint length of the > input string before actual data bytes however StrUtf8Coder in Python SDK > directly encodes the input string to bytes value. We should unify the > encoding schemes of UTF8 strings across the different SDKs and make it a > standard coder. -- This message was sent by Atlassian JIRA (v7.6.3#76005)