[ https://issues.apache.org/jira/browse/DRILL-4558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15226336#comment-15226336 ]
Vincent Uribe commented on DRILL-4558: -------------------------------------- Quick update, setting the option to false fixes the character issue but is also origin of other issues, like Double fields becoming String... We will have to make a path to correct the bson reader. > When a query returns diacritics in a string, the string is cut > -------------------------------------------------------------- > > Key: DRILL-4558 > URL: https://issues.apache.org/jira/browse/DRILL-4558 > Project: Apache Drill > Issue Type: Bug > Components: Storage - MongoDB > Environment: Apache Drill 1.6 > MongoDB 3.2.1 > Reporter: Vincent Uribe > > With the given document in a collection "Test" from a database testDb : > { > "_id" : ObjectId("56e7f1bd0944228aab06d0e2"), > "ID_ATTRIBUT" : "3", > "VAL_ATTRIBUT" : "Végétaux", > "UPDATED" : ISODate("2016-01-09T23:00:00.000Z") > } > When querying select * from mongoStorage.testDb.Test I get > _id: [B@affb65 > ID_ATTRIBUT: 3 > VAL_ATTRIBUT: *Végéta* > UPDATED: 2016-01-09T23:00:00.000Z > As you can see, the two 'é' cut the string "végétaux" by 2 characters, giving > végéta. -- This message was sent by Atlassian JIRA (v6.3.4#6332)