Hey all, I am working with JSON that is on the whole fairly clean. I am trying to load into Parquet files, and the previous days worth of data worked just fine, but todays data has something wrong with it and I Can't figure out what it is. Unfortunately, I can't post the data, which I know makes this hard to troubleshoot for the community. Hopefully I can provide some info here, and get some pointers on where to look, and then report back on how we could potentially improve the error messages.
The error is below. I am looking to figure out given the information reported where I'd look to trouble shoot this. Obviously the file 02ffc306e877_my_load_1446640931.json is where I am looking to start This file has 3000 lines (records of data, so it's somewhere in between. The index/length/expected range don't mean anything to me I could use some help there, because I am not even sure what I am looking for. The record and/or Fragment... do those help me dig in? Since this is one record per line, I went to line 2402 but that record looks completely normal to me, (like all the other ones) but since this is dense text, I am obviously missing something, but is the record the line number? Any other pointers I can use to trouble shoot this? Thanks! Error: Caused by: org.apache.drill.common.exceptions.UserRemoteException: DATA_READ ERROR: Error parsing JSON - index: 9604, length: 4 (expected: range(0, 8192)) File /etl/dev/my-metadata/mysqspull/loads/2015-11-04/02ffc306e877_my_load_1446640931.json Record 2402 Fragment 1:5
