Hi,

I have an Avro file that support the following data/schema:

{"field":"some", "classification":{"variant":"Gæst"}}

When I select 10 rows from this file I get:

+---------------------+
|       EXPR$0        |
+---------------------+
| Gæst                |
| Voksen              |
| Voksen              |
| Invitation KIF KBH  |
| Invitation KIF KBH  |
| Ordinarie pris KBH  |
| Ordinarie pris KBH  |
| Biljetter 200 krBH  |
| Biljetter 200 krBH  |
| Biljetter 200 krBH  |
+---------------------+

The bug is that the field values are incorrectly de-serialized and the
value from the previous row is retained if the subsequent row is shorter.

The sql query:

"select s.classification.variant variant from dfs.<some> as s limit 10;"


That way the  "Ordinarie pris" becomes "Ordinarie pris KBH" because the
previous row had the value "Invitation KIF KBH".

Regards,
  -Stefán

Reply via email to