[ 
https://issues.apache.org/jira/browse/AVRO-2748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17042646#comment-17042646
 ] 

Erik Erlandson commented on AVRO-2748:
--------------------------------------

[~kojiromike] thanks for the heads up about py3!  I'll have to move my dev on 
AVRO-2474 to 'py'.

I'm not sure how to visibly reproduce besides adding print statements, but you 
can see that the 'match_schemas' method is called in 'read_data' here:

[https://github.com/apache/avro/blob/master/lang/py/avro/io.py#L669]

And that's called, for example, on each iteration of '__next__':

[https://github.com/apache/avro/blob/master/lang/py/avro/datafile.py#L336]

 

> py3 schema resolution occurs on every read
> ------------------------------------------
>
>                 Key: AVRO-2748
>                 URL: https://issues.apache.org/jira/browse/AVRO-2748
>             Project: Apache Avro
>          Issue Type: Bug
>          Components: python
>    Affects Versions: 1.9.2
>            Reporter: Erik Erlandson
>            Priority: Minor
>
> In python, the schema resolution appears to be happening on each read 
> operation. I'm not an avro expert but in my perusing through the py3 io code 
> I haven't yet noticed a reason that the schema resolution couldn't happen 
> once up front, during the construction of DataFileReader, when it first loads 
> the write_schema.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to