This is a known problem. I would recommend the alternate implementation called fastavro which is indeed faster.
Sam On Thursday, March 24, 2016 4:05 PM, "Hong, Shunhui Welly" <shunhui.welly.h...@jpmorgan.com> wrote: <!--#yiv7600624740 _filtered #yiv7600624740 {font-family:Calibri;panose-1:2 15 5 2 2 2 4 3 2 4;} _filtered #yiv7600624740 {font-family:Consolas;panose-1:2 11 6 9 2 2 4 3 2 4;}#yiv7600624740 #yiv7600624740 p.yiv7600624740MsoNormal, #yiv7600624740 li.yiv7600624740MsoNormal, #yiv7600624740 div.yiv7600624740MsoNormal {margin:0in;margin-bottom:.0001pt;font-size:11.0pt;font-family:"Calibri", "sans-serif";}#yiv7600624740 a:link, #yiv7600624740 span.yiv7600624740MsoHyperlink {color:blue;text-decoration:underline;}#yiv7600624740 a:visited, #yiv7600624740 span.yiv7600624740MsoHyperlinkFollowed {color:purple;text-decoration:underline;}#yiv7600624740 code {font-family:Consolas;background:#F0F0F0;}#yiv7600624740 span.yiv7600624740EmailStyle17 {font-family:"Calibri", "sans-serif";color:windowtext;}#yiv7600624740 .yiv7600624740MsoChpDefault {font-family:"Calibri", "sans-serif";} _filtered #yiv7600624740 {margin:1.0in 1.0in 1.0in 1.0in;}#yiv7600624740 div.yiv7600624740WordSection1 {}-->Hi guys, I am trying to decode avro binary data (55M took 5 mins) in Python. It seems 10+ times slower than in Java. Any suggestions? bytes_reader = io.BytesIO(raw_bytes) decoder = avro.io.BinaryDecoder(bytes_reader) reader = avro.io.DatumReader(schema) decoded_data = reader.read(decoder) Thank you, Welly This email is confidential and subject to important disclaimers and conditions including on offers for the purchase or sale of securities, accuracy and completeness of information, viruses, confidentiality, legal privilege, and legal entity disclaimers, available at http://www.jpmorgan.com/pages/disclosures/email