This is a known problem. I would recommend the alternate implementation called 
fastavro which is indeed faster.

Sam

 

    On Thursday, March 24, 2016 4:05 PM, "Hong, Shunhui Welly" 
<shunhui.welly.h...@jpmorgan.com> wrote:
 

  <!--#yiv7600624740 _filtered #yiv7600624740 {font-family:Calibri;panose-1:2 
15 5 2 2 2 4 3 2 4;} _filtered #yiv7600624740 {font-family:Consolas;panose-1:2 
11 6 9 2 2 4 3 2 4;}#yiv7600624740 #yiv7600624740 p.yiv7600624740MsoNormal, 
#yiv7600624740 li.yiv7600624740MsoNormal, #yiv7600624740 
div.yiv7600624740MsoNormal 
{margin:0in;margin-bottom:.0001pt;font-size:11.0pt;font-family:"Calibri", 
"sans-serif";}#yiv7600624740 a:link, #yiv7600624740 
span.yiv7600624740MsoHyperlink 
{color:blue;text-decoration:underline;}#yiv7600624740 a:visited, #yiv7600624740 
span.yiv7600624740MsoHyperlinkFollowed 
{color:purple;text-decoration:underline;}#yiv7600624740 code 
{font-family:Consolas;background:#F0F0F0;}#yiv7600624740 
span.yiv7600624740EmailStyle17 {font-family:"Calibri", 
"sans-serif";color:windowtext;}#yiv7600624740 .yiv7600624740MsoChpDefault 
{font-family:"Calibri", "sans-serif";} _filtered #yiv7600624740 {margin:1.0in 
1.0in 1.0in 1.0in;}#yiv7600624740 div.yiv7600624740WordSection1 {}-->Hi guys,   
 I am trying to decode avro binary data (55M took 5 mins) in Python. It seems 
10+ times slower than in Java. Any suggestions?    bytes_reader = 
io.BytesIO(raw_bytes) decoder = avro.io.BinaryDecoder(bytes_reader) reader = 
avro.io.DatumReader(schema) decoded_data = reader.read(decoder)    Thank you,   
 Welly This email is confidential and subject to important disclaimers and 
conditions including on offers for the purchase or sale of securities, accuracy 
and completeness of information, viruses, confidentiality, legal privilege, and 
legal entity disclaimers, available at 
http://www.jpmorgan.com/pages/disclosures/email

  

Reply via email to