[GitHub] spark issue #22966: [PARK-25965][SQL][TEST] Add avro read benchmark

2018-11-07 Thread dbtsai
Github user dbtsai commented on the issue:

https://github.com/apache/spark/pull/22966
  
jmh is a framework to write benchmark that can generate standardized 
reports to be consumed by Jenkins. 

Here is an example, 
https://github.com/pvillega/jmh-scala-test/blob/master/src/main/scala/com/perevillega/JMHTest.scala


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #22966: [PARK-25965][SQL][TEST] Add avro read benchmark

2018-11-07 Thread gengliangwang
Github user gengliangwang commented on the issue:

https://github.com/apache/spark/pull/22966
  
@dbtsai Great! 
I was thinking the benchmark in this PR is kind of simple, so I didn't add 
it for over months..
The benchmark you mentioned should also workable for other data sources, 
right?


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #22966: [PARK-25965][SQL][TEST] Add avro read benchmark

2018-11-07 Thread dbtsai
Github user dbtsai commented on the issue:

https://github.com/apache/spark/pull/22966
  
cc @jleach4 and @aokolnychyi 

We have a great success using 
[jmh](http://openjdk.java.net/projects/code-tools/jmh/) for this type of 
benchmarking; the benchmarks can be written in the unit test. This framework 
handles JVM warn-up, computes the latency, and throughput, etc, and then 
generates reports that can be consumed in Jenkins. We also use Jenkins to 
visualize the trend of performance changes which is very useful to find 
regressions. 





---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #22966: [PARK-25965][SQL][TEST] Add avro read benchmark

2018-11-07 Thread gengliangwang
Github user gengliangwang commented on the issue:

https://github.com/apache/spark/pull/22966
  
Done, @dongjoon-hyun PTAL.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org