neumarcx commented on issue #568: Add Aggregate Median to SPARQL ARQ syntax
URL: https://github.com/apache/jena/pull/568#issuecomment-497073501
 
 
   yes the run-time results* seem to speak a clear language here.  I have 
posted a question** on the ACM user mailing list. Let's see if someone comes 
back with an explanation. Our use case in ARQ might not be the best for the ACM 
implementation. I will update the fork with my own implementation for now and 
will continue to test it on the staging server.
   
   *Test runs:
   
   Total # of triple values -> 100,000,000
   Avarage of 10 runs
   my median() Arrays.sort/pivot:        260 ms
   ACM Median().evaluate:                606 ms
   ACM ds.getPercentile(50):     666 ms
   
   Total # of triple values -> 100,000,000
   Avarage of 100 runs
   my median() Arrays.sort/pivot:        256 ms
   ACM Median().evaluate:                408 ms
   ACM ds.getPercentile(50):     365 ms
   
   Total # of triple values -> 100,000,000
   Avarage of 1000 runs
   my median() Arrays.sort/pivot:        256 ms
   ACM Median().evaluate:                436 ms
   ACM ds.getPercentile(50):     451 ms
   
   Total # of triple values -> 1,000,000,000
   Avarage of 1 run
   my median() Arrays.sort/pivot:        2,852 ms
   ACM Median().evaluate:                20,262 ms
   ACM ds.getPercentile(50):     33,420 ms
   
   ACM = Apache Commons Maths
   
   
   **http://mail-archives.apache.org/mod_mbox/commons-user/201905.mbox/browser
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

Reply via email to