madrob commented on a change in pull request #759:
URL: https://github.com/apache/lucene/pull/759#discussion_r833317662



##########
File path: gradle/datasets/external-datasets.gradle
##########
@@ -120,10 +120,9 @@ configure(project(":lucene:benchmark")) {
   task getReuters(type: Download) {
     ext {
       name = "reuters21578"
-      // note: there is no HTTPS url and we don't care because this is merely 
test/perf data
-      src = 
"http://www.daviddlewis.com/resources/testcollections/reuters21578/${name}.tar.gz";
+      src = "https://kdd.ics.uci.edu/databases/${name}/${name}.tar.gz";
       intermediate = file("${dataDir}/${name}.tar.gz")
-      dst = file("${dataDir}/${name}")

Review comment:
       I switched it to reuters-out because that is what most of the other 
algorithm files were expecting. We can keep reuters21578 and update it in 10 
places or change it to -out and update in one algorithm, was my thinking.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to