2. *Apache mirrors are inconvenient to use.* When you download
    something from an Apache mirror, you get a link like this one
    
<https://www.apache.org/dyn/closer.lua/spark/spark-2.2.1/spark-2.2.1-bin-hadoop2.7.tgz>.
    Instead of automatically redirecting you to your download, though,
    you need to process the results you get back
    
<https://github.com/nchammas/flintrock/blob/67bf84a1b7cfa1c276cf57ecd8a0b27613ad2698/flintrock/scripts/download-hadoop.py#L21-L42>
    to find your download target. And you need to handle the high
    download failure rate, since sometimes the mirror you get doesn’t
    have the file it claims to have.

It's not a full answer, just a note:

You can also use mirrors.cgi instead of parsing the json from closer.lua:

https://www.apache.org/dyn/mirrors/mirrors.cgi?action=download&filename=spark/spark-2.2.1/spark-2.2.1-bin-hadoop2.7.tgz

(Unfortunatelly it doesn't check the availibility of the file. If it's moved to the archive you will be redirected to a 404)

Marton

---------------------------------------------------------------------
To unsubscribe e-mail: dev-unsubscr...@spark.apache.org

Reply via email to