Stan Buren created SPARK-56818:
----------------------------------

             Summary: [DOCS] Binder build failure: openjdk-17-jre package not 
found in Ubuntu 22.04 base image
                 Key: SPARK-56818
                 URL: https://issues.apache.org/jira/browse/SPARK-56818
             Project: Spark
          Issue Type: Bug
          Components: Documentation
    Affects Versions: 4.1.1
         Environment: mybinder.org (Ubuntu 22.04 base image)
            Reporter: Stan Buren


h3. Description
The interactive Binder environments for all PySpark Quickstart tutorials are 
currently failing to build. This prevents users from utilizing the "Live 
Notebook" features directly from the official Spark documentation.

h3. Steps to Reproduce
1. Open the [Getting 
Started|https://spark.apache.org/docs/latest/api/python/getting_started/index.html]
 documentation.
2. Click on ANY of the following "Live Notebook" (Binder) buttons:
* ["Live Notebook: 
DataFrame"|https://mybinder.org/v2/gh/apache/spark/c0690c763ba?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_df.ipynb]
* ["Live Notebook: Spark 
Connect"|https://mybinder.org/v2/gh/apache/spark/c0690c763ba?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_connect.ipynb]
* ["Live Notebook: pandas API on 
Spark"|https://mybinder.org/v2/gh/apache/spark/c0690c763ba?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_ps.ipynb]
3. Wait for the build process on mybinder.org.


h3. Actual Result
All three notebooks fail at step 39 of the Dockerfile with the exact same 
error. 

{code:title=Build Log Snippet}
 > [7/8] RUN apt-get update && apt-get install -y openjdk-17-jre git coreutils:
 ...
 5.008 Package openjdk-17-jre is not available, but is referred to by another 
package.
 5.008 This may mean that the package is missing, has been obsoleted, or
 5.008 is only available from another source
 5.008 
 5.016 E: Package 'openjdk-17-jre' has no installation candidate
 ...
 ERROR: failed to solve: process "/bin/sh -c apt-get update && apt-get install 
-y openjdk-17-jre git coreutils" did not complete successfully: exit code: 100
{code}

{code:title=Affected Dockerfile Lines}
  37 |     USER root
  38 |     RUN chown -R ${NB_UID} ${HOME}
  39 | >>> RUN apt-get update && apt-get install -y openjdk-17-jre git coreutils
  40 |     USER ${NB_USER}
{code}

h3. Expected Result
The environment should build successfully for all Quickstart notebooks, 
allowing the user to interact with Jupyter.

h3. Root Cause Analysis
The build logs indicate that the environment is pulling packages from **Debian 
Trixie (Testing)** repositories:
{code}
Hit:1 http://deb.debian.org/debian trixie InRelease
{code}
The current {{binder/Dockerfile}} (at commit {{c0690c763ba}}) attempts to 
install {{openjdk-17-jre}}, but this package appears to be missing or 
unavailable in the current Trixie package index. Since this build is associated 
with the **v4.1.1-rc2** release preparation, it blocks the interactive 
documentation for the upcoming version.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to