Hi Ed,

Thanks for the detailed review. I wanted to address some of your questions.
Regarding JIRAs: for the last few releases, the activity on the JIRAs was 
minimal. I would create it and then after completing the implementation, I 
would close it. The community still has the option to weigh in on the code 
changes by reviewing the PRs but that hasn't happened as well. At this point, 
writing JIRAs just for myself doesn't help anybody. If the community is 
interested in suggesting new features, I would be happy to address them to the 
best of my abilities.

The typo in the wiki is fixed. Since it is not a part of the release, it is 
just a confluence page, I don't believe it affects the viability of this 
release.

Regarding python dependencies: The suggestion on the version numbers for 
dependencies is valid. I will update the installation instructions webpage with 
the relevant information. The python executable itself not being in the path is 
a corner case for most systems but is easy to fix. We should create a JIRA to 
track it and get it fixed for the next release.

Thanks,

Orhan Kislal
________________________________
From: Ed Espino <esp...@apache.org>
Sent: Tuesday, February 28, 2023 11:58 AM
To: dev@madlib.apache.org <dev@madlib.apache.org>
Subject: Re: Apache MADlib 1.21.0 Release (RC2)

!! External Email

Edward Espino


Table of Contents
_________________

1. Apache MADlib Version v1.21.0 RC2
2. Observations
.. 1. Reviewing Jira status - QUESTION
.. 2. 1.21.0 docs Review - PASSED
.. 3. MADlib 1.21.0 wiki - old release reference
.. 4. PGP signature verification - PASSED
.. 5. SHA512 checksum verification - PASSED
.. 6. RELEASE_NOTES reviewed - QUESTION
.. 7. Copyright review in NOTICE file - PASSED
.. 8. Apache RAT (mvn apache-rat:check) - PASSED
.. 9. Python dependencies - QUESTION/OBSERVATION
.. 10. Operating System: Rocky Linux 8.7 (Green Obsidian) x86_64 - PASSED
.. 11. Operating System: Ubuntu 22.04.2 LTS - PASSED
.. 12. Test cannot find "python" - FAILURE


1 Apache MADlib Version v1.21.0 RC2
===================================

  As I am not a member of PMC or a committer. Here are my observations
  of the release.


2 Observations
==============

2.1 Reviewing Jira status - QUESTION
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

  Only a single Jira is fixed in this release. Is this expected?
  
<https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fissues.apache.org%2Fjira%2Fbrowse%2FMADLIB-1506&data=05%7C01%7Cokislal%40vmware.com%7Cb5b835eef540432aa60308db196a0ddb%7Cb39138ca3cee4b4aa4d6cd83d9dd62f0%7C0%7C0%7C638131715521745368%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=tGKhJB8Hf%2BRTtemUo%2FSgsbmI4y5VXGcuyKTc8VpRJP0%3D&reserved=0>

  v1.21.0 Release Jira info
  
<https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fissues.apache.org%2Fjira%2Fprojects%2FMADLIB%2Fversions%2F12352104&data=05%7C01%7Cokislal%40vmware.com%7Cb5b835eef540432aa60308db196a0ddb%7Cb39138ca3cee4b4aa4d6cd83d9dd62f0%7C0%7C0%7C638131715521901621%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=6XEJWiGxFRIBJyll%2FNPr5IApQB5%2BQ3bezLYfqK09N%2Fg%3D&reserved=0>

  1 Issues in version
  1 Issues done
  0 Issues in progress
  0 Issues to do

  I am curious why the v1.20.0 Jira release list is not in a finalized
  state.

  
<https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fissues.apache.org%2Fjira%2Fprojects%2FMADLIB%2Fversions%2F12352103&data=05%7C01%7Cokislal%40vmware.com%7Cb5b835eef540432aa60308db196a0ddb%7Cb39138ca3cee4b4aa4d6cd83d9dd62f0%7C0%7C0%7C638131715521901621%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=fBz7AcPojrjorlxDlmqtUBABADYAt2apYO25txfEodY%3D&reserved=0>

  Is it possible some of these issues are actually fixed in the v1.21.0
  release?

  16 Issues in version
   2 Issues done
   0 Issues in progress
  14 Issues to do


2.2 1.21.0 docs Review - PASSED
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

  
<https://nam04.safelinks.protection.outlook.com/?url=http%3A%2F%2Fmadlib.apache.org%2Fdocs%2Frc%2Findex.html&data=05%7C01%7Cokislal%40vmware.com%7Cb5b835eef540432aa60308db196a0ddb%7Cb39138ca3cee4b4aa4d6cd83d9dd62f0%7C0%7C0%7C638131715521901621%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=2H8fpJDILF3sKpnTqpJqxAcfieEzqAIi7%2FPpXJWLA1M%3D&reserved=0>


2.3 MADlib 1.21.0 wiki - old release reference
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

  
<https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fcwiki.apache.org%2Fconfluence%2Fdisplay%2FMADLIB%2FMADlib%2B1.21.0&data=05%7C01%7Cokislal%40vmware.com%7Cb5b835eef540432aa60308db196a0ddb%7Cb39138ca3cee4b4aa4d6cd83d9dd62f0%7C0%7C0%7C638131715521901621%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=0CzUJ2EqRDZ2FfbYxv%2BOy%2FsGtlDCtotiZ3SP9Cp4n0Q%3D&reserved=0>

  This page reference v1.20.0 release:

  Release Notes
  MADlib v1.20.0:


2.4 PGP signature verification - PASSED
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

  --> Good signature from "Venkatesh Raghavan (G!) <raghava...@vmware.com>"
[unknown]


2.5 SHA512 checksum verification - PASSED
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~


2.6 RELEASE_NOTES reviewed - QUESTION
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

  The v1.21.0 release notes indicate three New Features, two
  Improvements and three Bug Fixes are included. However, a single Jira
  (MADLIB-1506) is listed for the release.


2.7 Copyright review in NOTICE file - PASSED
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

  Apache MADlib
  Copyright 2016-2023 The Apache Software Foundation.
  ...


2.8 Apache RAT (mvn apache-rat:check) - PASSED
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

  [INFO] Building madlib 1.21.0
    ... (elided verbose output)
  [INFO] 289 resources included (use -debug for more details)
  [INFO] Rat check: Summary of files. Unapproved: 0 unknown: 0 generated: 0
approved: 280 licence.


2.9 Python dependencies - QUESTION/OBSERVATION
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

  I was able to complete install-check and dev-check activities by
  installing the following Python modules. What exactly are the required
  components for testing and runtime use?  Minimally, these should be
  referenced in the release notes.

  keras
  tensorflow==1.14
  dill
  pandas
  hyperopt==0.2.5
  xgboost
  rtree
  scikit-learn

  When building against the Greenplum 6.23.1 release I had to set the
  following to find the appropriate Python 2.7 shared libraries embedded
  within the release installation directory.

  export
LIBRARY_PATH=/usr/local/greenplum-db-6.23.1/lib:/usr/local/greenplum-db-6.23.1/ext/python/lib


2.10 Operating System: Rocky Linux 8.7 (Green Obsidian) x86_64 - PASSED
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

  - VMware Greenplum 6.23.1 (GA)
  - PostgreSQL 12.14 (built from source)
  - PostgreSQL 11.19 (built from source)
  - Apache MADlib apache-madlib-1.21.0-src.tar.gz Source tarball

  - install-check passed
  - dev-check passed


2.11 Operating System: Ubuntu 22.04.2 LTS - PASSED
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

  - PostgreSQL 12.14 (built from source)
  - PostgreSQL 11.19 (built from source)
  - Apache MADlib apache-madlib-1.21.0-src.tar.gz Source tarball

  - install-check passed
  - dev-check passed


2.12 Test cannot find "python" - FAILURE
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

  Python 2 on the system revealed a hardcoded legacy issue. On both
  Rocky 8 and Ubuntu 22 the Python 2 executable is not installed as
  /usr/bin/python. Thus the deep_learning madlib_keras_gpu_info test
  fails. The function get_gpu_info_from_tensorflow expects to find it
  via the executable name "python" which is not satisfied and the test
  fails.

  Here is the reference;

  <
https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fapache%2Fmadlib%2Fblob%2Fmaster%2Fsrc%2Fports%2Fpostgres%2Fmodules%2Fdeep_learning%2Fmadlib_keras_gpu_info.py_in%23L67&data=05%7C01%7Cokislal%40vmware.com%7Cb5b835eef540432aa60308db196a0ddb%7Cb39138ca3cee4b4aa4d6cd83d9dd62f0%7C0%7C0%7C638131715521901621%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=VURQWkI2b3kOlmpxkTe1NdRyXopP9bZNyJJQ7JQjA5M%3D&reserved=0
>

  I was able to workaround this by creating the following symbolic
  link. This is possible because I did not install Python 3 which might
  have relied upon /usr/bin/python. This needs some investigation and a
  minimal reference in the release notes.

  ln -s /usr/bin/python2 /usr/bin/python

!! External Email: This email originated from outside of the organization. Do 
not click links or open attachments unless you recognize the sender.

Reply via email to