Thank you for being the release manager Jingyi. I used the rpm to upgrade from MADlib 1.13 to 1.14 on Greenplum 5. Works great! +1 (binding)
One issue, that does not affect my vote: - The link to all the `.asc` files in the email you sent is broken. I think the word 'SHA512' from the next line is getting included in the URL from the previous line for some reason. NJ On Fri, Apr 27, 2018 at 3:19 PM, Rashmi Raghu <rra...@pivotal.io> wrote: > Installed on Postgres 9.6 on MacOS using dmg. > Checked out the new additions to the summary function. Looks good. My > vote: +1 (binding). > > Some comments aside from the vote: > > - I followed this link in the email: https://cwiki.apache. > org/confluence/display/MADLIB/MADlib+1.14 > <https://cwiki.apache.org/confluence/display/MADLIB/MADlib+1.14> and > then from there clicked on https://dist.apache.org/ > repos/dist/release/madlib/1.14/ > <https://dist.apache.org/repos/dist/release/madlib/1.14/> which gives > a page-not-found error. > - I didn't see a link to documentation associated with this release - > it would be useful to have that also available (let me know if it was in > the email and I missed it or if it is not standard practice). For instance, > I wanted to briefly look at the new balanced datasets module and it would > have been easy to look it up in the web version of the docs. I did find the > docs through the function call e.g. madlib.balance_sample('usage') but that > requires knowing roughly what function name to look for (not hard in this > case but I can imagine other situations where it might not be > straightforward) > > Great to see all the new features and bug fixes! > > Thanks, > Rashmi > > > On Fri, Apr 27, 2018 at 1:40 PM, Orhan Kislal <okis...@pivotal.io> wrote: > >> Tested on PG 10.3 (src and dmg). Looks good. +1 (binding) >> >> Thanks for preparing the release Jingyi, >> >> Orhan Kislal >> >> On Fri, Apr 27, 2018 at 11:44 AM, Frank McQuillan <fmcquil...@pivotal.io> >> wrote: >> >>> Hi Jingyi, >>> >>> Thanks for posting the artifacts and sending out the vote. >>> >>> My findings: >>> >>> Installation and IC passed on postgres 9.6.7 >>> >>> Also I tested a cpl of the new features (personalized page rank and >>> mini-batch preprocessor) >>> and they worked OK for me with a small sample data set. >>> >>> +1 (binding) >>> >>> On Thu, Apr 26, 2018 at 2:57 PM, Jingyi Mei <j...@pivotal.io> wrote: >>> >>> > Hello Apache MADlib dev community, >>> > >>> > This is the vote for Apache MADlib 1.14 Release (RC1). It provides the >>> > source release tarball and convenience binaries. This is the third >>> > Apache MADlib release as an Apache Top Level Project (TLP). >>> > >>> > The vote will run for at least 72 working hours and will close on >>> > Tuesday, May 1st, 2018 @ 6pm PDT. A minimum of 3 binding +1 votes and >>> > more binding +1 than binding -1 are required to pass. >>> > >>> > The main goals of this release are: >>> > >>> > New features: >>> > >>> > - New module - Balanced datasets: A sampling module to balance >>> > classification >>> > datasets by resampling using various techniques including >>> > undersampling, >>> > oversampling, uniform sampling or user-defined proportion sampling >>> > (MADLIB-1168) >>> > - Mini-batch: Added a mini-batch optimizer for MLP and a >>> preprocessor >>> > function >>> > necessary to create batches from the data (MADLIB-1200, MADLIB-1206, >>> > MADLIB-1220, MADLIB-1224, MADLIB-1226, MADLIB-1227) >>> > - k-NN: Added weighted averaging/voting by distance (MADLIB-1181) >>> > - Summary: Added additional stats: number of positive, negative, >>> zero >>> > values and >>> > 95% confidence intervals for the mean (MADLIB-1167) >>> > - Encode categorical: Updated to produce lower-case column names >>> when >>> > possible >>> > (MADLIB-1202) >>> > - MLP: Added support for already one-hot encoded categorical >>> dependent >>> > variable >>> > in a classification task (MADLIB-1222) >>> > - Pagerank: Added option for personalized vertices that allows >>> higher >>> > weightage >>> > for a subset of vertices which will have a higher jump probability >>> as >>> > compared to other vertices and a random surfer is more likely to >>> > jump to these personalization vertices (MADLIB-1084) >>> > >>> > Bug fixes: >>> > >>> > - Fixed issue with invalid calls of construct_array that led to >>> > problems >>> > in Postgresql 10 (MADLIB-1185) >>> > - Added newline between file concatenation during PGXN install >>> > (MADLIB-1194) >>> > - Fixed upgrade issues in knn (MADLIB-1197) >>> > - Added fix to ensure RF variable importance are always non-negative >>> > - Fixed inconsistency in LDA output and improved usability >>> > (MADLIB-1160, MADLIB-1201) >>> > - Fixed MLP and RF predict for models trained in earlier versions to >>> > ensure missing optional parameters are given appropriate default >>> values >>> > (MADLIB-1207) >>> > - Fixed a scenario in DT where no features exist due categorical >>> > columns with single level being dropped led to the database crashing >>> > - Fixed step size initialization in MLP based on learning rate >>> policy >>> > (MADLIB-1212) >>> > - Fixed PCA issue that leads to failure when grouping column is a >>> TEXT >>> > type (MADLIB-1215) >>> > - Fixed cat levels output in DT when grouping is enabled >>> (MADLIB-1218) >>> > - Fixed and simplified initialization of model coefficients in MLP >>> > - Removed source table dependency for predicting regression models >>> in >>> > MLP (MADLIB-1223) >>> > - Print loss of first iteration in MLP (MADLIB-1228) >>> > - Fixed MLP failure on GPDB 4.3 when verbose=True (MADLIB-1209) >>> > - Fixed RF issue that showed up when var_importance=True with no >>> > continuous features (MADLIB-1219) >>> > - Fixed DT/RF issue for null_as_category=True and grouping enabled >>> > (MADLIB-1217) >>> > >>> > Other: >>> > >>> > - Reduced install-check runtime for PCA, DT, RF, elastic net >>> > (MADLIB-1216) >>> > - Added CentOS 7 PostgreSQL 9.6/10 docker files >>> > >>> > For additional information, please see: >>> > https://cwiki.apache.org/confluence/display/MADLIB/MADlib+1.14 >>> > >>> > Here are the release artifact details: >>> > >>> > Source release tag to be voted on: rc/1.14-rc1, located here: >>> > https://git-wip-us.apache.org/repos/asf?p=madlib.git;a=tag; >>> > h=refs/tags/rc/1.14-rc1 >>> > >>> > Source release tarball can be retrieved from the following locations: >>> > >>> > Package: >>> > https://dist.apache.org/repos/dist/dev/madlib/1.14-RC1/ >>> > apache-madlib-1.14-src.tar.gz >>> > PGP Signature: >>> > https://dist.apache.org/repos/dist/dev/madlib/1.14-RC1/ >>> > apache-madlib-1.14-src.tar.gz.asc >>> > SHA512 Hash: >>> > https://dist.apache.org/repos/dist/dev/madlib/1.14-RC1/ >>> > apache-madlib-1.14-src.tar.gz.sha512 >>> > >>> > Convenience binary packages can be retrieved from the following >>> > locations: >>> > >>> > macOS: 10.* PostgreSQL 9.6 & 10.2 >>> > >>> > Package: >>> > https://dist.apache.org/repos/dist/dev/madlib/1.14-RC1/ >>> > apache-madlib-1.14-bin-Darwin.dmg >>> > PGP Signature: >>> > https://dist.apache.org/repos/dist/dev/madlib/1.14-RC1/ >>> > apache-madlib-1.14-bin-Darwin.dmg.asc >>> > SHA512 Hash: >>> > https://dist.apache.org/repos/dist/dev/madlib/1.14-RC1/ >>> > apache-madlib-1.14-bin-Darwin.dmg.sha512 >>> > >>> > CentOS* GPDB 4.3.5+ >>> > >>> > Package: >>> > https://dist.apache.org/repos/dist/dev/madlib/1.14-RC1/ >>> > apache-madlib-1.14-bin-Linux-GPDB43.rpm >>> > PGP Signature: >>> > https://dist.apache.org/repos/dist/dev/madlib/1.14-RC1/ >>> > apache-madlib-1.14-bin-Linux-GPDB43.rpm.asc >>> > SHA512 Hash: >>> > https://dist.apache.org/repos/dist/dev/madlib/1.14-RC1/ >>> > apache-madlib-1.14-bin-Linux-GPDB43.rpm.sha512 >>> > >>> > CentOS 6 &* GPDB 5.3.0, PostgreSQL 9.6 & 10.2 >>> > >>> > Package: >>> > https://dist.apache.org/repos/dist/dev/madlib/1.14-RC1/ >>> > apache-madlib-1.14-bin-Linux.rpm >>> > PGP Signature: >>> > https://dist.apache.org/repos/dist/dev/madlib/1.14-RC1/ >>> > apache-madlib-1.14-bin-Linux.rpm.asc >>> > SHA512 Hash: >>> > https://dist.apache.org/repos/dist/dev/madlib/1.14-RC1/ >>> > apache-madlib-1.14-bin-Linux.rpm.sha512 >>> > >>> > The PGP KEYS file used to validate the signature of the release >>> artifacts >>> > is available here: >>> > https://dist.apache.org/repos/dist/dev/madlib/KEYS >>> > >>> > To help in tallying the vote, PMC members please be sure to indicate >>> > “(binding)” with the vote. >>> > >>> > [ ] +1 approve >>> > [ ] +0 no opinion >>> > [ ] -1 disapprove (and reason why) >>> > >>> > Regards, >>> > Jingyi Mei >>> > >>> > Pivotal R&D Advanced Analytics >>> > >>> > >>> >> >> > > > -- > Rashmi Raghu, Ph.D. > Pivotal Data Science >