Orhan Kislal created MADLIB-1255:
------------------------------------

             Summary: MLP: NaN loss for some hyperparam settings
                 Key: MADLIB-1255
                 URL: https://issues.apache.org/jira/browse/MADLIB-1255
             Project: Apache MADlib
          Issue Type: Bug
          Components: Module: Neural Networks
            Reporter: Orhan Kislal


w/ [~njayaram] For the boston dataset (duplicated for multiple groups testing) 
the following query produces NaN for the loss.
{code:java}
SELECT setseed(0);
        DROP TABLE IF EXISTS temp3;
        DROP TABLE IF EXISTS temp3_summary;
        DROP TABLE IF EXISTS temp3_standardization;
        SELECT madlib.mlp_regression(
          'madlibtestdata.boston_grouping'::varchar,
          'temp3'::varchar,
          'ARRAY[crim, zn, indus, chas, nox, rm, age, dis, rad, tax, ptratio, 
b, lstat]'::varchar,
          'medv'::varchar,
          ARRAY[100]::integer[],
          'learning_rate_init=0.0025, lambda=0.00001, 
learning_rate_policy=step, gamma=0.8, iterations_per_step=250, 
n_iterations=1500, tolerance=0, momentum=0'::varchar,
          'tanh'::varchar,
          NULL,
          False,
          False,
          'grp_col'
          );
        SELECT loss FROM temp3 WHERE grp_col=2;
{code}
Dataset: [https://www.cs.toronto.edu/~delve/data/boston/bostonDetail.html]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to