[ https://issues.apache.org/jira/browse/SYSTEMML-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15904456#comment-15904456 ]
Mike Dusenberry commented on SYSTEMML-1384: ------------------------------------------- Yeah this is a fair thing to investigate. Just to be clear, the {{conv}} biases are column vectors, but the {{affine}}, {{rnn}}, and {{lstm}} are all row vectors. > Revisit the weight and bias of fully connected layer > ---------------------------------------------------- > > Key: SYSTEMML-1384 > URL: https://issues.apache.org/jira/browse/SYSTEMML-1384 > Project: SystemML > Issue Type: Sub-task > Reporter: Niketan Pansare > > Since all our bias are column vector (which is consistent with Keras/Caffe), > whereas bias of fully connected layer is a row-vector. Similarly, the weight > that is passed to caffe is transpose of weights passed to SystemML (since > both store in row-major NCHW format). > Making the dimensions consistent will simplify loading of Caffe/Keras models. > [~mwdus...@us.ibm.com] -- This message was sent by Atlassian JIRA (v6.3.15#6346)