Jingyi Mei created MADLIB-1213:
----------------------------------
Summary: Support logical expression for minibatch preprocessor
Key: MADLIB-1213
URL: https://issues.apache.org/jira/browse/MADLIB-1213
Project: Apache MADlib
Issue Type: Improvement
Components: Module: Utilities
Reporter: Jingyi Mei
Fix For: v1.14
The minibatch preprocessor currently does not support all logical expressions
for independent and dependent variables.
# Independent varname does not support any logical expression.
# Dependent varname only supports logical expression for numerical columns.
For ex 'length >1' is a valid expression but it does not support creating an
alias for this expression.
This is the only expression that is supported
{code}
SELECT madlib.minibatch_preprocessor('minibatch_preprocessing_input',
'minibatch_preprocessing_out', 'y > 10', ' x1,x2', 4);
{code}
Not supported :
{code}
SELECT madlib.minibatch_preprocessor('minibatch_preprocessing_input',
'minibatch_preprocessing_out', 'y > 10 as foo', 'x1,x2', 4);
{code}
{code}
SELECT madlib.minibatch_preprocessor('minibatch_preprocessing_input',
'minibatch_preprocessing_out', 'y=''F''', 'x1,x2', 4);
{code}
Open Questions :
1. Do we need to support logical expressions for both independent and dependent
varname
2. If yes, to what extent ?
3. Should the user be allowed to create an alias for logical expressions?
4. There might be other modules that may partially support logical expressions.
Should we find out which modules ?
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)