[ 
https://issues.apache.org/jira/browse/MADLIB-1129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16145733#comment-16145733
 ] 

Himanshu Pandey commented on MADLIB-1129:
-----------------------------------------

Hi [~fmcquillan],

So in any case, are we keeping the number of columns in output tables same? 

For Eg: if output_neighbors is FALSE are we still showing the column with no 
data : 

{code}
postgres=# select * from madlib_knn_result_classification;
 id |  data   | prediction | k_nearest_neighbours 
----+---------+------------+----------------------
  1 | {2,1}   |          1 | 
  2 | {2,6}   |          1 | 
  3 | {15,40} |          0 | 
  4 | {12,1}  |          1 | 
  5 | {2,90}  |          0 | 
  6 | {50,45} |          0 | 
(6 rows)

{code}

Or we are hiding the column completely like this?

{code}
postgres=# select * from madlib_knn_result_classification;
 id |  data   | prediction 
----+---------+------------
  1 | {2,1}   |          1 
  2 | {2,6}   |          1  
  3 | {15,40} |          0 
  4 | {12,1}  |          1  
  5 | {2,90}  |          0  
  6 | {50,45} |          0  
(6 rows)

{code}

Same will be the case for prediction column when label_column_name is Null.

> Additional output information for k-NN
> --------------------------------------
>
>                 Key: MADLIB-1129
>                 URL: https://issues.apache.org/jira/browse/MADLIB-1129
>             Project: Apache MADlib
>          Issue Type: Improvement
>          Components: k-NN
>            Reporter: Frank McQuillan
>            Assignee: Himanshu Pandey
>            Priority: Minor
>              Labels: starter
>             Fix For: v2.0
>
>
> Follow on to
> https://issues.apache.org/jira/browse/MADLIB-927
> List the k-nearest neighbors that were used in the voting/averaging, sorted 
> in ASC order according to the distance function used.  This could be added to 
> the current output table.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to