[
https://issues.apache.org/jira/browse/MADLIB-1129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16145733#comment-16145733
]
Himanshu Pandey commented on MADLIB-1129:
-----------------------------------------
Hi [~fmcquillan],
So in any case, are we keeping the number of columns in output tables same?
For Eg: if output_neighbors is FALSE are we still showing the column with no
data :
{code}
postgres=# select * from madlib_knn_result_classification;
id | data | prediction | k_nearest_neighbours
----+---------+------------+----------------------
1 | {2,1} | 1 |
2 | {2,6} | 1 |
3 | {15,40} | 0 |
4 | {12,1} | 1 |
5 | {2,90} | 0 |
6 | {50,45} | 0 |
(6 rows)
{code}
Or we are hiding the column completely like this?
{code}
postgres=# select * from madlib_knn_result_classification;
id | data | prediction
----+---------+------------
1 | {2,1} | 1
2 | {2,6} | 1
3 | {15,40} | 0
4 | {12,1} | 1
5 | {2,90} | 0
6 | {50,45} | 0
(6 rows)
{code}
Same will be the case for prediction column when label_column_name is Null.
> Additional output information for k-NN
> --------------------------------------
>
> Key: MADLIB-1129
> URL: https://issues.apache.org/jira/browse/MADLIB-1129
> Project: Apache MADlib
> Issue Type: Improvement
> Components: k-NN
> Reporter: Frank McQuillan
> Assignee: Himanshu Pandey
> Priority: Minor
> Labels: starter
> Fix For: v2.0
>
>
> Follow on to
> https://issues.apache.org/jira/browse/MADLIB-927
> List the k-nearest neighbors that were used in the voting/averaging, sorted
> in ASC order according to the distance function used. This could be added to
> the current output table.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)