[ 
https://issues.apache.org/jira/browse/SPARK-21066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16055336#comment-16055336
 ] 

Yan Facai (颜发才) edited comment on SPARK-21066 at 6/20/17 8:27 AM:
------------------------------------------------------------------

[~sowen] I believe that the API has explained well in details.

 If unspecified or nonpositive, the number of features will be determined 
automatically at the cost of one additional pass.

The best way to solve the problem is to modify the misleading message of 
exception: to suggest user to specify `numFeatures`, rather than warn user to 
go away.


was (Author: facai):
[~sowen] I believe that the API has explained well in details.

 If unspecified or nonpositive, the number of features will be determined 
automatically at the cost of one additional pass.

> LibSVM load just one input file
> -------------------------------
>
>                 Key: SPARK-21066
>                 URL: https://issues.apache.org/jira/browse/SPARK-21066
>             Project: Spark
>          Issue Type: Bug
>          Components: ML
>    Affects Versions: 2.1.1
>            Reporter: darion yaphet
>
> Currently when we using SVM to train dataset we found the input files limit 
> only one .
> The file store on the Distributed File System such as HDFS is split into 
> mutil piece and I think this limit is not necessary .
>  We can join input paths into a string split with comma. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to