[ 
https://issues.apache.org/jira/browse/SPARK-20180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15952377#comment-15952377
 ] 

yuhao yang edited comment on SPARK-20180 at 4/1/17 8:14 PM:
------------------------------------------------------------

I assume user can achieve the same effect by setting maxPatternlength to a 
larger value. So the jira is really about changing the default behavior of 
PrefixSpan. 
Is there more background or context available, like why the current default 
length(10) is not good in practice? Thanks. We need to also consider the 
performance for larger dataset (in count and dimension).


was (Author: yuhaoyan):
I assume user can achieve the same effect by setting maxPatternlength to a 
larger value. So the jira is really about changing the default behavior of 
PrefixSpan. 
Is there more background or context available, like why the current default 
length(10) is not good in practice? Thanks.

> Unlimited max pattern length in Prefix span
> -------------------------------------------
>
>                 Key: SPARK-20180
>                 URL: https://issues.apache.org/jira/browse/SPARK-20180
>             Project: Spark
>          Issue Type: Improvement
>          Components: MLlib
>    Affects Versions: 2.1.0
>            Reporter: Cyril de Vogelaere
>            Priority: Minor
>   Original Estimate: 0h
>  Remaining Estimate: 0h
>
> Right now, we need to use .setMaxPatternLength() method to
> specify is the maximum pattern length of a sequence. Any pattern longer than 
> that won't be outputted.
> The current default maxPatternlength value being 10.
> This should be changed so that with input 0, all pattern of any length would 
> be outputted. Additionally, the default value should be changed to 0, so that 
> a new user could find all patterns in his dataset without looking at this 
> parameter.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to