[jira] [Updated] (SPARK-6177) Add note in LDA example to remind possible coalesce

2015-03-10 Thread Sean Owen (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-6177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Owen updated SPARK-6177:
-
Priority: Trivial  (was: Minor)
Assignee: yuhao yang

 Add note in LDA example to remind possible coalesce 
 

 Key: SPARK-6177
 URL: https://issues.apache.org/jira/browse/SPARK-6177
 Project: Spark
  Issue Type: Improvement
  Components: Examples, MLlib
Affects Versions: 1.2.1
Reporter: yuhao yang
Assignee: yuhao yang
Priority: Trivial
 Fix For: 1.4.0

   Original Estimate: 1h
  Remaining Estimate: 1h

 Add comment to introduce coalesce to LDA example to avoid the possible 
 massive partitions from sc.textFile.
 sc.textFile will create RDD with one partition for each file, and the 
 possible massive partitions downgrades LDA performance.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-6177) Add note for

2015-03-09 Thread yuhao yang (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-6177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

yuhao yang updated SPARK-6177:
--
Summary: Add note for   (was: LDA should check partitions size of the input)

 Add note for 
 -

 Key: SPARK-6177
 URL: https://issues.apache.org/jira/browse/SPARK-6177
 Project: Spark
  Issue Type: Improvement
  Components: Examples, MLlib
Affects Versions: 1.2.1
Reporter: yuhao yang
Priority: Minor
   Original Estimate: 1h
  Remaining Estimate: 1h

 Add comment to introduce coalesce to LDA example to avoid the possible 
 massive partitions from sc.textFile.
 sc.textFile will create RDD with one partition for each file, and the 
 possible massive partitions downgrades LDA performance.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-6177) Add note in LDA example to remind possible coalesce

2015-03-09 Thread yuhao yang (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-6177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

yuhao yang updated SPARK-6177:
--
Summary: Add note in LDA example to remind possible coalesce   (was: Add 
note for )

 Add note in LDA example to remind possible coalesce 
 

 Key: SPARK-6177
 URL: https://issues.apache.org/jira/browse/SPARK-6177
 Project: Spark
  Issue Type: Improvement
  Components: Examples, MLlib
Affects Versions: 1.2.1
Reporter: yuhao yang
Priority: Minor
   Original Estimate: 1h
  Remaining Estimate: 1h

 Add comment to introduce coalesce to LDA example to avoid the possible 
 massive partitions from sc.textFile.
 sc.textFile will create RDD with one partition for each file, and the 
 possible massive partitions downgrades LDA performance.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org