[ 
https://issues.apache.org/jira/browse/SPARK-3434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14161762#comment-14161762
 ] 

Ghousia Taj commented on SPARK-3434:
------------------------------------

Hi There,

We at Impetus Infotech, are also working on Block Matrix implementation. To 
start with we are considering a Matrix as an RDD of Blocks. Each Block includes 
a submatrix, row range and column range. The work discussed in this jira, 
closely connects with the work we are doing at present on Matrix operation. 
Your reference implementation would really help us to progress faster. We are 
also looking forward to working in tandem with you all and contributing in this 
space.

Many Thanks,
Ghousia. 

> Distributed block matrix
> ------------------------
>
>                 Key: SPARK-3434
>                 URL: https://issues.apache.org/jira/browse/SPARK-3434
>             Project: Spark
>          Issue Type: New Feature
>          Components: MLlib
>            Reporter: Xiangrui Meng
>
> This JIRA is for discussing distributed matrices stored in block 
> sub-matrices. The main challenge is the partitioning scheme to allow adding 
> linear algebra operations in the future, e.g.:
> 1. matrix multiplication
> 2. matrix factorization (QR, LU, ...)
> Let's discuss the partitioning and storage and how they fit into the above 
> use cases.
> Questions:
> 1. Should it be backed by a single RDD that contains all of the sub-matrices 
> or many RDDs with each contains only one sub-matrix?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to