[ 
https://issues.apache.org/jira/browse/MAHOUT-1604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14249309#comment-14249309
 ] 

ASF GitHub Bot commented on MAHOUT-1604:
----------------------------------------

Github user dlyubimov commented on a diff in the pull request:

    https://github.com/apache/mahout/pull/47#discussion_r21948535
  
    --- Diff: spark/pom.xml ---
    @@ -157,6 +157,27 @@
             </executions>
           </plugin>
     
    +      <!-- create job jar to include CLI driver deps-->
    +      <!-- leave this in even though there are no hadoop mapreduce jobs in 
this module -->
    +      <plugin>
    --- End diff --
    
    why does this keep getting added (even with comment "we don't use it????) I 
keep cleaning this out and it appears again. If it is not being used why it is 
here? It adds tons of time to compilation to build this jar and clogs artifact 
repo 


> Create a RowSimilarity for Spark
> --------------------------------
>
>                 Key: MAHOUT-1604
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1604
>             Project: Mahout
>          Issue Type: Bug
>          Components: CLI
>    Affects Versions: 1.0
>         Environment: Spark
>            Reporter: Pat Ferrel
>            Assignee: Pat Ferrel
>
> Using CooccurrenceAnalysis.cooccurrence create a driver that reads a text DRM 
> or two and produces LLR similarity/cross-similarity matrices.
> This will produce the same results as ItemSimilarity but take a Drm as input 
> instead of individual cells.
> The first version will only support LLR, other similarity measures will need 
> to be in separate Jiras



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to