[jira] [Created] (MAHOUT-2213) Add Braket back end

2024-07-17 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2213:


 Summary: Add Braket back end
 Key: MAHOUT-2213
 URL: https://issues.apache.org/jira/browse/MAHOUT-2213
 Project: Mahout
  Issue Type: New Feature
  Components: qumat
Reporter: Andrew Musselman
Assignee: Trevor Grant






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2211) Add roadmap to README

2024-04-17 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2211:


 Summary: Add roadmap to README
 Key: MAHOUT-2211
 URL: https://issues.apache.org/jira/browse/MAHOUT-2211
 Project: Mahout
  Issue Type: Improvement
Reporter: Andrew Musselman


Roadmap
* Q2
  * Qumat with hardened (tests, docs, CI/CD) cirq and qiskit backends
  * Kernel methods started
  * Submit public talk about Qumat
  * Classic in maintenance mode
* Q3 and beyond
  * Amazon Braket
  * Distributed quantum solvers



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2210) Docs

2024-04-17 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2210:


 Summary: Docs
 Key: MAHOUT-2210
 URL: https://issues.apache.org/jira/browse/MAHOUT-2210
 Project: Mahout
  Issue Type: Sub-task
  Components: qumat
Reporter: Andrew Musselman






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2209) Draw method

2024-04-17 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2209:


 Summary: Draw method
 Key: MAHOUT-2209
 URL: https://issues.apache.org/jira/browse/MAHOUT-2209
 Project: Mahout
  Issue Type: Sub-task
  Components: qumat
Reporter: Andrew Musselman






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2208) Measure method

2024-04-17 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2208:


 Summary: Measure method
 Key: MAHOUT-2208
 URL: https://issues.apache.org/jira/browse/MAHOUT-2208
 Project: Mahout
  Issue Type: Sub-task
Reporter: Andrew Musselman






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2206) Next steps for qiskit and cirq back ends

2024-03-27 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2206:


 Summary: Next steps for qiskit and cirq back ends
 Key: MAHOUT-2206
 URL: https://issues.apache.org/jira/browse/MAHOUT-2206
 Project: Mahout
  Issue Type: New Feature
  Components: qumat
Reporter: Andrew Musselman
Assignee: Andrew Musselman


implementing execute on qiskit and cirq
examples
docs
tests (requires testing framework)



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2205) Moving to GitHub Issues instead of JIRA

2024-03-27 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2205:


 Summary: Moving to GitHub Issues instead of JIRA
 Key: MAHOUT-2205
 URL: https://issues.apache.org/jira/browse/MAHOUT-2205
 Project: Mahout
  Issue Type: Task
Reporter: Andrew Musselman
Assignee: Andrew Musselman






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2204) Splitting Mahout project code into discrete repos

2024-03-27 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2204:


 Summary: Splitting Mahout project code into discrete repos
 Key: MAHOUT-2204
 URL: https://issues.apache.org/jira/browse/MAHOUT-2204
 Project: Mahout
  Issue Type: Task
Reporter: Andrew Musselman
Assignee: Andrew Musselman


Suggestions:
mahout-website
mahout-classic
mahout-samsara
mahout-qumat



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2177) Book club hook

2024-02-14 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2177:


 Summary: Book club hook
 Key: MAHOUT-2177
 URL: https://issues.apache.org/jira/browse/MAHOUT-2177
 Project: Mahout
  Issue Type: Documentation
  Components: website
Reporter: Andrew Musselman


Use the paper summarizer to publish articles to website

* Decide where to publish
* Possibly an RSS feed



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2175) Create new `main` branch

2024-01-17 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2175:


 Summary: Create new `main` branch
 Key: MAHOUT-2175
 URL: https://issues.apache.org/jira/browse/MAHOUT-2175
 Project: Mahout
  Issue Type: Sub-task
Reporter: Andrew Musselman


Create a new `main` branch off `trunk`, clear out all old materials, and bring 
in new poc work from github.com/rawkintrevo/qumat.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2174) Packaging

2024-01-10 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2174:


 Summary: Packaging
 Key: MAHOUT-2174
 URL: https://issues.apache.org/jira/browse/MAHOUT-2174
 Project: Mahout
  Issue Type: Sub-task
Reporter: Andrew Musselman


Pip packaging, etc.

Poetry, environment mgmt, etc.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2173) Determine whether we can ship outputs from one library to the plotting capabilities of another

2023-12-06 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2173:


 Summary: Determine whether we can ship outputs from one library to 
the plotting capabilities of another
 Key: MAHOUT-2173
 URL: https://issues.apache.org/jira/browse/MAHOUT-2173
 Project: Mahout
  Issue Type: Sub-task
Reporter: Andrew Musselman






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2172) Look at whether to re-use any samsara work for this

2023-12-06 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2172:


 Summary: Look at whether to re-use any samsara work for this
 Key: MAHOUT-2172
 URL: https://issues.apache.org/jira/browse/MAHOUT-2172
 Project: Mahout
  Issue Type: Sub-task
Reporter: Andrew Musselman






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2171) Think about how to represent imaginary numbers

2023-12-06 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2171:


 Summary: Think about how to represent imaginary numbers
 Key: MAHOUT-2171
 URL: https://issues.apache.org/jira/browse/MAHOUT-2171
 Project: Mahout
  Issue Type: Sub-task
Reporter: Andrew Musselman






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2170) Determine what can be stripped down to "just matrices"

2023-12-06 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2170:


 Summary: Determine what can be stripped down to "just matrices"
 Key: MAHOUT-2170
 URL: https://issues.apache.org/jira/browse/MAHOUT-2170
 Project: Mahout
  Issue Type: Sub-task
Reporter: Andrew Musselman






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2169) Identify the data structures that are behind all the math and operation objects in all the libraries

2023-12-06 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2169:


 Summary: Identify the data structures that are behind all the math 
and operation objects in all the libraries
 Key: MAHOUT-2169
 URL: https://issues.apache.org/jira/browse/MAHOUT-2169
 Project: Mahout
  Issue Type: Sub-task
Reporter: Andrew Musselman






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2168) Make a Google Sheet for comparing API methods and objects across all the libraries

2023-12-06 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2168:


 Summary: Make a Google Sheet for comparing API methods and objects 
across all the libraries
 Key: MAHOUT-2168
 URL: https://issues.apache.org/jira/browse/MAHOUT-2168
 Project: Mahout
  Issue Type: Sub-task
Reporter: Andrew Musselman






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2167) Web site 404s

2023-12-05 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2167:


 Summary: Web site 404s
 Key: MAHOUT-2167
 URL: https://issues.apache.org/jira/browse/MAHOUT-2167
 Project: Mahout
  Issue Type: Improvement
Reporter: Andrew Musselman


We have lots of broken links on the site.

Overall refresh is in order but for now let's just remove or fix any broken 
links.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2166) Proof of concept of quantum front-end

2023-11-29 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2166:


 Summary: Proof of concept of quantum front-end
 Key: MAHOUT-2166
 URL: https://issues.apache.org/jira/browse/MAHOUT-2166
 Project: Mahout
  Issue Type: New Feature
  Components: Quantum
Affects Versions: 14.2
Reporter: Andrew Musselman


Write up working proof of concept for a new python-based `mahout-q` component 
that will perform some basic quantum operations.

E.g.:
* NOT Gate
* Hadamard Gate
* CNOT Gate
* Toffoli Gate
* SWAP Gate
* Pauli X Gate (X-Gate)
* Pauli Y Gate (Y-Gate)
* Pauli Z Gate (Z-Gate)

Nice to have the ability to pass off commands to any of the major libraries or 
compute platforms.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2165) Docker image for web site build

2023-03-29 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2165:


 Summary: Docker image for web site build
 Key: MAHOUT-2165
 URL: https://issues.apache.org/jira/browse/MAHOUT-2165
 Project: Mahout
  Issue Type: Improvement
  Components: website
Reporter: Andrew Musselman
Assignee: Trevor Grant


Create a Docker image for updating web site



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2164) Fix license issues for RAT

2023-02-27 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2164:


 Summary: Fix license issues for RAT
 Key: MAHOUT-2164
 URL: https://issues.apache.org/jira/browse/MAHOUT-2164
 Project: Mahout
  Issue Type: Dependency
  Components: build
Reporter: Andrew Musselman


Revisit license headers in json dir for rat errors per:

https://github.com/apache/mahout/pull/427



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2162) Dependabot security PR

2023-02-27 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2162:


 Summary: Dependabot security PR
 Key: MAHOUT-2162
 URL: https://issues.apache.org/jira/browse/MAHOUT-2162
 Project: Mahout
  Issue Type: Dependency
Reporter: Andrew Musselman


https://github.com/apache/mahout/pull/417

Need to assess how important this is, decide to hard deprecate Hadoop 
MapReduce-based jobs, etc.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2163) Dependabot security PR #2

2023-02-27 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2163:


 Summary: Dependabot security PR #2
 Key: MAHOUT-2163
 URL: https://issues.apache.org/jira/browse/MAHOUT-2163
 Project: Mahout
  Issue Type: Dependency
Reporter: Andrew Musselman


https://github.com/apache/mahout/pull/417

Need to assess how important this is, decide to hard deprecate Hadoop 
MapReduce-based jobs, etc.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2161) Fix broken ASF link in Sidebar

2023-02-23 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2161:


 Summary: Fix broken ASF link in Sidebar
 Key: MAHOUT-2161
 URL: https://issues.apache.org/jira/browse/MAHOUT-2161
 Project: Mahout
  Issue Type: Improvement
  Components: website
Reporter: Andrew Musselman
 Attachments: Screen Shot 2023-02-23 at 1.29.22 PM.png

Broken link to ASF



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2159) Add Docker instructions to nav

2023-02-22 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2159:


 Summary: Add Docker instructions to nav
 Key: MAHOUT-2159
 URL: https://issues.apache.org/jira/browse/MAHOUT-2159
 Project: Mahout
  Issue Type: Task
  Components: website
Reporter: Andrew Musselman


Add reference to 
https://mahout.apache.org/docs/latest/tutorials/misc/getting-started-with-zeppelin/
 for people to get started.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2158) Edit website

2023-02-22 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2158:


 Summary: Edit website
 Key: MAHOUT-2158
 URL: https://issues.apache.org/jira/browse/MAHOUT-2158
 Project: Mahout
  Issue Type: Task
  Components: website
Reporter: Andrew Musselman


Do a scrub on pages and language on the website, remove unneeded material and 
consolidate where viable.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2157) Add "first ticket" tag or type in JIRA

2023-02-22 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2157:


 Summary: Add "first ticket" tag or type in JIRA
 Key: MAHOUT-2157
 URL: https://issues.apache.org/jira/browse/MAHOUT-2157
 Project: Mahout
  Issue Type: Task
  Components: jira
Reporter: Andrew Musselman


Find or create "good first ticket" type for issues in JIRA



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2156) Add "how to do your first JIRA" notes

2023-02-22 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2156:


 Summary: Add "how to do your first JIRA" notes
 Key: MAHOUT-2156
 URL: https://issues.apache.org/jira/browse/MAHOUT-2156
 Project: Mahout
  Issue Type: Task
  Components: website
Reporter: Andrew Musselman


Add some language for new contributors in 
https://mahout.apache.org/developers/how-to-contribute about how to grab a 
ticket and identify manageable first tasks.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2154) Update NOTICE

2023-02-10 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2154:


 Summary: Update NOTICE
 Key: MAHOUT-2154
 URL: https://issues.apache.org/jira/browse/MAHOUT-2154
 Project: Mahout
  Issue Type: Task
Reporter: Andrew Musselman


Our NOTICE is out of date. I know there are things like viennacl and others 
that we are making use of that aren't listed.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2153) Download page improvements

2023-02-09 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2153:


 Summary: Download page improvements
 Key: MAHOUT-2153
 URL: https://issues.apache.org/jira/browse/MAHOUT-2153
 Project: Mahout
  Issue Type: Improvement
  Components: website
Reporter: Andrew Musselman


The Announce (annou...@apache.org) list has rejected our release announcements 
previously due to the following non-compliance issues:
 
See [https://infra.apache.org/release-download-pages.html#download-page] for 
more info on what's required.
 
{quote}Hi! This is the ezmlm program. I'm managing the
[annou...@apache.org|mailto:annou...@apache.org] mailing list.

I'm sorry, your message (enclosed) was not accepted by the moderator.
If the moderator has made any comments, they are shown below.

>  >
The announce message currently links to

[http://www.apache.org/dist/mahout/0.14.0]

However, downloads must use the ASF mirrors for all but the KEYS, sigs and
hashes.

The download page
[https://mahout.apache.org/general/downloads.html]
has the same problem.

It also links to the git repo, which includes code that has not been
formally released.
Download pages must only link to released code.
[Pointers to source code repos should be on developer-oriented pages only]

Please have a look the download pages for Tomcat or Httpd for examples of
how to do it.{quote}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2152) Web site check

2023-02-09 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2152:


 Summary: Web site check
 Key: MAHOUT-2152
 URL: https://issues.apache.org/jira/browse/MAHOUT-2152
 Project: Mahout
  Issue Type: Improvement
  Components: website
Reporter: Andrew Musselman


There are several low-hanging items we could fix on the web site to conform 
with ASF expectations; see https://whimsy.apache.org/site/project/mahout



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2150) Mailing lists link broken on website

2023-02-06 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2150:


 Summary: Mailing lists link broken on website
 Key: MAHOUT-2150
 URL: https://issues.apache.org/jira/browse/MAHOUT-2150
 Project: Mahout
  Issue Type: Documentation
Reporter: Andrew Musselman


>From [https://mahout.apache.org/developers/how-to-contribute]

[https://mahout.apache.org/general/mailing-lists,-irc-and-archives.html] is a 
404



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2149) Migrate off Travis

2023-02-06 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2149:


 Summary: Migrate off Travis
 Key: MAHOUT-2149
 URL: https://issues.apache.org/jira/browse/MAHOUT-2149
 Project: Mahout
  Issue Type: Improvement
  Components: build
Reporter: Andrew Musselman


INFRA would like us to move off Travis and onto GitHub Actions, Jenkins, or 
Buildbot. Jenkins and Buildbot are in-house.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2146) Web site not publishing

2022-11-28 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2146:


 Summary: Web site not publishing
 Key: MAHOUT-2146
 URL: https://issues.apache.org/jira/browse/MAHOUT-2146
 Project: Mahout
  Issue Type: Bug
  Components: website
Reporter: Andrew Musselman


I updated the markdown for [https://mahout.apache.org/general/who-we-are]  and 
tried two ways to trigger a website publish:

(1) Bare commit

(2) Pull request

But the changes aren't showing up on the live site.

Source change: 
[https://github.com/apache/mahout/commit/a27628c3fadde7a9c57fc85cff01cdd455c5d45b]

See website/general/who-we-are.md for edited version.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (MAHOUT-2145) Research Ethereum Data Storage

2022-01-26 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2145:


 Summary: Research Ethereum Data Storage
 Key: MAHOUT-2145
 URL: https://issues.apache.org/jira/browse/MAHOUT-2145
 Project: Mahout
  Issue Type: Sub-task
Reporter: Andrew Musselman


Research feasibility and usefulness of:
 # Pulling down an entire chain
 # Sampling a chain
 # Parsing and projecting useful fields, discarding unused data
 # Selecting date range
 # Filtering to just fetch specific types of blocks, such as ERC-20 tokens
 # Storing locally or persistent (on S3, e.g.)
 # Storing a small example data file in source control with the project



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (MAHOUT-2144) Research Ethereum Indexer

2022-01-26 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2144:


 Summary: Research Ethereum Indexer
 Key: MAHOUT-2144
 URL: https://issues.apache.org/jira/browse/MAHOUT-2144
 Project: Mahout
  Issue Type: Sub-task
Reporter: Andrew Musselman


Research and write proof of concept search indexer for ethereum-compatible 
network data files.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (MAHOUT-2143) Research Ethereum Parser

2022-01-26 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2143:


 Summary: Research Ethereum Parser
 Key: MAHOUT-2143
 URL: https://issues.apache.org/jira/browse/MAHOUT-2143
 Project: Mahout
  Issue Type: Sub-task
Reporter: Andrew Musselman


Research and write proof of concept parser for ethereum-compatible networks.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (MAHOUT-2142) Discussion and planning epic for adding blockchain data sources and analytics use cases

2022-01-07 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2142:


 Summary: Discussion and planning epic for adding blockchain data 
sources and analytics use cases
 Key: MAHOUT-2142
 URL: https://issues.apache.org/jira/browse/MAHOUT-2142
 Project: Mahout
  Issue Type: Epic
Reporter: Andrew Musselman
Assignee: Andrew Musselman


*About*

Proposal is to provide a new data source, namely any number of 
ethereum-compatible ledgers, and pick a few compelling use cases to build out 
this year.

We will add children to this epic for specific work items.

*Example Use Cases*
 # Search-indexes of given ledgers
 # Computed similarity to other accounts on the same ledger based on activity 
history
 # Time-series analysis of gas (transaction) fees across multiple ledgers
 # Time-series analysis of transactions (overall # per week/month/year/custom 
period, by user account etc.) for a list of ledgers. (Comparative analysis of 
usage)
 # Max/Min range of transactions for different ledgers

 
*How to Get Started*
To explore ledger operations and data, get a copy of go-ethereum (geth: 
[https://geth.ethereum.org/docs/install-and-build/installing-geth]) and run it 
against a network to get all historical records. The Goerli test network's 
entire three years of data is only 32GB, so there are small enough data sets to 
play with, and the data files are stored on your local disk by default at 
~/ethereum.
 
There are libraries that interact live with any given ledger including Web3JS 
([https://web3js.readthedocs.io/en/v1.5.2/]) and Web3.py 
([https://web3py.readthedocs.io/en/stable/]), so reading out of ledgers is 
simple.
 
Reading and indexing the actual data might mean writing custom parsers for 
Mahout and Lucene, and possibly getting into decompiling bytecode back into 
readable Solidity code, so there are pieces we would need to plan out.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (MAHOUT-2141) Discussion and planning epic for adding blockchain data sources and analytics use cases

2022-01-07 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2141:


 Summary: Discussion and planning epic for adding blockchain data 
sources and analytics use cases
 Key: MAHOUT-2141
 URL: https://issues.apache.org/jira/browse/MAHOUT-2141
 Project: Mahout
  Issue Type: Epic
Reporter: Andrew Musselman
Assignee: Andrew Musselman


*About*

Discussion point for adding ethereum-compatible blockchains as data sources and 
some pertinent use cases.

We will add stories as children to this epic.

Proposal is to use ethereum-compatible ledgers as they adhere to the same 
standard for tokens 
([https://ethereum.org/en/developers/docs/standards/tokens|https://ethereum.org/en/developers/docs/standards/tokens)]),
 for instance for smart contracts 
([https://ethereum.org/en/developers/docs/smart-contracts|https://ethereum.org/en/developers/docs/smart-contracts)]).

{*}How to Get Started{*}{*}{*}
To explore concepts and data, get a copy of go-ethereum (geth: 
[https://geth.ethereum.org/docs/install-and-build/installing-geth]). Run it 
against a network and it will grab historical records. The Goerli test 
network's entire three years of data is only 32GB, so there are small enough 
data sets to play with, and the data files are stored on your local disk, by 
default in ~/.ethereum.
 
There are libraries that interact live with any given ledger including Web3JS 
([https://web3js.readthedocs.io/en/v1.5.2/]) and Web3.py 
([https://web3py.readthedocs.io/en/stable/]), so reading out of ledgers is 
simple.
 
Reading and indexing the actual data might mean writing custom parsers for 
Mahout and Lucene, and possibly getting into decompiling bytecode back into 
readable Solidity code.
 
*Some Starter Discussions* * Is there a place to persist each ledger we use as 
a data source, or would this pull down the ledger data every time for a new 
instance?
 * Should we build a live demo of this to run on the mahout.a.o web site?

 

*Example Use Cases*
 # Search-indexes of given ledgers
 # Computed similarity to other accounts on the same ledger based on activity 
history
 # Time-series analysis of gas (transaction) fees across multiple ledgers
 # Time-series analysis of transactions (overall # per week/month/year/custom 
period, by user account etc.) for a list of ledgers. (Comparative analysis of 
usage)
 # Max/Min range of transactions for different ledgers



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (MAHOUT-2131) Commit hook for website

2020-10-16 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2131:


 Summary: Commit hook for website
 Key: MAHOUT-2131
 URL: https://issues.apache.org/jira/browse/MAHOUT-2131
 Project: Mahout
  Issue Type: Task
Affects Versions: 14.1
Reporter: Andrew Musselman
Assignee: Trevor Grant


Turn off website build on pull request



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (MAHOUT-2130) Add talks page to website

2020-10-16 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2130:


 Summary: Add talks page to website
 Key: MAHOUT-2130
 URL: https://issues.apache.org/jira/browse/MAHOUT-2130
 Project: Mahout
  Issue Type: Task
Affects Versions: 14.1
Reporter: Andrew Musselman


Add page with talks and slides, add posts in the news section to them



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (MAHOUT-2129) Rebuild docs and add to website nav

2020-10-16 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2129:


 Summary: Rebuild docs and add to website nav
 Key: MAHOUT-2129
 URL: https://issues.apache.org/jira/browse/MAHOUT-2129
 Project: Mahout
  Issue Type: Task
Affects Versions: 14.1
Reporter: Andrew Musselman


Nav still has 0.13.0 docs only



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (MAHOUT-2128) Fix download page

2020-10-16 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2128:


 Summary: Fix download page
 Key: MAHOUT-2128
 URL: https://issues.apache.org/jira/browse/MAHOUT-2128
 Project: Mahout
  Issue Type: Task
Reporter: Andrew Musselman


|
|h3. announce-ow...@apache.org |
|Thu, Oct 8, 12:41 PM (8 days ago)|

| | |

|
|to akm 
!https://mail.google.com/mail/u/1/images/cleardot.gif!|
|

 
 
 
 Hi! This is the ezmlm program. I'm managing the
 [annou...@apache.org|mailto:annou...@apache.org] mailing list.
 
 I'm sorry, your message (enclosed) was not accepted by the moderator.
 If the moderator has made any comments, they are shown below.
 
 >  >
 I'm afraid this announcement is not valid, as the download links must be to a 
project-specific download page that supports the mirror system.
 
 Specifically, the announcement must not include "download the release 
artifacts and signatures from [https://downloads.apache.org/mahout/14.1/];
 
 Further in the announcement is a proper link "Download the release artifacts 
and signatures at [https://mahout.apache.org/general/downloads.html];
 
 But the downloads.html does not support the mirror system including 
instructions on obtaining the KEYS file for verification.
 
 Please correct the download page and the announcement and resubmit.
 
 Regards,
 Craig



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (MAHOUT-2117) RAT scan without exclusions shows: "443 Unknown Licenses"

2020-07-17 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2117:


 Summary: RAT scan without exclusions shows: "443 Unknown Licenses" 
 Key: MAHOUT-2117
 URL: https://issues.apache.org/jira/browse/MAHOUT-2117
 Project: Mahout
  Issue Type: Task
  Components: build
Affects Versions: 0.14.1
Reporter: Andrew Musselman
 Fix For: 14.1






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (MAHOUT-2116) NOTICE.txt does not contain current date

2020-07-17 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2116:


 Summary: NOTICE.txt does not contain current date
 Key: MAHOUT-2116
 URL: https://issues.apache.org/jira/browse/MAHOUT-2116
 Project: Mahout
  Issue Type: Task
  Components: build
Affects Versions: 0.14.1
Reporter: Andrew Musselman
 Fix For: 14.1






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (MAHOUT-2115) Add RELEASE_NOTES

2020-07-17 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2115:


 Summary: Add RELEASE_NOTES
 Key: MAHOUT-2115
 URL: https://issues.apache.org/jira/browse/MAHOUT-2115
 Project: Mahout
  Issue Type: Task
  Components: build
Affects Versions: 0.14.1
Reporter: Andrew Musselman
 Fix For: 14.1


Is this something we can do through JIRA?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (MAHOUT-2114) Archive contains LICENSE and LICENSE.txt as well as NOTICE and NOTICE.txt

2020-07-17 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2114:


 Summary: Archive contains LICENSE and LICENSE.txt as well as 
NOTICE and NOTICE.txt
 Key: MAHOUT-2114
 URL: https://issues.apache.org/jira/browse/MAHOUT-2114
 Project: Mahout
  Issue Type: Task
  Components: build
Affects Versions: 0.14.1
Reporter: Andrew Musselman
 Fix For: 14.1


Perhaps removing the file ending of the NOTICE.txt and the LICENSE.txt would 
eliminate this



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (MAHOUT-2113) MD5 and SHA1 hashes are considered deprecated and SHA512 should be used

2020-07-17 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2113:


 Summary:  MD5 and SHA1 hashes are considered deprecated and SHA512 
should be used
 Key: MAHOUT-2113
 URL: https://issues.apache.org/jira/browse/MAHOUT-2113
 Project: Mahout
  Issue Type: Task
  Components: build
Affects Versions: 0.14.1
Reporter: Andrew Musselman
 Fix For: 14.1






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (MAHOUT-2112) Your keys don't seem to be in the ASF web of trust

2020-07-17 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2112:


 Summary: Your keys don't seem to be in the ASF web of trust
 Key: MAHOUT-2112
 URL: https://issues.apache.org/jira/browse/MAHOUT-2112
 Project: Mahout
  Issue Type: Task
Reporter: Andrew Musselman


Perhaps attending a Key-Signing party would be a good idea.

In this time of quarantine, I think a virtual key-signing party would be great.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (MAHOUT-2111) The name of the source bundle doesn't contain "apache" in it

2020-07-17 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2111:


 Summary: The name of the source bundle doesn't contain "apache" in 
it
 Key: MAHOUT-2111
 URL: https://issues.apache.org/jira/browse/MAHOUT-2111
 Project: Mahout
  Issue Type: Task
  Components: build
Affects Versions: 0.14.1
Reporter: Andrew Musselman
 Fix For: 14.1






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (MAHOUT-2110) Deprecate MD5, upgrade to SHA512

2020-06-30 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2110:


 Summary: Deprecate MD5, upgrade to SHA512
 Key: MAHOUT-2110
 URL: https://issues.apache.org/jira/browse/MAHOUT-2110
 Project: Mahout
  Issue Type: Task
  Components: build
Affects Versions: 14.2
Reporter: Andrew Musselman
 Fix For: 14.1


Per https://infra.apache.org/release-distribution.html



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (MAHOUT-2109) Add release notes to source bundle

2020-06-30 Thread Andrew Musselman (Jira)
Andrew Musselman created MAHOUT-2109:


 Summary: Add release notes to source bundle
 Key: MAHOUT-2109
 URL: https://issues.apache.org/jira/browse/MAHOUT-2109
 Project: Mahout
  Issue Type: Task
  Components: build
Affects Versions: 14.2
Reporter: Andrew Musselman
 Fix For: 14.1


Typically RELEASE_NOTES are added alongside the source bundle.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (MAHOUT-2068) Release not rolled back, download link broken

2019-05-30 Thread Andrew Musselman (JIRA)
Andrew Musselman created MAHOUT-2068:


 Summary: Release not rolled back, download link broken
 Key: MAHOUT-2068
 URL: https://issues.apache.org/jira/browse/MAHOUT-2068
 Project: Mahout
  Issue Type: Bug
  Components: build
Affects Versions: 0.14.0
Reporter: Andrew Musselman
Assignee: Andrew Palumbo
 Fix For: 0.14.0


[http://mahout.apache.org|http://mahout.apache.org/] has a current version of 
0.14.1, and the link to download is broken.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (MAHOUT-2064) Jars not published to Maven repos

2019-03-26 Thread Andrew Musselman (JIRA)
Andrew Musselman created MAHOUT-2064:


 Summary: Jars not published to Maven repos
 Key: MAHOUT-2064
 URL: https://issues.apache.org/jira/browse/MAHOUT-2064
 Project: Mahout
  Issue Type: Bug
  Components: build
Affects Versions: 0.14.0
Reporter: Andrew Musselman
Assignee: Andrew Musselman
 Fix For: 0.14.1


Need to build a binary artifact along with source artifact so it can be 
released by nexus.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (MAHOUT-2059) Update web site with 0.14.0 release, fix download button

2019-03-05 Thread Andrew Musselman (JIRA)
Andrew Musselman created MAHOUT-2059:


 Summary: Update web site with 0.14.0 release, fix download button
 Key: MAHOUT-2059
 URL: https://issues.apache.org/jira/browse/MAHOUT-2059
 Project: Mahout
  Issue Type: Documentation
Affects Versions: 0.14.0
Reporter: Andrew Musselman


Download button on home page is linked to a broken mirror currently also 
([http://mirror.stjschools.org/public/apache/mahout/0.13.0/apache-mahout-distribution-0.13.0.tar.gz)]

Change it to use dynamic mirror.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (MAHOUT-2057) Example in README results in class not found

2019-03-05 Thread Andrew Musselman (JIRA)
Andrew Musselman created MAHOUT-2057:


 Summary: Example in README results in class not found
 Key: MAHOUT-2057
 URL: https://issues.apache.org/jira/browse/MAHOUT-2057
 Project: Mahout
  Issue Type: Bug
Affects Versions: 0.14.0
Reporter: Andrew Musselman
 Fix For: 0.14.1


Running the example in the README gives a class not found:
"java.lang.NoClassDefFoundError: 
it/unimi/dsi/fastutil/ints/Int2DoubleOpenHashMap"
 
If that's just us still using something that's been removed, it's not a 
deal-breaker for me as long as we fix it in a quick point release.
 
Pending that being a simple fix my vote is +1 binding, and if Andy's not back 
from vacation and his proxy works that's +2 binding from me and Andy.
 
 
bob $ export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64
bob $ export 
MAHOUT_HOME=//home/akm/a/src/test/[repository.apache.org/content/repositories/orgapachemahout-1052/org/apache/mahout/mahout/0.14.0|http://repository.apache.org/content/repositories/orgapachemahout-1052/org/apache/mahout/mahout/0.14.0]
bob $ export SPARK_HOME=/home/akm/a/src/spark-2.1.0-bin-hadoop2.7
bob $ MASTER=local[2] mahout-0.14.0/bin/mahout spark-shell
Adding lib/ to CLASSPATH
Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties
Setting default log level to "WARN".
To adjust logging level use sc.setLogLevel(newLevel). For SparkR, use 
setLogLevel(newLevel).
19/03/04 09:07:44 WARN NativeCodeLoader: Unable to load native-hadoop library 
for your platform... using builtin-java classes where applicable
19/03/04 09:07:44 WARN Utils: Your hostname, Bob resolves to a loopback 
address: 127.0.1.1; using 10.0.1.2 instead (on interface eno1)
19/03/04 09:07:44 WARN Utils: Set SPARK_LOCAL_IP if you need to bind to another 
address
19/03/04 09:07:53 WARN ObjectStore: Failed to get database global_temp, 
returning NoSuchObjectException
Spark context Web UI available at [http://10.0.1.2:4040|http://10.0.1.2:4040/]
Spark context available as 'sc' (master = local[2], app id = 
local-1551719265339).
Spark session available as 'spark'.
Loading 
/home/akm/a/src/test/[repository.apache.org/content/repositories/orgapachemahout-1052/org/apache/mahout/mahout/0.14.0/mahout-0.14.0/bin/load-shell.scala.|http://repository.apache.org/content/repositories/orgapachemahout-1052/org/apache/mahout/mahout/0.14.0/mahout-0.14.0/bin/load-shell.scala.]..
import org.apache.mahout.math._
import org.apache.mahout.math.scalabindings._
import org.apache.mahout.math.drm._
import org.apache.mahout.math.scalabindings.RLikeOps._
import org.apache.mahout.math.drm.RLikeDrmOps._
import org.apache.mahout.sparkbindings._
sdc: org.apache.mahout.sparkbindings.SparkDistributedContext = 
org.apache.mahout.sparkbindings.SparkDistributedContext@749ffdc7

    _ _
_ __ ___   __ _| |__   ___  _   _| |_
 '_ ` _ \ / _` | '_ \ / _ \| | | | __|
 | | | | (_| | | | | (_) | |_| | |_
_| |_| |_|\__,_|_| |_|\___/ \__,_|\__|  version 0.14.0



That file does not exist

Welcome to
    __
 / __/__  ___ _/ /__
    _\ \/ _ \/ _ `/ __/  '_/
   /___/ .__/\_,_/_/ /_/\_\   version 2.1.0
  /_/
 
Using Scala version 2.11.8 (OpenJDK 64-Bit Server VM, Java 1.8.0_191)
Type in expressions to have them evaluated.
Type :help for more information.

scala> :load 
/home/akm/a/src/test/[repository.apache.org/content/repositories/orgapachemahout-1052/org/apache/mahout/mahout/0.14.0/mahout-0.14.0/examples/bin/SparseSparseDrmTimer.mscala|http://repository.apache.org/content/repositories/orgapachemahout-1052/org/apache/mahout/mahout/0.14.0/mahout-0.14.0/examples/bin/SparseSparseDrmTimer.mscala]
Loading 
/home/akm/a/src/test/[repository.apache.org/content/repositories/orgapachemahout-1052/org/apache/mahout/mahout/0.14.0/mahout-0.14.0/examples/bin/SparseSparseDrmTimer.mscala.|http://repository.apache.org/content/repositories/orgapachemahout-1052/org/apache/mahout/mahout/0.14.0/mahout-0.14.0/examples/bin/SparseSparseDrmTimer.mscala.]..
timeSparseDRMMMul: (m: Int, n: Int, s: Int, para: Int, pctDense: Double, seed: 
Long)Long

scala> timeSparseDRMMMul(1000,1000,1000,1,.02,1234L)
19/03/04 09:13:13 ERROR Executor: Exception in task 0.0 in stage 0.0 (TID 1)
java.lang.NoClassDefFoundError: it/unimi/dsi/fastutil/ints/Int2DoubleOpenHashMap
    at 
org.apache.mahout.math.RandomAccessSparseVector.(RandomAccessSparseVector.java:49)
    at 
org.apache.mahout.math.RandomAccessSparseVector.(RandomAccessSparseVector.java:44)
    at 
org.apache.mahout.sparkbindings.SparkEngine$$anonfun$11$$anonfun$apply$2.apply(SparkEngine.scala:200)
    at 
org.apache.mahout.sparkbindings.SparkEngine$$anonfun$11$$anonfun$apply$2.apply(SparkEngine.scala:200)
    at 
scala.collection.TraversableLike$$anonfun$map$1.apply(TraversableLike.scala:234)
    at 
scala.collection.TraversableLike$$anonfun$map$1.apply(TraversableLike.scala:234)
    

[jira] [Created] (MAHOUT-2058) Website publishing README

2019-03-05 Thread Andrew Musselman (JIRA)
Andrew Musselman created MAHOUT-2058:


 Summary: Website publishing README
 Key: MAHOUT-2058
 URL: https://issues.apache.org/jira/browse/MAHOUT-2058
 Project: Mahout
  Issue Type: Documentation
Reporter: Andrew Musselman
 Fix For: 0.14.1


Would be good to have info on how to publish changes to the website in the 
website/README file.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (MAHOUT-1964) Logo in the spark-shell is broken

2017-04-15 Thread Andrew Musselman (JIRA)
Andrew Musselman created MAHOUT-1964:


 Summary: Logo in the spark-shell is broken
 Key: MAHOUT-1964
 URL: https://issues.apache.org/jira/browse/MAHOUT-1964
 Project: Mahout
  Issue Type: Bug
  Components: Mahout spark shell
Affects Versions: 0.13.0
Reporter: Andrew Musselman
Assignee: Andrew Musselman
Priority: Minor
 Fix For: 0.13.1


Mahout logo in the shell has a few characters misplaced.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Commented] (MAHOUT-1948) A release step causes errors for some people

2017-03-01 Thread Andrew Musselman (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15891549#comment-15891549
 ] 

Andrew Musselman commented on MAHOUT-1948:
--

This step causes errors on some environments:

mvn -Pmahout-release,apache-release,hadoop2 package

> A release step causes errors for some people
> 
>
> Key: MAHOUT-1948
> URL: https://issues.apache.org/jira/browse/MAHOUT-1948
> Project: Mahout
>  Issue Type: Bug
>  Components: build
>Affects Versions: 0.13.0
>Reporter: Andrew Musselman
>




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (MAHOUT-1948) A release step causes errors for some people

2017-03-01 Thread Andrew Musselman (JIRA)
Andrew Musselman created MAHOUT-1948:


 Summary: A release step causes errors for some people
 Key: MAHOUT-1948
 URL: https://issues.apache.org/jira/browse/MAHOUT-1948
 Project: Mahout
  Issue Type: Bug
  Components: build
Affects Versions: 0.13.0
Reporter: Andrew Musselman






--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Updated] (MAHOUT-1904) Create a test harness to test mahout across different hardware configurations

2017-02-28 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman updated MAHOUT-1904:
-
Fix Version/s: (was: 0.13.0)
   0.13.1

> Create a test harness to test mahout across different hardware configurations
> -
>
> Key: MAHOUT-1904
> URL: https://issues.apache.org/jira/browse/MAHOUT-1904
> Project: Mahout
>  Issue Type: Task
>Affects Versions: 0.12.2
>Reporter: Andrew Palumbo
>Assignee: Andrew Palumbo
>Priority: Critical
>  Labels: release, test
> Fix For: 0.13.1
>
>
> Creat a set of simple scala programs to be run as a test harness for Linux 
> amd/intel, mac, and avx2(default).



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Resolved] (MAHOUT-1944) remove derby metadb from dir structure before cutting RC

2017-02-27 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman resolved MAHOUT-1944.
--
Resolution: Fixed

Just remember to do this when releasing.

> remove derby metadb from dir structure before cutting RC
> 
>
> Key: MAHOUT-1944
> URL: https://issues.apache.org/jira/browse/MAHOUT-1944
> Project: Mahout
>  Issue Type: Task
>Reporter: Andrew Palumbo
>Assignee: Andrew Palumbo
>Priority: Blocker
>
> spark-shell leaves some derby directories around.   {{metastore_db/}.  Make 
> sure these are gone before packaging up final RC.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Resolved] (MAHOUT-1945) mvn -Pmahout-release,apache-release,hadoop2 package fails

2017-02-27 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman resolved MAHOUT-1945.
--
Resolution: Fixed

Works on my dev machine.

> mvn -Pmahout-release,apache-release,hadoop2 package fails
> -
>
> Key: MAHOUT-1945
> URL: https://issues.apache.org/jira/browse/MAHOUT-1945
> Project: Mahout
>  Issue Type: Task
>Reporter: Andrew Palumbo
>Assignee: Andrew Musselman
>Priority: Blocker
>
> The first command for the release fails in {{mahout math}}:  
> {code}
> $ mvn -Pmahout-release,apache-release,hadoop2 package
>  {...}
>  [ERROR] Failed to execute goal 
> org.apache.maven.plugins:maven-javadoc-plugin:2.9.1:jar (attach-javadocs) on 
> project mahout-math: MavenReportException: Error while creating archive: 
> [ERROR] Exit code: 1 - 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Matrix.java:184:
>  warning: no @return [ERROR] Matrix like(int rows, int columns); [ERROR] ^ 
> [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Matrix.java:412:
>  warning: no @return [ERROR] MatrixFlavor getFlavor(); [ERROR] ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:230:
>  error: bad use of '>' [ERROR] * @param power The power to use. Must be >= 0. 
> May also be {@link Double#POSITIVE_INFINITY}. See the Wikipedia link [ERROR] 
> ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:246:
>  error: bad use of '>' [ERROR] * @param power The power to use. Must be > 1. 
> Cannot be {@link Double#POSITIVE_INFINITY}. [ERROR] ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:252:
>  error: self-closing element not allowed [ERROR] * Return the k-norm of the 
> vector.  See http://en.wikipedia.org/wiki/Lp_space  [ERROR] ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:260:
>  warning: no @return [ERROR] double norm(double power); [ERROR] ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:412:
>  warning: no @return [ERROR] double getLengthSquared(); [ERROR] ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:417:
>  warning: no @param for v [ERROR] double getDistanceSquared(Vector v); 
> [ERROR] ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:417:
>  warning: no @return [ERROR] double getDistanceSquared(Vector v); [ERROR] ^ 
> [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:422:
>  warning: no @return [ERROR] double getLookupCost(); [ERROR] ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:428:
>  warning: no @return [ERROR] double getIteratorAdvanceCost(); [ERROR] ^ 
> [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:433:
>  warning: no @return [ERROR] boolean isAddConstantTime(); [ERROR] ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Algebra.java:40:
>  warning: no @param for a [ERROR] public static double hypot(double a, double 
> b) { [ERROR] ^
> {...}
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Work started] (MAHOUT-1945) mvn -Pmahout-release,apache-release,hadoop2 package fails

2017-02-27 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on MAHOUT-1945 started by Andrew Musselman.

> mvn -Pmahout-release,apache-release,hadoop2 package fails
> -
>
> Key: MAHOUT-1945
> URL: https://issues.apache.org/jira/browse/MAHOUT-1945
> Project: Mahout
>  Issue Type: Task
>Reporter: Andrew Palumbo
>Assignee: Andrew Musselman
>Priority: Blocker
>
> The first command for the release fails in {{mahout math}}:  
> {code}
> $ mvn -Pmahout-release,apache-release,hadoop2 package
>  {...}
>  [ERROR] Failed to execute goal 
> org.apache.maven.plugins:maven-javadoc-plugin:2.9.1:jar (attach-javadocs) on 
> project mahout-math: MavenReportException: Error while creating archive: 
> [ERROR] Exit code: 1 - 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Matrix.java:184:
>  warning: no @return [ERROR] Matrix like(int rows, int columns); [ERROR] ^ 
> [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Matrix.java:412:
>  warning: no @return [ERROR] MatrixFlavor getFlavor(); [ERROR] ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:230:
>  error: bad use of '>' [ERROR] * @param power The power to use. Must be >= 0. 
> May also be {@link Double#POSITIVE_INFINITY}. See the Wikipedia link [ERROR] 
> ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:246:
>  error: bad use of '>' [ERROR] * @param power The power to use. Must be > 1. 
> Cannot be {@link Double#POSITIVE_INFINITY}. [ERROR] ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:252:
>  error: self-closing element not allowed [ERROR] * Return the k-norm of the 
> vector.  See http://en.wikipedia.org/wiki/Lp_space  [ERROR] ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:260:
>  warning: no @return [ERROR] double norm(double power); [ERROR] ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:412:
>  warning: no @return [ERROR] double getLengthSquared(); [ERROR] ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:417:
>  warning: no @param for v [ERROR] double getDistanceSquared(Vector v); 
> [ERROR] ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:417:
>  warning: no @return [ERROR] double getDistanceSquared(Vector v); [ERROR] ^ 
> [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:422:
>  warning: no @return [ERROR] double getLookupCost(); [ERROR] ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:428:
>  warning: no @return [ERROR] double getIteratorAdvanceCost(); [ERROR] ^ 
> [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Vector.java:433:
>  warning: no @return [ERROR] boolean isAddConstantTime(); [ERROR] ^ [ERROR] 
> /home/andy/sandbox/mahout/math/src/main/java/org/apache/mahout/math/Algebra.java:40:
>  warning: no @param for a [ERROR] public static double hypot(double a, double 
> b) { [ERROR] ^
> {...}
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Resolved] (MAHOUT-1913) Clean Up of VCL bindings

2017-02-27 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman resolved MAHOUT-1913.
--
Resolution: Fixed

> Clean Up of VCL bindings
> 
>
> Key: MAHOUT-1913
> URL: https://issues.apache.org/jira/browse/MAHOUT-1913
> Project: Mahout
>  Issue Type: Bug
>Affects Versions: 0.12.2
>Reporter: Andrew Palumbo
>Assignee: Andrew Musselman
>Priority: Blocker
> Fix For: 0.13.0
>
>
> Much cleanup of vcl bindings commit



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (MAHOUT-1947) Make a profile for flink module

2017-02-27 Thread Andrew Musselman (JIRA)
Andrew Musselman created MAHOUT-1947:


 Summary: Make a profile for flink module
 Key: MAHOUT-1947
 URL: https://issues.apache.org/jira/browse/MAHOUT-1947
 Project: Mahout
  Issue Type: Bug
Reporter: Andrew Musselman
Priority: Minor


We removed the flink module from the root pom due to intermittent OOM errors, 
so let's add a flink profile for a release soon.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Commented] (MAHOUT-1929) Add Generalized Linear Models

2017-02-27 Thread Andrew Musselman (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886554#comment-15886554
 ] 

Andrew Musselman commented on MAHOUT-1929:
--

Sounds good; let's shoot for a point release soon if you can, or definitely for 
0.14.

Thanks!

> Add Generalized Linear Models
> -
>
> Key: MAHOUT-1929
> URL: https://issues.apache.org/jira/browse/MAHOUT-1929
> Project: Mahout
>  Issue Type: Wish
>  Components: Algorithms
>Affects Versions: 0.13.1
>Reporter: Trevor Grant
>
> Implement generalize Linear Models (GLM)
> https://en.wikipedia.org/wiki/Generalized_linear_model



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Updated] (MAHOUT-1946) ViennaCL not being picked up by JNI

2017-02-26 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman updated MAHOUT-1946:
-
Sprint: Jan/Feb-2017

> ViennaCL not being picked up by JNI
> ---
>
> Key: MAHOUT-1946
> URL: https://issues.apache.org/jira/browse/MAHOUT-1946
> Project: Mahout
>  Issue Type: Bug
>Reporter: Andrew Musselman
>Priority: Blocker
> Fix For: 0.13.0
>
>
> Using the PR for MAHOUT-1938 but probably in master as well:
> scala> :load ./examples/bin/SparseSparseDrmTimer.mscala
> Loading ./examples/bin/SparseSparseDrmTimer.mscala...
> timeSparseDRMMMul: (m: Int, n: Int, s: Int, para: Int, pctDense: Double, 
> seed: Long)Long
> scala> timeSparseDRMMMul(100,100,100,1,.02,1234L)
> [INFO] Creating org.apache.mahout.viennacl.opencl.GPUMMul solver
> [INFO] Successfully created org.apache.mahout.viennacl.opencl.GPUMMul solver
> gpuRWCW
> 17/02/26 13:18:54 ERROR Executor: Exception in task 0.0 in stage 3.0 (TID 3)
> java.lang.UnsatisfiedLinkError: no jniViennaCL in java.library.path
>   at java.lang.ClassLoader.loadLibrary(ClassLoader.java:1867)
>   at java.lang.Runtime.loadLibrary0(Runtime.java:870)
>   at java.lang.System.loadLibrary(System.java:1122)
>   at org.bytedeco.javacpp.Loader.loadLibrary(Loader.java:726)
>   at org.bytedeco.javacpp.Loader.load(Loader.java:501)
>   at org.bytedeco.javacpp.Loader.load(Loader.java:434)
>   at 
> org.apache.mahout.viennacl.opencl.javacpp.Context$.loadLib(Context.scala:63)
>   at 
> org.apache.mahout.viennacl.opencl.javacpp.Context$.(Context.scala:65)
>   at 
> org.apache.mahout.viennacl.opencl.javacpp.Context$.(Context.scala)
>   at 
> org.apache.mahout.viennacl.opencl.GPUMMul$.org$apache$mahout$viennacl$opencl$GPUMMul$$gpuRWCW(GPUMMul.scala:171)
>   at 
> org.apache.mahout.viennacl.opencl.GPUMMul$$anonfun$11.apply(GPUMMul.scala:77)
>   at 
> org.apache.mahout.viennacl.opencl.GPUMMul$$anonfun$11.apply(GPUMMul.scala:77)
>   at org.apache.mahout.viennacl.opencl.GPUMMul$.apply(GPUMMul.scala:127)
>   at org.apache.mahout.viennacl.opencl.GPUMMul$.apply(GPUMMul.scala:33)
>   at 
> org.apache.mahout.math.scalabindings.RLikeMatrixOps.$percent$times$percent(RLikeMatrixOps.scala:37)
>   at 
> org.apache.mahout.sparkbindings.blas.ABt$.org$apache$mahout$sparkbindings$blas$ABt$$mmulFunc$1(ABt.scala:98)
>   at 
> org.apache.mahout.sparkbindings.blas.ABt$$anonfun$6.apply(ABt.scala:113)
>   at 
> org.apache.mahout.sparkbindings.blas.ABt$$anonfun$6.apply(ABt.scala:113)
>   at 
> org.apache.mahout.sparkbindings.blas.ABt$$anonfun$pairwiseApply$1.apply(ABt.scala:209)
>   at 
> org.apache.mahout.sparkbindings.blas.ABt$$anonfun$pairwiseApply$1.apply(ABt.scala:209)
>   at scala.collection.Iterator$$anon$11.next(Iterator.scala:328)
>   at 
> org.apache.spark.util.collection.ExternalSorter.insertAll(ExternalSorter.scala:191)
>   at 
> org.apache.spark.shuffle.sort.SortShuffleWriter.write(SortShuffleWriter.scala:64)
>   at 
> org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:73)
>   at 
> org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:41)
>   at org.apache.spark.scheduler.Task.run(Task.scala:89)
>   at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:227)
>   at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
>   at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
>   at java.lang.Thread.run(Thread.java:745)
> 17/02/26 13:18:54 ERROR SparkUncaughtExceptionHandler: Uncaught exception in 
> thread Thread[Executor task launch worker-0,5,main]
> java.lang.UnsatisfiedLinkError: no jniViennaCL in java.library.path
>   at java.lang.ClassLoader.loadLibrary(ClassLoader.java:1867)
>   at java.lang.Runtime.loadLibrary0(Runtime.java:870)
>   at java.lang.System.loadLibrary(System.java:1122)
>   at org.bytedeco.javacpp.Loader.loadLibrary(Loader.java:726)
>   at org.bytedeco.javacpp.Loader.load(Loader.java:501)
>   at org.bytedeco.javacpp.Loader.load(Loader.java:434)
>   at 
> org.apache.mahout.viennacl.opencl.javacpp.Context$.loadLib(Context.scala:63)
>   at 
> org.apache.mahout.viennacl.opencl.javacpp.Context$.(Context.scala:65)
>   at 
> org.apache.mahout.viennacl.opencl.javacpp.Context$.(Context.scala)
>   at 
> org.apache.mahout.viennacl.opencl.GPUMMul$.org$apache$mahout$viennacl$opencl$GPUMMul$$gpuRWCW(GPUMMul.scala:171)
>   at 
> org.apache.mahout.viennacl.opencl.GPUMMul$$anonfun$11.apply(GPUMMul.scala:77)
>   at 
> org.apache.mahout.viennacl.opencl.GPUMMul$$anonfun$11.apply(GPUMMul.scala:77)
>   at org.apache.mahout.viennacl.opencl.GPUMMul$.apply(GPUMMul.scala:127)
>   at 

[jira] [Created] (MAHOUT-1946) ViennaCL not being picked up by JNI

2017-02-26 Thread Andrew Musselman (JIRA)
Andrew Musselman created MAHOUT-1946:


 Summary: ViennaCL not being picked up by JNI
 Key: MAHOUT-1946
 URL: https://issues.apache.org/jira/browse/MAHOUT-1946
 Project: Mahout
  Issue Type: Bug
Reporter: Andrew Musselman
Priority: Blocker
 Fix For: 0.13.0


Using the PR for MAHOUT-1938 but probably in master as well:

scala> :load ./examples/bin/SparseSparseDrmTimer.mscala
Loading ./examples/bin/SparseSparseDrmTimer.mscala...
timeSparseDRMMMul: (m: Int, n: Int, s: Int, para: Int, pctDense: Double, seed: 
Long)Long

scala> timeSparseDRMMMul(100,100,100,1,.02,1234L)
[INFO] Creating org.apache.mahout.viennacl.opencl.GPUMMul solver
[INFO] Successfully created org.apache.mahout.viennacl.opencl.GPUMMul solver
gpuRWCW
17/02/26 13:18:54 ERROR Executor: Exception in task 0.0 in stage 3.0 (TID 3)
java.lang.UnsatisfiedLinkError: no jniViennaCL in java.library.path
at java.lang.ClassLoader.loadLibrary(ClassLoader.java:1867)
at java.lang.Runtime.loadLibrary0(Runtime.java:870)
at java.lang.System.loadLibrary(System.java:1122)
at org.bytedeco.javacpp.Loader.loadLibrary(Loader.java:726)
at org.bytedeco.javacpp.Loader.load(Loader.java:501)
at org.bytedeco.javacpp.Loader.load(Loader.java:434)
at 
org.apache.mahout.viennacl.opencl.javacpp.Context$.loadLib(Context.scala:63)
at 
org.apache.mahout.viennacl.opencl.javacpp.Context$.(Context.scala:65)
at 
org.apache.mahout.viennacl.opencl.javacpp.Context$.(Context.scala)
at 
org.apache.mahout.viennacl.opencl.GPUMMul$.org$apache$mahout$viennacl$opencl$GPUMMul$$gpuRWCW(GPUMMul.scala:171)
at 
org.apache.mahout.viennacl.opencl.GPUMMul$$anonfun$11.apply(GPUMMul.scala:77)
at 
org.apache.mahout.viennacl.opencl.GPUMMul$$anonfun$11.apply(GPUMMul.scala:77)
at org.apache.mahout.viennacl.opencl.GPUMMul$.apply(GPUMMul.scala:127)
at org.apache.mahout.viennacl.opencl.GPUMMul$.apply(GPUMMul.scala:33)
at 
org.apache.mahout.math.scalabindings.RLikeMatrixOps.$percent$times$percent(RLikeMatrixOps.scala:37)
at 
org.apache.mahout.sparkbindings.blas.ABt$.org$apache$mahout$sparkbindings$blas$ABt$$mmulFunc$1(ABt.scala:98)
at 
org.apache.mahout.sparkbindings.blas.ABt$$anonfun$6.apply(ABt.scala:113)
at 
org.apache.mahout.sparkbindings.blas.ABt$$anonfun$6.apply(ABt.scala:113)
at 
org.apache.mahout.sparkbindings.blas.ABt$$anonfun$pairwiseApply$1.apply(ABt.scala:209)
at 
org.apache.mahout.sparkbindings.blas.ABt$$anonfun$pairwiseApply$1.apply(ABt.scala:209)
at scala.collection.Iterator$$anon$11.next(Iterator.scala:328)
at 
org.apache.spark.util.collection.ExternalSorter.insertAll(ExternalSorter.scala:191)
at 
org.apache.spark.shuffle.sort.SortShuffleWriter.write(SortShuffleWriter.scala:64)
at 
org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:73)
at 
org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:41)
at org.apache.spark.scheduler.Task.run(Task.scala:89)
at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:227)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)
17/02/26 13:18:54 ERROR SparkUncaughtExceptionHandler: Uncaught exception in 
thread Thread[Executor task launch worker-0,5,main]
java.lang.UnsatisfiedLinkError: no jniViennaCL in java.library.path
at java.lang.ClassLoader.loadLibrary(ClassLoader.java:1867)
at java.lang.Runtime.loadLibrary0(Runtime.java:870)
at java.lang.System.loadLibrary(System.java:1122)
at org.bytedeco.javacpp.Loader.loadLibrary(Loader.java:726)
at org.bytedeco.javacpp.Loader.load(Loader.java:501)
at org.bytedeco.javacpp.Loader.load(Loader.java:434)
at 
org.apache.mahout.viennacl.opencl.javacpp.Context$.loadLib(Context.scala:63)
at 
org.apache.mahout.viennacl.opencl.javacpp.Context$.(Context.scala:65)
at 
org.apache.mahout.viennacl.opencl.javacpp.Context$.(Context.scala)
at 
org.apache.mahout.viennacl.opencl.GPUMMul$.org$apache$mahout$viennacl$opencl$GPUMMul$$gpuRWCW(GPUMMul.scala:171)
at 
org.apache.mahout.viennacl.opencl.GPUMMul$$anonfun$11.apply(GPUMMul.scala:77)
at 
org.apache.mahout.viennacl.opencl.GPUMMul$$anonfun$11.apply(GPUMMul.scala:77)
at org.apache.mahout.viennacl.opencl.GPUMMul$.apply(GPUMMul.scala:127)
at org.apache.mahout.viennacl.opencl.GPUMMul$.apply(GPUMMul.scala:33)
at 
org.apache.mahout.math.scalabindings.RLikeMatrixOps.$percent$times$percent(RLikeMatrixOps.scala:37)
at 

[jira] [Commented] (MAHOUT-1791) Automatic threading for java based mmul in the front end and the backend.

2017-02-24 Thread Andrew Musselman (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15883740#comment-15883740
 ] 

Andrew Musselman commented on MAHOUT-1791:
--

Where's the best place to start with this? For things that reference "MMul" I 
see:

JvmBackend
RootSolverFactory
MMul
GPUMMul

> Automatic threading for java based mmul in the front end and the backend.
> -
>
> Key: MAHOUT-1791
> URL: https://issues.apache.org/jira/browse/MAHOUT-1791
> Project: Mahout
>  Issue Type: Improvement
>Affects Versions: 0.11.1, 0.12.0, 0.11.2
>Reporter: Dmitriy Lyubimov
>Assignee: Andrew Musselman
>Priority: Critical
> Fix For: 0.13.0
>
>
> As we know, we are still struggling with decisions which path to take for 
> bare metal accelerations in in-core math. 
> Meanwhile, a simple no-brainer improvement though is to add decision paths 
> and apply multithreaded matrix-matrix multiplication (and maybe even others; 
> but mmul perhaps is the most prominent beneficiary here at the moment which 
> is both easy to do and to have a statistically significant improvement) 
> So multithreaded logic addition to mmul is one path. 
> Another path is automatic adjustment of multithreading. 
> In front end, we probably want to utilize all cores available. 
> in the backend, we can oversubscribe cores but probably doing so by more than 
> 2x or 3x is unadvisable because of point of diminishing returns driven by 
> growing likelihood of context switching overhead.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Work started] (MAHOUT-1913) Clean Up of VCL bindings

2017-02-24 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on MAHOUT-1913 started by Andrew Musselman.

> Clean Up of VCL bindings
> 
>
> Key: MAHOUT-1913
> URL: https://issues.apache.org/jira/browse/MAHOUT-1913
> Project: Mahout
>  Issue Type: Bug
>Affects Versions: 0.12.2
>Reporter: Andrew Palumbo
>Assignee: Andrew Musselman
>Priority: Blocker
> Fix For: 0.13.0
>
>
> Much cleanup of vcl bindings commit



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Commented] (MAHOUT-1913) Clean Up of VCL bindings

2017-02-24 Thread Andrew Musselman (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15883652#comment-15883652
 ] 

Andrew Musselman commented on MAHOUT-1913:
--

I don't see that string at all; any other stuff to look for?

> Clean Up of VCL bindings
> 
>
> Key: MAHOUT-1913
> URL: https://issues.apache.org/jira/browse/MAHOUT-1913
> Project: Mahout
>  Issue Type: Bug
>Affects Versions: 0.12.2
>Reporter: Andrew Palumbo
>Assignee: Andrew Musselman
>Priority: Blocker
> Fix For: 0.13.0
>
>
> Much cleanup of vcl bindings commit



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Resolved] (MAHOUT-1910) Remove .zip archive from build

2017-02-24 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman resolved MAHOUT-1910.
--
Resolution: Fixed

commit a2cbd15709e8f5789711d171afc5206fa6ec63ed
Author: Andrew Musselman 
Date:   Fri Feb 24 14:27:05 2017 -0800

MAHOUT-1910: Remove .zip archive from build

> Remove .zip archive from build
> --
>
> Key: MAHOUT-1910
> URL: https://issues.apache.org/jira/browse/MAHOUT-1910
> Project: Mahout
>  Issue Type: Task
>Affects Versions: 0.12.2
>Reporter: Andrew Palumbo
>Assignee: Andrew Musselman
>Priority: Blocker
> Fix For: 0.13.0
>
>
> remove the .zip archves from the mahout binary disributions and keep only the 
> .tar.gz:
> In {{assembly/bin.xml:}}
> {code}
>   
> dir
> tar.gz
> zip
>   
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Work started] (MAHOUT-1910) Remove .zip archive from build

2017-02-24 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on MAHOUT-1910 started by Andrew Musselman.

> Remove .zip archive from build
> --
>
> Key: MAHOUT-1910
> URL: https://issues.apache.org/jira/browse/MAHOUT-1910
> Project: Mahout
>  Issue Type: Task
>Affects Versions: 0.12.2
>Reporter: Andrew Palumbo
>Assignee: Andrew Musselman
>Priority: Blocker
> Fix For: 0.13.0
>
>
> remove the .zip archves from the mahout binary disributions and keep only the 
> .tar.gz:
> In {{assembly/bin.xml:}}
> {code}
>   
> dir
> tar.gz
> zip
>   
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Commented] (MAHOUT-1894) Add support for Spark 2x backend

2017-02-23 Thread Andrew Musselman (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15881664#comment-15881664
 ] 

Andrew Musselman commented on MAHOUT-1894:
--

I'm getting "That file does not exist" after the shell opens with spark 1.6.3.

> Add support for Spark 2x backend
> 
>
> Key: MAHOUT-1894
> URL: https://issues.apache.org/jira/browse/MAHOUT-1894
> Project: Mahout
>  Issue Type: Task
>  Components: spark
>Affects Versions: 0.13.0
>Reporter: Suneel Marthi
>Assignee: Trevor Grant
>Priority: Critical
> Fix For: 1.0.0, 0.13.0, 0.14.0
>
>
> add support for Spark 2.x as backend execution engine.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Resolved] (MAHOUT-1682) Create a documentation page for SPCA

2017-02-03 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman resolved MAHOUT-1682.
--
Resolution: Fixed

> Create a documentation page for SPCA
> 
>
> Key: MAHOUT-1682
> URL: https://issues.apache.org/jira/browse/MAHOUT-1682
> Project: Mahout
>  Issue Type: Documentation
>  Components: Documentation
>Reporter: Andrew Palumbo
>Assignee: Andrew Musselman
> Fix For: 0.13.0
>
>
> Following the template of the SSVD and QR pages create a page for SPCA.  This 
> Page would go under Algorithms-> Distributed Matrix Decomposition.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Work started] (MAHOUT-1682) Create a documentation page for SPCA

2017-02-02 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on MAHOUT-1682 started by Andrew Musselman.

> Create a documentation page for SPCA
> 
>
> Key: MAHOUT-1682
> URL: https://issues.apache.org/jira/browse/MAHOUT-1682
> Project: Mahout
>  Issue Type: Documentation
>  Components: Documentation
>Reporter: Andrew Palumbo
>Assignee: Andrew Musselman
> Fix For: 0.13.0
>
>
> Following the template of the SSVD and QR pages create a page for SPCA.  This 
> Page would go under Algorithms-> Distributed Matrix Decomposition.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Resolved] (MAHOUT-1893) Fix Algorithm list on mahout.apache.org

2017-02-02 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman resolved MAHOUT-1893.
--
Resolution: Fixed

> Fix Algorithm list on mahout.apache.org
> ---
>
> Key: MAHOUT-1893
> URL: https://issues.apache.org/jira/browse/MAHOUT-1893
> Project: Mahout
>  Issue Type: Bug
>Affects Versions: 0.12.2
>Reporter: Andrew Palumbo
>Assignee: Andrew Musselman
>Priority: Critical
> Fix For: 0.13.0
>
> Attachments: text.html
>
>
> mahout.apache.org ->Algorithms lists only thin-QR and DSSVD.  Update with all 
> current algorithms.
> ALS, SPCA, point them to 
> https://mahout.apache.org/users/sparkbindings/home.html



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Work started] (MAHOUT-1893) Fix Algorithm list on mahout.apache.org

2017-02-02 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on MAHOUT-1893 started by Andrew Musselman.

> Fix Algorithm list on mahout.apache.org
> ---
>
> Key: MAHOUT-1893
> URL: https://issues.apache.org/jira/browse/MAHOUT-1893
> Project: Mahout
>  Issue Type: Bug
>Affects Versions: 0.12.2
>Reporter: Andrew Palumbo
>Assignee: Andrew Musselman
>Priority: Critical
> Fix For: 0.13.0
>
> Attachments: text.html
>
>
> mahout.apache.org ->Algorithms lists only thin-QR and DSSVD.  Update with all 
> current algorithms.
> ALS, SPCA, point them to 
> https://mahout.apache.org/users/sparkbindings/home.html



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Updated] (MAHOUT-1930) Add test for StandardScaler

2017-02-01 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman updated MAHOUT-1930:
-
Fix Version/s: (was: 0.13.1)
   0.13.0

> Add test for StandardScaler
> ---
>
> Key: MAHOUT-1930
> URL: https://issues.apache.org/jira/browse/MAHOUT-1930
> Project: Mahout
>  Issue Type: Test
>  Components: Algorithms
>Affects Versions: 0.13.0
>Reporter: Trevor Grant
>  Labels: beginner
> Fix For: 0.13.0
>
>
> Add test for StandardScaler



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Updated] (MAHOUT-1927) Need test for Durbin-Watson Test Statistic

2017-02-01 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman updated MAHOUT-1927:
-
Fix Version/s: (was: 0.13.1)
   0.13.0

> Need test for Durbin-Watson Test Statistic
> --
>
> Key: MAHOUT-1927
> URL: https://issues.apache.org/jira/browse/MAHOUT-1927
> Project: Mahout
>  Issue Type: Test
>Reporter: Trevor Grant
>  Labels: beginner
> Fix For: 0.13.0
>
>
> The Durbin-Watson Test statistic test, needs an R prototype and unit test in 
> the RegressionsTestsSuiteBase



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Assigned] (MAHOUT-1923) dqrThin Propagates cache hint

2017-02-01 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman reassigned MAHOUT-1923:


Assignee: Trevor Grant

> dqrThin Propagates cache hint
> -
>
> Key: MAHOUT-1923
> URL: https://issues.apache.org/jira/browse/MAHOUT-1923
> Project: Mahout
>  Issue Type: Improvement
>  Components: Math
>Affects Versions: 0.12.0
>Reporter: Trevor Grant
>Assignee: Trevor Grant
>Priority: Minor
>  Labels: beginner
> Fix For: 0.13.0
>
>
> The user should be able to pass a checkpointing hint as this can lead to 
> dramatic performance issues in some cases.
> https://github.com/apache/mahout/blob/b5fe4aab22e7867ae057a6cdb1610cfa17555311/math-scala/src/main/scala/org/apache/mahout/math/decompositions/DQR.scala#L50



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Assigned] (MAHOUT-1922) DSPCA Propagates cache hint

2017-02-01 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman reassigned MAHOUT-1922:


Assignee: Trevor Grant

> DSPCA Propagates cache hint
> ---
>
> Key: MAHOUT-1922
> URL: https://issues.apache.org/jira/browse/MAHOUT-1922
> Project: Mahout
>  Issue Type: Improvement
>  Components: Math
>Affects Versions: 0.13.0
>Reporter: Trevor Grant
>Assignee: Trevor Grant
>Priority: Minor
>  Labels: beginner
> Fix For: 0.13.0
>
>
> The DSPCA does lots of check pointing, but currently only the default 
> checkpoint cacheHint is given.
> The user should be able to pass a checkpointing hint as this can lead to 
> dramatic performance issues in some cases.
> https://github.com/apache/mahout/blob/master/math-scala/src/main/scala/org/apache/mahout/math/decompositions/DSPCA.scala



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Assigned] (MAHOUT-1921) DSSVD Propagates cache hint

2017-02-01 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman reassigned MAHOUT-1921:


Assignee: Trevor Grant

> DSSVD Propagates cache hint
> ---
>
> Key: MAHOUT-1921
> URL: https://issues.apache.org/jira/browse/MAHOUT-1921
> Project: Mahout
>  Issue Type: Improvement
>  Components: Math
>Affects Versions: 0.13.0
>Reporter: Trevor Grant
>Assignee: Trevor Grant
>Priority: Minor
>  Labels: beginner
> Fix For: 0.13.0
>
>
> The DSSVD does lots of check pointing, but currently only the default 
> checkpoint cacheHint is given.  
> The user should be able to pass a checkpointing hint as this can lead to 
> dramatic performance issues in some cases.
> https://github.com/apache/mahout/blob/master/math-scala/src/main/scala/org/apache/mahout/math/decompositions/DSSVD.scala



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Assigned] (MAHOUT-1913) Clean Up of VCL bindings

2017-02-01 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman reassigned MAHOUT-1913:


Assignee: Andrew Musselman  (was: Andrew Palumbo)

> Clean Up of VCL bindings
> 
>
> Key: MAHOUT-1913
> URL: https://issues.apache.org/jira/browse/MAHOUT-1913
> Project: Mahout
>  Issue Type: Bug
>Affects Versions: 0.12.2
>Reporter: Andrew Palumbo
>Assignee: Andrew Musselman
>Priority: Blocker
> Fix For: 0.13.0
>
>
> Much cleanup of vcl bindings commit



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Assigned] (MAHOUT-1910) Remove .zip archive from build

2017-02-01 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman reassigned MAHOUT-1910:


Assignee: Andrew Musselman  (was: Andrew Palumbo)

> Remove .zip archive from build
> --
>
> Key: MAHOUT-1910
> URL: https://issues.apache.org/jira/browse/MAHOUT-1910
> Project: Mahout
>  Issue Type: Task
>Affects Versions: 0.12.2
>Reporter: Andrew Palumbo
>Assignee: Andrew Musselman
>Priority: Blocker
> Fix For: 0.13.0
>
>
> remove the .zip archves from the mahout binary disributions and keep only the 
> .tar.gz:
> In {{assembly/bin.xml:}}
> {code}
>   
> dir
> tar.gz
> zip
>   
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Updated] (MAHOUT-1897) Mahout Shell is running with a lag

2017-02-01 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman updated MAHOUT-1897:
-
Sprint:   (was: Jan/Feb-2017)

> Mahout Shell is running with a lag
> --
>
> Key: MAHOUT-1897
> URL: https://issues.apache.org/jira/browse/MAHOUT-1897
> Project: Mahout
>  Issue Type: Bug
>  Components: Mahout spark shell
>Affects Versions: 0.12.2
>Reporter: Andrew Palumbo
> Fix For: 0.13.1
>
>
> There is a noticeable lag in the {{mahout spark-shell}}.  When compared to 
> Spark's {{spark-shell}} it is easy to see. This makes things like Auto 
> Complete difficult to work with.  
> Since the {{mahout spark-shell}} is just an extension of the Spark 
> {{spark-shell}}, this shouldn't be happening.  
> This slow down makes the shell clunky and a bit difficult to work with.  
> Often times when people want to try out mahout, they play around with the 
> shell, so It would be good to get this fixed up, so people don't leave with a 
> bad impression of mahout. 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Updated] (MAHOUT-1893) Fix Algorithm list on mahout.apache.org

2017-02-01 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman updated MAHOUT-1893:
-
Description: 
mahout.apache.org ->Algorithms lists only thin-QR and DSSVD.  Update with all 
current algorithms.

ALS, SPCA, point them to https://mahout.apache.org/users/sparkbindings/home.html

  was:
mahout.apache.org ->Algorithms lists only thin-QR and DSSVD.  Update with all 
current algorithms.

ALS, SPCA


> Fix Algorithm list on mahout.apache.org
> ---
>
> Key: MAHOUT-1893
> URL: https://issues.apache.org/jira/browse/MAHOUT-1893
> Project: Mahout
>  Issue Type: Bug
>Affects Versions: 0.12.2
>Reporter: Andrew Palumbo
>Assignee: Andrew Musselman
>Priority: Critical
> Fix For: 0.13.0
>
> Attachments: text.html
>
>
> mahout.apache.org ->Algorithms lists only thin-QR and DSSVD.  Update with all 
> current algorithms.
> ALS, SPCA, point them to 
> https://mahout.apache.org/users/sparkbindings/home.html



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Updated] (MAHOUT-1893) Fix Algorithm list on mahout.apache.org

2017-02-01 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman updated MAHOUT-1893:
-
Description: 
mahout.apache.org ->Algorithms lists only thin-QR and DSSVD.  Update with all 
current algorithms.

ALS, SPCA

  was:
mahout.apache.org ->Algorithms lists only thin-QR and DSSVD.  Update with all 
current algorithms.

DALS, DS


> Fix Algorithm list on mahout.apache.org
> ---
>
> Key: MAHOUT-1893
> URL: https://issues.apache.org/jira/browse/MAHOUT-1893
> Project: Mahout
>  Issue Type: Bug
>Affects Versions: 0.12.2
>Reporter: Andrew Palumbo
>Assignee: Andrew Musselman
>Priority: Critical
> Fix For: 0.13.0
>
> Attachments: text.html
>
>
> mahout.apache.org ->Algorithms lists only thin-QR and DSSVD.  Update with all 
> current algorithms.
> ALS, SPCA



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Updated] (MAHOUT-1893) Fix Algorithm list on mahout.apache.org

2017-02-01 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman updated MAHOUT-1893:
-
Description: 
mahout.apache.org ->Algorithms lists only thin-QR and DSSVD.  Update with all 
current algorithms.

DALS, DS

  was:mahout.apache.org ->Algorithms lists only thin-QR and DSSVD.  Update with 
all current algorithms.


> Fix Algorithm list on mahout.apache.org
> ---
>
> Key: MAHOUT-1893
> URL: https://issues.apache.org/jira/browse/MAHOUT-1893
> Project: Mahout
>  Issue Type: Bug
>Affects Versions: 0.12.2
>Reporter: Andrew Palumbo
>Assignee: Andrew Musselman
>Priority: Critical
> Fix For: 0.13.0
>
> Attachments: text.html
>
>
> mahout.apache.org ->Algorithms lists only thin-QR and DSSVD.  Update with all 
> current algorithms.
> DALS, DS



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Updated] (MAHOUT-1873) Use densityAnalysis() in all necessary operations

2017-02-01 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman updated MAHOUT-1873:
-
Fix Version/s: (was: 0.13.0)
   0.13.1

> Use densityAnalysis() in all necessary operations
> -
>
> Key: MAHOUT-1873
> URL: https://issues.apache.org/jira/browse/MAHOUT-1873
> Project: Mahout
>  Issue Type: Improvement
>Affects Versions: 0.12.2
>Reporter: Andrew Palumbo
>Assignee: Andrew Palumbo
>Priority: Critical
> Fix For: 0.13.1
>
>
> Find all places in which {{densityAnalysis(...)}} can be used to determine 
> ideal matrix structure and implement it.  Eg in {{ABt}}, {{AtB}}, and 
> possibly Kryo serializers.  Ensure when doing this that it is not redundant; 
> Ie. the call is not made by both the Kryo serializer and the distributed 
> operation.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Updated] (MAHOUT-1873) Use densityAnalysis() in all necessary operations

2017-02-01 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman updated MAHOUT-1873:
-
Sprint:   (was: Jan/Feb-2017)

> Use densityAnalysis() in all necessary operations
> -
>
> Key: MAHOUT-1873
> URL: https://issues.apache.org/jira/browse/MAHOUT-1873
> Project: Mahout
>  Issue Type: Improvement
>Affects Versions: 0.12.2
>Reporter: Andrew Palumbo
>Assignee: Andrew Palumbo
>Priority: Critical
> Fix For: 0.13.1
>
>
> Find all places in which {{densityAnalysis(...)}} can be used to determine 
> ideal matrix structure and implement it.  Eg in {{ABt}}, {{AtB}}, and 
> possibly Kryo serializers.  Ensure when doing this that it is not redundant; 
> Ie. the call is not made by both the Kryo serializer and the distributed 
> operation.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Updated] (MAHOUT-1919) Flink Module breaks the build regularly

2017-02-01 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman updated MAHOUT-1919:
-
Sprint: Jan/Feb-2017

> Flink Module breaks the build regularly
> ---
>
> Key: MAHOUT-1919
> URL: https://issues.apache.org/jira/browse/MAHOUT-1919
> Project: Mahout
>  Issue Type: Bug
>Affects Versions: 0.12.2
>Reporter: Andrew Palumbo
>Priority: Critical
> Fix For: 0.13.0
>
>
> OOM Errors thrown by the flink module regularly break the Knightly Build.  
> These should be addressed before the 0.13.0 release... 
> One possibility is to Downgrade the Flink dependency in the root pom to the 
> orignal development dep: (1.0.x I believe)? 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Resolved] (MAHOUT-1849) Update home page language

2016-12-19 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman resolved MAHOUT-1849.
--
Resolution: Fixed

> Update home page language
> -
>
> Key: MAHOUT-1849
> URL: https://issues.apache.org/jira/browse/MAHOUT-1849
> Project: Mahout
>  Issue Type: Documentation
>Affects Versions: 0.12.2
>Reporter: Andrew Musselman
>Assignee: Andrew Musselman
> Fix For: 0.13.0
>
>
> Update this to remove mr reference and replace with samsara:
> Apache Mahout software includes three major features:
> A simple and extensible programming environment and framework for building 
> scalable algorithms
> A wide variety of premade algorithms for Scala + Apache Spark, H2O, Apache 
> Flink
> Mahout's mature Hadoop MapReduce algorithms



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (MAHOUT-1893) Fix Algorithm list on mahout.apache.orgq

2016-12-10 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman reassigned MAHOUT-1893:


Assignee: Andrew Musselman

> Fix Algorithm list on mahout.apache.orgq
> 
>
> Key: MAHOUT-1893
> URL: https://issues.apache.org/jira/browse/MAHOUT-1893
> Project: Mahout
>  Issue Type: Bug
>Reporter: Andrew Palumbo
>Assignee: Andrew Musselman
>Priority: Critical
>
> mahout.apache.org ->Algorithms lists only thin-QR and DSSVD.  Update with all 
> current algorithms.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MAHOUT-1890) Infinite loop in OpenLongObjectHashMap

2016-10-21 Thread Andrew Musselman (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15596137#comment-15596137
 ] 

Andrew Musselman commented on MAHOUT-1890:
--

Thanks anyway Michael :)

> Infinite loop in OpenLongObjectHashMap
> --
>
> Key: MAHOUT-1890
> URL: https://issues.apache.org/jira/browse/MAHOUT-1890
> Project: Mahout
>  Issue Type: Bug
>  Components: collections
>Affects Versions: 1.0.0
> Environment: java version "1.8.0_66"
> Java(TM) SE Runtime Environment (build 1.8.0_66-b17)
> Java HotSpot(TM) 64-Bit Server VM (build 25.66-b17, mixed mode)
>Reporter: Michael M.
>
> It seems that OpenLongObjectHashMap<> 
> (org.apache.mahout:mahout-collections:1.0) can enter a state where 
> containsKey (indexOfKey) ends up in an infinite loop, stuck in this loop:
> {code:title=OpenLongObjectHashMap.java|borderStyle=solid}
> while (state[i] != FREE && (state[i] == REMOVED || table[i] != key)) {
>   i -= decrement;
>   //hashCollisions++;
>   if (i < 0) {
> i += length;
>   }
> }
> {code}
> I haven't identified a minimum set of operations necessary to reach this 
> state, but I have generated a (fairly large) test that achieves it:
> {code:title=TestOpenLongObjectHashMap.java|borderStyle=solid}
> import java.util.Arrays;
> import java.util.List;
> import java.util.function.Consumer;
> import org.apache.mahout.math.map.OpenLongObjectHashMap;
> import org.junit.Test;
> public class TestOpenLongObjectHashMap {
> private static final List> 
> transcript =
> Arrays.asList(add(66546), add(66847), add(71319), del(71319), 
> add(80177), del(80177), add(88428),
>   add(88861), add(92709), del(92709), add(94392), 
> del(94392), add(99506), del(99506),
>   add(104232), add(104968), del(104232), del(104968), 
> add(111042), del(111042), add(123271),
>   del(123271), add(130887), del(66847), add(131387), 
> add(131537), add(131569), del(131537),
>   add(135253), del(135253), add(138781), del(138781), 
> add(141689), del(141689), add(144224),
>   del(144224), add(147237), del(147237), add(152945), 
> add(153646), del(152945), add(154915),
>   del(154915), add(155621), del(155621), add(158464), 
> add(158724), del(158724), del(158464),
>   add(174017), del(174017), add(176818), del(176818), 
> add(178344), add(178716), del(178344),
>   add(178956), del(178956), del(178716), add(181714), 
> del(181714), add(188533), del(188533),
>   add(189152), del(189152), del(131569), add(193603), 
> add(193614), add(193632), del(193614),
>   add(193650), add(193661), add(193662), del(193662), 
> add(193691), add(193761), del(193661),
>   del(193761), add(193801), add(193812), del(193650), 
> del(131387), del(193603), add(193837),
>   del(193837), add(194160), add(194175), del(194160), 
> add(194224), del(194175), add(195507),
>   add(195617), add(195838), add(196272), add(196402), 
> add(196410), del(196272), del(195507),
>   add(196426), add(196427), del(193691), add(196439), 
> add(196440), del(195838), add(196449),
>   del(196426), add(196460), add(196634), del(196449), 
> del(196402), del(196439), del(196440),
>   add(197250), add(197482), add(197531), del(193632), 
> add(197983), del(197250), del(197983),
>   del(196634), add(199455), add(199648), add(200356), 
> add(200397), add(200711), del(200356),
>   del(199648), add(201133), del(200711), add(201209), 
> del(201209), del(196410), del(200397),
>   add(205160), del(205160), del(197531), del(196427), 
> add(207533), add(207546), del(196460),
>   del(197482), add(207555), add(207556), add(207660), 
> add(207820), del(207555), del(207556),
>   del(207660), add(208887), add(208944), del(208944), 
> add(210976), del(210976), add(212301),
>   del(208887), add(213198), del(207546), del(213198), 
> add(213321), del(212301), add(213402),
>   add(214597), del(214597), add(224378), add(225915), 
> add(229080), del(213402), add(229346),
>   del(225915), del(224378), add(231030), add(231151), 
> add(231373), del(231030), del(231373),
>   del(229080), add(232821), del(232821), add(233121), 
> add(233146), del(233146), del(233121),
>   add(233947), del(233947), del(229346), add(234011), 
> add(234078), 

[jira] [Commented] (MAHOUT-1869) Create a runtime performance measuring framework for mahout

2016-08-27 Thread Andrew Musselman (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15442806#comment-15442806
 ] 

Andrew Musselman commented on MAHOUT-1869:
--

Will take a look this weekend; thanks!

> Create a runtime performance measuring framework for mahout
> ---
>
> Key: MAHOUT-1869
> URL: https://issues.apache.org/jira/browse/MAHOUT-1869
> Project: Mahout
>  Issue Type: Story
>  Components: build, Classification, Collaborative Filtering, Math
>Affects Versions: 1.0.0
>Reporter: Saikat Kanjilal
>  Labels: build
> Fix For: 1.0.0
>
>   Original Estimate: 1,008h
>  Remaining Estimate: 1,008h
>
> This proposal will outline a runtime performance module used to measure the 
> performance of various algorithms in mahout in the three major areas, 
> clustering, regression and classification. The module will be a 
> spray/scala/akka application which will be run by any current or new 
> algorithm in mahout and will display a csv file and a set of zeppelin plots 
> outlining the various criteria for performance. The goal of releasing any new 
> build in mahout will be to run a set of tests for each of the algorithms to 
> compare and contrast some benchmarks from one release to another.
> github repo is here:  https://github.com/skanjila/mahout, will send pull 
> request when I have 1 algorithm operational
> Architecture
> The run time performance application will run on top of spray/scala and akka 
> and will make async api calls into the various mahout algorithms to generate 
> a cvs file containing data representing the run time performance measurement 
> calculations for each algorithm of interest as well as a set of zeppelin 
> plots for displaying some of these results. The spray scala architecture will 
> leverage the zeppelin server to create the visualizations. The discussion 
> below centers around two types of algorithms to be addressed by the 
> application.
> Clustering
> The application will consist of a set of rest APIs to do the following:
> a) A method to load and execute the run time perf module and takes as inputs 
> the name of the algorithm (kmeans, fuzzy kmeans) and a location of a set of 
> files containing various sizes of data sets
> /algorithm=clustering/fileLocation=/path/to/files/of/different/datasets/clusters=12,20,30,40
>  and finally a set of values for the number of clusters to use for each of 
> the different sizes of the datasets
> The above API call will return a runId which the client program can then use 
> to monitor the module
> b) A method to monitor the application to ensure that its making progress 
> towards generating the zeppelin plots
> /monitor/runId=456
> The above method will execute asynchronously by calling into the mahout 
> kmeans (fuzzy kmeans) clustering implementations and will generate zeppelin 
> plots showing the normalized time on the y axis and the number of clusters in 
> the x axis. The spray/scala akka framework will allow the client application 
> to receive a callback when the run time performance calculations are actually 
> completed. For now the calculations for measuring run time performance will 
> contain: a) the ratio of the number of points clustered correctly to the 
> total number of points b) the total time taken for the algorithm to run . 
> These items will be represented in separate zeppelin plots.
> Regression
> a) The runtime performance module will run the likelihood ratio test with a 
> different set of features in every run . We will introduce a rest API to run 
> the likelihood ratio test and return the results, this will once again be an 
> sync call through the spray/akka stack.
> b) The run time performance module will contain the following metrics for 
> every algorithm: 1) cpu usage 2) memory usage 3) time taken for algorithm to 
> converge and run to completion. These metrics will be reported on top of the 
> zeppelin graphs for both the regression and the different clustering 
> algorithms mentioned above.
> How does the application get run.  The run time performance measuring 
> application will get invoked from the command line, eventually it would be 
> worthwhile to hook this into some sort of integration test suite to certify 
> the different mahout releases.
>  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MAHOUT-1874) Broken markdown in Downloads section on Mahout site

2016-08-27 Thread Andrew Musselman (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15442717#comment-15442717
 ] 

Andrew Musselman commented on MAHOUT-1874:
--

Thanks Sanjeev

> Broken markdown in Downloads section on Mahout site
> ---
>
> Key: MAHOUT-1874
> URL: https://issues.apache.org/jira/browse/MAHOUT-1874
> Project: Mahout
>  Issue Type: Documentation
>  Components: Documentation
>Reporter: Kyriakos Georgiou
>Assignee: Andrew Musselman
>Priority: Trivial
>  Labels: Documentation, Markdown, Site
> Attachments: diff.png
>
>   Original Estimate: 5m
>  Remaining Estimate: 5m
>
> There's a space that breaks the closing back-ticks around 
> ```~/.bash_profile`` ` at 
> https://svn.apache.org/viewvc/mahout/site/mahout_cms/trunk/content/general/downloads.mdtext?view=markup#l22



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Work started] (MAHOUT-1874) Broken markdown in Downloads section on Mahout site

2016-08-27 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on MAHOUT-1874 started by Andrew Musselman.

> Broken markdown in Downloads section on Mahout site
> ---
>
> Key: MAHOUT-1874
> URL: https://issues.apache.org/jira/browse/MAHOUT-1874
> Project: Mahout
>  Issue Type: Documentation
>  Components: Documentation
>Reporter: Kyriakos Georgiou
>Assignee: Andrew Musselman
>Priority: Trivial
>  Labels: Documentation, Markdown, Site
> Attachments: diff.png
>
>   Original Estimate: 5m
>  Remaining Estimate: 5m
>
> There's a space that breaks the closing back-ticks around 
> ```~/.bash_profile`` ` at 
> https://svn.apache.org/viewvc/mahout/site/mahout_cms/trunk/content/general/downloads.mdtext?view=markup#l22



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (MAHOUT-1874) Broken markdown in Downloads section on Mahout site

2016-08-27 Thread Andrew Musselman (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Musselman resolved MAHOUT-1874.
--
Resolution: Fixed

Fixed in svn revision 1758086.

> Broken markdown in Downloads section on Mahout site
> ---
>
> Key: MAHOUT-1874
> URL: https://issues.apache.org/jira/browse/MAHOUT-1874
> Project: Mahout
>  Issue Type: Documentation
>  Components: Documentation
>Reporter: Kyriakos Georgiou
>Assignee: Andrew Musselman
>Priority: Trivial
>  Labels: Documentation, Markdown, Site
> Attachments: diff.png
>
>   Original Estimate: 5m
>  Remaining Estimate: 5m
>
> There's a space that breaks the closing back-ticks around 
> ```~/.bash_profile`` ` at 
> https://svn.apache.org/viewvc/mahout/site/mahout_cms/trunk/content/general/downloads.mdtext?view=markup#l22



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


  1   2   3   4   5   >