[jira] [Comment Edited] (MAHOUT-2020) Maven repo structure malformed

2017-11-01 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234411#comment-16234411 ] Pat Ferrel edited comment on MAHOUT-2020 at 11/1/17 5:2

[jira] [Commented] (MAHOUT-2020) Maven repo structure malformed

2017-11-01 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234411#comment-16234411 ] Pat Ferrel commented on MAHOUT-2020: Nothing to do with SBT, look in the parent

[jira] [Commented] (MAHOUT-2023) Drivers broken, scopt classes not found

2017-10-31 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233555#comment-16233555 ] Pat Ferrel commented on MAHOUT-2023: To be clear we have to fix MAHOUT-2020 befo

[jira] [Resolved] (MAHOUT-1951) Drivers don't run with remote Spark

2017-10-31 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pat Ferrel resolved MAHOUT-1951. Resolution: Fixed > Drivers don't run with remo

[jira] [Issue Comment Deleted] (MAHOUT-1951) Drivers don't run with remote Spark

2017-10-31 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pat Ferrel updated MAHOUT-1951: --- Comment: was deleted (was: odd, I did not resolve this as history says, at least did not mean to

[jira] [Reopened] (MAHOUT-1951) Drivers don't run with remote Spark

2017-10-31 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pat Ferrel reopened MAHOUT-1951: odd, I did not resolve this as history says, at least did not mean to. > Drivers don't

[jira] [Comment Edited] (MAHOUT-2020) Maven repo structure malformed

2017-10-31 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16190340#comment-16190340 ] Pat Ferrel edited comment on MAHOUT-2020 at 11/1/17 1:2

[jira] [Updated] (MAHOUT-2020) Maven repo structure malformed

2017-10-31 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pat Ferrel updated MAHOUT-2020: --- Description: The maven repo is built with scala 2.10 always in the parent pom&#

[jira] [Updated] (MAHOUT-2020) Maven repo structure malformed

2017-10-31 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pat Ferrel updated MAHOUT-2020: --- Summary: Maven repo structure malformed (was: Maven repo structure compatibility with SBT) > Ma

Re: Error while building E-Commerce Recommendation engine with PIO 0.12.0-incubating

2017-10-31 Thread Pat Ferrel
The versions of services installed must work with each other and be compatible with the version built into PIO. So I install the latest stable services and build PIO using the latest artifacts. Unless there has been a major version change (Google Semantic Versioning) you will usually be able to

[jira] [Commented] (MAHOUT-2023) Drivers broken, scopt classes not found

2017-10-26 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16221279#comment-16221279 ] Pat Ferrel commented on MAHOUT-2023: I suspect the PR listed above is mistak

[jira] [Commented] (MAHOUT-2023) Drivers broken, scopt classes not found

2017-10-26 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16221276#comment-16221276 ] Pat Ferrel commented on MAHOUT-2023: While I agree this is a bug, I don't u

[jira] [Comment Edited] (MAHOUT-2020) Maven repo structure compatibility with SBT

2017-10-26 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16221263#comment-16221263 ] Pat Ferrel edited comment on MAHOUT-2020 at 10/26/17 9:3

[jira] [Comment Edited] (MAHOUT-2020) Maven repo structure compatibility with SBT

2017-10-26 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16221263#comment-16221263 ] Pat Ferrel edited comment on MAHOUT-2020 at 10/26/17 9:3

[jira] [Comment Edited] (MAHOUT-2020) Maven repo structure compatibility with SBT

2017-10-26 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16221263#comment-16221263 ] Pat Ferrel edited comment on MAHOUT-2020 at 10/26/17 9:3

[jira] [Comment Edited] (MAHOUT-2020) Maven repo structure compatibility with SBT

2017-10-26 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16221263#comment-16221263 ] Pat Ferrel edited comment on MAHOUT-2020 at 10/26/17 9:2

[jira] [Commented] (MAHOUT-2020) Maven repo structure compatibility with SBT

2017-10-26 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16221263#comment-16221263 ] Pat Ferrel commented on MAHOUT-2020: my script for creating a repo: {

[jira] [Commented] (MAHOUT-2020) Maven repo structure compatibility with SBT

2017-10-26 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16221258#comment-16221258 ] Pat Ferrel commented on MAHOUT-2020: The problem here is with the multi-arti

[jira] [Updated] (MAHOUT-2020) Maven repo structure compatibility with SBT

2017-10-26 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pat Ferrel updated MAHOUT-2020: --- Priority: Blocker (was: Minor) > Maven repo structure compatibility with

Re: PIO Template for User Label Inference

2017-10-25 Thread Pat Ferrel
No the current separation of concerns is: Recommender returns recommended item ids User’s DB keeps all item metadata, much of which has nothing to do with data the recommender needs. The application queries the Recommenders for item-ids which are keys into the DB that retrieve item metadata If

Re: PIO 0.12.0 on cloud server: ImportError: No module named predictionio

2017-10-24 Thread Pat Ferrel
Ok, you said, "the UR's handmade's integration test I got the following error message” so naturally assumed, On Oct 24, 2017, at 12:36 AM, Noelia Osés Fernández wrote: Hi Donald and Pat, I'm following the installation instructions at http://predictionio.incubator.apache.org/install/install-s

Re: PIO 0.12.0 on cloud server: ImportError: No module named predictionio

2017-10-23 Thread Pat Ferrel
Also you won’t be able to use the UR test data with the Apache ECom template. They are fundamentally different and the UR is currently being ported to PIO 0.12.0 ready for testing this week. On Oct 23, 2017, at 7:33 AM, Noelia Osés Fernández wrote: Hi all, I have made a clean installation of

Templates First

2017-10-20 Thread Pat Ferrel
PredictionIO is completely useless without a Template yet we seem as a group too focused on releasing PIO without regard for Templates. This IMO must change. 90% of users will never touch the code of a template and only 1% will actually create a template. These guesses come from list questions.

Templates First

2017-10-20 Thread Pat Ferrel
PredictionIO is completely useless without a Template yet we seem as a group too focused on releasing PIO without regard for Templates. This IMO must change. 90% of users will never touch the code of a template and only 1% will actually create a template. These guesses come from list questions.

Re: [ERROR] [TaskSetManager] Task 2.0 in stage 10.0 had a not serializable result

2017-10-20 Thread Pat Ferrel
is for me to read the documentation and understand how the algorithm works. Then, try again with a slightly larger dataset. Thank you very much! On 19 October 2017 at 17:15, Pat Ferrel mailto:p...@occamsmachete.com>> wrote: This sample dataset is too small with too few cooccurrences. U1 will

Re: [ERROR] [TaskSetManager] Task 2.0 in stage 10.0 had a not serializable result

2017-10-19 Thread Pat Ferrel
net/pferrel/unified-recommender-39986309 <http://www.slideshare.net/pferrel/unified-recommender-39986309>) - [Multi-domain predictive AI or how to make one thing predict another](https://developer.ibm.com/dwblog/2017/mahout-spark-correlated-cross-occurences/ <https://developer.ibm.com/dw

Re: [ERROR] [TaskSetManager] Task 2.0 in stage 10.0 had a not serializable result

2017-10-19 Thread Pat Ferrel
g/users/algorithms/intro-cooccurrence-spark.html <http://mahout.apache.org/users/algorithms/intro-cooccurrence-spark.html>) - [The Universal Recommender Slide Deck](http://www.slideshare.net/pferrel/unified-recommender-39986309 <http://www.slideshare.net/pferrel/unified-recommender-39986309>) -

Re: installing environment (stops when compiling "compiler-interface" for Scala)

2017-10-18 Thread Pat Ferrel
Memory depends on your data and the engine you are using. Spark puts all data into memory across the Spark cluster so if that is one machine, 4g will not allow more than toy or example data. Remember that PIO and Machine Learning in general works best with big data. BTW my laptop has 16g and I

Upgrading to PredictionIO 0.12.0

2017-10-18 Thread Pat Ferrel
PIO-0.12.0 by default, compiles and runs expecting ES5. If you are upgrading (not installing from clean) you will have an issue because ES1 indexes are not upgradable in any simple way. The simplest way to upgrade to pio-0.12.0 and ES5 is to do `pio export` to backup BEFORE upgrading—so export w

Upgrading to PredictionIO 0.12.0

2017-10-18 Thread Pat Ferrel
PIO-0.12.0 by default, compiles and runs expecting ES5. If you are upgrading (not installing from clean) you will have an issue because ES1 indexes are not upgradable in any simple way. The simplest way to upgrade to pio-0.12.0 and ES5 is to do `pio export` to backup BEFORE upgrading—so export w

Re: [ERROR] [TaskSetManager] Task 2.0 in stage 10.0 had a not serializable result

2017-10-18 Thread Pat Ferrel
it work with a very small dataset is because I want to be able to follow the calculations. I want to understand what the UR is doing and understand the impact of changing this or that, here or there... I find that easier to achieve with a small example in which I know exactly what's hap

Re: [ERROR] [TaskSetManager] Task 2.0 in stage 10.0 had a not serializable result

2017-10-16 Thread Pat Ferrel
3:1.0})); not retrying [ERROR] [TaskSetManager] Task 2.0 in stage 10.0 (TID 24) had a not serializable result: org.apache.mahout.math.RandomAccessSparseVector Serialization stack: ... Any ideas? On 15 October 2017 at 19:09, Pat Ferrel mailto:p...@occamsmachete.com>> wrote: This is probab

Re: [ERROR] [TaskSetManager] Task 2.0 in stage 10.0 had a not serializable result

2017-10-15 Thread Pat Ferrel
"item":"Surface","score":0.0}]} Isn't this odd? Can you guess what's going on? Thank you very much for all your support! noelia On 5 October 2017 at 19:22, Pat Ferrel mailto:p...@occamsmachete.com>> wrote: Ok, that config should work. Does the integra

Re: PredictionIO Universal Recommender user rating

2017-10-09 Thread Pat Ferrel
Yes, this is a very important point. We have found that the % of video viewed is indeed a very important factor but rather than sending some fraction to indicate the length viewed we have taken the approach before to determine the % that indicates the user liked the video. This we do by trigger

Re: Universal Recommender and PredictionIO 0.12.0 incompatibility

2017-10-06 Thread Pat Ferrel
: Hi Pat, On 4 October 2017 at 22:04, Pat Ferrel mailto:p...@actionml.com>> wrote: It looks like PIO 0.12.0 will require a code change in the UR. PIO changed ES1 support drastically when ES5 support was added and broke the UR code. We will do a quick fix to the template to address this.

Re: [ERROR] Timeout when read recent events

2017-10-06 Thread Pat Ferrel
When you query for all users in batch, the system is easily overloaded. This is the worst case query situation where no caching applies (for instance). 1) run batch queries at low input load time, because you are competing with input for access to HBase 2) throttle your query speed and/or numbe

[jira] [Created] (MAHOUT-2023) Drivers broken, scopt classes not found

2017-10-05 Thread Pat Ferrel (JIRA)
Pat Ferrel created MAHOUT-2023: -- Summary: Drivers broken, scopt classes not found Key: MAHOUT-2023 URL: https://issues.apache.org/jira/browse/MAHOUT-2023 Project: Mahout Issue Type: Bug

[jira] [Created] (MAHOUT-2023) Drivers broken, scopt classes not found

2017-10-05 Thread Pat Ferrel (JIRA)
Pat Ferrel created MAHOUT-2023: -- Summary: Drivers broken, scopt classes not found Key: MAHOUT-2023 URL: https://issues.apache.org/jira/browse/MAHOUT-2023 Project: Mahout Issue Type: Bug

Universal Recommender and PredictionIO 0.12.0 incompatibility

2017-10-04 Thread Pat Ferrel
It looks like PIO 0.12.0 will require a code change in the UR. PIO changed ES1 support drastically when ES5 support was added and broke the UR code. We will do a quick fix to the template to address this. In the meantime stay on PIO 0.11.0 if you need the UR.

Re: [ERROR] [TaskSetManager] Task 2.0 in stage 10.0 had a not serializable result

2017-10-04 Thread Pat Ferrel
What version of Scala. Spark, PIO, and UR are you using? On Oct 4, 2017, at 6:10 AM, Noelia Osés Fernández wrote: Hi all, I'm still trying to create a very simple app to learn to use PredictionIO and still having trouble. I have done pio build no problem. But when I do pio train I get a very

Re: Running Mahout on a Spark cluster

2017-10-03 Thread Pat Ferrel
wrote: The spark is included via maven classifier- the sbt line should be libraryDependencies += "org.apache.mahout" % "mahout-spark_2.11" % "0.13.1-SNAPSHOT" classifier "spark_2.1" On Tue, Oct 3, 2017 at 2:55 PM, Pat Ferrel wrote: > I’m the aforem

[jira] [Issue Comment Deleted] (MAHOUT-2019) SparseRowMatrix assign ops user for loops instead of iterateNonZero and so can be optimized

2017-10-03 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pat Ferrel updated MAHOUT-2019: --- Comment: was deleted (was: This may be a non-issue: Trevor said in email: {quote}The spark is

[jira] [Updated] (MAHOUT-2020) Maven repo structure compatibility with SBT

2017-10-03 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pat Ferrel updated MAHOUT-2020: --- Priority: Minor (was: Critical) > Maven repo structure compatibility with

[jira] [Commented] (MAHOUT-2020) Maven repo structure compatibility with SBT

2017-10-03 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16190340#comment-16190340 ] Pat Ferrel commented on MAHOUT-2020: This may be a non-issue. Trevor said in e

[jira] [Commented] (MAHOUT-2019) SparseRowMatrix assign ops user for loops instead of iterateNonZero and so can be optimized

2017-10-03 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16190338#comment-16190338 ] Pat Ferrel commented on MAHOUT-2019: This may be a non-issue: Trevor said in e

[jira] [Updated] (MAHOUT-2019) SparseRowMatrix assign ops user for loops instead of iterateNonZero and so can be optimized

2017-10-03 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pat Ferrel updated MAHOUT-2019: --- Priority: Major (was: Minor) > SparseRowMatrix assign ops user for loops instead of iterateNonZ

[jira] [Updated] (MAHOUT-2019) SparseRowMatrix assign ops user for loops instead of iterateNonZero and so can be optimized

2017-10-03 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pat Ferrel updated MAHOUT-2019: --- Priority: Minor (was: Major) > SparseRowMatrix assign ops user for loops instead of iterateNonZ

[jira] [Created] (MAHOUT-2020) Maven repo structure compatibility with SBT

2017-10-03 Thread Pat Ferrel (JIRA)
Pat Ferrel created MAHOUT-2020: -- Summary: Maven repo structure compatibility with SBT Key: MAHOUT-2020 URL: https://issues.apache.org/jira/browse/MAHOUT-2020 Project: Mahout Issue Type: Bug

[jira] [Created] (MAHOUT-2020) Maven repo structure compatibility with SBT

2017-10-03 Thread Pat Ferrel (JIRA)
Pat Ferrel created MAHOUT-2020: -- Summary: Maven repo structure compatibility with SBT Key: MAHOUT-2020 URL: https://issues.apache.org/jira/browse/MAHOUT-2020 Project: Mahout Issue Type: Bug

Re: Running Mahout on a Spark cluster

2017-10-03 Thread Pat Ferrel
Actually if you require scala 2.11 and spark 2.1 you have to use the current master (o.13.0 does not support these) and also can’t use sbt, unless you have some trick I haven’t discovered. On Oct 3, 2017, at 12:55 PM, Pat Ferrel wrote: I’m the aforementioned pferrel @Hoa, thanks for that

Re: Running Mahout on a Spark cluster

2017-10-03 Thread Pat Ferrel
I’m the aforementioned pferrel @Hoa, thanks for that reference, I forgot I had that example. First don’t use the Hadoop part of Mahout, it is not supported and will be deprecated. The Spark version of cooccurrence will be supported. You find it in the SimilarityAnalysis object. If you go back

[jira] [Created] (MAHOUT-2019) SparseRowMatrix assign ops user for loops instead of iterateNonZero and so can be optimized

2017-10-02 Thread Pat Ferrel (JIRA)
Pat Ferrel created MAHOUT-2019: -- Summary: SparseRowMatrix assign ops user for loops instead of iterateNonZero and so can be optimized Key: MAHOUT-2019 URL: https://issues.apache.org/jira/browse/MAHOUT-2019

[jira] [Created] (MAHOUT-2019) SparseRowMatrix assign ops user for loops instead of iterateNonZero and so can be optimized

2017-10-02 Thread Pat Ferrel (JIRA)
Pat Ferrel created MAHOUT-2019: -- Summary: SparseRowMatrix assign ops user for loops instead of iterateNonZero and so can be optimized Key: MAHOUT-2019 URL: https://issues.apache.org/jira/browse/MAHOUT-2019

Re: [DISCUSS] Proposed resolution to graduate the PredictionIO podling

2017-09-29 Thread Pat Ferrel
oh, nm I found it. Pasted below, there were no dissenters to Donald’s detailed assessment. On Sep 29, 2017, at 3:27 PM, Pat Ferrel wrote: Actually we did go over the maturity checklist ourselves. Donald, maybe you can forwards the thread here. pasted from the thread on d

Re: [DISCUSS] Proposed resolution to graduate the PredictionIO podling

2017-09-29 Thread Pat Ferrel
oh, nm I found it. Pasted below, there were no dissenters to Donald’s detailed assessment. On Sep 29, 2017, at 3:27 PM, Pat Ferrel wrote: Actually we did go over the maturity checklist ourselves. Donald, maybe you can forwards the thread here. pasted from the thread on dev

Re: [DISCUSS] Proposed resolution to graduate the PredictionIO podling

2017-09-29 Thread Pat Ferrel
Actually we did go over the maturity checklist ourselves. Donald, maybe you can forwards the thread here. On Sep 29, 2017, at 2:04 PM, Bertrand Delacretaz wrote: Hi John, On Fri, Sep 29, 2017 at 2:59 PM, John D. Ament wrote: > ...I wouldn't conflate lack of mentor engagement with a project'

Re: [RESULT][VOTE] Resolution to create a TLP from graduating Incubator podling

2017-09-29 Thread Pat Ferrel
>> Apache PredictionIO Project: >> >> * Alex Merritt >> * Andrew Kyle Purtell >> * Chan Lee >> * Donald Szeto >> * Felipe Oliveira >> * James Taylor >> * Justin Yip >> * Kenneth C

Re: Eventserver API in an Engine?

2017-09-23 Thread Pat Ferrel
sion with this thread is to: Enable single-process, single network-listener PredictionIO app deployment (i.e. Queries & Events APIs in the same process.) Attempting to address some previous questions & statements… From Pat Ferrel on Tue, 11 Jul 2017 10:53:48 -0700 (PDT): > h

Re: How to training and deploy on different machine?

2017-09-21 Thread Pat Ferrel
e template from machine [TrainingServer] (only need to do this once) Then run `pio deploy` It is not a Spark driver or executor for training Write a cron job of `pio deploy` It is permanent. Thanks Brian On Wed, Sep 20, 2017 at 11:16 PM, Pat Ferrel wrote: > Yes, this is the recom

Re: How to training and deploy on different machine?

2017-09-20 Thread Pat Ferrel
Yes, this is the recommended config (Postgres is not, but later). Spark is only needed during training but the `pio train` process creates drives and executors in Spark. The driver will be the `pio train` machine so you must install pio on it. You should have 2 Spark machines at least because th

Re: Unable to connect to all storage backends successfully

2017-09-20 Thread Pat Ferrel
meaning is “firstcluster” the cluster name in your Elasticsearch configuration? On Sep 19, 2017, at 8:54 PM, Vaghawan Ojha wrote: I think the problem is with Elasticsearch, are you sure the cluster exists in elasticsearch configuration? On Wed, Sep 20, 2017 at 8:17 AM, Jim Miller mailto:jemi

[jira] [Closed] (PIO-32) create component upgrade releases

2017-09-19 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/PIO-32?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pat Ferrel closed PIO-32. - Resolution: Fixed > create component upgrade releases > - > >

Re: [VOTE] Apache PredictionIO (incubating) 0.12.0 Release (RC2)

2017-09-14 Thread Pat Ferrel
The last release was hung up by the IPMC regarding content licensing issues and libraries used by the doc site, which we promised to address in this release. Have these been resolved, don’t recall the specifics? It would be great to fly through the IPMC vote without issue. On Sep 14, 2017, at

Re: Universal Recommender - search by subtext/Unicode

2017-09-13 Thread Pat Ferrel
the “subtext”? Not sure what you mean by “subtext” - I mean if the I look for an item like "ifone" instead of "iphone", ie. make an error in the spelling, would it still work ?? On Wed, Sep 13, 2017 at 1:37 PM, Pat Ferrel mailto:p...@occamsmachete.com>> wrote: 1) Yes, of

Re: Universal Recommender : seasonality of product

2017-09-13 Thread Pat Ferrel
s out of the data"? On Wed, Sep 13, 2017 at 1:42 PM, Pat Ferrel mailto:p...@occamsmachete.com>> wrote: This is done with blacklisting. The default config blacklists all items in the training data that the users has taken the primary event on. So if your primary event is “buy” then

Re: Universal Recommender : seasonality of product

2017-09-13 Thread Pat Ferrel
This is done with blacklisting. The default config blacklists all items in the training data that the users has taken the primary event on. So if your primary event is “buy” then once a user has bought a particular table they will not be recommended that table again until the “buy” event ages ou

Re: Universal Recommender - search by subtext/Unicode

2017-09-13 Thread Pat Ferrel
1) Yes, of course. Use UTF-8 encoding. 2) I don’t understand this question. The UR is not a search engine, what kind of recommendation are looking for? The best recommendation from a list of items? The best recommendation that contains some text in the “subtext”? Not sure what you mean by “subt

Re: How to give bias on newer item?

2017-09-08 Thread Pat Ferrel
With the Universal Recommender there are a robust set of business rules including boosting by item properties. If you add a year+week to every item, then in the query you boost items from a range of weeks by differing amounts you can implement any “decay function you want without modifying code.

Re: Graduation to TLP

2017-09-07 Thread Pat Ferrel
themselves. It may be good. In fact, most of committers carry out a task with limited time. Anyway I hope the project will progress in a good direction. 2017-09-06 3:43 GMT+09:00 Pat Ferrel : > I personally don’t see much benefit in removing people unless they prove the > exception. AFA

Re: Validate the built model

2017-09-06 Thread Pat Ferrel
We do cross-validation tests to see how well the model predicts actual behavior. As to the best data mix, cross-validation works with any engine tuning or data input. Typically this requires re-traiing between test runs so make sure you use exatly the same training/test split. If you want to exa

Re: Train a model without stopping

2017-09-06 Thread Pat Ferrel
The UR does this automatically. Once deployed you never have to deploy a second time. When a new `pio train` happens the new model is hot-swapped to replace the old, which is then erased, so there is no re-deploy and no downtime. Yes, it uses Elasticsearch aliases but most other Templates do not

Re: Graduation to TLP

2017-09-05 Thread Pat Ferrel
rom > main ASF doc) > - IN10, IN20 > > Let me know what you think. > > On Fri, Sep 1, 2017 at 10:32 AM, Pat Ferrel wrote: > >> The Chair, PMC, and Committers may be different after graduation. >> PMC/committers are sometimes not active committers but can have a &g

Re: Recommender for social media

2017-09-05 Thread Pat Ferrel
Actually IMO it is not more complex, it is just far better documented and more flexible. If you don’t need the features it is just as simple as the Apache PIO Templates. I could argue the UR is simpler since you don’t need to $set every item and user, they are determined automatically from the d

Re: Error while running pio train + Universal Recommender

2017-09-04 Thread Pat Ferrel
Those are just the Mahout code checks for GPUs. It falls back to using CPUs when they are not found so no error and nothing to fix. On Sep 3, 2017, at 11:27 PM, Saarthak Chandra wrote: Hi, I face the following error while running `pio train` command: [INFO] [RootSolverFactory$] org.apache.m

Re: Securing Event Server on Heroku?

2017-09-01 Thread Pat Ferrel
TLS/SSL is required along with authentication of the HTTPS requests. I’m not familiar with Heroku but the Proxy must authenticate the incoming connections. Nginx has basic auth and is a fast proxy, for instance. A cheap, dirty, and not recommended unless it is your only option, is to set your s

Re: Graduation to TLP

2017-09-01 Thread Pat Ferrel
The Chair, PMC, and Committers may be different after graduation. PMC/committers are sometimes not active committers but can have a valuable role as mentors, in non-technical roles, as support people on the mailing list, or as sometimes committers who don’t seem very active but come in every so

Re: Error: Could not find or load main class org.apache.predictionio.tools.console.Console

2017-08-31 Thread Pat Ferrel
Ubuntu 16.04.3 x64 -- Paritosh Piplewar Sent with Airmail On 30 August 2017 at 5:22:43 PM, Pat Ferrel (p...@occamsmachete.com <mailto:p...@occamsmachete.com>) wrote: > Can you explain how you installed and what your problem is? The link below > doesn’t contain much information.

Re: Graduation to TLP

2017-08-30 Thread Pat Ferrel
while due to family issues, but I'd be happy to volunteer for release management. On Wed, Aug 30, 2017 at 4:30 PM, Pat Ferrel wrote: > Along with the link Donald gave, check this out: > http://community.apache.org/apache-way/apache-project-maturity-model.html > <http://com

Re: Graduation to TLP

2017-08-30 Thread Pat Ferrel
what the VP vs Member responsibilities look like. Let's graduate. +1 *Mars On Wed, Aug 30, 2017 at 15:21 Pat Ferrel wrote: > I have had several people tell me they want to wait until PIO is not > incubating before using it. This even after explaining that “incubating” > has mor

Re: Graduation to TLP

2017-08-30 Thread Pat Ferrel
I have had several people tell me they want to wait until PIO is not incubating before using it. This even after explaining that “incubating” has more to do with getting into the Apache Way of doing things and has no direct link to quality or community. I can only conclude from this that “incuba

Re: spark-itemsimilarity scalability / Spark parallelism issues (SimilarityAnalysis.cooccurrencesIDSs)

2017-08-30 Thread Pat Ferrel
Matt, I’m interested in following up on this. If you can’t do a PR, can you describe what you did a bit more? On Aug 21, 2017, at 12:05 PM, Pat Ferrel wrote: Matt I’ll create a feature branch of Mahout in my git repo for simplicity (we are in code freeze for Mahout right now) Then if you

Re: Error: Could not find or load main class org.apache.predictionio.tools.console.Console

2017-08-30 Thread Pat Ferrel
Can you explain how you installed and what your problem is? The link below doesn’t contain much information. On Aug 29, 2017, at 9:02 PM, Paritosh Piplewar wrote: Yes I'm running pio from inside bin directory. Sent from my iPhone On 30-Aug-2017, at 3:41 AM, Mars Hall mailto:mars.h...@sales

Re: sbt.ResolveException: unresolved dependency: org.apache.predictionio#pio-build;0.11.0-incubrating

2017-08-22 Thread Pat Ferrel
haha yeah and fix the typo too but I think the other instructions apply also. Thanks Yevgeny. On Aug 22, 2017, at 9:44 AM, Yevgeny Khodorkovsky wrote: ...incubRating... :) On Tue, Aug 22, 2017 at 09:40 Pat Ferrel mailto:p...@occamsmachete.com>> wrote: Oh, I see which template no

Re: sbt.ResolveException: unresolved dependency: org.apache.predictionio#pio-build;0.11.0-incubrating

2017-08-22 Thread Pat Ferrel
core" % pioVersion % "provided", When you are done and it works, consider submitting a PR to the original repo. On Aug 22, 2017, at 9:22 AM, Pat Ferrel wrote: One error is the version of PIO in your templates build.sbt does not match the one you have installed. But not all tem

Re: sbt.ResolveException: unresolved dependency: org.apache.predictionio#pio-build;0.11.0-incubrating

2017-08-22 Thread Pat Ferrel
You template is linking to org.apache.predictionio#pio-build;0.10.0-incubrating, what do you have installed? org.apache.predictionio#pio-build;0.11.0-incubrating? Looks like you have to change your templates build.sbt to link to the artifact you have built. On Aug 22, 2017, at 3:52 AM, Abhima

Re: spark-itemsimilarity scalability / Spark parallelism issues (SimilarityAnalysis.cooccurrencesIDSs)

2017-08-21 Thread Pat Ferrel
se e.g.: in AtB. https://github.com/apache/mahout/blob/08e02602e947ff945b9bd73ab5f0b45863df3e53/math-scala/src/main/scala/org/apache/mahout/math/scalabindings/package.scala#L431 +1 to PR. thanks for pointing this out. --andy ____ From: Pat Ferrel Sent: Monday

Re: spark-itemsimilarity scalability / Spark parallelism issues (SimilarityAnalysis.cooccurrencesIDSs)

2017-08-21 Thread Pat Ferrel
SparseRowMatrix instances to a custom subclass of SparseRowMatrix; this new class overrides the AbstractMatrix implementation of apply(matrix, function). FWIW, all of my app's existing tests pass and it produces the same results as before these changes. Matt On 8/21/17, 1:53 PM,

Re: spark-itemsimilarity scalability / Spark parallelism issues (SimilarityAnalysis.cooccurrencesIDSs)

2017-08-21 Thread Pat Ferrel
through the implications. If I can also test it I have some large real-world data where I can test real-world speedup. On Aug 21, 2017, at 10:53 AM, Pat Ferrel wrote: Interesting indeed. What is “massive”? Does the change pass all unit tests? On Aug 17, 2017, at 1:04 PM, Scruggs, Matt wrote

Re: spark-itemsimilarity scalability / Spark parallelism issues (SimilarityAnalysis.cooccurrencesIDSs)

2017-08-21 Thread Pat Ferrel
you have to move the data to CPU and back > to memory to distributed it around possibly multiple times, you may wind up > with something much slower than you would have had if you were to attack > the problem directly. > > > > On Wed, Aug 16, 2017 at 4:47 PM, Pat Ferrel wrote:

Re: Getting error in

2017-08-18 Thread Pat Ferrel
That template has not been moved to use the new Apache namespace. It is still looking for classes with io.prediciton instead of org.apache.predictionio… You will have to update all the imports and build.sbt to use the apache namespace. This fact is noted in the template gallery entry. On Aug 1

Re: spark-itemsimilarity scalability / Spark parallelism issues (SimilarityAnalysis.cooccurrencesIDSs)

2017-08-16 Thread Pat Ferrel
adoc on those methods mentions they shouldn't be used unless absolutely necessary due to their O(log n) complexity. Thanks for your time...this is fun stuff! Matt On 8/15/17, 10:15 AM, "Pat Ferrel" wrote: > Great, this is the best way to use the APIs. The big win with CCO, the alg

Re: spark-itemsimilarity scalability / Spark parallelism issues (SimilarityAnalysis.cooccurrencesIDSs)

2017-08-15 Thread Pat Ferrel
n) operations I mentioned seem to take >95% of runtime. > > Thanks, > Matt > > From: Pat Ferrel > Sent: Monday, August 14, 2017 11:02:42 PM > To: user@mahout.apache.org > Subject: Re: spark-itemsimilarity scalability / Spark parallelism issues > (SimilarityAnal

Re: spark-itemsimilarity scalability / Spark parallelism issues (SimilarityAnalysis.cooccurrencesIDSs)

2017-08-14 Thread Pat Ferrel
Are you using the CLI? If so it’s likely that there is only one partition of the data. If you use Mahout in the Spark shell or using it as a lib, do a repartition on the input data before passing it into SimilarityAnalysis.cooccurrencesIDSs. I repartition to 4*total cores to start with and set

Re: Hi I would like to join the google group

2017-08-11 Thread Pat Ferrel
Subscribe here: http://predictionio.incubator.apache.org/support/ The site has lot of answers and docs too. On Aug 11, 2017, at 11:39 AM, Trey Zhong wrote: Hi I would like to join the google group of predictionIO user

[Bug 1512992] Re: package zlib1g-dev 1:1.2.8.dfsg-2ubuntu4 failed to install/upgrade: trying to overwrite '/usr/include/i386-linux-gnu/zconf.h', which is also in package lib32z1-dev 1:1.2.8.dfsg-2ubu

2017-08-10 Thread Pat Ferrel
Just installed Xenial with all updates and can't install zlib1g-dev. It does not exist in proposed (tried from there) so I assume it's in the usual update repos. It looks like it is still broken. Is there a work around? sudo apt-get -o Dpkg::Options::="--force-overwrite" install zlib1g-dev:amd64 R

[Touch-packages] [Bug 1512992] Re: package zlib1g-dev 1:1.2.8.dfsg-2ubuntu4 failed to install/upgrade: trying to overwrite '/usr/include/i386-linux-gnu/zconf.h', which is also in package lib32z1-dev 1

2017-08-10 Thread Pat Ferrel
Just installed Xenial with all updates and can't install zlib1g-dev. It does not exist in proposed (tried from there) so I assume it's in the usual update repos. It looks like it is still broken. Is there a work around? sudo apt-get -o Dpkg::Options::="--force-overwrite" install zlib1g-dev:amd64 R

Re: How to config a high availability eventserver for PredictionIO ?

2017-08-07 Thread Pat Ferrel
A truly HA cluster is often not required depending on what you use it for. Can you share what your application is? The EventServer in my experience (I wrote the pages references below) has never crashed because of input. I think the only crash modes I’ve seen involved disk full on some service

Re: Train and deploy on the fly dynamically

2017-08-05 Thread Pat Ferrel
PredictionIO only supports batch training of models. If you want online training that updates models with every new input be aware that DL4J is not fast enough to guarantee what some call “kappa” style online learning and PIO does not directly support it. One hack I have used with another onli

Re: Batch Queries

2017-08-04 Thread Pat Ferrel
This is a question for the Apache PredictionIO mailing list. I think there has been work done on implementing batch queries in the next version of PIO. Not sure if it will meet your needs. On Aug 4, 2017, at 2:59 PM, l...@platterz.ca wrote: The article about implementing batch queries to PIO

Re: Error when importing data

2017-08-03 Thread Pat Ferrel
splitting this apart for production. On Aug 3, 2017, at 8:32 AM, Pat Ferrel wrote: It should be easy to try a smaller batch of data first since we are just guessing On Aug 2, 2017, at 11:22 PM, Carlos Vidal mailto:carlos.vi...@beeva.com>> wrote: Hello Mahesh, Pat Thanks for your answers.

<    1   2   3   4   5   6   7   8   9   10   >