Re: Not able to train data

2017-10-26 Thread Donald Szeto
How is your HBase deployment like? Is it running on a single machine, or clustered? Is it configured to use the embedded ZooKeeper? If you are using a single machine deployment with embedded ZooKeeper, it is known to be a pretty unstable combination. If the input data that you are testing with alr

Re: Not able to train data

2017-10-26 Thread Abhimanyu Nagrath
Hi Donald, Checked pio.log and found the following error while training : 1. ERROR org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher [main] - hconnection-0x21325036, quorum=localhost:2181, baseZNode=/hbase Received unexpected KeeperException, re-throwing exception org.apache.zookeeper.KeeperExc

Re: Not able to train data

2017-10-26 Thread Donald Szeto
Send an empty e-mail to user-unsubscr...@predictionio.apache.org and follow instructions in the reply. On Thu, Oct 26, 2017 at 12:10 AM Seshachalam Malisetti wrote: > how do unsubscribe from this list ? please help > > Sent from Nylas Mail >

Re: Not able to train data

2017-10-26 Thread Donald Szeto
Hi Abhimanyu, Is there more information from Spark web UI, or pio.log from where you run the `pio train` command? Also, sharing your full modifications somewhere on GitHub will be very helpful. Regards, Donald On Thu, Oct 26, 2017 at 2:22 AM Abhimanyu Nagrath < abhimanyunagr...@gmail.com> wrote:

Re: Not able to train data

2017-10-26 Thread Abhimanyu Nagrath
HI Vaghawan, Thanks for the reply. Yes already tried that but still its same getting same error . Regards, Abhimanyu On Thu, Oct 26, 2017 at 2:40 PM, Vaghawan Ojha wrote: > Hi Abhimanyu, > > I've never tried the classification template, So I'm not sure about how > much time would it exactly tak

Re: Not able to train data

2017-10-26 Thread Vaghawan Ojha
Hi Abhimanyu, I've never tried the classification template, So I'm not sure about how much time would it exactly take. But as per your error, your model is not going any far from stage 1. "Task 0 in stage 1.0 failed 1 times, " . Probably something to do with the OOMs. https://stackoverflow.com/qu

Re: Not able to train data

2017-10-26 Thread Abhimanyu Nagrath
Hi Vaghawan, For debugging I just made a change I just reduced the number if features to 1 record count being the same as 1 Million and hardware is (240 GB RAM , 32 cores and 100 GB SWAP) and training is still going on since 2 hrs.Is it an expected behavior. On which factors does the training tim

Re: Not able to train data

2017-10-26 Thread Vaghawan Ojha
Hi Abhimanyu, In that case, you should probably wait for someone else to help, I'd done the same thing with one of the old recommendation template, but that was with 0.10.0 . Thanks Vaghawan On Thu, Oct 26, 2017 at 12:56 PM, Abhimanyu Nagrath < abhimanyunagr...@gmail.com> wrote: > Hi Vaghawan,

Re: Not able to train data

2017-10-26 Thread Abhimanyu Nagrath
Hi Vaghawan, I have made that template compatible with the version mentioned above. Changed versions of engine.json and changed packages name. Regards, Abhimanyu On Thu, Oct 26, 2017 at 12:39 PM, Vaghawan Ojha wrote: > Hi Abhimanyu, > > I don't think this template works with version 0.11.0. A

Re: Not able to train data

2017-10-26 Thread Seshachalam Malisetti
how do unsubscribe from this list ? please helpSent from Nylas Mail, the best free email app for work On Oct 26 2017, at 12:39 pm, Vaghawan Ojha wrote: Hi Abhimanyu, I don't think this template works with version 0.11.0. A

Re: Not able to train data

2017-10-26 Thread Vaghawan Ojha
Hi Abhimanyu, I don't think this template works with version 0.11.0. As per the template : update for PredictionIO 0.9.2, including: I don't think it supports the latest pio. You rather switch it to 0.9.2 if you want to experiment it. On Thu, Oct 26, 2017 at 12:52 PM, Abhimanyu Nagrath < abhima

Re: Not able to train data

2017-10-26 Thread Abhimanyu Nagrath
Hi Vaghawan , I am using v0.11.0-incubating with (ES - v5.2.1 , Hbase - 1.2.6 , Spark - 2.1.0). Regards, Abhimanyu On Thu, Oct 26, 2017 at 12:31 PM, Vaghawan Ojha wrote: > Hi Abhimanyu, > > Ok, which version of pio is this? Because the template looks old to me. > > On Thu, Oct 26, 2017 at 12:4

Re: Not able to train data

2017-10-26 Thread Vaghawan Ojha
Hi Abhimanyu, Ok, which version of pio is this? Because the template looks old to me. On Thu, Oct 26, 2017 at 12:44 PM, Abhimanyu Nagrath < abhimanyunagr...@gmail.com> wrote: > Hi Vaghawan, > > yes, the spark master connection string is correct I am getting executor > fails to connect to spark m

Re: Not able to train data

2017-10-26 Thread Abhimanyu Nagrath
Hi Vaghawan, yes, the spark master connection string is correct I am getting executor fails to connect to spark master after 4-5 hrs. Regards, Abhimanyu On Thu, Oct 26, 2017 at 12:17 PM, Sachin Kamkar wrote: > It should be correct, as the user got the exception after 3-4 hours of > starting.

Re: Not able to train data

2017-10-25 Thread Sachin Kamkar
It should be correct, as the user got the exception after 3-4 hours of starting. So looks like something else broke. OOM? With Regards, Sachin ⚜KTBFFH⚜ On Thu, Oct 26, 2017 at 12:15 PM, Vaghawan Ojha wrote: > "Executor failed to connect with master ", are you sure the --master > spark://*

Re: Not able to train data

2017-10-25 Thread Vaghawan Ojha
"Executor failed to connect with master ", are you sure the --master spark://*.*.*.*:7077 is correct? Like the one you copied from the spark master's web ui? sometimes having that wrong fails to connect with the spark master. Thanks On Thu, Oct 26, 2017 at 12:02 PM, Abhimanyu Nagrath < abhimanyu

Not able to train data

2017-10-25 Thread Abhimanyu Nagrath
I am new to predictionIO . I am using template https://github.com/EmergentOrder/template-scala-probabilistic-classifier-batch-lbfgs . My training dataset count is 1184603 having approx 6500 features. I am using ec2 r4.8xlarge system (240 GB RAM, 32 Cores, 200 GB Swap). I tried two ways for train