Re: [Dev] [ML] Getting error while creating a dataset

2015-10-01 Thread Malintha Adikari
Hi Maheshakya,

Thank you very much for the reply. Yes that was the reason and I could
solve the issue.

Regards,
Malintha

On Fri, Oct 2, 2015 at 9:34 AM, Maheshakya Wijewardena 
wrote:

> Hi Malintha,
>
> Here is the excerpt from the first few lines of the
> sloan-school-of-management dataset:
>
>  *88, 92,  2, 99, 16, 66, 94, 37, 70,  0,  0, 24, 42, 65,100,100, 8*
>  80,100, 18, 98, 60, 66,100, 29, 42,  0,  0, 23, 42, 61, 56, 98, 8
>   0, 94,  9, 57, 20, 19,  7,  0, 20, 36, 70, 68,100,100, 18, 92, 8
>  95, 82, 71,100, 27, 77, 77, 73,100, 80, 93, 42, 56, 13,  0,  0, 9
>  68,100,  6, 88, 47, 75, 87, 82, 85, 56,100, 29, 75,  6,  0,  0, 9
>  70,100,100, 97, 70, 81, 45, 65, 30, 49, 20, 33,  0, 16,  0,  0, 1
>  40,100,  0, 81, 15, 58,100, 57, 47, 87, 50, 88, 40, 42, 36,  0, 4
>   3, 71,  0, 95, 45,100,100, 99, 79, 78, 48, 53, 31, 24, 54,  0, 7
>
> As you can see, there is no header row (a row with feature names) in this
> csv file. At the dataset creation, if you did not specify that there is no
> header row in the dataset, ML will automatically take the first row as the
> header row and the feature names are derived from that.
> If the first row is taken as the header row, you can see that there are
> duplicate entries: 0, 100
> In ML, there cannot be multiple features with the same name.
>
> At dataset creation, please select "No" for "Column header available", or
> add a header row manually into the data file before uploading.
>
> Best regards.
>
> On Fri, Oct 2, 2015 at 8:54 AM, Nirmal Fernando  wrote:
>
>> Hi Malintha,
>>
>> Thanks for trying ML. @Wije can you please check?
>>
>> On Fri, Oct 2, 2015 at 1:09 AM, Malintha Adikari 
>> wrote:
>>
>>> Hi,
>>>
>>> I am trying to create a dataset from 748KB sized data file [1] and
>>> getting following error.
>>>
>>> [2015-10-02 01:03:38,769]  INFO
>>> {org.wso2.carbon.ml.core.impl.MLDatasetProcessor} -  [Created] MLDataset
>>> [id=1, name=digitdd, tenantId=-1234, userName=admin, dataSourceType=file,
>>> dataTargetType=file, sourcePath=null, dataType=csv, comments=,
>>> version=1.0.0, containsHeader=true, status=null]
>>> [2015-10-02 01:03:40,537]  WARN
>>> {org.wso2.carbon.ml.database.internal.MLDatabaseUtils} -  An error occurred
>>> while enabling autocommit: PooledConnection has already been closed.
>>> java.sql.SQLException: PooledConnection has already been closed.
>>> at
>>> org.apache.tomcat.jdbc.pool.DisposableConnectionFacade.invoke(DisposableConnectionFacade.java:86)
>>> at com.sun.proxy.$Proxy16.setAutoCommit(Unknown Source)
>>> at
>>> org.wso2.carbon.ml.database.internal.MLDatabaseUtils.enableAutoCommit(MLDatabaseUtils.java:153)
>>> at
>>> org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2370)
>>> at
>>> org.wso2.carbon.ml.core.impl.SummaryStatsGenerator.run(SummaryStatsGenerator.java:130)
>>> at
>>> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>>> at
>>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>>> at java.lang.Thread.run(Thread.java:745)
>>> [2015-10-02 01:03:40,550] ERROR
>>> {org.wso2.carbon.ml.core.impl.SummaryStatsGenerator} -  Error occurred
>>> while calculating summary statistics for dataset version 1: An error
>>> occurred while updating the database with summary statistics of the dataset
>>> 1: 16
>>> org.wso2.carbon.ml.database.exceptions.DatabaseHandlerException: An
>>> error occurred while updating the database with summary statistics of the
>>> dataset 1: 16
>>> at
>>> org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2366)
>>> at
>>> org.wso2.carbon.ml.core.impl.SummaryStatsGenerator.run(SummaryStatsGenerator.java:130)
>>> at
>>> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>>> at
>>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>>> at java.lang.Thread.run(Thread.java:745)
>>> Caused by: java.lang.ArrayIndexOutOfBoundsException: 16
>>> at
>>> org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2329)
>>> ... 4 more
>>>
>>> What could be the possible reason for this error ?
>>>
>>> [1]
>>> http://ocw.mit.edu/courses/sloan-school-of-management/15-097-prediction-machine-learning-and-statistics-spring-2012/datasets/digits.csv
>>>
>>> Regards,
>>> Malintha
>>>
>>> --
>>> *Malintha Adikari*
>>> Software Engineer
>>> WSO2 Inc.; http://wso2.com
>>> lean.enterprise.middleware
>>>
>>> Mobile: +94 71 2312958
>>> Blog:http://malinthas.blogspot.com
>>> Page:   http://about.me/malintha
>>>
>>
>>
>>
>> --
>>
>> Thanks & regards,
>> Nirmal
>>
>> Team Lead - WSO2 Machine Learner
>> Associate Technical Lead - Data Technologies Team, WSO2 Inc.
>> Mobile: +94715779733
>> Blog: http://nirmalfdo.blogspot.com/
>>
>>
>>
>
>
> --
> Pruthuvi Maheshakya Wijewardena
> Software Engineer
> WSO2 : http://wso2.com/
> Email: mahesha...@wso2.com
> Mobile: +94711228855
>

Re: [Dev] [ML] Getting error while creating a dataset

2015-10-01 Thread Maheshakya Wijewardena
Hi Malintha,

Here is the excerpt from the first few lines of the
sloan-school-of-management dataset:

 *88, 92,  2, 99, 16, 66, 94, 37, 70,  0,  0, 24, 42, 65,100,100, 8*
 80,100, 18, 98, 60, 66,100, 29, 42,  0,  0, 23, 42, 61, 56, 98, 8
  0, 94,  9, 57, 20, 19,  7,  0, 20, 36, 70, 68,100,100, 18, 92, 8
 95, 82, 71,100, 27, 77, 77, 73,100, 80, 93, 42, 56, 13,  0,  0, 9
 68,100,  6, 88, 47, 75, 87, 82, 85, 56,100, 29, 75,  6,  0,  0, 9
 70,100,100, 97, 70, 81, 45, 65, 30, 49, 20, 33,  0, 16,  0,  0, 1
 40,100,  0, 81, 15, 58,100, 57, 47, 87, 50, 88, 40, 42, 36,  0, 4
  3, 71,  0, 95, 45,100,100, 99, 79, 78, 48, 53, 31, 24, 54,  0, 7

As you can see, there is no header row (a row with feature names) in this
csv file. At the dataset creation, if you did not specify that there is no
header row in the dataset, ML will automatically take the first row as the
header row and the feature names are derived from that.
If the first row is taken as the header row, you can see that there are
duplicate entries: 0, 100
In ML, there cannot be multiple features with the same name.

At dataset creation, please select "No" for "Column header available", or
add a header row manually into the data file before uploading.

Best regards.

On Fri, Oct 2, 2015 at 8:54 AM, Nirmal Fernando  wrote:

> Hi Malintha,
>
> Thanks for trying ML. @Wije can you please check?
>
> On Fri, Oct 2, 2015 at 1:09 AM, Malintha Adikari 
> wrote:
>
>> Hi,
>>
>> I am trying to create a dataset from 748KB sized data file [1] and
>> getting following error.
>>
>> [2015-10-02 01:03:38,769]  INFO
>> {org.wso2.carbon.ml.core.impl.MLDatasetProcessor} -  [Created] MLDataset
>> [id=1, name=digitdd, tenantId=-1234, userName=admin, dataSourceType=file,
>> dataTargetType=file, sourcePath=null, dataType=csv, comments=,
>> version=1.0.0, containsHeader=true, status=null]
>> [2015-10-02 01:03:40,537]  WARN
>> {org.wso2.carbon.ml.database.internal.MLDatabaseUtils} -  An error occurred
>> while enabling autocommit: PooledConnection has already been closed.
>> java.sql.SQLException: PooledConnection has already been closed.
>> at
>> org.apache.tomcat.jdbc.pool.DisposableConnectionFacade.invoke(DisposableConnectionFacade.java:86)
>> at com.sun.proxy.$Proxy16.setAutoCommit(Unknown Source)
>> at
>> org.wso2.carbon.ml.database.internal.MLDatabaseUtils.enableAutoCommit(MLDatabaseUtils.java:153)
>> at
>> org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2370)
>> at
>> org.wso2.carbon.ml.core.impl.SummaryStatsGenerator.run(SummaryStatsGenerator.java:130)
>> at
>> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>> at
>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>> at java.lang.Thread.run(Thread.java:745)
>> [2015-10-02 01:03:40,550] ERROR
>> {org.wso2.carbon.ml.core.impl.SummaryStatsGenerator} -  Error occurred
>> while calculating summary statistics for dataset version 1: An error
>> occurred while updating the database with summary statistics of the dataset
>> 1: 16
>> org.wso2.carbon.ml.database.exceptions.DatabaseHandlerException: An error
>> occurred while updating the database with summary statistics of the dataset
>> 1: 16
>> at
>> org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2366)
>> at
>> org.wso2.carbon.ml.core.impl.SummaryStatsGenerator.run(SummaryStatsGenerator.java:130)
>> at
>> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>> at
>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>> at java.lang.Thread.run(Thread.java:745)
>> Caused by: java.lang.ArrayIndexOutOfBoundsException: 16
>> at
>> org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2329)
>> ... 4 more
>>
>> What could be the possible reason for this error ?
>>
>> [1]
>> http://ocw.mit.edu/courses/sloan-school-of-management/15-097-prediction-machine-learning-and-statistics-spring-2012/datasets/digits.csv
>>
>> Regards,
>> Malintha
>>
>> --
>> *Malintha Adikari*
>> Software Engineer
>> WSO2 Inc.; http://wso2.com
>> lean.enterprise.middleware
>>
>> Mobile: +94 71 2312958
>> Blog:http://malinthas.blogspot.com
>> Page:   http://about.me/malintha
>>
>
>
>
> --
>
> Thanks & regards,
> Nirmal
>
> Team Lead - WSO2 Machine Learner
> Associate Technical Lead - Data Technologies Team, WSO2 Inc.
> Mobile: +94715779733
> Blog: http://nirmalfdo.blogspot.com/
>
>
>


-- 
Pruthuvi Maheshakya Wijewardena
Software Engineer
WSO2 : http://wso2.com/
Email: mahesha...@wso2.com
Mobile: +94711228855
___
Dev mailing list
Dev@wso2.org
http://wso2.org/cgi-bin/mailman/listinfo/dev


Re: [Dev] [ML] Getting error while creating a dataset

2015-10-01 Thread Nirmal Fernando
Hi Malintha,

Thanks for trying ML. @Wije can you please check?

On Fri, Oct 2, 2015 at 1:09 AM, Malintha Adikari  wrote:

> Hi,
>
> I am trying to create a dataset from 748KB sized data file [1] and getting
> following error.
>
> [2015-10-02 01:03:38,769]  INFO
> {org.wso2.carbon.ml.core.impl.MLDatasetProcessor} -  [Created] MLDataset
> [id=1, name=digitdd, tenantId=-1234, userName=admin, dataSourceType=file,
> dataTargetType=file, sourcePath=null, dataType=csv, comments=,
> version=1.0.0, containsHeader=true, status=null]
> [2015-10-02 01:03:40,537]  WARN
> {org.wso2.carbon.ml.database.internal.MLDatabaseUtils} -  An error occurred
> while enabling autocommit: PooledConnection has already been closed.
> java.sql.SQLException: PooledConnection has already been closed.
> at
> org.apache.tomcat.jdbc.pool.DisposableConnectionFacade.invoke(DisposableConnectionFacade.java:86)
> at com.sun.proxy.$Proxy16.setAutoCommit(Unknown Source)
> at
> org.wso2.carbon.ml.database.internal.MLDatabaseUtils.enableAutoCommit(MLDatabaseUtils.java:153)
> at
> org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2370)
> at
> org.wso2.carbon.ml.core.impl.SummaryStatsGenerator.run(SummaryStatsGenerator.java:130)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> at java.lang.Thread.run(Thread.java:745)
> [2015-10-02 01:03:40,550] ERROR
> {org.wso2.carbon.ml.core.impl.SummaryStatsGenerator} -  Error occurred
> while calculating summary statistics for dataset version 1: An error
> occurred while updating the database with summary statistics of the dataset
> 1: 16
> org.wso2.carbon.ml.database.exceptions.DatabaseHandlerException: An error
> occurred while updating the database with summary statistics of the dataset
> 1: 16
> at
> org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2366)
> at
> org.wso2.carbon.ml.core.impl.SummaryStatsGenerator.run(SummaryStatsGenerator.java:130)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> at java.lang.Thread.run(Thread.java:745)
> Caused by: java.lang.ArrayIndexOutOfBoundsException: 16
> at
> org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2329)
> ... 4 more
>
> What could be the possible reason for this error ?
>
> [1]
> http://ocw.mit.edu/courses/sloan-school-of-management/15-097-prediction-machine-learning-and-statistics-spring-2012/datasets/digits.csv
>
> Regards,
> Malintha
>
> --
> *Malintha Adikari*
> Software Engineer
> WSO2 Inc.; http://wso2.com
> lean.enterprise.middleware
>
> Mobile: +94 71 2312958
> Blog:http://malinthas.blogspot.com
> Page:   http://about.me/malintha
>



-- 

Thanks & regards,
Nirmal

Team Lead - WSO2 Machine Learner
Associate Technical Lead - Data Technologies Team, WSO2 Inc.
Mobile: +94715779733
Blog: http://nirmalfdo.blogspot.com/
___
Dev mailing list
Dev@wso2.org
http://wso2.org/cgi-bin/mailman/listinfo/dev


[Dev] [ML] Getting error while creating a dataset

2015-10-01 Thread Malintha Adikari
Hi,

I am trying to create a dataset from 748KB sized data file [1] and getting
following error.

[2015-10-02 01:03:38,769]  INFO
{org.wso2.carbon.ml.core.impl.MLDatasetProcessor} -  [Created] MLDataset
[id=1, name=digitdd, tenantId=-1234, userName=admin, dataSourceType=file,
dataTargetType=file, sourcePath=null, dataType=csv, comments=,
version=1.0.0, containsHeader=true, status=null]
[2015-10-02 01:03:40,537]  WARN
{org.wso2.carbon.ml.database.internal.MLDatabaseUtils} -  An error occurred
while enabling autocommit: PooledConnection has already been closed.
java.sql.SQLException: PooledConnection has already been closed.
at
org.apache.tomcat.jdbc.pool.DisposableConnectionFacade.invoke(DisposableConnectionFacade.java:86)
at com.sun.proxy.$Proxy16.setAutoCommit(Unknown Source)
at
org.wso2.carbon.ml.database.internal.MLDatabaseUtils.enableAutoCommit(MLDatabaseUtils.java:153)
at
org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2370)
at
org.wso2.carbon.ml.core.impl.SummaryStatsGenerator.run(SummaryStatsGenerator.java:130)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:745)
[2015-10-02 01:03:40,550] ERROR
{org.wso2.carbon.ml.core.impl.SummaryStatsGenerator} -  Error occurred
while calculating summary statistics for dataset version 1: An error
occurred while updating the database with summary statistics of the dataset
1: 16
org.wso2.carbon.ml.database.exceptions.DatabaseHandlerException: An error
occurred while updating the database with summary statistics of the dataset
1: 16
at
org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2366)
at
org.wso2.carbon.ml.core.impl.SummaryStatsGenerator.run(SummaryStatsGenerator.java:130)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:745)
Caused by: java.lang.ArrayIndexOutOfBoundsException: 16
at
org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2329)
... 4 more

What could be the possible reason for this error ?

[1]
http://ocw.mit.edu/courses/sloan-school-of-management/15-097-prediction-machine-learning-and-statistics-spring-2012/datasets/digits.csv

Regards,
Malintha

-- 
*Malintha Adikari*
Software Engineer
WSO2 Inc.; http://wso2.com
lean.enterprise.middleware

Mobile: +94 71 2312958
Blog:http://malinthas.blogspot.com
Page:   http://about.me/malintha
___
Dev mailing list
Dev@wso2.org
http://wso2.org/cgi-bin/mailman/listinfo/dev