Hi Maheshakya,

Thank you very much for the reply. Yes that was the reason and I could
solve the issue.

Regards,
Malintha

On Fri, Oct 2, 2015 at 9:34 AM, Maheshakya Wijewardena <mahesha...@wso2.com>
wrote:

> Hi Malintha,
>
> Here is the excerpt from the first few lines of the
> sloan-school-of-management dataset:
>
>  *88, 92,  2, 99, 16, 66, 94, 37, 70,  0,  0, 24, 42, 65,100,100, 8*
>  80,100, 18, 98, 60, 66,100, 29, 42,  0,  0, 23, 42, 61, 56, 98, 8
>   0, 94,  9, 57, 20, 19,  7,  0, 20, 36, 70, 68,100,100, 18, 92, 8
>  95, 82, 71,100, 27, 77, 77, 73,100, 80, 93, 42, 56, 13,  0,  0, 9
>  68,100,  6, 88, 47, 75, 87, 82, 85, 56,100, 29, 75,  6,  0,  0, 9
>  70,100,100, 97, 70, 81, 45, 65, 30, 49, 20, 33,  0, 16,  0,  0, 1
>  40,100,  0, 81, 15, 58,100, 57, 47, 87, 50, 88, 40, 42, 36,  0, 4
>   3, 71,  0, 95, 45,100,100, 99, 79, 78, 48, 53, 31, 24, 54,  0, 7
>
> As you can see, there is no header row (a row with feature names) in this
> csv file. At the dataset creation, if you did not specify that there is no
> header row in the dataset, ML will automatically take the first row as the
> header row and the feature names are derived from that.
> If the first row is taken as the header row, you can see that there are
> duplicate entries: 0, 100
> In ML, there cannot be multiple features with the same name.
>
> At dataset creation, please select "No" for "Column header available", or
> add a header row manually into the data file before uploading.
>
> Best regards.
>
> On Fri, Oct 2, 2015 at 8:54 AM, Nirmal Fernando <nir...@wso2.com> wrote:
>
>> Hi Malintha,
>>
>> Thanks for trying ML. @Wije can you please check?
>>
>> On Fri, Oct 2, 2015 at 1:09 AM, Malintha Adikari <malin...@wso2.com>
>> wrote:
>>
>>> Hi,
>>>
>>> I am trying to create a dataset from 748KB sized data file [1] and
>>> getting following error.
>>>
>>> [2015-10-02 01:03:38,769]  INFO
>>> {org.wso2.carbon.ml.core.impl.MLDatasetProcessor} -  [Created] MLDataset
>>> [id=1, name=digitdd, tenantId=-1234, userName=admin, dataSourceType=file,
>>> dataTargetType=file, sourcePath=null, dataType=csv, comments=,
>>> version=1.0.0, containsHeader=true, status=null]
>>> [2015-10-02 01:03:40,537]  WARN
>>> {org.wso2.carbon.ml.database.internal.MLDatabaseUtils} -  An error occurred
>>> while enabling autocommit: PooledConnection has already been closed.
>>> java.sql.SQLException: PooledConnection has already been closed.
>>> at
>>> org.apache.tomcat.jdbc.pool.DisposableConnectionFacade.invoke(DisposableConnectionFacade.java:86)
>>> at com.sun.proxy.$Proxy16.setAutoCommit(Unknown Source)
>>> at
>>> org.wso2.carbon.ml.database.internal.MLDatabaseUtils.enableAutoCommit(MLDatabaseUtils.java:153)
>>> at
>>> org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2370)
>>> at
>>> org.wso2.carbon.ml.core.impl.SummaryStatsGenerator.run(SummaryStatsGenerator.java:130)
>>> at
>>> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>>> at
>>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>>> at java.lang.Thread.run(Thread.java:745)
>>> [2015-10-02 01:03:40,550] ERROR
>>> {org.wso2.carbon.ml.core.impl.SummaryStatsGenerator} -  Error occurred
>>> while calculating summary statistics for dataset version 1: An error
>>> occurred while updating the database with summary statistics of the dataset
>>> 1: 16
>>> org.wso2.carbon.ml.database.exceptions.DatabaseHandlerException: An
>>> error occurred while updating the database with summary statistics of the
>>> dataset 1: 16
>>> at
>>> org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2366)
>>> at
>>> org.wso2.carbon.ml.core.impl.SummaryStatsGenerator.run(SummaryStatsGenerator.java:130)
>>> at
>>> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>>> at
>>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>>> at java.lang.Thread.run(Thread.java:745)
>>> Caused by: java.lang.ArrayIndexOutOfBoundsException: 16
>>> at
>>> org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2329)
>>> ... 4 more
>>>
>>> What could be the possible reason for this error ?
>>>
>>> [1]
>>> http://ocw.mit.edu/courses/sloan-school-of-management/15-097-prediction-machine-learning-and-statistics-spring-2012/datasets/digits.csv
>>>
>>> Regards,
>>> Malintha
>>>
>>> --
>>> *Malintha Adikari*
>>> Software Engineer
>>> WSO2 Inc.; http://wso2.com
>>> lean.enterprise.middleware
>>>
>>> Mobile: +94 71 2312958
>>> Blog:    http://malinthas.blogspot.com
>>> Page:   http://about.me/malintha
>>>
>>
>>
>>
>> --
>>
>> Thanks & regards,
>> Nirmal
>>
>> Team Lead - WSO2 Machine Learner
>> Associate Technical Lead - Data Technologies Team, WSO2 Inc.
>> Mobile: +94715779733
>> Blog: http://nirmalfdo.blogspot.com/
>>
>>
>>
>
>
> --
> Pruthuvi Maheshakya Wijewardena
> Software Engineer
> WSO2 : http://wso2.com/
> Email: mahesha...@wso2.com
> Mobile: +94711228855
>
>
>


-- 
*Malintha Adikari*
Software Engineer
WSO2 Inc.; http://wso2.com
lean.enterprise.middleware

Mobile: +94 71 2312958
Blog:    http://malinthas.blogspot.com
Page:   http://about.me/malintha
_______________________________________________
Dev mailing list
Dev@wso2.org
http://wso2.org/cgi-bin/mailman/listinfo/dev

Reply via email to