Hi Maheshakya, Thank you very much for the reply. Yes that was the reason and I could solve the issue.
Regards, Malintha On Fri, Oct 2, 2015 at 9:34 AM, Maheshakya Wijewardena <mahesha...@wso2.com> wrote: > Hi Malintha, > > Here is the excerpt from the first few lines of the > sloan-school-of-management dataset: > > *88, 92, 2, 99, 16, 66, 94, 37, 70, 0, 0, 24, 42, 65,100,100, 8* > 80,100, 18, 98, 60, 66,100, 29, 42, 0, 0, 23, 42, 61, 56, 98, 8 > 0, 94, 9, 57, 20, 19, 7, 0, 20, 36, 70, 68,100,100, 18, 92, 8 > 95, 82, 71,100, 27, 77, 77, 73,100, 80, 93, 42, 56, 13, 0, 0, 9 > 68,100, 6, 88, 47, 75, 87, 82, 85, 56,100, 29, 75, 6, 0, 0, 9 > 70,100,100, 97, 70, 81, 45, 65, 30, 49, 20, 33, 0, 16, 0, 0, 1 > 40,100, 0, 81, 15, 58,100, 57, 47, 87, 50, 88, 40, 42, 36, 0, 4 > 3, 71, 0, 95, 45,100,100, 99, 79, 78, 48, 53, 31, 24, 54, 0, 7 > > As you can see, there is no header row (a row with feature names) in this > csv file. At the dataset creation, if you did not specify that there is no > header row in the dataset, ML will automatically take the first row as the > header row and the feature names are derived from that. > If the first row is taken as the header row, you can see that there are > duplicate entries: 0, 100 > In ML, there cannot be multiple features with the same name. > > At dataset creation, please select "No" for "Column header available", or > add a header row manually into the data file before uploading. > > Best regards. > > On Fri, Oct 2, 2015 at 8:54 AM, Nirmal Fernando <nir...@wso2.com> wrote: > >> Hi Malintha, >> >> Thanks for trying ML. @Wije can you please check? >> >> On Fri, Oct 2, 2015 at 1:09 AM, Malintha Adikari <malin...@wso2.com> >> wrote: >> >>> Hi, >>> >>> I am trying to create a dataset from 748KB sized data file [1] and >>> getting following error. >>> >>> [2015-10-02 01:03:38,769] INFO >>> {org.wso2.carbon.ml.core.impl.MLDatasetProcessor} - [Created] MLDataset >>> [id=1, name=digitdd, tenantId=-1234, userName=admin, dataSourceType=file, >>> dataTargetType=file, sourcePath=null, dataType=csv, comments=, >>> version=1.0.0, containsHeader=true, status=null] >>> [2015-10-02 01:03:40,537] WARN >>> {org.wso2.carbon.ml.database.internal.MLDatabaseUtils} - An error occurred >>> while enabling autocommit: PooledConnection has already been closed. >>> java.sql.SQLException: PooledConnection has already been closed. >>> at >>> org.apache.tomcat.jdbc.pool.DisposableConnectionFacade.invoke(DisposableConnectionFacade.java:86) >>> at com.sun.proxy.$Proxy16.setAutoCommit(Unknown Source) >>> at >>> org.wso2.carbon.ml.database.internal.MLDatabaseUtils.enableAutoCommit(MLDatabaseUtils.java:153) >>> at >>> org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2370) >>> at >>> org.wso2.carbon.ml.core.impl.SummaryStatsGenerator.run(SummaryStatsGenerator.java:130) >>> at >>> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) >>> at >>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) >>> at java.lang.Thread.run(Thread.java:745) >>> [2015-10-02 01:03:40,550] ERROR >>> {org.wso2.carbon.ml.core.impl.SummaryStatsGenerator} - Error occurred >>> while calculating summary statistics for dataset version 1: An error >>> occurred while updating the database with summary statistics of the dataset >>> 1: 16 >>> org.wso2.carbon.ml.database.exceptions.DatabaseHandlerException: An >>> error occurred while updating the database with summary statistics of the >>> dataset 1: 16 >>> at >>> org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2366) >>> at >>> org.wso2.carbon.ml.core.impl.SummaryStatsGenerator.run(SummaryStatsGenerator.java:130) >>> at >>> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) >>> at >>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) >>> at java.lang.Thread.run(Thread.java:745) >>> Caused by: java.lang.ArrayIndexOutOfBoundsException: 16 >>> at >>> org.wso2.carbon.ml.database.internal.MLDatabaseService.updateSummaryStatistics(MLDatabaseService.java:2329) >>> ... 4 more >>> >>> What could be the possible reason for this error ? >>> >>> [1] >>> http://ocw.mit.edu/courses/sloan-school-of-management/15-097-prediction-machine-learning-and-statistics-spring-2012/datasets/digits.csv >>> >>> Regards, >>> Malintha >>> >>> -- >>> *Malintha Adikari* >>> Software Engineer >>> WSO2 Inc.; http://wso2.com >>> lean.enterprise.middleware >>> >>> Mobile: +94 71 2312958 >>> Blog: http://malinthas.blogspot.com >>> Page: http://about.me/malintha >>> >> >> >> >> -- >> >> Thanks & regards, >> Nirmal >> >> Team Lead - WSO2 Machine Learner >> Associate Technical Lead - Data Technologies Team, WSO2 Inc. >> Mobile: +94715779733 >> Blog: http://nirmalfdo.blogspot.com/ >> >> >> > > > -- > Pruthuvi Maheshakya Wijewardena > Software Engineer > WSO2 : http://wso2.com/ > Email: mahesha...@wso2.com > Mobile: +94711228855 > > > -- *Malintha Adikari* Software Engineer WSO2 Inc.; http://wso2.com lean.enterprise.middleware Mobile: +94 71 2312958 Blog: http://malinthas.blogspot.com Page: http://about.me/malintha
_______________________________________________ Dev mailing list Dev@wso2.org http://wso2.org/cgi-bin/mailman/listinfo/dev