Hi all, There was a issue with the time-series extension where all events keep getting added to the extension and all subsequent regressions consider ALL previous events for the regression calculation. This obviously leads to 'out of memory' issues and the solution was to implement an upper limit to the number of events that will be held in memory for regression calculation.
This has been implemented and the user can specify the batchsize as part of the siddhi query for the time series regression. Furthermore, I felt it useful to allow the user the ability to calculate regression every 'n' events, instead of at every event (especially in the case of extremely low latency data streams such as stock market data). This too has been implemented, in a way that user can specify the calculation interval as part of the siddhi query for timeseries regression. So an example of a time series regression query would be as follows... *from DataStream#transform.timeseries:regress( 1000, 1000000, 0.95, y, x1, x2, x3, x4 )* *select ** *insert into RegressionResult* where, 1000 is the calculation interval 1000000 is the batchsize 0.95 is the confidence interval y is the dependent variabe (which we are forecasting) x1, x2, x3, x4 are the independent variablese that are used for forecasting y. Let me know if you have any thoughts/suggestions on this. Cheers, Seshika
_______________________________________________ Architecture mailing list Architecture@wso2.org https://mail.wso2.org/cgi-bin/mailman/listinfo/architecture