[ https://issues.apache.org/jira/browse/STATISTICS-7?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16803193#comment-16803193 ]
Salman Hussain edited comment on STATISTICS-7 at 3/28/19 12:28 AM: ------------------------------------------------------------------- Thanks for the guidance [~ericbarnhill], I have been taking a look at the examples in scikit-learn - and perhaps instead of porting the regression library in commons.math.stat I could implement [sklearn.linear_model|https://scikit-learn.org/stable/modules/classes.html#module-sklearn.linear_model] in Java. The selection of linear regression methods seems to be more comprehensive and this is much needed within the package. The codebase would however be significantly larger. > Stream-based Java statistical processing > ---------------------------------------- > > Key: STATISTICS-7 > URL: https://issues.apache.org/jira/browse/STATISTICS-7 > Project: Apache Commons Statistics > Issue Type: New Feature > Reporter: Eric Barnhill > Priority: Major > Labels: GSoC2019, gsoc2019, statistics, streams > > The new component aims to be a library of commons statistics functions > synchronized with the latest developments in the Java language, in particular > Java's functional programming syntax. > The library will make commonly used statistical functions available to an end > user through a simple grammar comparable to commons-math-statistics or > scikit-learn, while under the hood will implement Java's mapping, streaming, > and other producer and consumer functions to ensure the statistical methods > run optimally in new Java implementations. > Developers working on the project will have the opportunity to demonstrate > Java programming, functional programming, algorithm design, and data science > skills and receive authorship on a commons project that is likely to be > widely used. > The ideal contributor will also be able to help with important architectural > decision making. The old source of these libraries, commons-math, grew too > large, hierarchically complex and interdependent for the commons mission. The > developers on this project need to make architectural choices that will > enable the statiscal code to be lightweight and reusable, with a minimum of > outside dependencies while avoiding redundancy. -- This message was sent by Atlassian JIRA (v7.6.3#76005)