Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/334#discussion_r234051252 --- Diff: src/ports/postgres/modules/utilities/minibatch_preprocessing.py_in --- @@ -487,10 +487,16 @@ class MiniBatchDocumentation: ---------------------------------------------------------------- SUMMARY ---------------------------------------------------------------- - MiniBatch Preprocessor is a utility function to pre process the input - data for use with models that support mini-batching as an optimization + The mini-batch preprocessor is a utility that prepares input data for + use by models that support mini-batch as an optimization option. (This + is currently only the case for Neural Networks.) It is effectively a + packing operation that builds arrays of dependent and independent + variables from the source data table. - #TODO add more here + The advantage of using mini-batching is that it can perform better than + stochastic gradient descent (default MADlib optimizer) because it uses + more than one training example at a time, typically resulting faster --- End diff -- missing the word in `resulting in faster .....`
---