Re: Any plans to migrate Transformer API to Spark SQL (closer to DataFrames)?

2016-03-28 Thread Michał Zieliński
Hi Maciej, Absolutely. We had to copy HasInputCol/s, HasOutputCol/s (along with a couple of others like HasProbabilityCol) to our repo. Which for most use-cases is good enough, but for some (e.g. operating on any Transformer that accepts either our or Sparks HasInputCol) makes the code clunky.

Re: Any plans to migrate Transformer API to Spark SQL (closer to DataFrames)?

2016-03-26 Thread Michał Zieliński
Spark ML Pipelines API (not just Transformers, Estimators and custom Pipelines classes as well) are definitely not just machine-learning specific. We use them heavily in our developement. We're building machine learning pipelines *BUT* many steps involve joining, schema manipulation,