PhilippeMoussalli opened a new pull request, #22587:
URL: https://github.com/apache/beam/pull/22587

   - PR that implements a notebook to demonstrate the usage of the beam 
dataframe API as a preprocessing tool for ML training
   
   WIP:
   - [ ] **Find a method to implement the one-hot-encoding for encoding 
categorical variables:** related to ticket 
[#22268](https://github.com/apache/beam/issues/22268)
   - [ ]  **Fix bug that returns `ValueError: No producer for 
ref_PCollection_PCollection_265` when attempting to merge two deferred datasets 
:** related to ticket [#22267](https://github.com/apache/beam/issues/22267)
   - [ ] Have only one installation script for Beam with the latest implemented 
functions in the Dataframe API instead of installing from source
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to