Re: [scikit-learn] About the Boston housing prices dataset

2020-10-14 Thread Christian Lorentzen
Hi As was recently mentioned in PR #18594, the problem with the boston housing dataset does not go away, just because we remove it from scikit-learn. On the contrary, it is a valuable dataset to show and teach bias and discrimination - issue #16715 is still waiting for someone to write an exa

Re: [scikit-learn] About the Boston housing prices dataset

2020-10-14 Thread Adrin
Most of those are not talking about the ethical issues of the dataset. Let's talk about the alternatives we have: Keep the loader, but raise a warning: - this will result in most people not changing their code/material, and IMO mostly ignore the warning. Some people may see the warning and care ab

Re: [scikit-learn] About the Boston housing prices dataset

2020-10-14 Thread Olivier Grisel
Le mar. 13 oct. 2020 à 16:19, Adrin a écrit : > > Isn't the Boston dataset available through openml? Maybe here: > https://www.openml.org/d/531 > > I'm happy to have the dataset out there on opemml, and for any material that > addresses some of the issues with it. > But for educational purposes,