Github user kno10 commented on the pull request:
https://github.com/apache/flink/pull/696#issuecomment-131123325
On low-dimensional data, exact kNN may be feasible using grid-based
approaches even for very large data sets. It's not very "sexy" to implement
this, but its also not very hard.
Also, a lot of users will be using data sets where pairwise distance
computations is still possible (it's not as if everybody has exabyte vector
data), so why deprive them of this option, even if it is too costly for others?
Last but not least, for evaluation purposes, exact kNN can be useful as a
badeline, too.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---