alamb commented on a change in pull request #1166:
URL: https://github.com/apache/arrow-datafusion/pull/1166#discussion_r734961145
##########
File path: docs/source/specification/roadmap.md
##########
@@ -92,8 +92,28 @@ Note: There are some additional thoughts on a datafusion-cli
vision on [#1096](h
- publishing to apt, brew, and possible NuGet registry so that people can use
it more easily
- adopt a shorter name, like dfcli?
-## Ballista
+# Ballista
-# Vision
+Ballista is a distributed compute platform based on Apache Arrow and
DataFusion. It provides a query scheduler that
+breaks a physical plan into stages and tasks and then schedules tasks for
execution across the available executors
+in the cluster.
-TBD
+Having Ballista as part of the DataFusion codebase helps ensure that
DataFusion remains suitable for distributed
+compute. For example, it helps ensure that physical query plans can be
serialized to protobuf format and that they
Review comment:
thank you for this context
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]