Fabian Hueske created FLINK-1444:
------------------------------------
Summary: Add data properties for data sources
Key: FLINK-1444
URL: https://issues.apache.org/jira/browse/FLINK-1444
Project: Flink
Issue Type: New Feature
Components: Java API, JobManager, Optimizer
Affects Versions: 0.9
Reporter: Fabian Hueske
Priority: Minor
This issue proposes to add support for attaching data properties to data
sources. These data properties are defined with respect to input splits.
Possible properties are:
- partitioning across splits: all elements of the same key (combination) are
contained in one split
- sorting / grouping with splits: elements are sorted or grouped on certain
keys within a split
- key uniqueness: a certain key (combination) is unique for all elements of the
data source. This property is not defined wrt. input splits.
The optimizer can leverage this information to generate more efficient
execution plans.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)