Fabian Hueske created FLINK-1444:
------------------------------------

             Summary: Add data properties for data sources
                 Key: FLINK-1444
                 URL: https://issues.apache.org/jira/browse/FLINK-1444
             Project: Flink
          Issue Type: New Feature
          Components: Java API, JobManager, Optimizer
    Affects Versions: 0.9
            Reporter: Fabian Hueske
            Priority: Minor


This issue proposes to add support for attaching data properties to data 
sources. These data properties are defined with respect to input splits.
Possible properties are:

- partitioning across splits: all elements of the same key (combination) are 
contained in one split
- sorting / grouping with splits: elements are sorted or grouped on certain 
keys within a split
- key uniqueness: a certain key (combination) is unique for all elements of the 
data source. This property is not defined wrt. input splits.

The optimizer can leverage this information to generate more efficient 
execution plans.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to