[jira] [Created] (TEZ-4571) Shared fetch enabled fetches all partitions on task 0000s directly to disk for non-broadcast edges

2024-06-14 Thread Jonathan Turner Eagles (Jira)
Jonathan Turner Eagles created TEZ-4571:
---

 Summary: Shared fetch enabled fetches all partitions on task s 
directly to disk for non-broadcast edges
 Key: TEZ-4571
 URL: https://issues.apache.org/jira/browse/TEZ-4571
 Project: Apache Tez
  Issue Type: Improvement
Reporter: Jonathan Turner Eagles


I think the idea of shared fetch is to fetch once per node for broadcast input. 
However, the enabled in the fetcher doesn't check the edge type but only that 
1) shared fetch is enabled and 2) task for the vertex is . For broadcast 
edge this is correct perhaps, but for non-broadcast edges, all partitions are 
fetched to disk without possibility of sharing



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (TEZ-4570) Implement data-via-events for ordered outputs

2024-06-14 Thread Jonathan Turner Eagles (Jira)
Jonathan Turner Eagles created TEZ-4570:
---

 Summary: Implement data-via-events for ordered outputs
 Key: TEZ-4570
 URL: https://issues.apache.org/jira/browse/TEZ-4570
 Project: Apache Tez
  Issue Type: New Feature
Reporter: Jonathan Turner Eagles


Currently, data-via-events is only implemented by the unordered outputs and 
unordered fetch.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)