[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2023-02-07 Thread Ethan Guo (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ethan Guo updated HUDI-3495:

Fix Version/s: 0.13.0

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Yue Zhang
>Priority: Blocker
> Fix For: 0.13.0
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2023-02-07 Thread Ethan Guo (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ethan Guo updated HUDI-3495:

Fix Version/s: (was: 0.13.0)

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Yue Zhang
>Priority: Blocker
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-10-01 Thread Zhaojing Yu (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhaojing Yu updated HUDI-3495:
--
Fix Version/s: 0.13.0
   (was: 0.12.1)

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Yue Zhang
>Priority: Blocker
> Fix For: 0.13.0
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-08-22 Thread Raymond Xu (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raymond Xu updated HUDI-3495:
-
Sprint:   (was: 2022/09/19)

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Yue Zhang
>Priority: Blocker
> Fix For: 0.12.1
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-08-20 Thread Raymond Xu (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raymond Xu updated HUDI-3495:
-
Sprint: 2022/09/19  (was: 2022/08/22)

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Yue Zhang
>Priority: Blocker
> Fix For: 0.12.1
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-08-18 Thread sivabalan narayanan (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sivabalan narayanan updated HUDI-3495:
--
Fix Version/s: 0.12.1
   (was: 0.13.0)

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Yue Zhang
>Priority: Blocker
> Fix For: 0.12.1
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-07-29 Thread Sagar Sumit (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sagar Sumit updated HUDI-3495:
--
Status: Open  (was: In Progress)

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Yue Zhang
>Priority: Blocker
> Fix For: 0.13.0
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-07-29 Thread Sagar Sumit (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sagar Sumit updated HUDI-3495:
--
Fix Version/s: 0.13.0
   (was: 0.12.0)

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Yue Zhang
>Priority: Blocker
> Fix For: 0.13.0
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-03-29 Thread Raymond Xu (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raymond Xu updated HUDI-3495:
-
Fix Version/s: (was: 0.11.0)

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Yue Zhang
>Priority: Major
> Fix For: 0.12.0
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-03-29 Thread Raymond Xu (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raymond Xu updated HUDI-3495:
-
Priority: Blocker  (was: Major)

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Yue Zhang
>Priority: Blocker
> Fix For: 0.12.0
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-03-27 Thread Raymond Xu (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raymond Xu updated HUDI-3495:
-
Fix Version/s: 0.12.0

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Yue Zhang
>Priority: Major
> Fix For: 0.11.0, 0.12.0
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-03-26 Thread Ethan Guo (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ethan Guo updated HUDI-3495:

Priority: Major  (was: Blocker)

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Yue Zhang
>Priority: Major
> Fix For: 0.11.0
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-03-26 Thread Raymond Xu (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raymond Xu updated HUDI-3495:
-
Reviewers: Ethan Guo

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Yue Zhang
>Priority: Blocker
> Fix For: 0.11.0
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-03-24 Thread Raymond Xu (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raymond Xu updated HUDI-3495:
-
Status: In Progress  (was: Open)

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Yue Zhang
>Priority: Blocker
> Fix For: 0.11.0
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-03-22 Thread Raymond Xu (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raymond Xu updated HUDI-3495:
-
Sprint: Hudi-Sprint-Mar-14, Hudi-Sprint-Mar-21  (was: Hudi-Sprint-Mar-14)

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Yue Zhang
>Priority: Blocker
> Fix For: 0.11.0
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-03-14 Thread Raymond Xu (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raymond Xu updated HUDI-3495:
-
Story Points: 3

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Yue Zhang
>Priority: Blocker
> Fix For: 0.11.0
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-03-14 Thread Raymond Xu (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raymond Xu updated HUDI-3495:
-
Sprint: Hudi-Sprint-Mar-14

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Yue Zhang
>Priority: Blocker
> Fix For: 0.11.0
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-03-08 Thread sivabalan narayanan (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sivabalan narayanan updated HUDI-3495:
--
Epic Link: HUDI-1822

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Assignee: Ethan Guo
>Priority: Blocker
> Fix For: 0.11.0
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-02-27 Thread Raymond Xu (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raymond Xu updated HUDI-3495:
-
Fix Version/s: 0.11.0

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Priority: Blocker
> Fix For: 0.11.0
>
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-02-27 Thread Raymond Xu (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raymond Xu updated HUDI-3495:
-
Priority: Blocker  (was: Major)

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Priority: Blocker
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HUDI-3495) Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists

2022-02-27 Thread Raymond Xu (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raymond Xu updated HUDI-3495:
-
Component/s: metadata

> Reading keys in parallel from HoodieMetadataMergedLogRecordReader may lead to 
> empty results even if key exists
> --
>
> Key: HUDI-3495
> URL: https://issues.apache.org/jira/browse/HUDI-3495
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: metadata
>Reporter: Prashant Wason
>Priority: Major
>
> [HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
>  has two function which lookup keys:
> getRecordByKey(String key) - lookups the key in member variable map "records"
> getRecordsByKeys(List keys) - clears member variable map "records" 
> and scans the log files again.
> If the two functions are called in parallel, the getRecordByKey() may return 
> an empty key because the records was cleared in another thread calling 
> getRecordsByKeys()



--
This message was sent by Atlassian Jira
(v8.20.1#820001)