[ 
https://issues.apache.org/jira/browse/DRILL-7388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16976464#comment-16976464
 ] 

ASF GitHub Bot commented on DRILL-7388:
---------------------------------------

arina-ielchiieva commented on pull request #1901: DRILL-7388: Kafka improvements
URL: https://github.com/apache/drill/pull/1901
 
 
   Jira - [DRILL-7388](https://issues.apache.org/jira/browse/DRILL-7388).
   
   1. Upgraded Kafka libraries to 2.3.1 (DRILL-6739).
   2. Added new options to support the same features as native JSON reader:
     a. store.kafka.reader.skip_invalid_records, default: false (DRILL-6723);
     b. store.kafka.reader.allow_nan_inf, default: true;
     c. store.kafka.reader.allow_escape_any_char, default: false.
   3. Fixed issue when Kafka topic contains only one message (DRILL-7388).
   4. Replaced Gson parser with Jackson to parse JSON in the same manner as 
Drill native Json reader.
   5. Performance improvements: Kafka consumers will be closed async, fixed 
issue with resource leak (DRILL-7290), moved to debug unnecessary info logging.
   6. Updated bootstrap-storage-plugins.json to reflect actual Kafka connection 
properties.
   7. Added unit tests.
   8. Refactoring and code clean up.
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Apache Drill Kafka Storage module fails to return results for partitions 
> containing single offset record
> --------------------------------------------------------------------------------------------------------
>
>                 Key: DRILL-7388
>                 URL: https://issues.apache.org/jira/browse/DRILL-7388
>             Project: Apache Drill
>          Issue Type: Bug
>    Affects Versions: 1.16.0
>            Reporter: daniel kelly
>            Assignee: Arina Ielchiieva
>            Priority: Major
>             Fix For: 1.17.0
>
>
> If a partition only contains one record - e.g.
> [topicName=myTopic, partitionId=117, startOffset=0, endOffset=1]
> no data is returned.
> I fixed this locally with the following code change in contrib/storage-kafka 
> :-
> {code:java}
> git diff 
> src/main/java/org/apache/drill/exec/store/kafka/KafkaRecordReader.java
> @@ -109,7 +109,7 @@ public class KafkaRecordReader extends 
> AbstractRecordReader {
>      currentMessageCount = 0;
>  
>      try {
> -      while (currentOffset < subScanSpec.getEndOffset() - 1 && 
> msgItr.hasNext()) {
> +      while (currentOffset < subScanSpec.getEndOffset() && msgItr.hasNext()) 
> {
>          ConsumerRecord<byte[], byte[]> consumerRecord = msgItr.next();
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to