GitHub user kumarvishal09 opened a pull request:
https://github.com/apache/incubator-carbondata/pull/108
[CARBONDATA-190]Data mismatch issue in case of filter query
Issue steps:1. create table , then restart the server and then do data
load, in that case filter query record count is not matching.
Problem: When user is creating any table and if user has not disabled
inverted index false for any key column we are setting the inverted index true
in column schema object. As we are not persisting this information in schema
file, so after restarting the server useInvertedIndex property is false in
columnschema object and in data loading column data is not sorted and in filter
execution we are doing binary search, as data is not sorted binary search is
failing and it is skipping some of the record.
Solution : In this pr default value is set to true. One more PR will be
raised to handle inverted index disabled sceario. By default Inverted index
will be enabled for all the column for better query performance
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/kumarvishal09/incubator-carbondata
DataMismatchIssue
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/incubator-carbondata/pull/108.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #108
----
commit 6c41f27f39bfe3308732cced743507276f3f6da6
Author: kumarvishal <[email protected]>
Date: 2016-08-29T15:10:00Z
Fixed Data mismatch issue when inverted index is disabled
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---