Github user emlaver commented on a diff in the pull request:
https://github.com/apache/bahir/pull/41#discussion_r110706772
--- Diff: sql-cloudant/README.md ---
@@ -62,6 +62,8 @@ cloudant.protocol|https|protocol to use to transfer data:
http or https
cloudant.host||cloudant host url
cloudant.username||cloudant userid
cloudant.password||cloudant password
+cloudant.useQuery|false|When enabled, for query not using index or view,
_find will be used instead of _all_docs, some query predicates will be driven
into datastore. However, RDD partition is ONE during _find, so parallel loading
is not achieved
--- End diff --
Ok - I'm still having a hard time understanding the statement `some of the
query predicates will be driven into datastore`. What do you mean by driven?
Does it have to do with `partitions used to drive JsonStoreRDD loading query
result in parallel` ?
Also, `As RDD partition is ONE during _find` - one what? set to one
partition for the `jsonstore.rdd.partitions` configuration?
And for `parallel loading is not achieved` - This sounds awkward to me. Is
it not achieved because the `_find` endpoint only supports one partition?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---