[ 
https://issues.apache.org/jira/browse/DRILL-5411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lucas Alvarez Argüero closed DRILL-5411.
----------------------------------------
    Resolution: Not A Problem

> Getting 0 rows when there are more than 100000 in the mongoDB collection
> ------------------------------------------------------------------------
>
>                 Key: DRILL-5411
>                 URL: https://issues.apache.org/jira/browse/DRILL-5411
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - MongoDB
>    Affects Versions: 1.10.0
>         Environment: VM1("ubuntu/trusty64"): mongo1
> •     mongoS (mongo server)
> •     MongoD shard1 (Primary, secondary,secondary)
> •     Mongo config server 
> •     Drillbit
> VM2("ubuntu/trusty64"): mongo2
> •     MongoD shard2 (Primary, secondary,secondary)
> •     Mongo config server 
> •     Drillbit
> VM3("ubuntu/trusty64"): mongo3
> •     MongoD shard3 (Primary, secondary,secondary)
> •     Mongo config server 
> •     Drillbit
> VM4("ubuntu/trusty64"): zk1
> •     Zookeeper in quorum
> VM5("ubuntu/trusty64"): zk2
> •     Zookeeper in quorum
> VM6("ubuntu/trusty64"): zk3
> •     Zookeeper in quorum
>            Reporter: Lucas Alvarez Argüero
>
> Getting 0 rows when there are more than 100000 in the mongoDB collection
> Drills works perfectly when I am using mongo as storage when there are less 
> than 100000(aprox) documents in the collection (partitioned) but when there 
> are more documents, drill return zero rows but still can count all documents 
> (but it can’t count documents using where).
> Less than 100000:
> select v.measInfo_id,v.endTime from mongo.mandarinaTime3.MeasValue v    limit 
> 3; 
> +--------------+-------------+
> | measInfo_id  |   endTime   |
> +--------------+-------------+
> | [B@1a7d4b45  | 2016-09-19  |
> | [B@17d8ac99  | 2016-09-19  |
> | [B@122b7d0a  | 2016-09-19  |
> +--------------+-------------+
> 3 rows selected (0.313 seconds)
> More  than 100000:
> 0: jdbc:drill:> select v.measInfo_id,v.endTime from 
> mongo.mandarinaTime3.MeasValue v    limit 3;                                  
>                                       
> +--------------+----------+
> | measInfo_id  | endTime  |
> +--------------+----------+
> +--------------+----------+
> No rows selected (0.341 seconds)
> 0: jdbc:drill:> select count() from mongo.mandarinaTime3.MeasValue v    ;     
>                    
> +---------+
> | EXPR$0  |
> +---------+
> | 502068  |
> +---------+
> 1 row selected (0.426 seconds)
> 0: jdbc:drill:> select count() from mongo.mandarinaTime3.MeasValue v    Where 
> endtime='2016-09-19';
> +---------+
> | EXPR$0  |
> +---------+
> | 0       |
> +---------+
> 1 row selected (0.98 seconds)
> If the collection isn’t partitioned, drill also works perfectly
> drill mongo plugin:
> {
>   "type": "mongo",
>   "connection": "mongodb://mongo1:27017/",
>   "enabled": true
> }
> mongo sharded collection:
>  {  "_id" : "mandarinaTime3",  "primary" : "b",  "partitioned" : true }
>                 mandarinaTime3.MeasCollecFile
>                         shard key: { "_id" : 1 }
>                         unique: false
>                         balancing: true
>                         chunks:
>                                 b       1
>                         { "_id" : { "$minKey" : 1 } } -->> { "_id" : { 
> "$maxKey" : 1 } } on : b Timestamp(1, 0) 
>                 mandarinaTime3.MeasInfo
>                         shard key: { "_id" : 1 }
>                         unique: false
>                         balancing: true
>                         chunks:
>                                 a       1
>                                 b       1
>                                 c       1
>                         { "_id" : { "$minKey" : 1 } } -->> { "_id" : 
> ObjectId("58e364dddc7a033f5c08c7c6") } on : a Timestamp(2, 0) 
>                         { "_id" : ObjectId("58e364dddc7a033f5c08c7c6") } -->> 
> { "_id" : ObjectId("58e364e0dc7a033f5c08c8b0") } on : c Timestamp(3, 0) 
>                         { "_id" : ObjectId("58e364e0dc7a033f5c08c8b0") } -->> 
> { "_id" : { "$maxKey" : 1 } } on : b Timestamp(3, 1) 
>                 mandarinaTime3.MeasValue
>                         shard key: { "_id" : 1 }
>                         unique: false
>                         balancing: true
>                         chunks:
>                                 a       7
>                                 b       7
>                                 c       7
>                         too many chunks to print, use verbose if you want to 
> force print



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to