Re: what are the implications of setting `schemaSampleSize = -1` on cloudant connector?

2017-05-16 Thread Esteban M Laver
Hi Chris,   When schemaSampleSize is set to -1, the connector will scan all the documents in the database.  +1 scans only the first document.  Using the value -1 would add the most overhead. N number of documents will scan an arbitrary number of documents in the database (if N is greater than the n

what are the implications of setting `schemaSampleSize = -1` on cloudant connector?

2017-05-16 Thread chris snow
how does this option work? I'm guessing that this may add some overhead to my cloudant data load into spark because it will need to read in all the data before it creates the dataframe?