Good idea to introduce in Zeppelin a way to download full datasets without actually visualizing them.
Not sure if this helps, we taught our users to use %sh hadoop fs -getmerge /hadoop/path/dir/ /some/nfs/mount/ for large files (they sometimes have to download datasets with millions of records). They run Zeppelin on edge nodes that have NFS mounts to a drop zone. ps. Hue has a limit too, by default 100k rows https://github.com/cloudera/hue/blob/release-3.12.0/desktop/conf.dist/hue.ini#L905 Not sure how much it scales up. -- Ruslan Dautkhanov On Tue, May 2, 2017 at 10:41 AM, Paul Brenner <pbren...@placeiq.com> wrote: > There are limits to how much data the download to csv button will download > (1.5MB? 3500 rows?) which limit zeppelin’s usefulness for our BI teams. > This limit comes up far before we run into issues with showing too many > rows of data in zeppelin. > > Unfortunately (fortunately?) Hue is the other tool the BI team has been > using and there they have no problem downloading much larger datasets to > csv. This is definitely not a requirement I’ve ever run into in the way I > use zeppelin since I would just use spark to write the data out. However, > the BI team is not allowed to run spark jobs (they use hive via jdbc) so > that download to csv button is pretty important to them. > > Would it be possible to significantly increase the limit? Even better > would it be possible to download more data than is shown? I assume this is > the type of thing I would need to open a ticket for, but I wanted to ask > here first. > > <http://www.placeiq.com/> <http://www.placeiq.com/> > <http://www.placeiq.com/> Paul Brenner <https://twitter.com/placeiq> > <https://twitter.com/placeiq> <https://twitter.com/placeiq> > <https://www.facebook.com/PlaceIQ> <https://www.facebook.com/PlaceIQ> > <https://www.linkedin.com/company/placeiq> > <https://www.linkedin.com/company/placeiq> > DATA SCIENTIST > *(217) 390-3033 <(217)%20390-3033> * > > <http://www.placeiq.com/2015/05/26/placeiq-named-winner-of-prestigious-2015-oracle-data-cloud-activate-award/> > <http://placeiq.com/2015/12/18/accuracy-vs-precision-in-location-data-mma-webinar/> > <http://placeiq.com/2015/12/18/accuracy-vs-precision-in-location-data-mma-webinar/> > <http://placeiq.com/2015/12/18/accuracy-vs-precision-in-location-data-mma-webinar/> > <http://placeiq.com/2015/12/18/accuracy-vs-precision-in-location-data-mma-webinar/> > <http://placeiq.com/2016/03/08/measuring-addressable-tv-campaigns-is-now-possible/> > <http://placeiq.com/2016/04/13/placeiq-joins-the-network-advertising-initiative-nai-as-100th-member/> > <http://placeiq.com/2016/04/13/placeiq-joins-the-network-advertising-initiative-nai-as-100th-member/> > <http://placeiq.com/2016/04/13/placeiq-joins-the-network-advertising-initiative-nai-as-100th-member/> > <http://placeiq.com/2016/04/13/placeiq-joins-the-network-advertising-initiative-nai-as-100th-member/> > <http://placeiq.com/2016/04/13/placeiq-joins-the-network-advertising-initiative-nai-as-100th-member/> > <http://pages.placeiq.com/Location-Data-Accuracy-Whitepaper-Download.html?utm_source=Signature&utm_medium=Email&utm_campaign=AccuracyWP> > <http://placeiq.com/2016/08/03/placeiq-bolsters-location-intelligence-platform-with-mastercard-insights/> > <http://placeiq.com/2016/10/26/the-making-of-a-location-data-industry-milestone/>[image: > PlaceIQ:Location Data Accuracy] > <http://placeiq.com/2016/12/07/placeiq-introduces-landmark-a-groundbreaking-offering-that-delivers-access-to-the-highest-quality-location-data-for-insights-that-fuel-limitless-business-decisions/> >