> https://github.com/hortonworks/hive-testbench
>
> The official procedure to generate and upload the data has never worked
>for me (and it looks like it's not a supported software), so it could be
>a bit tricky to do it manually and on a single host.

I wrote the MapReduce jobs for that (tpcds-gen/tpch-gen) after waiting a
whole weekend for 1Tb of data to be generated on a single machine.

If you or anyone else has issues with it, I can take a look at it.

Cheers,
Gopal


Reply via email to