Re: JSON in Kafka -> ORC in HDFS - Thoughts on different tools?

2023-12-10 Thread Aaron Grubb
t of the box (Hive) does it. Why exactly you need to replace it? Good luck, M. On Fri, Dec 1, 2023 at 11:38 AM Aaron Grubb mailto:aa...@kaden.ai>> wrote: Hi all, Posting this here to avoid biases from the individual mailing lists on why the product they're using is the best. I

JSON in Kafka -> ORC in HDFS - Thoughts on different tools?

2023-12-01 Thread Aaron Grubb
Hi all, Posting this here to avoid biases from the individual mailing lists on why the product they're using is the best. I'm analyzing tools to replace a section of our pipeline with something more efficient. Currently we're using Kafka Connect to take data from Kafka and put it into S3 (not HD

YARN Node Labels - Effective Capacity is 0% on labeled partition and infinity% on DEFAULT_PARTITION

2023-08-11 Thread Aaron Grubb
Hi all, Having some issues getting node labels working correctly. I'm going to link to a forum post I made because it has screenshots and formatted/highlighted config files so I think it's easier to read that way, hopefully that isn't breaking list etiquette! https://community.cloudera.com/t5/

RE: setting up Load balancer for name nodes

2020-08-21 Thread Aaron Grubb
It is important to always read documents for your version as documentation for other version will not specify which versions contain the feature being documented. https://hadoop.apache.org/docs/r2.9.2/hadoop-project-dist/hadoop-hdfs/HDFSHighAvailabilityWithQJM.html From: Muthupandiyan, Kamaraj