Fantastic input Stamatis. Thank you
On 2024/09/19 11:40:15 Stamatis Zampetakis wrote: > There are lots of great ideas in this thread and I am +1 for all of them. > > For content management, I am familiar with https://jekyllrb.com/, > https://gohugo.io/, and https://docusaurus.io/ and I know that these > are used by many Apache projects. Worth Mentioning that Hive recently > migrated to Hugo so picking this option may be easier to get some help > from currently active Hive contributors. Apart from that, I don't have > strong preference for one or the other. > > In general, I am not a big fan of the Confluence wiki since it makes > contributions more cumbersome. It is difficult to do reviews, history > is hard to track, and people's contributions are not easily > noticeable. In the past, I have experimented in transforming the wiki > pages to markdown files so I can definitely help getting the content > on the website if needed. > > Indeed, it would be very helpful to have a getting started guide which > hides the installation complexities and gets more hands-on with Tez. > Most people got introduced to Map Reduce via the word count example > and there are tons of articles online about that. In Tez, we have a > word count example in the git repo and I think we should give it a > prominent place in the Getting started page. Possibly other classes > from the tez-examples module would be a good fit for the > documentation. > > For hiding the installation complexities the obvious choice is to > build or use some docker containers that could get someone ready to go > very fast. Possibly the already published Hadoop images could be of > use here. > > Best, > Stamatis > > On Wed, Sep 18, 2024 at 4:33 PM Lewis John McGibbney <[email protected]> > wrote: > > > > Thanks Ayush for the input. > > I’ve gathered a fair amount of information and have a ‘plan’ of sorts… > > I’ll summarize it here and seek input in due course. > > lewismc > > > > On 2024/09/13 08:29:46 Ayush Saxena wrote: > > > Tez does maintain some documentation like one here: [1], the first > > > time I deployed Tez locally, I used this doc, but I think there are > > > some stuff outdated or a couple of more tweaks required. Maybe > > > validating or improving the existing ones maybe a good start.. > > > > > > Thanx Lewis for volunteering, in any case whether it requires anything > > > on the Tez side or Hive side, I will be happy to help or pull in > > > people who can help > > > > > > -Ayush > > > > > > [1] https://tez.apache.org/install.html > > > > > > On Fri, 13 Sept 2024 at 13:49, Denys Kuzmenko <[email protected]> > > > wrote: > > > > > > > > I think that could be helpful if we could consolidate existing Tez > > > > documentation (wiki pages) and migrate into the Tez site space. > > > > > > > > +1 on simple getting started, as it's the first place new users would > > > > check > > > > > > > > Also few additional resource might be added into the user guides: > > > > 1. https://blog.cloudera.com/optimizing-hive-on-tez-performance/ > > > > 2. > > > > https://community.cloudera.com/t5/Community-Articles/Demystify-Apache-Tez-Memory-Tuning-Step-by-Step/ta-p/245279 > > > >
