There are lots of great ideas in this thread and I am +1 for all of them. For content management, I am familiar with https://jekyllrb.com/, https://gohugo.io/, and https://docusaurus.io/ and I know that these are used by many Apache projects. Worth Mentioning that Hive recently migrated to Hugo so picking this option may be easier to get some help from currently active Hive contributors. Apart from that, I don't have strong preference for one or the other.
In general, I am not a big fan of the Confluence wiki since it makes contributions more cumbersome. It is difficult to do reviews, history is hard to track, and people's contributions are not easily noticeable. In the past, I have experimented in transforming the wiki pages to markdown files so I can definitely help getting the content on the website if needed. Indeed, it would be very helpful to have a getting started guide which hides the installation complexities and gets more hands-on with Tez. Most people got introduced to Map Reduce via the word count example and there are tons of articles online about that. In Tez, we have a word count example in the git repo and I think we should give it a prominent place in the Getting started page. Possibly other classes from the tez-examples module would be a good fit for the documentation. For hiding the installation complexities the obvious choice is to build or use some docker containers that could get someone ready to go very fast. Possibly the already published Hadoop images could be of use here. Best, Stamatis On Wed, Sep 18, 2024 at 4:33 PM Lewis John McGibbney <[email protected]> wrote: > > Thanks Ayush for the input. > I’ve gathered a fair amount of information and have a ‘plan’ of sorts… > I’ll summarize it here and seek input in due course. > lewismc > > On 2024/09/13 08:29:46 Ayush Saxena wrote: > > Tez does maintain some documentation like one here: [1], the first > > time I deployed Tez locally, I used this doc, but I think there are > > some stuff outdated or a couple of more tweaks required. Maybe > > validating or improving the existing ones maybe a good start.. > > > > Thanx Lewis for volunteering, in any case whether it requires anything > > on the Tez side or Hive side, I will be happy to help or pull in > > people who can help > > > > -Ayush > > > > [1] https://tez.apache.org/install.html > > > > On Fri, 13 Sept 2024 at 13:49, Denys Kuzmenko <[email protected]> wrote: > > > > > > I think that could be helpful if we could consolidate existing Tez > > > documentation (wiki pages) and migrate into the Tez site space. > > > > > > +1 on simple getting started, as it's the first place new users would > > > check > > > > > > Also few additional resource might be added into the user guides: > > > 1. https://blog.cloudera.com/optimizing-hive-on-tez-performance/ > > > 2. > > > https://community.cloudera.com/t5/Community-Articles/Demystify-Apache-Tez-Memory-Tuning-Step-by-Step/ta-p/245279 > >
