There are lots of great ideas in this thread and I am +1 for all of them.

For content management, I am familiar with https://jekyllrb.com/,
https://gohugo.io/, and https://docusaurus.io/ and I know that these
are used by many Apache projects. Worth Mentioning that Hive recently
migrated to Hugo so picking this option may be easier to get some help
from currently active Hive contributors. Apart from that, I don't have
strong preference for one or the other.

In general, I am not a big fan of the Confluence wiki since it makes
contributions more cumbersome. It is difficult to do reviews, history
is hard to track, and people's contributions are not easily
noticeable. In the past, I have experimented in transforming the wiki
pages to markdown files so I can definitely help getting the content
on the website if needed.

Indeed, it would be very helpful to have a getting started guide which
hides the installation complexities and gets more hands-on with Tez.
Most people got introduced to Map Reduce via the word count example
and there are tons of articles online about that. In Tez, we have a
word count example in the git repo and I think we should give it a
prominent place in the Getting started page. Possibly other classes
from the tez-examples module would be a good fit for the
documentation.

For hiding the installation complexities the obvious choice is to
build or use some docker containers that could get someone ready to go
very fast. Possibly the already published Hadoop images could be of
use here.

Best,
Stamatis

On Wed, Sep 18, 2024 at 4:33 PM Lewis John McGibbney <[email protected]> wrote:
>
> Thanks Ayush for the input.
> I’ve gathered a fair amount of information and have a ‘plan’ of sorts…
> I’ll summarize it here and seek input in due course.
> lewismc
>
> On 2024/09/13 08:29:46 Ayush Saxena wrote:
> > Tez does maintain some documentation like one here: [1], the first
> > time I deployed Tez locally, I used this doc, but I think there are
> > some stuff outdated or a couple of more tweaks required. Maybe
> > validating or improving the existing ones maybe a good start..
> >
> > Thanx Lewis for volunteering, in any case whether it requires anything
> > on the Tez side or Hive side, I will be happy to help or pull in
> > people who can help
> >
> > -Ayush
> >
> > [1] https://tez.apache.org/install.html
> >
> > On Fri, 13 Sept 2024 at 13:49, Denys Kuzmenko <[email protected]> wrote:
> > >
> > > I think that could be helpful if we could consolidate existing Tez 
> > > documentation (wiki pages) and migrate into the Tez site space.
> > >
> > > +1 on simple getting started, as it's the first place new users would 
> > > check
> > >
> > > Also few additional resource might be added into the user guides:
> > > 1. https://blog.cloudera.com/optimizing-hive-on-tez-performance/
> > > 2. 
> > > https://community.cloudera.com/t5/Community-Articles/Demystify-Apache-Tez-Memory-Tuning-Step-by-Step/ta-p/245279
> >

Reply via email to