Hi Zhiyuan,
Thanks for the guidance. I'm making progress but I am still battling initial
configuration management issues.
I'm running HDFS and YARN v3.1.4 in pseudo-mode.
My tez-site.xml contains the following content
<configuration>
<property>
<name>tez.lib.uris</name>
<value>${fs.defaultFS}/apps/tez-0.10.1-SNAPSHOT,${fs.defaultFS}/apps/tez-0.10.1-SNAPSHOT/lib,${fs.defaultFS}/apps/nutch</value>
</property>
</configuration>
N.B. When I attempted to use the compressed Tez tar.gz, I was running into
classpath issues which are largely documented in the installation documentation
you pointed me to. I overcame these issues by simply uploading the minimal
directory. All seems fine at this stage as I can run all of the Tez examples.
I run into trouble when I try to run any job from the Nutch application. For
example when I run the Injector one of the Nutch plugin extension points (x
point org.apache.nutch.net.URLNormalizer) cannot be not found. The relevant log
can be seen at https://paste.apache.org/4whoe.
I should note that the entire Nutch .job is available on HDFS at the URI
defined in the tez-site.xml above.
The output of jar -tf on the nutch.job artifact can be seen at
https://paste.apache.org/hl8tk.
Am I required to somehow describe the structural heirarchy of this artifact in
the tez.lib.uris.classpath configuration property?
Thank you again for any guidance.
lewismc
On 2020/12/14 03:23:48, Zhiyuan Yang <[email protected]> wrote:
> Hi Lewis,
>
> If there is no incompatibility, your existing job will run well on Tez
> without code change. You can just follow this guide
> <https://tez.apache.org/install.html> (especially step 4) to try it out.
>
> Thanks,
> Zhiyuan
>
> On Mon, Dec 14, 2020 at 9:04 AM Lewis John McGibbney <[email protected]>
> wrote:
>