Hi Markus, thanks!
Could you share the files in .ivy2/cache/org.apache.httpcomponents/httpasyncclient/ and maybe also the logs of a Nutch build starting with an empty ~/.ivy2/cache ? I'll have a look and compare it what I find on my system - maybe use a new thread on user@ or a Jira issue, I'll plan to close the vote over the weekend, so let's keep this thread for the release vote alone. Best, Sebastian On 8/29/22 14:17, Markus Jelsma wrote: > Hello Sebastian, > > No, the JAR isn't present. Multiple JARs are missing, probably because they > are loaded after httpasyncclient. I checked the previously emptied Ivy > cache. The Ivy files are there, but the JAR is missing there too. > > markus@midas:~$ ls .ivy2/cache/org.apache.httpcomponents/httpasyncclient/ > ivy-4.1.4.xml ivy-4.1.4.xml.original ivydata-4.1.4.properties > > I manually downloaded the JAR from [1] and added it to the jars/ directory > in the Ivy cache. It still cannot find the JAR, perhaps the Ivy cache needs > some more things than just adding the JAR manually. > > The odd thing is, that i got the URL below FROM the ivydata-4.1.4.properties > file in the cache. > > Since Ralf can compile it without problems, it seems to be an issue on my > machine only. So Nutch seems fine, therefore +1. > > Regards, > Markus > > [1] > https://repo1.maven.org/maven2/org/apache/httpcomponents/httpasyncclient/4.1.4/ > > > Op zo 28 aug. 2022 om 12:05 schreef Sebastian Nagel > <wastl.na...@googlemail.com.invalid>: > >> Hi Ralf, >> >>> It fetches it parses >> >> So a +1 ? >> >> Best, >> Sebastian >> >> On 8/25/22 05:22, BlackIce wrote: >>> nevermind I made a typo... >>> >>> It fetches it parses >>> >>> On Thu, Aug 25, 2022 at 3:42 AM BlackIce <blackice...@gmail.com> wrote: >>>> >>>> so far... it doesn't select anything when creating segments: >>>> 0 records selected for fetching, exiting >>>> >>>> On Wed, Aug 24, 2022 at 3:02 PM BlackIce <blackice...@gmail.com> wrote: >>>>> >>>>> I have been able to compile under OpenJDK 11 >>>>> Have not done anything further so far >>>>> I'm gonna try to get to it this evening >>>>> >>>>> Greetz >>>>> Ralf >>>>> >>>>> On Wed, Aug 24, 2022 at 1:29 PM Markus Jelsma >>>>> <markus.jel...@openindex.io> wrote: >>>>>> >>>>>> Hi, >>>>>> >>>>>> Everything seems fine, the crawler seems fine when trying the binary >>>>>> distribution. The source won't work because this computer still cannot >>>>>> compile it. Clearing the local Ivy cache did not do much. This is the >> known >>>>>> compiler error with the elastic-indexer plugin: >>>>>> compile: >>>>>> [echo] Compiling plugin: indexer-elastic >>>>>> [javac] Compiling 3 source files to >>>>>> /home/markus/temp/apache-nutch-1.19/build/indexer-elastic/classes >>>>>> [javac] >>>>>> >> /home/markus/temp/apache-nutch-1.19/src/plugin/indexer-elastic/src/java/org/apache/nutch/indexwriter/elastic/ElasticIndexWriter.java:39: >>>>>> error: package org.apache.http.impl.nio.client does not exist >>>>>> [javac] import >> org.apache.http.impl.nio.client.HttpAsyncClientBuilder; >>>>>> [javac] ^ >>>>>> [javac] 1 error >>>>>> >>>>>> >>>>>> The binary distribution works fine though. I do see a lot of new >> messages >>>>>> when fetching: >>>>>> 2022-08-24 13:21:15,867 INFO o.a.n.n.URLExemptionFilters >> [LocalJobRunner >>>>>> Map Task Executor #0] Found 0 extensions at >>>>>> point:'org.apache.nutch.net.URLExemptionFilter' >>>>>> >>>>>> This is also new at start of each task: >>>>>> SLF4J: Class path contains multiple SLF4J bindings. >>>>>> SLF4J: Found binding in >>>>>> >> [jar:file:/home/markus/temp/apache-nutch-1.19/lib/log4j-slf4j-impl-2.18.0.jar!/org/slf4j/impl/StaticLoggerBinder.class] >>>>>> >>>>>> SLF4J: Found binding in >>>>>> >> [jar:file:/home/markus/temp/apache-nutch-1.19/lib/slf4j-reload4j-1.7.36.jar!/org/slf4j/impl/StaticLoggerBinder.class] >>>>>> >>>>>> SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an >>>>>> explanation. >>>>>> SLF4J: Actual binding is of type >>>>>> [org.apache.logging.slf4j.Log4jLoggerFactory] >>>>>> >>>>>> And this one at the end of fetcher: >>>>>> log4j:WARN No appenders could be found for logger >>>>>> (org.apache.commons.httpclient.params.DefaultHttpParams). >>>>>> log4j:WARN Please initialize the log4j system properly. >>>>>> log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig >> for >>>>>> more info. >>>>>> >>>>>> I am worried about the indexer-elastic plugin, maybe others have that >>>>>> problem too? Otherwise everything seems fine. >>>>>> >>>>>> Markus >>>>>> >>>>>> Op ma 22 aug. 2022 om 17:30 schreef Sebastian Nagel < >> sna...@apache.org>: >>>>>> >>>>>>> Hi Folks, >>>>>>> >>>>>>> A first candidate for the Nutch 1.19 release is available at: >>>>>>> >>>>>>> https://dist.apache.org/repos/dist/dev/nutch/1.19/ >>>>>>> >>>>>>> The release candidate is a zip and tar.gz archive of the binary and >>>>>>> sources in: >>>>>>> https://github.com/apache/nutch/tree/release-1.19 >>>>>>> >>>>>>> In addition, a staged maven repository is available here: >>>>>>> >> https://repository.apache.org/content/repositories/orgapachenutch-1020 >>>>>>> >>>>>>> We addressed 87 issues: >>>>>>> https://s.apache.org/lf6li >>>>>>> >>>>>>> >>>>>>> Please vote on releasing this package as Apache Nutch 1.19. >>>>>>> The vote is open for the next 72 hours and passes if a majority >>>>>>> of at least three +1 Nutch PMC votes are cast. >>>>>>> >>>>>>> [ ] +1 Release this package as Apache Nutch 1.19. >>>>>>> [ ] -1 Do not release this package becauseā¦ >>>>>>> >>>>>>> Cheers, >>>>>>> Sebastian >>>>>>> (On behalf of the Nutch PMC) >>>>>>> >>>>>>> P.S. >>>>>>> Here is my +1. >>>>>>> - tested most of Nutch tools and run a test crawl on a single-node >> cluster >>>>>>> running Hadoop 3.3.4, see >>>>>>> https://github.com/sebastian-nagel/nutch-test-single-node-cluster/ >> ) >>>>>>> >> >