[RESULT] [VOTE] Release Apache ManifoldCF 2.13, RC1
Three +1's, >72 hours. Vote passes! Karl On Wed, May 1, 2019 at 11:37 AM Antonio David Pérez Morales < adperezmora...@gmail.com> wrote: > Built and ran tests > > +1 for me > > El mar., 30 abr. 2019 8:34, Karl Wright escribió: > > > Ran tests. > > +1 from me. > > Karl > > > > > > On Mon, Apr 29, 2019 at 4:32 AM Rafa Haro wrote: > > > > > Downloaded source code and built the release correctly. +1 > > > > > > On Thu, Apr 25, 2019 at 11:54 PM Karl Wright > wrote: > > > > > > > Please vote on whether to release Apache ManifoldCF 2.13, RC0. The > > > release > > > > artifact can be found at: > > > > > > https://dist.apache.org/repos/dist/dev/manifoldcf/apache-manifoldcf-2.13 > > > . > > > > There is also a release tag at > > > > https://svn.apache.org/repos/asf/manifoldcf/tags/release-2.13-RC1. > > > > > > > > This release contains primarily a redeveloped Jcifs connector, to > work > > > with > > > > jcifs-ng, plus a modest number of bug fixes. > > > > > > > > The release has been respun due to a syntax error in the jcifs > > connector > > > > pom. > > > > > > > > Karl > > > > > > > > > >
Re: [VOTE] Release Apache ManifoldCF 2.13, RC1
Built and ran tests +1 for me El mar., 30 abr. 2019 8:34, Karl Wright escribió: > Ran tests. > +1 from me. > Karl > > > On Mon, Apr 29, 2019 at 4:32 AM Rafa Haro wrote: > > > Downloaded source code and built the release correctly. +1 > > > > On Thu, Apr 25, 2019 at 11:54 PM Karl Wright wrote: > > > > > Please vote on whether to release Apache ManifoldCF 2.13, RC0. The > > release > > > artifact can be found at: > > > > https://dist.apache.org/repos/dist/dev/manifoldcf/apache-manifoldcf-2.13 > > . > > > There is also a release tag at > > > https://svn.apache.org/repos/asf/manifoldcf/tags/release-2.13-RC1. > > > > > > This release contains primarily a redeveloped Jcifs connector, to work > > with > > > jcifs-ng, plus a modest number of bug fixes. > > > > > > The release has been respun due to a syntax error in the jcifs > connector > > > pom. > > > > > > Karl > > > > > >
[jira] [Commented] (CONNECTORS-1519) CLIENTPROTOCOLEXCEPTION is thrown with 2.10 -> ES 6.x.y
[ https://issues.apache.org/jira/browse/CONNECTORS-1519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16830948#comment-16830948 ] Karl Wright commented on CONNECTORS-1519: - [~svanschalkwyk], are you following this? > CLIENTPROTOCOLEXCEPTION is thrown with 2.10 -> ES 6.x.y > --- > > Key: CONNECTORS-1519 > URL: https://issues.apache.org/jira/browse/CONNECTORS-1519 > Project: ManifoldCF > Issue Type: Bug > Components: Elastic Search connector >Affects Versions: ManifoldCF 2.10 >Reporter: Steph van Schalkwyk >Assignee: Steph van Schalkwyk >Priority: Major > Fix For: ManifoldCF 2.13 > > > Investigating CLIENTPROTOCOLEXCEPTION when using 2.10 with ES 6.x.y > More information to follow. > Fails when using security , i.e. > [http://user:password@elasticsearch:9200.|http://user:password@elasticsearch:9200./] > Remedy: > # Disable x-pack security. > # Use http://elasticsearch:9200. > > > |07-27-2018 17:53:19.010|Indexation > (ES)|file:/var/manifoldcf/corpus/14.html|CLIENTPROTOCOLEXCEPTION|38053|23| -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Resolved] (CONNECTORS-1602) Continuous crawling doesn't recrawl everything
[ https://issues.apache.org/jira/browse/CONNECTORS-1602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Wright resolved CONNECTORS-1602. - Resolution: Not A Problem > Continuous crawling doesn't recrawl everything > -- > > Key: CONNECTORS-1602 > URL: https://issues.apache.org/jira/browse/CONNECTORS-1602 > Project: ManifoldCF > Issue Type: Bug > Components: Web connector >Reporter: Donald Van den Driessche >Priority: Major > > When crawling a website in continuous crawling mode we saw that not all > documents are recrawled. > The site is quite extensive. We figured out that after crawling a > document/page gets a recrawl timestamp in between the recrawl interval and > max recrawl interval. > But if these values occur within the first crawl, Manifold starts recrawling > those, but seems to ignore the rest of the website. Also sometimes documents > get recrawled 5 times while other don't get recrawled. Apparently due to the > same issue. > > Is it possible to shed a bit more light on the continuous crawling? > Is it a good system to use for crawling a (extensive) website? -- This message was sent by Atlassian JIRA (v7.6.3#76005)