[RESULT] [VOTE] Release Apache ManifoldCF 2.13, RC1

2019-05-01 Thread Karl Wright
Three +1's, >72 hours.  Vote passes!

Karl

On Wed, May 1, 2019 at 11:37 AM Antonio David Pérez Morales <
adperezmora...@gmail.com> wrote:

> Built and ran tests
>
> +1 for me
>
> El mar., 30 abr. 2019 8:34, Karl Wright  escribió:
>
> > Ran tests.
> > +1 from me.
> > Karl
> >
> >
> > On Mon, Apr 29, 2019 at 4:32 AM Rafa Haro  wrote:
> >
> > > Downloaded source code and built the release correctly. +1
> > >
> > > On Thu, Apr 25, 2019 at 11:54 PM Karl Wright 
> wrote:
> > >
> > > > Please vote on whether to release Apache ManifoldCF 2.13, RC0.  The
> > > release
> > > > artifact can be found at:
> > > >
> > https://dist.apache.org/repos/dist/dev/manifoldcf/apache-manifoldcf-2.13
> > > .
> > > > There is also a release tag at
> > > > https://svn.apache.org/repos/asf/manifoldcf/tags/release-2.13-RC1.
> > > >
> > > > This release contains primarily a redeveloped Jcifs connector, to
> work
> > > with
> > > > jcifs-ng, plus a modest number of bug fixes.
> > > >
> > > > The release has been respun due to a syntax error in the jcifs
> > connector
> > > > pom.
> > > >
> > > > Karl
> > > >
> > >
> >
>


Re: [VOTE] Release Apache ManifoldCF 2.13, RC1

2019-05-01 Thread Antonio David Pérez Morales
Built and ran tests

+1 for me

El mar., 30 abr. 2019 8:34, Karl Wright  escribió:

> Ran tests.
> +1 from me.
> Karl
>
>
> On Mon, Apr 29, 2019 at 4:32 AM Rafa Haro  wrote:
>
> > Downloaded source code and built the release correctly. +1
> >
> > On Thu, Apr 25, 2019 at 11:54 PM Karl Wright  wrote:
> >
> > > Please vote on whether to release Apache ManifoldCF 2.13, RC0.  The
> > release
> > > artifact can be found at:
> > >
> https://dist.apache.org/repos/dist/dev/manifoldcf/apache-manifoldcf-2.13
> > .
> > > There is also a release tag at
> > > https://svn.apache.org/repos/asf/manifoldcf/tags/release-2.13-RC1.
> > >
> > > This release contains primarily a redeveloped Jcifs connector, to work
> > with
> > > jcifs-ng, plus a modest number of bug fixes.
> > >
> > > The release has been respun due to a syntax error in the jcifs
> connector
> > > pom.
> > >
> > > Karl
> > >
> >
>


[jira] [Commented] (CONNECTORS-1519) CLIENTPROTOCOLEXCEPTION is thrown with 2.10 -> ES 6.x.y

2019-05-01 Thread Karl Wright (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16830948#comment-16830948
 ] 

Karl Wright commented on CONNECTORS-1519:
-

[~svanschalkwyk], are you following this?

> CLIENTPROTOCOLEXCEPTION   is thrown with 2.10 -> ES 6.x.y
> ---
>
> Key: CONNECTORS-1519
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1519
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Elastic Search connector
>Affects Versions: ManifoldCF 2.10
>Reporter: Steph van Schalkwyk
>Assignee: Steph van Schalkwyk
>Priority: Major
> Fix For: ManifoldCF 2.13
>
>
> Investigating CLIENTPROTOCOLEXCEPTION when using 2.10 with ES 6.x.y
> More information to follow.
> Fails when using security , i.e. 
> [http://user:password@elasticsearch:9200.|http://user:password@elasticsearch:9200./]
> Remedy:
>  # Disable x-pack security.
>  # Use http://elasticsearch:9200.
>  
>  
> |07-27-2018 17:53:19.010|Indexation 
> (ES)|file:/var/manifoldcf/corpus/14.html|CLIENTPROTOCOLEXCEPTION|38053|23|



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (CONNECTORS-1602) Continuous crawling doesn't recrawl everything

2019-05-01 Thread Karl Wright (JIRA)


 [ 
https://issues.apache.org/jira/browse/CONNECTORS-1602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karl Wright resolved CONNECTORS-1602.
-
Resolution: Not A Problem

> Continuous crawling doesn't recrawl everything
> --
>
> Key: CONNECTORS-1602
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1602
> Project: ManifoldCF
>  Issue Type: Bug
>  Components: Web connector
>Reporter: Donald Van den Driessche
>Priority: Major
>
> When crawling a website in continuous crawling mode we saw that not all 
> documents are recrawled.
> The site is quite extensive. We figured out that after crawling a 
> document/page gets a recrawl timestamp in between the recrawl interval and 
> max recrawl interval.
> But if these values occur within the first crawl, Manifold starts recrawling 
> those, but seems to ignore the rest of the website. Also sometimes documents 
> get recrawled 5 times while other don't get recrawled. Apparently due to the 
> same issue.
>  
> Is it possible to shed a bit more light on the continuous crawling?
> Is it a good system to use for crawling a (extensive) website?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)