Re: Documents Out Of Scope and hop count

2023-09-26 Thread Marisol Redondo
ick up the documents you excluded before with the > hopcount mechanism. Only when the documents need to be rescanned for some > other reason would that happen as it stands now. But I will get back to > you after a review at the end of the week. > > Karl > > Karl > > > O

Re: Documents Out Of Scope and hop count

2023-09-26 Thread Marisol Redondo
ld need to > have recorded for this is gone. The only way to get it back is if you can > convince the ManifoldCF to recrawl all documents in the job. > > > On Tue, Sep 26, 2023 at 4:51 AM Marisol Redondo < > marisol.redondo.gar...@gmail.com> wrote: > >> >&

Documents Out Of Scope and hop count

2023-09-26 Thread Marisol Redondo
Hi, I had a problem with document out of scope I change the Maximum hop count for type "redirect" in one of my job to 5, and saw that the job is not processing some pages because of that, so I removed the value to get them injecting into the output connector (Solr connector) After that, the same

Re: Duplicate key value violates unique constraint "repohistory_pkey"

2023-06-16 Thread Marisol Redondo
Hi, Did you find any solution for that or do you have still disabled the history? I'm having the same problem, and we are using postgresql as the db. Regards On Sun, 29 Jan 2023 at 05:48, Artem Abeleshev wrote: > Hi everyone! > > We are using ManifoldCF 2.22.1 with multiple nodes in our

Solr connector authentication issue

2023-06-07 Thread Marisol Redondo
Hi, We are using Solr 8 with basic authentication, and when checking the output connection I'm getting an Exception "Solr authorization failure, code 401: aborting job" The solr type is Solrcloud, as we have 3 server (installed in AWS Kubernette containers), I have set the user ID and password

Unreachable documents not deleted from solr

2017-09-14 Thread Marisol Redondo
Hi. I'm using ManifoldCF 2.x (in one vm 2.5 and 2.6 in other) and crawling a web site to index into solr 6. I was thinking that when checking the check box "Delete unreachable documents" in the "Hop Filters" tab of the job, all the documents indexed in my solr instance that have been removed or

Re: UTF-8 Format from Confluence to Solr

2017-06-12 Thread Marisol Redondo
. > > Thanks > > Regards > > 2017-06-01 16:28 GMT+02:00 Marisol Redondo <marisol.redondo.garcia@gmail. > com>: > >> I fixed the problem. >> >> The problem is that the Confluence connector is getting the entity of the >> request with the defa

Re: UTF-8 Format from Confluence to Solr

2017-06-01 Thread Marisol Redondo
s.toString(entity,*"UTF-8"*) Thanks On 31 May 2017 at 10:13, Marisol Redondo <marisol.redondo.gar...@gmail.com> wrote: > Hi. > > I'm having problems with the encoding when injecting in Solr 6 in > standalone mode from a Confluence wiki. > > I have Manifold 2.5 with

UTF-8 Format from Confluence to Solr

2017-05-31 Thread Marisol Redondo
Hi. I'm having problems with the encoding when injecting in Solr 6 in standalone mode from a Confluence wiki. I have Manifold 2.5 with Tomcat-8. The repository connector from the job take the information from a Confluence wiki and the output connector is Solr, using the Tika transformation, a

Re: Metadata adjuster

2017-02-22 Thread Marisol Redondo
Solr log for this document? That > should help clear up any confusion. > > Also, for what it is worth, MCF 1.10 is not using a SolrJ that is up to > date with Solr 6.x. That could be the source of the problem Is there any > reason you are using a 1.x version of

Metadata adjuster

2017-02-21 Thread Marisol Redondo
Hi. I'm trying to use metadata adjuster to add one field to the solr index but doesn't inject the field into a solr's field. Maybe I'm misundertaning the use of the metada adjuster, but I have read in the documentation (https://manifoldcf.apache.org/release/release-1.10/en_