Re: Job Multiple Outputs

2019-09-10 Thread Julien Massiera
of the same document. The framework is smart enough to not hand a document to a connector if it hasn't changed (according to how the connector computes the connector-specific output version string). Karl On Tue, Sep 10, 2019 at 11:00 AM Julien Massiera <mailto:julien.massi...@francelabs.

Job Multiple Outputs

2019-09-10 Thread Julien Massiera
document waiting for the current ingestion to be completed for both outputs ?). But also, if one output is configured to send a commit at the end of the job, is this commit pending until the last ingestion has occured in the other output ? Thanks for your help, Julien

RE: Web connector empty session cookie cache

2019-06-04 Thread Julien
Hi Karl, I understand, I’ll check that. Thanks Julien De : Karl Wright Envoyé le :lundi 3 juin 2019 20:37 À : user@manifoldcf.apache.org Objet :Re: Web connector empty session cookie cache Hi Julien, When the session-based web crawl detects entry into a login sequence, the session cookies

Web connector empty session cookie cache

2019-06-03 Thread Julien Massiera
the saved cookies. Did I miss something to easily empty the session cookies cache ? If not, wouldn't it make sense to add a button to allow it ? Regards, Julien

RE: Long running queries on jobqueue

2019-05-28 Thread Julien
the following Postgres properties : max_connections = 405 shared_buffers = 1024MB checkpoint_timeout = 900s max_wal_size = 14GB autovacuum = off Julien De : Karl Wright Envoyé le :mardi 28 mai 2019 15:02 À : user@manifoldcf.apache.org Objet :Re: Long running queries on jobqueue Hi Julien, "

Long running queries on jobqueue

2019-05-28 Thread Julien
was only expecting low performances on job status UI. Is it possible that it is responsible of the long running queries ? Thanks for the help, Julien --- L'absence de virus dans ce courrier électronique a été vérifiée par le logiciel antivirus Avast. https://www.avast.com/antivirus

Re: Solr examples with long metadata needed

2018-09-26 Thread Julien Massiera
xels}{add=[file:/localhost/OCR/HOT%20Balloon%20Trip_Ultra%20HD.jpg (1612689210913849344)]} Julien On 26/09/2018 17:09, Karl Wright wrote: Hi ManifoldCF Community, I need one or two concrete examples of solr [INFO] log messages that include very long metadata (>8192).  This is app

Parallelize jobs

2018-02-26 Thread Julien
(Solr). When I start them both, the execution is sequential, one job is somehow waiting till the other one is done. I tested it on a MCF v2.7 Regards, Julien --- L'absence de virus dans ce courrier électronique a été vérifiée par le logiciel antivirus Avast. https://www.avast.com/antivirus

Re: User rights for Sharepoint connector

2017-12-23 Thread Julien Massiera
Hi Karl, No problem, it is what I would have proposed anyway ! Julien Le 23/12/2017 à 16:27, Karl Wright a écrit : Do you mind if I include this in the SharePoint connector documentation? Thanks, Karl On Sat, Dec 23, 2017 at 10:13 AM, Julien Massiera <julien.massi...@francelabs.

User rights for Sharepoint connector

2017-12-21 Thread Julien Massiera
rks but I would like to avoid an admin user to crawl my site. Thanks for your help ! Julien Massiera --- L'absence de virus dans ce courrier électronique a été vérifiée par le logiciel antivirus Avast. https://www.avast.com/antivirus

MCF 2.8.1 agent logs

2017-09-27 Thread Julien Massiera
                      The problem is that the logging file is correctly created but is never filled. Any idea of what's wrong ? Did I miss something ? Thanks for your help, Julien

Update Tika

2017-09-14 Thread Julien
that MCF 2.8.1 uses the 21 version of the guava lib where Tika uses the version 17, but there are some major changes since the version 19 that can potentially break something in Tika. Can you help me a little bit with this please ? Thanks, Julien De : Karl Wright Envoyé le :vendredi 8 septembre

Re: Download MCF 2.8.1

2017-09-12 Thread Julien Massiera
Hi Karl, Ok I understand now. Thanks for the answer. Julien On 12/09/2017 11:54, Karl Wright wrote: Hi Julien, The 2nd is when the artifact was uploaded to the release server. There is then a voting process before actual release. The voting process was completed on the 7th. Karl

Download MCF 2.8.1

2017-09-12 Thread Julien Massiera
and I can safely download MCF from this website or is it better to take it from the MCF website ? Regards -- Julien MASSIERA Expert en technologies de recherche France Labs – Les experts du Search Vainqueur du challenge Internal Search de EY à Viva Technologies 2016 www.francelabs.com Tel : +33

Re: Delete IDs with JDBC connector

2017-04-27 Thread julien . massiera
Oh yes indeed, never mind, I missed that point. So all is ok Thanks Karl Le 27.04.2017 13:28, Karl Wright a écrit : > Hi Julien, > > The JDBC connector uses MODEL_ADD_CHANGE. The requirement for > MODEL_ADD_CHANGE is that seeding includes all documents between the time > r

Re: Delete IDs with JDBC connector

2017-04-27 Thread julien . massiera
E) and $(ENDTIME) variables are not present in the 'Seeding query' and the 'Version check query' is empty What do you think ? Anyway thanks for your quick fix, Julien Le 26.04.2017 19:12, Karl Wright a écrit : > I committed a fix to trunk, and also uploaded a patch to the ticket. Please > l

Re: Delete IDs with JDBC connector

2017-04-26 Thread julien . massiera
Oh OK so I finally don't have to investigate :) Thanks Karl ! Julien Le 26.04.2017 17:20, Karl Wright a écrit : > Oh, never mind. I see the issue, which is that without the version query, > documents that don't appear in the result list *at all* are never removed > from the m

Delete IDs with JDBC connector

2017-04-26 Thread julien . massiera
he seeding of the job, but it is very time and resource consuming. Best regards, Julien Massiera

[JCIFS Connector] crawl job stop on access error

2016-12-09 Thread Julien Massiera
eally want to avoid the job to stop if a lock file is encountered and not filtered. Thanks -- Julien MASSIERA Expert en technologies de recherche France Labs – Les experts du Search Vainqueur du challenge Internal Search de EY à Viva Technologies 2016 www.francelabs.com Tel : +33 (0) 663778847

[MCF API] DELETE not available for some connectors

2016-11-28 Thread Julien Massiera
nnections" while the "GET" or "PUT" are available (ref method "executeDeleteCommand" in the class "org/apache/manifoldcf/crawler/system/ManifoldCF"). Is there a specific reason for this ? If not, is it part of your plan to implement it any time soon ? Than

Re: Multiple output documents from one input document in transformation connector

2016-05-20 Thread Julien Massiera
re. Am I missing something ? Julien On 19/05/2016 21:14, Karl Wright wrote: This sounds like it would work. Karl Sent from my Windows Phone From: Julien Massiera Sent: 5/19/2016 12:44 PM To:user@manifoldcf.apache.org Subject: Multiple output documents from one input document in transformation co

Multiple output documents from one input document in transformation connector

2016-05-19 Thread Julien Massiera
the emails and send them for Solr ingestion through the activities object. Is my approach correct ? or do I need to consider another solution ? Thanks for your help. Julien Massiera