Re: RE : Contribution to ManifoldCF webcrawler

2023-09-26 Thread Karl Wright
version 5 / Discover our >> version 5 >> www.datafari.com >> >> De : Furkan KAMACI >> Envoyé le :lundi 25 septembre 2023 09:28 >> À : dev@manifoldcf.apache.org >> Cc : olivier.tav...@francelabs.com; France Labs >> Objet :Re: Contribution to Manifol

Re: RE : Contribution to ManifoldCF webcrawler

2023-09-25 Thread Karl Wright
tav...@francelabs.com; France Labs > Objet :Re: Contribution to ManifoldCF webcrawler > > Hi Emeric, > > First of all, thank you for your effort and suggestion. Do you have a Pull > Request for that improvement? > > Kind regards, > Furkan Kamaci > > On Mon, Sep 25, 2

RE : Contribution to ManifoldCF webcrawler

2023-09-25 Thread Emeric Bernet-Rollande
septembre 2023 09:28 À : dev@manifoldcf.apache.org Cc : olivier.tav...@francelabs.com; France Labs Objet :Re: Contribution to ManifoldCF webcrawler Hi Emeric, First of all, thank you for your effort and suggestion. Do you have a Pull Request for that improvement? Kind regards, Furkan Kamaci On Mon

Re: Contribution to ManifoldCF webcrawler

2023-09-25 Thread Furkan KAMACI
Hi Emeric, First of all, thank you for your effort and suggestion. Do you have a Pull Request for that improvement? Kind regards, Furkan Kamaci On Mon, Sep 25, 2023 at 10:23 AM Emeric Bernet-Rollande < emeric.ber...@francelabs.com> wrote: > Hi Karl and all ! > > > > I’ve been working on the

Contribution to ManifoldCF webcrawler

2023-09-25 Thread Emeric Bernet-Rollande
Hi Karl and all !   I’ve been working on the MCF webcrawler component for our Datafari project, and I made some developments that might interest the MCF community.   Currently if a website redirects the user with a code 301 or 302 and the «  limit to seed is checked », the website (the one