Re: [MarkLogic Dev General] Large job processing question.

2017-08-23 Thread Ladner, Eric (Eric.Ladner)
On Behalf Of Sam Mefford Sent: August 22, 2017 16:18 To: MarkLogic Developer Discussion Subject: [**EXTERNAL**] Re: [MarkLogic Dev General] Large job processing question. We generally write external applications for long-running jobs. Java is a popular language for such jobs, and our Data Mov

Re: [MarkLogic Dev General] Large job processing question.

2017-08-23 Thread Geert Josten
o: MarkLogic Developer Discussion mailto:general@developer.marklogic.com>> Date: Tuesday, August 22, 2017 at 10:33 PM To: MarkLogic Developer Discussion mailto:general@developer.marklogic.com>> Subject: Re: [MarkLogic Dev General] Large job processing question. Is it smart enough not

Re: [MarkLogic Dev General] Large job processing question.

2017-08-22 Thread Sam Mefford
r your cooperation. From: general-boun...@developer.marklogic.com [general-boun...@developer.marklogic.com] on behalf of Ladner, Eric (Eric.Ladner) [eric.lad...@chevron.com] Sent: Tuesday, August 22, 2017 8:36 AM To: general@developer.marklogic.com Subject: [MarkLogic Dev General]

Re: [MarkLogic Dev General] Large job processing question.

2017-08-22 Thread Eliot Kimber
f Of Geert Josten Sent: August 22, 2017 13:59 To: MarkLogic Developer Discussion Subject: [**EXTERNAL**] Re: [MarkLogic Dev General] Large job processing question. Hi Eric, Personally, I would probably let go of the all-docs-at-once approach, and spawn processes for each input (sub)f

Re: [MarkLogic Dev General] Large job processing question.

2017-08-22 Thread Ladner, Eric (Eric.Ladner)
August 22, 2017 13:59 To: MarkLogic Developer Discussion Subject: [**EXTERNAL**] Re: [MarkLogic Dev General] Large job processing question. Hi Eric, Personally, I would probably let go of the all-docs-at-once approach, and spawn processes for each input (sub)folder, and potentially for batc

Re: [MarkLogic Dev General] Large job processing question.

2017-08-22 Thread Geert Josten
vron.com>> Reply-To: MarkLogic Developer Discussion mailto:general@developer.marklogic.com>> Date: Tuesday, August 22, 2017 at 4:36 PM To: "general@developer.marklogic.com<mailto:general@developer.marklogic.com>" mailto:general@developer.marklogic.com>> Subject: [Ma

[MarkLogic Dev General] Large job processing question.

2017-08-22 Thread Ladner, Eric (Eric.Ladner)
We have some large jobs (ingestion and validation of unstructured documents) that have timeout issues. The way the jobs are structured is structured is that the first job checks that all the existing documents are valid (still exists on the file system). It does this in two steps: 1) gath