HI Geoff,

Great that you are getting across the MarkLogic Data Hub.

Which version of MarkLogic have you got installed to run the Data Hub framework 
against?

Regards,
Chris Day

Chris Day - Sales Engineer
[email protected]
 

Mobile:  +61 433 370 083

Phone:  +61 2 8315 1556

Skype:  chrisday-aus
Twitter:  @ML_ChrisDay

MarkLogic Pty Ltd
www.marklogic.com<http://www.marklogic.com/>

What’s new in MarkLogic 9? MLU self-paced course -  https://goo.gl/tMWkoq

[cid:[email protected]]<http://www.marklogic.com/social>
This e-mail and any accompanying attachments are confidential. The information 
is intended solely for the use of the individual to whom it is addressed. Any 
review, disclosure, copying, distribution, or use of this e-mail communication 
by others is strictly prohibited. If you are not the intended recipient, please 
notify us immediately by returning this message to the sender and delete all 
copies. Thank you for your cooperation.


From: <[email protected]> on behalf of Geoffrey Skellams 
<[email protected]>
Reply-To: MarkLogic Developer Discussion <[email protected]>
Date: Monday, 31 July 2017 at 3:59 pm
To: "[email protected]" <[email protected]>
Subject: [MarkLogic Dev General] MarkLogic Data Hub Tutorial problems

Afternoon all,

I’m brand new to using MarkLogic, and I’m trying to work my way through the 
MarkLogic Data Hub tutorial on GitHub 
(https://marklogic-community.github.io/marklogic-data-hub/). At this point, I 
am simply just following the instructions as written to get a feel for what the 
process is and make sure that we can get some data into a database correctly.

I’m running MarkLogic on a Windows Server 2016 VM in the Microsoft Azure cloud, 
using the current version of Java 8. Last week I tried using the DataHub 
v.1.1.4 .war file, but ran into errors with it, and thought I had done 
something wrong, so we destroyed the virtual machine, created a brand new one, 
installed the software, but used the v.1.1.3 .war file on advice from 
colleagues who had some success with it last week (but are unable to do so 
again today).


Everything seems to go well, until I get to step 8, ingesting the Acme Tech 
data. The DataHub’s log shows the import as failing, and I’ve included the log 
file below. If I try to ingest the GlobalTech records, I get similar problems 
(although a lot more of them, considering the larger data file size).

It turns out that the error messages from the v1.1.4 hub were the same as the 
ones below.

I’ve tried googling the problem, but can’t seem to find anything that resembles 
this issue.

Can anyone suggest a reason why this is occurring and what we can do to fix it? 
Is it simply a PEBKAC issue (i.e: both myself and my colleague have somehow 
missed important steps), or does the tutorial documentation not match up with 
the latest versions of the hub?


----- BEGIN LOG FILE CONTENTS -----
8:22.876 [main] INFO  c.m.contentpump.LocalJobRunner - Content type: JSON
03:48:24.043 [main] INFO  c.marklogic.contentpump.ContentPump - Job name: 
local_702976126_1
03:48:24.076 [main] INFO  c.m.c.FileAndDirectoryInputFormat - Total input paths 
to process : 2
03:48:25.552 [pool-1-thread-2] ERROR c.m.contentpump.TransformWriter - 
QueryException:XDMP-AS: (err:XPTY0004) $transform-option as map:map? -- Invalid 
coercion: "/34324.json" as map:map
03:48:25.552 [pool-1-thread-2] WARN  c.m.contentpump.TransformWriter - Failed 
document /34324.json in file:/C:/data-hub/input/AcmeTech/34324.json
03:48:25.555 [pool-1-thread-1] ERROR c.m.contentpump.TransformWriter - 
QueryException:XDMP-AS: (err:XPTY0004) $transform-option as map:map? -- Invalid 
coercion: "/32920.json" as map:map
03:48:25.555 [pool-1-thread-1] WARN  c.m.contentpump.TransformWriter - Failed 
document /32920.json in file:/C:/data-hub/input/AcmeTech/32920.json
03:48:25.564 [Thread-4] INFO  c.m.contentpump.LocalJobRunner -  completed 100%
03:48:25.575 [main] INFO  c.m.contentpump.LocalJobRunner - 
com.marklogic.mapreduce.MarkLogicCounter:
03:48:25.576 [main] INFO  c.m.contentpump.LocalJobRunner - INPUT_RECORDS: 2
03:48:25.576 [main] INFO  c.m.contentpump.LocalJobRunner - OUTPUT_RECORDS: 2
03:48:25.577 [main] INFO  c.m.contentpump.LocalJobRunner - 
OUTPUT_RECORDS_COMMITTED: 0
03:48:25.577 [main] INFO  c.m.contentpump.LocalJobRunner - 
OUTPUT_RECORDS_FAILED: 2
03:48:25.578 [main] INFO  c.m.contentpump.LocalJobRunner - Total execution 
time: 1 sec


----- END LOG FILE CONTENTS -----

Regards
Geoff



This message is for the designated recipient only and may contain privileged, 
proprietary, or otherwise private information. If you have received it in 
error, please notify the sender immediately and delete the original. Any other 
use of the email by you is prohibited.
_______________________________________________
General mailing list
[email protected]
Manage your subscription at: 
http://developer.marklogic.com/mailman/listinfo/general

Reply via email to