HI Geoff, Great that you are getting across the MarkLogic Data Hub.
Which version of MarkLogic have you got installed to run the Data Hub framework against? Regards, Chris Day Chris Day - Sales Engineer [email protected] Mobile: +61 433 370 083 Phone: +61 2 8315 1556 Skype: chrisday-aus Twitter: @ML_ChrisDay MarkLogic Pty Ltd www.marklogic.com<http://www.marklogic.com/> What’s new in MarkLogic 9? MLU self-paced course - https://goo.gl/tMWkoq [cid:[email protected]]<http://www.marklogic.com/social> This e-mail and any accompanying attachments are confidential. The information is intended solely for the use of the individual to whom it is addressed. Any review, disclosure, copying, distribution, or use of this e-mail communication by others is strictly prohibited. If you are not the intended recipient, please notify us immediately by returning this message to the sender and delete all copies. Thank you for your cooperation. From: <[email protected]> on behalf of Geoffrey Skellams <[email protected]> Reply-To: MarkLogic Developer Discussion <[email protected]> Date: Monday, 31 July 2017 at 3:59 pm To: "[email protected]" <[email protected]> Subject: [MarkLogic Dev General] MarkLogic Data Hub Tutorial problems Afternoon all, I’m brand new to using MarkLogic, and I’m trying to work my way through the MarkLogic Data Hub tutorial on GitHub (https://marklogic-community.github.io/marklogic-data-hub/). At this point, I am simply just following the instructions as written to get a feel for what the process is and make sure that we can get some data into a database correctly. I’m running MarkLogic on a Windows Server 2016 VM in the Microsoft Azure cloud, using the current version of Java 8. Last week I tried using the DataHub v.1.1.4 .war file, but ran into errors with it, and thought I had done something wrong, so we destroyed the virtual machine, created a brand new one, installed the software, but used the v.1.1.3 .war file on advice from colleagues who had some success with it last week (but are unable to do so again today). Everything seems to go well, until I get to step 8, ingesting the Acme Tech data. The DataHub’s log shows the import as failing, and I’ve included the log file below. If I try to ingest the GlobalTech records, I get similar problems (although a lot more of them, considering the larger data file size). It turns out that the error messages from the v1.1.4 hub were the same as the ones below. I’ve tried googling the problem, but can’t seem to find anything that resembles this issue. Can anyone suggest a reason why this is occurring and what we can do to fix it? Is it simply a PEBKAC issue (i.e: both myself and my colleague have somehow missed important steps), or does the tutorial documentation not match up with the latest versions of the hub? ----- BEGIN LOG FILE CONTENTS ----- 8:22.876 [main] INFO c.m.contentpump.LocalJobRunner - Content type: JSON 03:48:24.043 [main] INFO c.marklogic.contentpump.ContentPump - Job name: local_702976126_1 03:48:24.076 [main] INFO c.m.c.FileAndDirectoryInputFormat - Total input paths to process : 2 03:48:25.552 [pool-1-thread-2] ERROR c.m.contentpump.TransformWriter - QueryException:XDMP-AS: (err:XPTY0004) $transform-option as map:map? -- Invalid coercion: "/34324.json" as map:map 03:48:25.552 [pool-1-thread-2] WARN c.m.contentpump.TransformWriter - Failed document /34324.json in file:/C:/data-hub/input/AcmeTech/34324.json 03:48:25.555 [pool-1-thread-1] ERROR c.m.contentpump.TransformWriter - QueryException:XDMP-AS: (err:XPTY0004) $transform-option as map:map? -- Invalid coercion: "/32920.json" as map:map 03:48:25.555 [pool-1-thread-1] WARN c.m.contentpump.TransformWriter - Failed document /32920.json in file:/C:/data-hub/input/AcmeTech/32920.json 03:48:25.564 [Thread-4] INFO c.m.contentpump.LocalJobRunner - completed 100% 03:48:25.575 [main] INFO c.m.contentpump.LocalJobRunner - com.marklogic.mapreduce.MarkLogicCounter: 03:48:25.576 [main] INFO c.m.contentpump.LocalJobRunner - INPUT_RECORDS: 2 03:48:25.576 [main] INFO c.m.contentpump.LocalJobRunner - OUTPUT_RECORDS: 2 03:48:25.577 [main] INFO c.m.contentpump.LocalJobRunner - OUTPUT_RECORDS_COMMITTED: 0 03:48:25.577 [main] INFO c.m.contentpump.LocalJobRunner - OUTPUT_RECORDS_FAILED: 2 03:48:25.578 [main] INFO c.m.contentpump.LocalJobRunner - Total execution time: 1 sec ----- END LOG FILE CONTENTS ----- Regards Geoff This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited.
_______________________________________________ General mailing list [email protected] Manage your subscription at: http://developer.marklogic.com/mailman/listinfo/general
