There is a commercial MongoDB JDBC connector for Sqoop from DataDirect: https://www.progress.com/products/data-sources/mongodb
It will work with the Generic JDBC Connector to import data and supports timestamps. On May 5, 2015, at 3:30 AM, Zhou, Richard <[email protected]<mailto:[email protected]>> wrote: Hey Swati: Unfortunately, there is no MongoDB connector in Sqoop 1/2 now. We will discuss the road map in the Sqoop developer meeting @22th May, and the Key-Value DB support is in the topic list. I will keep you posted if there is any plan on it. Thanks. Regards Richard From: Swati Ambulkar -X (sambulka - PERSISTENT SYSTEMS INC at Cisco) [mailto:[email protected]] Sent: Tuesday, May 05, 2015 12:04 AM To: [email protected]<mailto:[email protected]> Subject: RE: Need info : Sqoop from MongoDB Hi: I assume , I will have to use objectId timestamp to get incremental data. Still waiting to get access to new data source. So do not know if there is any timestamp field in JSON document body yet. Did not find much info on MongoDB & Sqoop connectivity. Does sqoop work with MongoDB Hadoop Connector? If sqoop works with mongoDB Hadoop connector how should the query look like to pull incremental data? Thanks, Swati From: Bradley Wright [mailto:[email protected]] Sent: Friday, May 01, 2015 11:41 AM To: [email protected]<mailto:[email protected]> Subject: Re: Need info : Sqoop from MongoDB Do you mean the document objectId timestamp or a timestamp field in the JSON document body? From: "Swati Ambulkar -X (sambulka - PERSISTENT SYSTEMS INC at Cisco)" <[email protected]<mailto:[email protected]>> Reply-To: "[email protected]<mailto:[email protected]>" <[email protected]<mailto:[email protected]>> Date: Friday, May 1, 2015 12:31 PM To: "[email protected]<mailto:[email protected]>" <[email protected]<mailto:[email protected]>> Subject: Need info : Sqoop from MongoDB Hi: I am trying to find any information on using Sqoop to import data from MongoDB. I am looking to do incremental import based on timestamp from MongoDB and want to import all document sets into Hadoop. If anyone is doing this already, any pointers on how to do it, sample code, any issues encountered will be helpful. Thanks in advance, Swati
