Fwd: Numbering RDD members Sequentially

2015-03-11 Thread Steve Lewis
-- Forwarded message -- From: Steve Lewis lordjoe2...@gmail.com Date: Wed, Mar 11, 2015 at 9:13 AM Subject: Re: Numbering RDD members Sequentially To: Daniel, Ronald (ELS-SDG) r.dan...@elsevier.com perfect - exactly what I was looking for, not quite sure why it is called

Re: Numbering RDD members Sequentially

2015-03-11 Thread Mark Hamstra
Lewis lordjoe2...@gmail.com Date: Wed, Mar 11, 2015 at 9:13 AM Subject: Re: Numbering RDD members Sequentially To: Daniel, Ronald (ELS-SDG) r.dan...@elsevier.com perfect - exactly what I was looking for, not quite sure why it is called zipWithIndex since zipping is not involved my code

Numbering RDD members Sequentially

2015-03-10 Thread Steve Lewis
I have Hadoop Input Format which reads records and produces JavaPairRDDString,String locatedData where _1() is a formatted version of the file location - like 12690,, 24386 .27523 ... _2() is data to be processed For historical reasons I want to convert _1() into in integer