Use zipWithIndex but cache the data before you run zipWithIndex...that way
your ordering will be consistent (unless the bug has been fixed where you
don't have to cache the data)...
Normally these operations are used for dictionary building and so I am
hoping you can cache the dictionary of
On Tue, Nov 18, 2014 at 9:06 AM, Debasish Das debasish.da...@gmail.com wrote:
Use zipWithIndex but cache the data before you run zipWithIndex...that way
your ordering will be consistent (unless the bug has been fixed where you
don't have to cache the data)...
Could you point some link about
On Tue, Nov 18, 2014 at 8:26 PM, Davies Liu dav...@databricks.com wrote:
On Tue, Nov 18, 2014 at 9:06 AM, Debasish Das debasish.da...@gmail.com
wrote:
Use zipWithIndex but cache the data before you run zipWithIndex...that way
your ordering will be consistent (unless the bug has been fixed
I see, thanks!
On Tue, Nov 18, 2014 at 12:12 PM, Sean Owen so...@cloudera.com wrote:
On Tue, Nov 18, 2014 at 8:26 PM, Davies Liu dav...@databricks.com wrote:
On Tue, Nov 18, 2014 at 9:06 AM, Debasish Das debasish.da...@gmail.com
wrote:
Use zipWithIndex but cache the data before you run