Are the Spark efforts supporting all Mahout Vector types? Named, Property 
Vectors? It occurred to me that data frames in R is a related but more general 
solution. If all rows and columns of a DRM and their coresponding Vectors (row 
or column vectors) were to support arbitrary properties attached to them in 
such a way that they are preserved during transpose, Vector extraction, and any 
other operations that make sense there would be a huge benefit for users.

One of the constant problems with input to Mahout is translation of IDs. 
External to Mahout going in, Mahout to external coming out. Most of this would 
be unneeded if Mahout supported data frames, some would be avoided by 
supporting named or property vectors universally.

Reply via email to