dataframe implementations

Jay Norwood via Digitalmars-d-learn Mon, 02 Nov 2015 05:56:09 -0800

I was reading about the Julia dataframe implementation yesterday,trying to understand their decisions and how D might implement.


From my notes,
1. they are currently using a dictionary of column vectors.

2. for NA (not available) they are currently using an array ofbytes, effectively as a Boolean flag, rather than a bitVector,for performance reasons.

3. they are not currently implementing hierarchical headers.

4. they are transforming non-valid symbol header strings (readfrom csv, for example) to valid symbols by replacing '.' withunderscore and prefixing numbers with 'x', as examples. Thisallows use in expressions.5. Along with 4., they currently have @with for DataVector, toallow expressions to use, for example, :symbol_name instead ofdv[:symbol_name].6. They have operation symbols for per element operations on twovectors, for example a ./ b expresses applying the operation tothe vector.

7. They currently only have row indexes,  no row names or symbols.

I saw someone posting that they were working on DataFrameimplementation here, but haven't been able to locate any code ingithub, and was wondering what implementation decisions are beingmade here. Thanks.

dataframe implementations

Reply via email to