Re: dataframe implementations

Jay Norwood via Digitalmars-d-learn Wed, 18 Nov 2015 10:06:18 -0800

On Wednesday, 18 November 2015 at 17:15:38 UTC, Laeeth Isharcwrote:

What do you think about the use of NaN for missing floats? Intheory I could imagine wanting to distinguish between an NaN inthe source file and a missing value, but in my world I neverfelt the need for this. For integers and bools, that isdifferent of course.

The julia discussions mention another dataframe implementation, Ibelieve it was for R, where NaN was used. There was some mentionof the virtues of their own choice and the problems with NaN. Ithink use of NaN was a particular encoding of NaN. Otherimplementations they mentioned used some reserved value in eachof the numeric data types to represent NA. In the julia case, Ibelieve what they use is a separate byte vector for each columnthat holds the NA status. They discussed some other possibleenhancements, but I don't know what they implemented. Forexample, if the single byte holds the NA flag, the cell value canhold additional info ... maybe the reason for the NA. There wasalso some discussion of having the associated cell hold repeatcounts for the NA status, which I suppose meant to repeat it forfollowing cells in the column vector. I'll try to find thediscussions and post the link.

Re: dataframe implementations

Reply via email to