Re: Empty VS null array?

Regan Heath Mon, 21 Oct 2013 03:36:32 -0700

On Sat, 19 Oct 2013 10:56:02 +0100, Kagamin <s...@here.lot> wrote:

On Friday, 18 October 2013 at 10:44:11 UTC, Regan Heath wrote:
This comes up time and again. The use of, and ability to distinguishempty from null is very useful. Yes, you run the risk of things likenull pointer exceptions etc, but we have that risk now without thereward of being able to distinguish these cases.
In C# code null strings are a plague.

I code in C# every day for work and I never have any problems with nullstrings. The conflated empty/null cases are the real nightmare for me(more below).

null strings are no different to null class references, they're not aspecial case. People seem to have this odd idea that null is somehow aninvalid state for a string /reference/ (c# strings are reference types),it's not.

People also seem to elevate empty strings to some sort of special status,that's like saying 0 has some special status for int - it doesn't it'sjust one of a number of possible values.

In fact, int having no null like state is a "problem" causing solutionslike boxing to elevate the value type to a reference in order to allow anull state for int.

Yet, in D we've decided to inconsistently remove that functionality fromstring for no gain. If string could not actually be null then we'd gainsomething from the limitation, instead we lose functionality and gainnothing - you still have to check your strings for null in D.

We ought to go one way or the other, this middle ground is worse thaneither of the other options.

In my code I don't have to check for or treat empty strings anydifferently to other values. I simply have to check for null.Remembering to check for null on reference types is automatic for me,strings are not special in this regard.

Most of the time you don't need them

Sure, and if I don't have access to null (like when using a value typelike int), I can code around that lack, but it's never as straight forwarda solution.

but still must check for them just in order to not get an exception.


Sure, you must check for the possible states of a reference type.

Also business logic makes no difference between null and empty


This is simply not true.  Example at the end.

both of them are just "no data", so you end up typingif(string.IsNullOrEmpty(mystr)) every time everywhere.

I only have to code like this when I use 3rd party code which hasconflated empty and null. In my code when it's null it means notspecified, and empty is just one type of value - for which I do no specialhandling.

And, yeah, only one small feature in this big mess ever needs todifferentiate between null and empty.


Untrue, null allows many alternate and IMO more direct/obvious designs.

I found this one case trivially implementable, but nulls still plagueall remaining code.


Which one case?  The readline() one below?

Take this simple design:

  string readline();

This function would like to be able to:
 - return null for EOF
 - return [] for a blank line

but it cannot, because as soon as you write:

  foo(readline())

the null/[] case merges.

This is a horrible design. You better throw an exception on eof insteadof null:

No, no, no. You should only throw in exceptional circumstances or yourisk using exceptions for flow control, and that is just plain horrid.

this null will break the caller anyway possibly in a contrived way.

Never a contrived way, always a blatantly obvious one and only if you'renot doing your job properly. If you want a contrived, unpredictable anddifficult to debug breakage look no further than heap or stackcorruption. Null is never a difficult bug to find and fix, and is nodifferent to forgetting to handle one of the integer return values of afunction.


I use this all the time:
http://msdn.microsoft.com/en-us/library/system.io.streamreader.readline.aspx

It has never caused me any issues. It explicitly states that null is apossible output, and so I check for it - doing anything less is simply badprogramming.

It works if you read one line per loop cycle, but if you read severallines and assume they're not null (some multiline data format),

There is your problem, never "assume" - the documentation is very clear onthe issue.

you're screwed or your code becomes littered with null checks, but whoaccounts for all alternative scenarios from the start?

Me, and IMO any competent programmer. It is misguided to think you canignore valid states, null is a valid state in C, C++, C#, and D.. Youshould be thinking about and handling it.

You don't have to check for it on every access to the variable, but you doneed to check for it once where the variable is assigned, or passed (inprivate functions you can skip this). From that point onward you canassume non-null, valid, job done.

There are plenty of other such design/cases that can be imagined, andwhile you can work around them all they add complexity for zero gain.
I believe there's no problem domain, which would like to differentiatebetween null and empty string instead of treating them as "no data".


null means not specified, non existent, was not there.
empty means, present but set to empty/blank.

Databases have this distinction for a reason.

If you get input from a user a field called "foo" may be:
 - not specified
 - specified

and if specified, may be:
 - empty
 - not empty

If foo is not specified you may want to assign a default value for it, ifyour business logic is using empty to mean "not specified" you prevent theuser actually setting foo to empty and that limitation is a right pain inmany cases.

You can code around this by using a boolean a dictionary to indicate thespecified/not specified distinction, but this is less direct than simplyusing null.

If we have null, lets use it, if we want to remove null the lets removeit, but can we get out of this horrid middle ground please.


Regan

--
Using Opera's revolutionary email client: http://www.opera.com/mail/

Re: Empty VS null array?

Reply via email to