Re: Empty VS null array?

Kagamin Fri, 25 Oct 2013 04:46:30 -0700

On Monday, 21 October 2013 at 10:33:01 UTC, Regan Heath wrote:

null strings are no different to null class references, they'renot a special case.

True. That's an implementation detail which has no meaning forbusiness logic. When implementation deviates from business logic,one ends up fixing the implementation details everywhere in orderto implement business logic. That's why string.IsNullOrEmpty isused.

People seem to have this odd idea that null is somehow aninvalid state for a string /reference/ (c# strings arereference types), it's not.

That's the very problem: null and empty are valid states and mustbe treated equally as "no data", but they can't for purelytechnical reasons.

People also seem to elevate empty strings to some sort ofspecial status, that's like saying 0 has some special statusfor int - it doesn't it's just one of a number of possiblevalues.
In fact, int having no null like state is a "problem" causingsolutions like boxing to elevate the value type to a referencein order to allow a null state for int.


You want to check ints for null everywhere too?

Yet, in D we've decided to inconsistently remove thatfunctionality from string for no gain. If string could notactually be null then we'd gain something from the limitation,instead we lose functionality and gain nothing - you still haveto check your strings for null in D.

Huh? Null slices work just like empty ones - that's why thistopic was started in the first place. One doesn't have to checkslices for nulls, only for length.

If you want clear nullable semantics, you have Nullable, it worksfor everything, including strings and ints. You would want thisfeature only in rare cases, so it doesn't make sense to make itdefault, or it will be a nuisance.

both of them are just "no data", so you end up typingif(string.IsNullOrEmpty(mystr)) every time everywhere.
I only have to code like this when I use 3rd party code whichhas conflated empty and null. In my code when it's null itmeans not specified, and empty is just one type of value - forwhich I do no special handling.

Equivalence between null and empty is a business logic'srequirement, that's why it's done.

And, yeah, only one small feature in this big mess ever needsto differentiate between null and empty.
Untrue, null allows many alternate and IMO more direct/obviousdesigns.

The need for those designs is rare and trivially implementablefor all value types.

I found this one case trivially implementable, but nulls stillplague all remaining code.
Which one case?  The readline() one below?

No, it was an authentication system in third-party code for onespecial case. I also had to specify this null value in app.config- guess how, explicitly specify, not substitute missing parameterwith a default.


Another possibility for readline is to return a tuple

{bool eof, string line(non-null)} - this way you have easy checkfor eof and don't have to check for null when you don't need it.

I use this all the time:
http://msdn.microsoft.com/en-us/library/system.io.streamreader.readline.aspx
It has never caused me any issues. It explicitly states thatnull is a possible output, and so I check for it - doinganything less is simply bad programming.
It works if you read one line per loop cycle, but if you readseveral lines and assume they're not null (some multiline dataformat),
There is your problem, never "assume" - the documentation isvery clear on the issue.
you're screwed or your code becomes littered with null checks,but who accounts for all alternative scenarios from the start?
Me, and IMO any competent programmer. It is misguided to thinkyou can ignore valid states, null is a valid state in C, C++,C#, and D.. You should be thinking about and handling it.

Here null is a valid state for readline, not for the caller: ifthe caller parses a multiline data format, unexpected end of fileis an invalid state.

And what do you gain by littering your code with those nullchecks? Just making runtime happy and adding noise to the code?You could use that time to improve the code or add features oreven relax. It's exactly nullable strings, which gain you only atime waste.

You don't have to check for it on every access to the variable,but you do need to check for it once where the variable isassigned, or passed (in private functions you can skip this).From that point onward you can assume non-null, valid, job done.

You just said "never assume". The assumption may fail, becausethe string type is still nullable, compiler doesn't save youhere, this sucks. And in order to check for everything everywhereon a level near that of the compiler, you must be not justcompetent, but perfect.

I believe there's no problem domain, which would like todifferentiate between null and empty string instead oftreating them as "no data".
null means not specified, non existent, was not there.
empty means, present but set to empty/blank.

Databases have this distinction for a reason.

Oracle makes no distinction between null and empty string. For areason?A database is an implementation detail of a data storage, itdoesn't implement business logic, it only provides features,which can be used with more or less success to implement businesslogic. Ever heard of advantages of OO databases over relationalones? That's an illustration of technical details, which don'tprecisely map to business logic.

If you get input from a user a field called "foo" may be:
 - not specified
 - specified

and if specified, may be:
 - empty
 - not empty

If the user doesn't fill a text box, it's both empty and notspecified - there's just no difference. And it doesn't matter howyou store it in the database - as null or as empty string - bothare presented in the same way. Heck, we use these optional textboxes everywhere - can you tell if their content is empty or notspecified?

And what if the value is required? Would you accept an emptyvalue? And if your database treats empty string as not null,would you allow to register a user with an empty login name? Andhow to express this constraint in the database? In SQL "not null"means "required value", but it's not equivalent to the businesslogic'a notion of a required value. I wouldn't be surprised ifOracle did that in order to reject empty strings in not nullfields.

Let's consider a process of specifying user's data. What textfields do we have?1. Login. No difference between null and empty - both invalid -"no data", must enter something.2. First name. No difference between null and empty - both are"no data" and are presented as empty text box.

3. Middle name. ditto.
4. Last name. ditto.
5. Country. ditto.
6. State. ditto.
7. City. ditto.
8. Address. ditto.
9. Building. ditto.
10. Flat. ditto.
11. Zip code. ditto.
12. Phone. ditto.
13. Fax. ditto.
14. E-mail. ditto.
15. Site. ditto.
16. Passport number. ditto.
17. Birth place. ditto.
18. Comment. Hell! Comment!

See? Not a single field in the list requires distinction betweennull and empty. And slices don't differentiate between them. Justas planned.

If we have null, lets use it, if we want to remove null thelets remove it, but can we get out of this horrid middle groundplease.


*sigh* people just don't buy the KISS principle...

Re: Empty VS null array?

Reply via email to