subject:"Proposal for std.path replacement"

Re: Proposal for std.path replacement

2011-04-13 Thread Bruno Medeiros


On 07/04/2011 09:32, Lars T. Kyllingstad wrote:

On Wed, 06 Apr 2011 15:51:15 +0100, Bruno Medeiros wrote:

Thanks for the feedback, I will read it more thoroughly when I take up
work on std.path again.  Just a general comment, though:  Having the
exact same functionality on Windows and POSIX just doesn't work, if
nothing else simply because c:\dir\file is a valid base name on POSIX.
That is, both ':' and '\' are valid filename characters.  The ONLY
invalid filename characters on POSIX are '/' and '\0'.

Yes, weird file names like that may be uncommon, but the library should
be able to handle them nonetheless.

-Lars


Yeah, that's a good point. I'm sure yet if there is a good way that 
could address both issues, I want to think about it more later.
(in Eclipse's IPath this is less of a problem because that API works 
with a path data type, not with a path string directly)


--
Bruno Medeiros - Software Engineer

Re: Proposal for std.path replacement

2011-04-07 Thread Lars T. Kyllingstad

On Wed, 06 Apr 2011 15:51:15 +0100, Bruno Medeiros wrote:

 On 03/03/2011 16:29, Lars T. Kyllingstad wrote:
 As mentioned in the std.path.getName(): Screwy by design? thread, I
 started working on a rewrite of std.path a long time ago, but I got
 sidetracked by other things.  The recent discussion got me working on
 it again, and it turned out there wasn't that much left to be done.

 So here it is, please comment:

  http://kyllingen.net/code/ltk/doc/path.html
  https://github.com/kyllingstad/ltk/blob/master/ltk/path.d
 
 I hope I'm not too late for the party, especially because I do have a
 bit of criticism for this one...

Not at all.  Reviews of, and further work on, std.path has been put on 
hold until I have handed in my PhD thesis (which, if all goes well, 
should be very soon).  I haven't got time to participate in any extensive 
discussions on the NG right now.  So there will be ample opportunity to 
comment on the design yet. :)


 Looking at the DDoc page, this module seem to have very
 platform-dependent behavior. I find this detrimental, even unsavory. I
 think it's best that programs work with internal data structures that
 are as platform-independent as possible, and only convert to
 platform-dependent data or API at the very last possible moment, when so
 required (ie, when interfacing with the actual OS, or with the user).
 
 So, with that in mind, there is a toCanonical function that converts to
 a OS specific format, but there's no function to convert to an
 OS/platform independent format?... :S
 
 Also, what does dirName( d:file) return on POSIX? Is it the same as on
 Windows? I hope so, and that such behavior is explicitly part of the API
 and not just accidental. (I don't a linux machine nearby to try it out
 myself) Because, what if I want to refer to Windows paths from a POSIX
 application? (I'm sure there are scenarios where that makes sense)
 
 Or what if I just want my application to behave in a pedantically
 platform-identical way, like having it to accept backlashes as path
 separators not just on Windows but on POSIX as well? (This makes much
 more sense than is immediately obvious... in many cases it can be argued
 to be the Right Thing)
 
 
 I'm sorry if I seem a bit agitated :P , it's just that due to some more
 or less recent traumatizing events (a long story relating to Windows 7)
 I have become a Crusader for cross-platformness.
 
 
 The other suggestion I have (mentioned by others as well) is to
 generalize the driver letter to a device symbol/string/identifier. But
 this only makes sense if this device segment works in a
 platform-independent way. This generalization might make the path module
 useful in a few new contexts. Note, I'm not saying it should handle
 URIs, in fact I want to explicitly say it should not handle URIs, as
 URIs have additional semantics (query and fragment parts, the percent
 encoding, etc.) which should not be of concern here.
 
 BTW, I admit I take some inspiration from this API:
 http://help.eclipse.org/helios/index.jsp?topic=/
org.eclipse.platform.doc.isv/reference/api/org/eclipse/core/runtime/
IPath.html
 Note that here there is only *one* platform dependent function, the
 aptly named toOSString() ...

Thanks for the feedback, I will read it more thoroughly when I take up 
work on std.path again.  Just a general comment, though:  Having the 
exact same functionality on Windows and POSIX just doesn't work, if 
nothing else simply because c:\dir\file is a valid base name on POSIX.  
That is, both ':' and '\' are valid filename characters.  The ONLY 
invalid filename characters on POSIX are '/' and '\0'.

Yes, weird file names like that may be uncommon, but the library should 
be able to handle them nonetheless.

-Lars

Re: Proposal for std.path replacement

2011-04-07 Thread Jonathan M Davis

 On Wed, 06 Apr 2011 15:51:15 +0100, Bruno Medeiros wrote:
  On 03/03/2011 16:29, Lars T. Kyllingstad wrote:
  As mentioned in the std.path.getName(): Screwy by design? thread, I
  started working on a rewrite of std.path a long time ago, but I got
  sidetracked by other things.  The recent discussion got me working on
  it again, and it turned out there wasn't that much left to be done.
  
  So here it is, please comment:
   http://kyllingen.net/code/ltk/doc/path.html
   https://github.com/kyllingstad/ltk/blob/master/ltk/path.d
  
  I hope I'm not too late for the party, especially because I do have a
  bit of criticism for this one...
 
 Not at all.  Reviews of, and further work on, std.path has been put on
 hold until I have handed in my PhD thesis (which, if all goes well,
 should be very soon).  I haven't got time to participate in any extensive
 discussions on the NG right now.  So there will be ample opportunity to
 comment on the design yet. :)
 
  Looking at the DDoc page, this module seem to have very
  platform-dependent behavior. I find this detrimental, even unsavory. I
  think it's best that programs work with internal data structures that
  are as platform-independent as possible, and only convert to
  platform-dependent data or API at the very last possible moment, when so
  required (ie, when interfacing with the actual OS, or with the user).
  
  So, with that in mind, there is a toCanonical function that converts to
  a OS specific format, but there's no function to convert to an
  OS/platform independent format?... :S
  
  Also, what does dirName( d:file) return on POSIX? Is it the same as on
  Windows? I hope so, and that such behavior is explicitly part of the API
  and not just accidental. (I don't a linux machine nearby to try it out
  myself) Because, what if I want to refer to Windows paths from a POSIX
  application? (I'm sure there are scenarios where that makes sense)
  
  Or what if I just want my application to behave in a pedantically
  platform-identical way, like having it to accept backlashes as path
  separators not just on Windows but on POSIX as well? (This makes much
  more sense than is immediately obvious... in many cases it can be argued
  to be the Right Thing)
  
  
  I'm sorry if I seem a bit agitated :P , it's just that due to some more
  or less recent traumatizing events (a long story relating to Windows 7)
  I have become a Crusader for cross-platformness.
  
  
  The other suggestion I have (mentioned by others as well) is to
  generalize the driver letter to a device symbol/string/identifier. But
  this only makes sense if this device segment works in a
  platform-independent way. This generalization might make the path module
  useful in a few new contexts. Note, I'm not saying it should handle
  URIs, in fact I want to explicitly say it should not handle URIs, as
  URIs have additional semantics (query and fragment parts, the percent
  encoding, etc.) which should not be of concern here.
  
  BTW, I admit I take some inspiration from this API:
  http://help.eclipse.org/helios/index.jsp?topic=/
 
 org.eclipse.platform.doc.isv/reference/api/org/eclipse/core/runtime/
 IPath.html
 
  Note that here there is only *one* platform dependent function, the
  aptly named toOSString() ...
 
 Thanks for the feedback, I will read it more thoroughly when I take up
 work on std.path again.  Just a general comment, though:  Having the
 exact same functionality on Windows and POSIX just doesn't work, if
 nothing else simply because c:\dir\file is a valid base name on POSIX.
 That is, both ':' and '\' are valid filename characters.  The ONLY
 invalid filename characters on POSIX are '/' and '\0'.
 
 Yes, weird file names like that may be uncommon, but the library should
 be able to handle them nonetheless.

And on some file systems, even / is valid! Though it's not worth it to try and 
get std.path to work with files with / in the name. It's generally a very bad 
idea to create a file with a / in the name - too many programs would choke on 
it or just plain have the wrong behavior. However, there _are_ *nix file 
systems which allow for / in file names.

- Jonathan M Davis

Re: Proposal for std.path replacement

2011-04-07 Thread Lars T. Kyllingstad

On Thu, 07 Apr 2011 03:57:18 -0700, Jonathan M Davis wrote:
 
 And on some file systems, even / is valid! Though it's not worth it to
 try and get std.path to work with files with / in the name. It's
 generally a very bad idea to create a file with a / in the name - too
 many programs would choke on it or just plain have the wrong behavior.
 However, there _are_ *nix file systems which allow for / in file names.


Which filesystems are those?  The POSIX:2008 specification specifically 
states that

The characters composing the name may be selected from
 the set of all character values excluding the slash
 character and the null byte.

where slash is defined as '/'.

http://pubs.opengroup.org/onlinepubs/9699919799/basedefs/V1_chap03.html

-Lars

Re: Proposal for std.path replacement

2011-04-07 Thread Jonathan M Davis

On 2011-04-07 04:38, Lars T. Kyllingstad wrote:
 On Thu, 07 Apr 2011 03:57:18 -0700, Jonathan M Davis wrote:
  And on some file systems, even / is valid! Though it's not worth it to
  try and get std.path to work with files with / in the name. It's
  generally a very bad idea to create a file with a / in the name - too
  many programs would choke on it or just plain have the wrong behavior.
  However, there _are_ *nix file systems which allow for / in file names.
 
 Which filesystems are those?  The POSIX:2008 specification specifically
 states that
 
     The characters composing the name may be selected from
      the set of all character values excluding the slash
      character and the null byte.
 
 where slash is defined as '/'.
 
 http://pubs.opengroup.org/onlinepubs/9699919799/basedefs/V1_chap03.html

I didn't know that Posix had anything to say on the matter (though it doesn't 
hurt my feelings any that it effectively says that / isn't valid in file 
names). However, the file systems themselves apparently don't necessarily 
stick to that. If you take a look at 
http://en.wikipedia.org/wiki/Comparison_of_file_systems you can see which file 
systems allow which characters. For instance, the exts disallow NUL and /. 
However ReiserFS, Btrfs, JFS, and XFS allow /. In fact, most of the Linux file 
systems seem to allow / (though the exts are probably the most used and they 
don't).

Still, Posix or no, I would expect that using / in a file name would be just 
asking for trouble and find no reason to support it in std.path (particularly 
when we'd rely on the underlying C calls handling it appropriately, and I 
expect that there's a good chance that they don't). But if Posix disallows it, 
then we definitely shouldn't. Still, the file systems themselves aren't 
necessarily Posix-related, and apparently quite a few of the *nix file systems 
allow /.

- Jonathan M Davis

Re: Proposal for std.path replacement

2011-04-06 Thread Bruno Medeiros

On 03/03/2011 16:29, Lars T. Kyllingstad wrote:

As mentioned in the std.path.getName(): Screwy by design? thread, I
started working on a rewrite of std.path a long time ago, but I got
sidetracked by other things. The recent discussion got me working on it
again, and it turned out there wasn't that much left to be done.

So here it is, please comment:

http://kyllingen.net/code/ltk/doc/path.html
https://github.com/kyllingstad/ltk/blob/master/ltk/path.d

Features:

- Most functions work with all string types, i.e. all permutations of
mutable/const/immutable(char/wchar/dchar)[]. Notable exceptions are
toAbsolute() and toCanonical, because they rely on std.file.getcwd()
which returns an immutable(char)[].

- Correct behaviour in corner cases that aren't covered by the current
std.path. See the other thread for some examples, or take a look at the
unittests for a more complete picture.

- Saner naming scheme. (Still not set in stone, of course.)

-Lars

I hope I'm not too late for the party, especially because I do have a
bit of criticism for this one...
Looking at the DDoc page, this module seem to have very
platform-dependent behavior. I find this detrimental, even unsavory. I
think it's best that programs work with internal data structures that
are as platform-independent as possible, and only convert to
platform-dependent data or API at the very last possible moment, when so
required (ie, when interfacing with the actual OS, or with the user).

So, with that in mind, there is a toCanonical function that converts to
a OS specific format, but there's no function to convert to an
OS/platform independent format?... :S

Also, what does dirName( d:file) return on POSIX? Is it the same as on
Windows? I hope so, and that such behavior is explicitly part of the API
and not just accidental. (I don't a linux machine nearby to try it out
myself) Because, what if I want to refer to Windows paths from a POSIX
application? (I'm sure there are scenarios where that makes sense)

Or what if I just want my application to behave in a pedantically
platform-identical way, like having it to accept backlashes as path
separators not just on Windows but on POSIX as well? (This makes much
more sense than is immediately obvious... in many cases it can be argued
to be the Right Thing)

I'm sorry if I seem a bit agitated :P , it's just that due to some more
or less recent traumatizing events (a long story relating to Windows 7)
I have become a Crusader for cross-platformness.

The other suggestion I have (mentioned by others as well) is to
generalize the driver letter to a device symbol/string/identifier. But
this only makes sense if this device segment works in a
platform-independent way. This generalization might make the path module
useful in a few new contexts. Note, I'm not saying it should handle
URIs, in fact I want to explicitly say it should not handle URIs, as
URIs have additional semantics (query and fragment parts, the percent
encoding, etc.) which should not be of concern here.

BTW, I admit I take some inspiration from this API:
http://help.eclipse.org/helios/index.jsp?topic=/org.eclipse.platform.doc.isv/reference/api/org/eclipse/core/runtime/IPath.html
Note that here there is only *one* platform dependent function, the
aptly named toOSString() ...

1 2 >

1 - 100 of 140 matches

Mail list logo