ID:               43864
 User updated by:  sq6elt at wp dot pl
 Reported By:      sq6elt at wp dot pl
 Status:           Wont fix
 Bug Type:         Performance problem
 Operating System: Linux
 PHP Version:      5.2.5
 New Comment:

>From logical point of view, these files are different.
Programmers should thread this as different files.

Besides, this is broken for hardlinks, only symlinks may be determined
by this call.

Simple test (rp.c)

#include <stdlib.h>
#include <string.h>

char x[4096];

int main(int argc, char* argv[]) {
        if (argc != 2 ) {
                printf("Provide a path %i\n", argc);
                return 0;
        }
        memset(x, 0, 4096);
        realpath(argv[1], x);
        printf("Realpath for %s is %s\n", argv[1], x);
}

$ ./rp /home/ftp/welcome.msg 
Realpath for /home/ftp/welcome.msg is /home/ftp/welcome.msg

$ mkdir /home/t
$ mount /home/ftp /home/t --bind
$ ./rp /home/t/welcome.msg 
Realpath for /home/t/welcome.msg is /home/t/welcome.msg

So, this is partial solution, and in some environment 
have big performance impacts.

If its really required to know if two files are really 
the same file, look at stat.
$ stat /home/ftp/welcome.msg 
  File: `/home/ftp/welcome.msg'
  Size: 166             Blocks: 8          IO Block: 4096   regular
file
Device: 807h/2055d      Inode: 67117468    Links: 1
[...]

$ stat /home/t/welcome.msg 
  File: `/home/t/welcome.msg'
  Size: 166             Blocks: 8          IO Block: 4096   regular
file
Device: 807h/2055d      Inode: 67117468    Links: 1
[...]

One may determine if two files are really the same by comparing
device and inode. 
This compare do a single stat on target file and not stats on each path
element, and solves hard linking.


Previous Comments:
------------------------------------------------------------------------

[2009-06-18 14:17:44] ras...@php.net

We need the realpath call to determine if there are symlinks in the
path.  If /a is a symlink to /b and you do:

require '/a/file.php';
require_once '/b/file.php';

then file.php is actually the same file and the second require_once
should do nothing, but we can only know that with a realpath call. 

In 5.3 we have replaced the system-level realpath call with our own
implementation which does intra-path caching, so this has been addressed
now, but it won't be changed in 5.2.

------------------------------------------------------------------------

[2009-06-18 09:35:34] sq6elt at wp dot pl

From: TSRM/tsrm_virtual_cwd.c

if (!realpath(path, resolved_path)) {  /* Note: Not threadsafe on older
BSD's */
  if (use_realpath == CWD_REALPATH) {
     return 1;
  }
  goto no_realpath;
}
use_realpath = CWD_REALPATH;
CWD_STATE_COPY(&old_state, state);

Manual page says:
BUGS
       Avoid using this function.  It is broken by design...

So please avoid use this function, as stated above it has significant
performance impact and as stated in manual it's simply broken.

For now I have disabled this, by undefining HAVE_REALPATH.

------------------------------------------------------------------------

[2008-01-16 17:18:27] nlgordon at iastate dot edu

I've seen this same issue, and while I agree that it is excessive
considering the realpath cache.  It appears to be a byproduct of using
realpath on pretty much all file accesses for includes.

Theoretically the realpath cache could be extended to cover directories
and the like.  I know I would like it, my servers get hit hard because
I'm serving php out of AFS space.

------------------------------------------------------------------------

[2008-01-16 08:21:25] sq6elt at wp dot pl

Description:
------------
With a lot of includes, located deep in file system,
there may be a performance impact, because of a lot of
unnecessary lstats.
Why, when there is exactly specified path, any lstats are made.
Simple access is sufficient, or I missed something?
If some of these are required, then do it only once.
I have checked php4, it behaves in the same way.
There is no difference, when i use include, include_once, require,
require_once.

Reproduce code:
---------------
Create a directory tree:
mkdir -p /tmp/a/b/c/d/e/f/g/h/i/j

Two empty php scripts:
echo '<? ?>' > /tmp/a/b/c/d/e/f/g/h/i/j/a.php
echo '<? ?>' > /tmp/a/b/c/d/e/f/g/h/i/j/b.php

and a main script /tmp/t.php

<?
  include "/tmp/a/b/c/d/e/f/g/h/i/j/a.php";
  include "/tmp/a/b/c/d/e/f/g/h/i/j/b.php";
?>




Actual result:
--------------
Do a strace on t,php
strace /usr/bin/php5 t.php 2>&1 | grep lstat64

And guess what:
lstat64("/usr", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
lstat64("/usr/bin", {st_mode=S_IFDIR|0755, st_size=49152, ...}) = 0
lstat64("/usr/bin/php5", {st_mode=S_IFREG|0755, st_size=5510176, ...})
= 0
lstat64("/etc", {st_mode=S_IFDIR|0755, st_size=8192, ...}) = 0
lstat64("/etc/php5", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
lstat64("/etc/php5/cli", {st_mode=S_IFDIR|0755, st_size=4096, ...}) =
0
lstat64("/etc/php5/cli/php.ini", {st_mode=S_IFREG|0644, st_size=44278,
...}) = 0
lstat64("/tmp", {st_mode=S_IFDIR|S_ISVTX|0777, st_size=8192, ...}) = 0
lstat64("/tmp/t.php", {st_mode=S_IFREG|0644, st_size=94, ...}) = 0
lstat64("/tmp", {st_mode=S_IFDIR|S_ISVTX|0777, st_size=8192, ...}) = 0
lstat64("/tmp/a", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
lstat64("/tmp/a/b", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
lstat64("/tmp/a/b/c", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
lstat64("/tmp/a/b/c/d", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
lstat64("/tmp/a/b/c/d/e", {st_mode=S_IFDIR|0755, st_size=4096, ...}) =
0
lstat64("/tmp/a/b/c/d/e/f", {st_mode=S_IFDIR|0755, st_size=4096, ...})
= 0
lstat64("/tmp/a/b/c/d/e/f/g", {st_mode=S_IFDIR|0755, st_size=4096,
...}) = 0
lstat64("/tmp/a/b/c/d/e/f/g/h", {st_mode=S_IFDIR|0755, st_size=4096,
...}) = 0
lstat64("/tmp/a/b/c/d/e/f/g/h/i", {st_mode=S_IFDIR|0755, st_size=4096,
...}) = 0
lstat64("/tmp/a/b/c/d/e/f/g/h/i/j", {st_mode=S_IFDIR|0755,
st_size=4096, ...}) = 0
lstat64("/tmp/a/b/c/d/e/f/g/h/i/j/a.php", {st_mode=S_IFREG|0644,
st_size=11, ...}) = 0
lstat64("/tmp", {st_mode=S_IFDIR|S_ISVTX|0777, st_size=8192, ...}) = 0
lstat64("/tmp/a", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
lstat64("/tmp/a/b", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
lstat64("/tmp/a/b/c", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
lstat64("/tmp/a/b/c/d", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
lstat64("/tmp/a/b/c/d/e", {st_mode=S_IFDIR|0755, st_size=4096, ...}) =
0
lstat64("/tmp/a/b/c/d/e/f", {st_mode=S_IFDIR|0755, st_size=4096, ...})
= 0
lstat64("/tmp/a/b/c/d/e/f/g", {st_mode=S_IFDIR|0755, st_size=4096,
...}) = 0
lstat64("/tmp/a/b/c/d/e/f/g/h", {st_mode=S_IFDIR|0755, st_size=4096,
...}) = 0
lstat64("/tmp/a/b/c/d/e/f/g/h/i", {st_mode=S_IFDIR|0755, st_size=4096,
...}) = 0
lstat64("/tmp/a/b/c/d/e/f/g/h/i/j", {st_mode=S_IFDIR|0755,
st_size=4096, ...}) = 0
lstat64("/tmp/a/b/c/d/e/f/g/h/i/j/b.php", {st_mode=S_IFREG|0644,
st_size=7, ...}) = 0



------------------------------------------------------------------------


-- 
Edit this bug report at http://bugs.php.net/?id=43864&edit=1

Reply via email to