[issue11474] url2pathname() handling of '/C|/' on Windows

2011-03-12 Thread Bastian Kleineidam

New submission from Bastian Kleineidam :

Python 2.7 on Windows converts the URL path '/C|/' to a naked drive letter 'C:'.

C:\src>c:\Python27\python.exe -c "import urllib;print 
urllib.url2pathname('/C|/')"
C:
C:\src>

Expected and I believe the correct output would be C:\, not C:. Reason is that 
a naked drive letter C: means "current directory in drive C:", whereas C:\ 
means "root directory in drive C:". So if you happen to start your application 
in "C:\src" for example, the output "C:" is interpreted as "C:\src", not as 
"C:\".

--
components: Library (Lib)
messages: 130684
nosy: calvin
priority: normal
severity: normal
status: open
title: url2pathname() handling of '/C|/' on Windows
versions: Python 2.7

___
Python tracker 
<http://bugs.python.org/issue11474>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue11467] urlparse.urlsplit() regression for paths consisting of digits

2011-03-11 Thread Bastian Kleineidam

Bastian Kleineidam  added the comment:

To make the previous comment more precise: URLs where
the scheme specific part begins with a digit are affected.

--

___
Python tracker 
<http://bugs.python.org/issue11467>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue11467] urlparse.urlsplit() regression for paths consisting of digits

2011-03-11 Thread Bastian Kleineidam

Bastian Kleineidam  added the comment:

Regarding the correctness of the Python 2.6 implementation: 
http://www.faqs.org/rfcs/rfc1738.html specifies URLs of the form 
: where the scheme specific part is allowed to 
consist only of digits.

I agree that the example URL is not a good one and it is artificially 
constructed.

Some better examples demonstrating the same issue might be
clsid:85bbd92o-42a0-1o69-a2e4-08002b30309d
or
mailto:1...@example.org

--

___
Python tracker 
<http://bugs.python.org/issue11467>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue11467] urlparse.urlsplit() regression for paths consisting of digits

2011-03-11 Thread Bastian Kleineidam

Bastian Kleineidam  added the comment:

The behaviour change is caused by the fix for issue #754016.

--

___
Python tracker 
<http://bugs.python.org/issue11467>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue11467] urlparse.urlsplit() regression for paths consisting of digits

2011-03-11 Thread Bastian Kleineidam

Changes by Bastian Kleineidam :


--
versions: +Python 3.2

___
Python tracker 
<http://bugs.python.org/issue11467>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue11467] urlparse.urlsplit() regression for paths consisting of digits

2011-03-11 Thread Bastian Kleineidam

New submission from Bastian Kleineidam :

When using a javascript URL with only digits as paths, the urlsplit() functions 
behaves different in Python 2.7 than in 2.6:

$ python2.6 -c "import urlparse; print urlparse.urlsplit('javascript:123')"
SplitResult(scheme='javascript', netloc='', path='123', query='', fragment='')

$ python2.7 -c "import urlparse; print urlparse.urlsplit('javascript:123')"
SplitResult(scheme='', netloc='', path='javascript:123', query='', fragment='')

Python 3.2 has the same regression:
$ python3.2 -c "import urllib.parse; 
print(urllib.parse.urlsplit('javascript:123'))"
SplitResult(scheme='', netloc='', path='javascript:123', query='', fragment='')

I consider the Python 2.6 behaviour to be correct, ie. the current behaviour is 
buggy.

--
components: Library (Lib)
messages: 130570
nosy: calvin
priority: normal
severity: normal
status: open
title: urlparse.urlsplit() regression for paths consisting of digits
type: behavior
versions: Python 2.7

___
Python tracker 
<http://bugs.python.org/issue11467>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue9721] urlparse.urljoin() cuts off last base character with semicolon at url start

2010-08-30 Thread Bastian Kleineidam

Bastian Kleineidam  added the comment:

Update: the python2.5 behaviour is the expected and what I think the correct 
output.

--
versions:  -Python 2.5

___
Python tracker 
<http://bugs.python.org/issue9721>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue9721] urlparse.urljoin() cuts off last base character with semicolon at url start

2010-08-30 Thread Bastian Kleineidam

New submission from Bastian Kleineidam :

The urljoin() implementation cuts off the last base URL
character if the URL to join starts with a semicolon.
Expected output is no cut off characters.

$ python2.6
Python 2.6.6 (r266:84292, Aug 29 2010, 12:36:23) 
[GCC 4.4.5 20100824 (prerelease)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> import urlparse
>>> print urlparse.urljoin('http://localhost:8080/feedback', ';jsessionid=XXX')
http://localhost:8080/feedbac;jsessionid=XXX
>>> 

... same in Python 3.1.2:

$ python3.1
Python 3.1.2 (release31-maint, Aug 29 2010, 18:45:17) 
[GCC 4.4.5 20100824 (prerelease)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> import urllib.parse
>>> urllib.parse.urljoin('http://localhost:8080/feedback', ';jsessionid=XXX')
'http://localhost:8080/feedbac;jsessionid=XXX'
>>>

... in Python 2.5 the last path segment is cut off.
$ python2.5
Python 2.5.5 (r255:77872, Aug 23 2010, 02:55:15) 
[GCC 4.4.5 20100816 (prerelease)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
m>>> import urlparse
>>> print urlparse.urljoin('http://localhost:8080/feedback', ';jsessionid=XXX')
http://localhost:8080/;jsessionid=XXX
>>>

--
components: Library (Lib)
messages: 115252
nosy: calvin
priority: normal
severity: normal
status: open
title: urlparse.urljoin() cuts off last base character with semicolon at url 
start
type: behavior
versions: Python 2.5, Python 2.6, Python 3.1

___
Python tracker 
<http://bugs.python.org/issue9721>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com