>
>
>tryit_edit.asp?filename=tryhtml_basic&referer=http://www.w3schools.com/html/html
>_examples.asp
>  
>
>>and just this one is truncated. I think some regexp or pattern or
>>explicit list of where_not_to_break_a_string characters would solve
>>the problem. Or maybe it is already possible, but I dont know how?
>>    
>>
>
>I think that some URL encoding has not happened somewhere. Whether wget or the
>web server is at fault, I don't know, but the solution would be to URL encode
>the slashes.
>  
>
Why should be there an url encoding ?
/ are an legal character in url and in the GET string.
Ist used for example for Path2Query translation.

The main problem is that wget need to translate an URL to and Filesystem 
name.

Filesystem names are PATH and FILE names. And wget do it right i think.

example:

http://my.domain/dyn_page.sql/content_id/1891/session/0815

Server:

File: /dyn_page.sql
Query String /content_id/1891/session/0815

Client:
0. dyn_page.sql/content_id/1891/session/0815    (current i think)
1. dyn_page.sql_content_id_1891_session_0815
2. 0815

Only the Author of the webpage could tell you what is an good 
translation from an URL to filesystem
if there is an querystring on the page, else ALL solutions have their 
bad sites !!!
Only solution if you would try to make more people happy is to add some 
translation style and an option for selection.

This is the same problem as with javascript urls. But here an "solution" 
could be theoretical defined for the url naming:

- setup an webserver
- wildcard domain *.mirror
- error_hanlder page in the document_root
- table that contain original_url to filename mapping filled by wget
- the error handler cann strip of the .mirror and then return the 
propper content

POSITIV: work for all URL styles
NEGATIV: new Cocept, more work, not the original intention meet.

Attachment: msg04276/pgp00000.pgp
Description: PGP signature

Reply via email to