Here is a simple fix to the jsp page from the previous post:

BufferedReader in = new BufferedReader(new
InputStreamReader(content,"ISO-8859-1"));


Thanks Yohann!

On Mon, Jun 8, 2009 at 10:04 PM, Alain de Raynal <aderay...@gmail.com>wrote:

> Below is a short jsp page that illustrates the problem I'm facing
> the result on my local dev environment is Adrián Beltré
> the result on the gae servers is Adri��n Beltr��
>
> can somebody explain the difference ?
> is there a way to decode these string to get the accents back ?
>
> <%@ page contentType="text/html;charset=UTF-8" language="java" %>
> <%@ page import="java.io.BufferedReader" %>
> <%@ page import="java.io.InputStream" %>
> <%@ page import="java.io.InputStreamReader" %>
> <%@ page import="java.net.URL" %>
> <html>
> <body>
> <%
> String urlString = "http://en.wikipedia.org/wiki/Adri%C3%A1n_Beltr%C3%A9";;
> InputStream content = new URL(urlString).openStream();
> BufferedReader in = new BufferedReader(new InputStreamReader(content));
> String line;
> while ((line = in.readLine()) != null) {
>  if (line.contains("<h1 id=\"firstHeading\"")){
> out.println(line);
>  }
> }
> %>
> </body>
> </html>
>
> On Fri, Jun 5, 2009 at 9:41 PM, Alain de Raynal <aderay...@gmail.com>wrote:
>
>> Hello again,
>> one week later, the problem is slightly different:
>> When I tried the same operation from the gae servers, urlFetch returns
>>
>> Adri�n B�ltre
>>
>> instead of the expected "Adrián Béltre"
>> The accentuated characters are no longer recognized...
>>
>> Does anybody know what has changed in the past week ?
>>
>> Thanks in advance,
>> Alain
>>
>>
>>
>>
>> On Sun, May 31, 2009 at 2:41 PM, Alain de R <aderay...@gmail.com> wrote:
>>
>>> Hi everyone,
>>>
>>> I've noticed that when I try to retrieve the content of a web page
>>> (for instance: http://sports.yahoo.com/mlb/players/6039)
>>> the same code returns
>>>     "Adrián Béltre" from my dev/eclipse environment
>>> and
>>>     "Adrian Beltre" from GAE servers
>>>
>>> It's important for me to retrieve the content of a web page with the
>>> accentuated characters.
>>> is there a way around this ?
>>>
>>> Thanks in advance,
>>> Alain
>>
>>
>>
>

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"Google App Engine" group.
To post to this group, send email to google-appengine@googlegroups.com
To unsubscribe from this group, send email to 
google-appengine+unsubscr...@googlegroups.com
For more options, visit this group at 
http://groups.google.com/group/google-appengine?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to