I am having difficulty while parsing some Turkish sites.Here is the
part of the code. The problem is when the title contains some non-UTF
characters like ç,ü,ı,ö,ğ it stops parsing and doesnt read the rest.
For example if the title is "Ebru Gündeş askere gitti" it only reads
until "ş" which is "Ebru G". Or when reading "Serdar Ortaç sünnet
oldu" it only read "Serdar Or"
How can I fix the problem? Any suggestions???

NodeList nl=docEle.getElementsByTagName("item");
                                if(nl!=null && nl.getLength()>0){
                                        for(int i=0;i<nl.getLength();i++){
                                                Element entry = 
(Element)nl.item(i);
                                                Element _title = 
(Element)entry.getElementsByTagName
("title").item(0);
                                                Element _link = 
(Element)entry.getElementsByTagName("link").item
(0);
                                                Element _date = 
(Element)entry.getElementsByTagName
("pubDate").item(0);

                                                String title = 
_title.getFirstChild().getNodeValue();
                                                String link = 
_link.getFirstChild().getNodeValue();

                                                haber yeniHaber = new 
haber(link,title);
                                                haberEkle(yeniHaber);

                                        }
                                }

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google
Groups "Android Developers" group.
To post to this group, send email to android-developers@googlegroups.com
To unsubscribe from this group, send email to
android-developers-unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/android-developers?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to