I'm trying to use the ImageExtractor but it doesn't seem to work with utf8
characters correctly.

For this filename:
!!!!東京大学総合研究博物館小石川分館0001.jpg

I get
http://upload.wikimedia.org/wikipedia/commons/1/1e/!!!!古市公威像0103.JPG

But i should get:
http://upload.wikimedia.org/wikipedia/commons/d/d8/%21%21%21%21%E5%8F%A4%E5%B8%82%E5%85%AC%E5%A8%81%E5%83%8F0103.JPG


Any ideas how to get the right hash prefix(d/d8)? I tried url encoding the
file name first but that did not produce the right hash prefixes.


-- 
@tommychheng
http://tommy.chheng.com
------------------------------------------------------------------------------
The demand for IT networking professionals continues to grow, and the
demand for specialized networking skills is growing even more rapidly.
Take a complimentary Learning@Cisco Self-Assessment and learn 
about Cisco certifications, training, and career opportunities. 
http://p.sf.net/sfu/cisco-dev2dev
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to