iconv .../TRANSLIT merge ok, cand are inputul definit cum trebuie. Nu cred ca acopera nici el toate ciudateniile de caractere, mai ales ca utf8 e f f f extins. Dar caracterele romanesti le-am testat acum 2-3 ani si isi facea treaba bine.
On Fri, Jun 10, 2016 at 5:11 PM, Adrian Popa <adrian.popa...@gmail.com> wrote: > html2text. O sa verific daca pot specifica encoding, mersu! > On 10 Jun 2016 16:08, "Claudiu Nicolaie CISMARU" <clau...@virtuamagic.com> > wrote: > > > > Poate problema e că inputul nu e UTF8? > > > adrianp@frost:~/development/dictionaries$ file wiki_ro_full.txt > > > wiki_ro_full.txt: Non-ISO extended-ASCII text, with LF, NEL line > > terminators > > > > Inputul tau este un amalgam. Cum ai extras textul? > > > > > _______________________________________________ > RLUG mailing list > RLUG@lists.lug.ro > http://lists.lug.ro/mailman/listinfo/rlug > -- -------------------------------------------- ----THE END of this transmission---- _______________________________________________ RLUG mailing list RLUG@lists.lug.ro http://lists.lug.ro/mailman/listinfo/rlug