So if we make the minimum Java 11 for the next release cycle we could use UTF9 for the property files; that sounds like a good idea. -- Jody Garnett
On Mon, 15 Aug 2022 at 09:48, Rahkonen Jukka < jukka.rahko...@maanmittauslaitos.fi> wrote: > Hi, > > > > As a reminder the start of the mailing list thread when we discussed about > starting to translate with Transifex: > https://www.mail-archive.com/geoserver-users@lists.sourceforge.net/msg35112.html. > Title was “Translation workflow improvement” but all mails do not appear in > one thread so best to use also the sort by date view. Before that thread > there was another one “Geoserver 2.20.0, Encoding German Umlaut-Problems”. > Conclusion from that tile > > - The way to use ISO-8859-1 with U-codes is reliable for Java 8 and > above, but it is best to U-encode also some characters which belong to > ISO-8859-1 like “ä” and “ö” to avoid problems with some Java programs. > Weblate seems to do just that. > - UTF-8 is reliable but works only with Java 11 and above without > using some tricks. > > > > For anybody who needs to read or edit the properties file with a pure text > editor the U-coded strings are painful and error prone. Compare > “\u00e4\u00e4r\u00e4” vs. “määrä”. Somehow it would feel ideal to change > into pure UTF-8 when the support for Java 8 is dropped, but a well working > and open-for-all translation tool and automatic workflows are of course > very valuable. > > > > -Jukka Rahkonen- > > > > > > *Lähettäjä:* Jody Garnett <jody.garn...@gmail.com> > *Lähetetty:* maanantai 15. elokuuta 2022 17.10 > *Vastaanottaja:* Alexandre Gacon <alexandre.ga...@gmail.com> > *Kopio:* GeoServer <geoserver-devel@lists.sourceforge.net> > *Aihe:* Re: [Geoserver-devel] Use OSGEO weblate as translation tool - > Development consequences > > > > If you wish to choose weblate I am sure we can work with what it does > > > > On Mon, Aug 15, 2022 at 12:12 AM Alexandre Gacon < > alexandre.ga...@gmail.com> wrote: > > Hi Jody, > > > > I had a look at the git history for the UTF8 file: the file was created by > you by renaming the CN ISO file ( > https://osgeo-org.atlassian.net/browse/GEOS-10282 > <https://eur06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fosgeo-org.atlassian.net%2Fbrowse%2FGEOS-10282&data=05%7C01%7Cjukka.rahkonen%40maanmittauslaitos.fi%7Cb309015dd8bb4ab101e008da7ec7fb16%7Cc4f8a63255804a1c92371d5a571b71fa%7C0%7C0%7C637961694649878839%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=%2Bvmo6iinAcc7Mp0oFJYBIekTdA5UOO%2FVE%2BP8TnII6lQ%3D&reserved=0>). > If I understand the JIRA issue correctly, this was related to the encoding > issue in the translation files (the same old story) and the idea was > perhaps to migrate all the translations to UTF-8 encoding. > > > > Weblate UI manages to display the characters in all options: > > - ISO-8859 characters in ISO-8859 encoded file > > - U encoded characters (\u00e9 for example) in ISO-8859 encoded file > > - UTF-8 file > > > > Weblate can read and write either ISO-8859 files or UTF-8 files but inside > for a given component you can have only one of the two options. For > ISO-8859 files, weblate can read either ISO-8859 characters (é) and U > encoded characters (\u00e9) but when you change a translation, it will > always be written in U encoded characters. > > > > Regards > > Alexandre > > > > Le ven. 12 août 2022 à 21:46, Jody Garnett <jody.garn...@gmail.com> a > écrit : > > You may have to search back in the geoserver-devel history or bug tracker > for details. I assume we were provided utf8 translations and wished to > preserve them? > > > > When using the UI in weblate can folks see the native characters > regardless of encoding used? > > > > What I remember is that we needed UTF8 to support some chinese characters; > but the java properties file format is ISO-8859-1 (like the actual format > on disk). > > Wicket had some naming convention where you could append utf8 to the > filename - but no other tools understand this. > > > > If I understand you correctly: > > - Weblate wants to work with UTF8 if it can? > > - Java properties file want ISO-8859-1 > > > > Is there any way to hint to weblate what encoding to use? Looks like yes > https://docs.weblate.org/no/latest/formats.html#java-properties > <https://eur06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fdocs.weblate.org%2Fno%2Flatest%2Fformats.html%23java-properties&data=05%7C01%7Cjukka.rahkonen%40maanmittauslaitos.fi%7Cb309015dd8bb4ab101e008da7ec7fb16%7Cc4f8a63255804a1c92371d5a571b71fa%7C0%7C0%7C637961694649878839%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=ftVP2gLjiQn9O8Pc4DOLed05qpqPn7s2MH64m1F7JkU%3D&reserved=0> > > I cannot tell from that page if you can choose the encoding of individual > property files (ie do this manually)? > > > > Following links weblate uses > http://docs.translatehouse.org/projects/translate-toolkit/en/latest/formats/properties.html > <https://eur06.safelinks.protection.outlook.com/?url=http%3A%2F%2Fdocs.translatehouse.org%2Fprojects%2Ftranslate-toolkit%2Fen%2Flatest%2Fformats%2Fproperties.html&data=05%7C01%7Cjukka.rahkonen%40maanmittauslaitos.fi%7Cb309015dd8bb4ab101e008da7ec7fb16%7Cc4f8a63255804a1c92371d5a571b71fa%7C0%7C0%7C637961694649878839%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=V7ZYl4qJXtpj5X8f9j%2FvTKgW1EWvQGy3xzXwsIsna7g%3D&reserved=0> > > > > Which uses: > > - > http://docs.translatehouse.org/projects/translate-toolkit/en/latest/commands/prop2po.html > <https://eur06.safelinks.protection.outlook.com/?url=http%3A%2F%2Fdocs.translatehouse.org%2Fprojects%2Ftranslate-toolkit%2Fen%2Flatest%2Fcommands%2Fprop2po.html&data=05%7C01%7Cjukka.rahkonen%40maanmittauslaitos.fi%7Cb309015dd8bb4ab101e008da7ec7fb16%7Cc4f8a63255804a1c92371d5a571b71fa%7C0%7C0%7C637961694650660032%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=2pl4VTdcIsBo0YSN9QGplgDa0w%2FnEZPLHi9ZRR4X4%2BY%3D&reserved=0> > > > > Reading through those docs there is an option "--personality=mozilla" > which would preserve UTF8 characters rather than force them into Latin1 > encoding. > > -- > > Jody > > > > On Fri, 12 Aug 2022 at 04:25, Alexandre Gacon <alexandre.ga...@gmail.com> > wrote: > > I try to keep only the UTF-8 file as the source for Chinese, along with > the other languages in ISO-8859-1: the display of the file content in > weblate is totally broken. I have to define another fake component > configured to support UTF-8 encoding to be able to manage it correctly. > > > > What is the reason to keep the two encoding? > > > > Alexandre > > > > Le ven. 12 août 2022 à 08:26, Alexandre Gacon <alexandre.ga...@gmail.com> > a écrit : > > Hi Jody, > > > > You are guessing well: it is all about the Chinese language : for weblate > it is the same language defined twice so it cannot cope with it. > > > > I have to check how weblate can work with having one of the languages in > UTF-8 whereas the other ones are in ISO-8859-1 : I fear that it will mean > to have two components side by side. > > > > For your last point, it seems to work well for ages: Romanian is mixing > both characters encoding whereas Japanese and Korean are totally with > unicode characters inside a ISO-8859-1 encoded file. > > > > Alexandre > > > > Le jeu. 11 août 2022 à 20:36, Jody Garnett <jody.garn...@gmail.com> a > écrit : > > Weblate is a good idea; we held back perviously because of lack of > hosting. However if OSGeo is able to host :) > > > > I do not understand about two items for the same language: can you provide > links in the github repo? Do you mean two properties files; or two entries > in the same property file. Or two entries in different property files? > > > > I am guessing you mean two property files for the chinese language; where > we followed some wicket convention for having property files in different > encodings. I think we should just use the utf8 encoding? Which is not a > java standard but wicket supports it. > > > > - > https://github.com/geoserver/geoserver/blob/main/src/web/wms/src/main/resources/GeoServerApplication.properties > <https://eur06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fgeoserver%2Fgeoserver%2Fblob%2Fmain%2Fsrc%2Fweb%2Fwms%2Fsrc%2Fmain%2Fresources%2FGeoServerApplication.properties&data=05%7C01%7Cjukka.rahkonen%40maanmittauslaitos.fi%7Cb309015dd8bb4ab101e008da7ec7fb16%7Cc4f8a63255804a1c92371d5a571b71fa%7C0%7C0%7C637961694650660032%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=NCQX3UuzLr7k4dFSKBumjfA2h53%2BGtE6uWzHhjMW1Ow%3D&reserved=0> > > - > https://github.com/geoserver/geoserver/blob/main/src/web/wms/src/main/resources/GeoServerApplication_zh.properties > <https://eur06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fgeoserver%2Fgeoserver%2Fblob%2Fmain%2Fsrc%2Fweb%2Fwms%2Fsrc%2Fmain%2Fresources%2FGeoServerApplication_zh.properties&data=05%7C01%7Cjukka.rahkonen%40maanmittauslaitos.fi%7Cb309015dd8bb4ab101e008da7ec7fb16%7Cc4f8a63255804a1c92371d5a571b71fa%7C0%7C0%7C637961694650660032%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=Qn7cySaRTwOI9n7wvrXXpQqOpaUwGmoG6e9xv%2FDCcJg%3D&reserved=0> > > - > https://github.com/geoserver/geoserver/blob/main/src/web/wms/src/main/resources/GeoServerApplication_zh_CN.utf8.properties > <https://eur06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fgeoserver%2Fgeoserver%2Fblob%2Fmain%2Fsrc%2Fweb%2Fwms%2Fsrc%2Fmain%2Fresources%2FGeoServerApplication_zh_CN.utf8.properties&data=05%7C01%7Cjukka.rahkonen%40maanmittauslaitos.fi%7Cb309015dd8bb4ab101e008da7ec7fb16%7Cc4f8a63255804a1c92371d5a571b71fa%7C0%7C0%7C637961694650660032%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=WycwdtyR2l2EkxUa6ByE6UNnidQ6jOpOSJ61tqUW6Gw%3D&reserved=0> > > > > My understanding is we should keep the utf8 properties if weblate is > willing to understand utf8 encoding?? > > > > About replacing text with unicode; not sure how that works with java > properties being specified in a in ISO-8859-1 as part of the java api. > > -- > > Jody Garnett > > > > > > On Wed, 10 Aug 2022 at 23:11, Alexandre Gacon <alexandre.ga...@gmail.com> > wrote: > > Hi all, > > > > I am studying if we could migrate the translation tooling from Transifex > to Weblate. I have started this because with the current setup Transifex is > changing a lot of translations when I upload updates of the translation > source, making it difficult to do the synchronization between GitHub and > Transifex. > > > > Weblate is a copyleft libre software and OSGeo is hosting its own > instance, already used by several OSGeo projects (postgis, pgrouting and > grass gis at least). > > > > Thanks to Regina Obe, I have set up a GeoServer project on the OSGeo > instance to study how weblate works and if there is something which can > prevent us from using it. > > > > I have already two points to share with you to get some feedback: > > - First, when you configure a component into weblate, you cannot have > two items for the same language, even if they are in a different encoding. > As a consequence, I cannot directly integrate most of the core components > since they contain 2 files for the Chinese language: is it something which > can be changed? Which one is used by GeoServer? > - Second, when you change the translation of a text in weblate, it > automatically replaces special characters by their equivalent in unicode, > even if the character exists in the ISO-8859-1 encoding. For example: > > org.geoserver.security.GeoServerAuthenticationKeyFilter.name > <https://eur06.safelinks.protection.outlook.com/?url=http%3A%2F%2Forg.geoserver.security.geoserverauthenticationkeyfilter.name%2F&data=05%7C01%7Cjukka.rahkonen%40maanmittauslaitos.fi%7Cb309015dd8bb4ab101e008da7ec7fb16%7Cc4f8a63255804a1c92371d5a571b71fa%7C0%7C0%7C637961694650660032%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=SDUCKi1ZyvcTHoRmlKYGZOFnxEGVWCar3Y6F3oEv%2Bw8%3D&reserved=0>=Clé > d'authentification > > is replaced by > > org.geoserver.security.GeoServerAuthenticationKeyFilter.name > <https://eur06.safelinks.protection.outlook.com/?url=http%3A%2F%2Forg.geoserver.security.geoserverauthenticationkeyfilter.name%2F&data=05%7C01%7Cjukka.rahkonen%40maanmittauslaitos.fi%7Cb309015dd8bb4ab101e008da7ec7fb16%7Cc4f8a63255804a1c92371d5a571b71fa%7C0%7C0%7C637961694650660032%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=SDUCKi1ZyvcTHoRmlKYGZOFnxEGVWCar3Y6F3oEv%2Bw8%3D&reserved=0>=Cl\u00E9 > d'authentification > > > > (my own change in the translation was to add a space at the end of the > string, to match the original layout of the source string) > > > > From a technical point of view, it does not break anything but it would > make it more difficult to work on a translation without using weblate. > > > > Do you see any problems around these two points? Anything else to check? > > > > -- > > Alexandre Gacon > > _______________________________________________ > Geoserver-devel mailing list > Geoserver-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/geoserver-devel > <https://eur06.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.sourceforge.net%2Flists%2Flistinfo%2Fgeoserver-devel&data=05%7C01%7Cjukka.rahkonen%40maanmittauslaitos.fi%7Cb309015dd8bb4ab101e008da7ec7fb16%7Cc4f8a63255804a1c92371d5a571b71fa%7C0%7C0%7C637961694650660032%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=OkKXIu6NZm%2FoK900xmcqxJ1hTGeVzRJkR2DZVL4mzwY%3D&reserved=0> > > > > -- > > Alexandre Gacon > > > > -- > > Alexandre Gacon > > > > > -- > > Alexandre Gacon > > -- > > -- > > Jody Garnett >
_______________________________________________ Geoserver-devel mailing list Geoserver-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/geoserver-devel