Re: how can I fix my accent issues?
Hello to all, we have found the solution to our accents problem, a colleague of mine got the idea to use xlsx instead of xls and the magic happened, thanks to all for your support best regards El mié, 13 dic 2023 a las 0:19, Adrian Klaver () escribió: > > On 12/12/23 16:09, Igniris Valdivia Baez wrote: > > Hello to all, to clarify the data is moving this way: > > 1. The data is extracted from a database in postgres using Pentaho(Kettle) > > 2. Here is there is a bifurcation some data is loaded into the destiny > > database and behaves fine the other scenario the data is saved in xls > > files to be reviewed > > How is saved to xls files? > > > 3. After the revision the data is loaded to the destiny database and > > here is were I believe the issue is, because the data is reviewed in > > Windows and somehow Pentaho is not understanding correctly the > > interaction between both operating systems. > > Defined reviewed, on particular is the data changed? > > How is transferred from xls to to the database? > > Is the data reviewed in Excel only on one machine or many? > > What the locales/encodings/character sets involved? > > > > > PD: when the hole operation is executed in Windows it never fails > > Define what you mean by whole operation done in Windows. > > > Thank you all > > > -- > Adrian Klaver > adrian.kla...@aklaver.com >
Re: how can I fix my accent issues?
Hello, How is saved to xls files? --- using pentaho there is a tool there to output data in different formats in this case xls Defined reviewed, on particular is the data changed? ---Yes, some descriptions are changed How is transferred from xls to to the database? --- Using pentaho there is a tool there to load the data in different formats in this case xls Is the data reviewed in Excel only on one machine or many? ---only in one machine What the locales/encodings/character sets involved? ---UTF 8 location spanish_Cuba.1252 Define what you mean by whole operation done in Windows.--- When the process is executed in my local machine which is in windows there are no issues, when it move to dev environment which is in linux but the xls is still reviewed in windows the load throws data with the U+FFFD � REPLACEMENT CHARACTER. best regards El mié, 13 dic 2023 a las 0:19, Adrian Klaver () escribió: > > On 12/12/23 16:09, Igniris Valdivia Baez wrote: > > Hello to all, to clarify the data is moving this way: > > 1. The data is extracted from a database in postgres using Pentaho(Kettle) > > 2. Here is there is a bifurcation some data is loaded into the destiny > > database and behaves fine the other scenario the data is saved in xls > > files to be reviewed > > How is saved to xls files? > > > 3. After the revision the data is loaded to the destiny database and > > here is were I believe the issue is, because the data is reviewed in > > Windows and somehow Pentaho is not understanding correctly the > > interaction between both operating systems. > > Defined reviewed, on particular is the data changed? > > How is transferred from xls to to the database? > > Is the data reviewed in Excel only on one machine or many? > > What the locales/encodings/character sets involved? > > > > > PD: when the hole operation is executed in Windows it never fails > > Define what you mean by whole operation done in Windows. > > > Thank you all > > > -- > Adrian Klaver > adrian.kla...@aklaver.com >
Re: how can I fix my accent issues?
Hello to all, to clarify the data is moving this way: 1. The data is extracted from a database in postgres using Pentaho(Kettle) 2. Here is there is a bifurcation some data is loaded into the destiny database and behaves fine the other scenario the data is saved in xls files to be reviewed 3. After the revision the data is loaded to the destiny database and here is were I believe the issue is, because the data is reviewed in Windows and somehow Pentaho is not understanding correctly the interaction between both operating systems. PD: when the hole operation is executed in Windows it never fails Thank you all El mar., 12 de diciembre de 2023 7:00 p. m., Adrian Klaver < adrian.kla...@aklaver.com> escribió: > On 12/12/23 15:54, Igniris Valdivia Baez wrote: > > Please use Reply All to reply to list also > Ccing list > > > PgAdmin 4 but it looks the same in the console and from postman. > > I believe that the problem is the xls that is generated from a postgres > > database opened in Windows to fulfill a review requirement and imported > > again using Pentaho, because I'm moving another data using the same > > environment and it's fine the difference is the review xls > > Huh, where did that come from? > > At no point previously have you indicated xls(Excel?) was involved. > > Provide a more detailed explanation that the route the data is taking to > get to the database. > > > > Thank you > > > > El mar., 12 de diciembre de 2023 6:04 p. m., Adrian Klaver > > mailto:adrian.kla...@aklaver.com>> escribió: > > > > On 12/12/23 12:44, Igniris Valdivia Baez wrote: > > > this is the result I got, now I have to figure it out how to > > solve it, > > > thank you so much > > > > In what client are you viewing the data? > > > > > > -- > > Adrian Klaver > > adrian.kla...@aklaver.com <mailto:adrian.kla...@aklaver.com> > > > > -- > Adrian Klaver > adrian.kla...@aklaver.com > >
Re: how can I fix my accent issues?
this is the result I got, now I have to figure it out how to solve it, thank you so much El mar, 12 dic 2023 a las 14:42, Daniel Verite () escribió: > > Igniris Valdivia Baez wrote: > > > hello, thank you for answering, it's not a typo, in the attachments > > you can see that this is actually my collation, algo a pic of the > > problem for more clarification, > > This character is meant to replace undisplayable characters: > > From https://en.wikipedia.org/wiki/Specials_(Unicode_block): > > U+FFFD � REPLACEMENT CHARACTER used to replace an unknown, > unrecognised, or unrepresentable character > > It would useful to know whether: > > - this code point U+FFFD is in the database contents in places > where accented characters should be. In this case the SQL client is > just faithfully displaying it and the problem is not on its side. > > - or whether the database contains the accented characters normally > encoded in UTF8. In this case there's a configuration mismatch on the > SQL client side when reading. > > To break down a string into code points to examine it, a query like > the following can be used, where you replace SELECT 'somefield' > with a query that selects a suspicious string from your actual table: > > WITH string(x) AS ( >SELECT 'somefield' > ) > SELECT > c, > to_hex(ascii(c)) AS codepoint > FROM > string CROSS JOIN LATERAL regexp_split_to_table(x, '') AS c > ; > > > Best regards, > -- > Daniel Vérité > https://postgresql.verite.pro/ > Twitter: @DanielVerite
Re: how can I fix my accent issues?
hello to all, thanks for your answers i've changed the encoding using this: ALTER DATABASE testdb SET client_encoding = WIN1252; now when we try to select data from a table we get this error: ERROR: character with byte sequence 0xe2 0x80 0x8b in encoding "UTF8" has no equivalent in encoding "WIN1252" SQL state: 22P05ERROR: character with byte sequence 0xe2 0x80 0x8b in encoding "UTF8" has no equivalent in encoding "WIN1252" SQL state: 22P05 i want to clarify that the postgres on dev is in a docker environment that already have databases in it so we can't change encoding for the hole container thanks in advance El sáb, 9 dic 2023 a las 1:01, Laurenz Albe () escribió: > > On Fri, 2023-12-08 at 23:58 -0500, Igniris Valdivia Baez wrote: > > hello, I have an ETL process collecting data from a postgresql > > database and xls files and inserting in a postgresql database that > > process occurs great in a local DB in postgres 14 with UTF8 > > codification and Spanish_Cuba.1952 collation but when I execute that > > process in dev which is in postgres 15 and UTF8 with collation > > en_US.utf8 the words with accents and ñ looks like an interrogation > > symbol, what can I do to fix this? > > If the data you are sending are encoded in WINDOWS-1252 (I assume that > "1952" is just a typo), you should set the client encoding to WIN1252, > so that PostgreSQL knows how to convert the data correctly. > > You can do that in several ways; the simplest might be to set the > environment variable PGCLIENTENCODING to WIN1252. > > Yours, > Laurenz Albe
Re: how can I fix my accent issues?
this is the settings for my local db which I failed to say is also in Postgres 14, the dev db is in Postgres 15.4 has UTF an en_US.utf8 collation, for the ETL process I'm using Pentaho Data Integration tool, also known as kettle, thanks in advance El sáb, 9 dic 2023 a las 10:50, Adrian Klaver () escribió: > > On 12/9/23 07:41, Igniris Valdivia Baez wrote: > > hello, thank you for answering, it's not a typo, in the attachments > > you can see that this is actually my collation, algo a pic of the > > problem for more clarification, > > thank you all > > You picture shows the database collation as Spanish_Cuba.1252 not the > Spanish_Cuba.1952 you originally indicated. > > 1) Which is the above for the production database or the dev one? > > 2) What are the exact settings for the other database? > > > > best regards > > > > El sáb, 9 dic 2023 a las 1:01, Laurenz Albe > > () escribió: > >> > >> On Fri, 2023-12-08 at 23:58 -0500, Igniris Valdivia Baez wrote: > >>> hello, I have an ETL process collecting data from a postgresql > >>> database and xls files and inserting in a postgresql database that > >>> process occurs great in a local DB in postgres 14 with UTF8 > >>> codification and Spanish_Cuba.1952 collation but when I execute that > >>> process in dev which is in postgres 15 and UTF8 with collation > >>> en_US.utf8 the words with accents and ñ looks like an interrogation > >>> symbol, what can I do to fix this? > >> > >> If the data you are sending are encoded in WINDOWS-1252 (I assume that > >> "1952" is just a typo), you should set the client encoding to WIN1252, > >> so that PostgreSQL knows how to convert the data correctly. > >> > >> You can do that in several ways; the simplest might be to set the > >> environment variable PGCLIENTENCODING to WIN1252. > >> > >> Yours, > >> Laurenz Albe > > -- > Adrian Klaver > adrian.kla...@aklaver.com >
Re: how can I fix my accent issues?
hello, thank you for answering, it's not a typo, in the attachments you can see that this is actually my collation, algo a pic of the problem for more clarification, thank you all best regards El sáb, 9 dic 2023 a las 1:01, Laurenz Albe () escribió: > > On Fri, 2023-12-08 at 23:58 -0500, Igniris Valdivia Baez wrote: > > hello, I have an ETL process collecting data from a postgresql > > database and xls files and inserting in a postgresql database that > > process occurs great in a local DB in postgres 14 with UTF8 > > codification and Spanish_Cuba.1952 collation but when I execute that > > process in dev which is in postgres 15 and UTF8 with collation > > en_US.utf8 the words with accents and ñ looks like an interrogation > > symbol, what can I do to fix this? > > If the data you are sending are encoded in WINDOWS-1252 (I assume that > "1952" is just a typo), you should set the client encoding to WIN1252, > so that PostgreSQL knows how to convert the data correctly. > > You can do that in several ways; the simplest might be to set the > environment variable PGCLIENTENCODING to WIN1252. > > Yours, > Laurenz Albe
how can I fix my accent issues?
hello, I have an ETL process collecting data from a postgresql database and xls files and inserting in a postgresql database that process occurs great in a local DB in postgres 14 with UTF8 codification and Spanish_Cuba.1952 collation but when I execute that process in dev which is in postgres 15 and UTF8 with collation en_US.utf8 the words with accents and ñ looks like an interrogation symbol, what can I do to fix this? thanks in advance