Re: [ADMIN] invalid byte sequence for encoding "UTF8": 0xf481 - how could this happen?

2012-04-19 Thread Albe Laurenz
Rural Hunter wrote: >> Do you get the error if you try Chinese settings without nlpbamboo? > How can I do this? The Chinese processing is provided by nlpbamboo. Er, sorry, forget what I said. I have no experience with Chinese. I guess you should try to ask the nlpbamboo people. Is there anythin

Re: [ADMIN] invalid byte sequence for encoding "UTF8": 0xf481 - how could this happen?

2012-04-18 Thread Rural Hunter
于 2012/4/17 18:06, Albe Laurenz 写道: Rural Hunter wrote: That's probably the problem - it seems to emit something that is not proper UTF-8 sometimes. Do you get the error if you try Chinese settings without nlpbamboo? How can I do this? The Chinese processing is provided by nlpbamboo. -- Sent v

Re: [ADMIN] invalid byte sequence for encoding "UTF8": 0xf481 - how could this happen?

2012-04-17 Thread Albe Laurenz
Rural Hunter wrote: > My db is in utf-8, I have a row in my table say tmp_article and I wanted > to generate ts_vector from the article content: > select to_tsvector(content) from tmp_article; > But I got this error: > ERROR: invalid byte sequence for encoding "UTF8": 0xf481 >

Re: [ADMIN] invalid byte sequence for encoding "UTF8": 0xf481 - how could this happen?

2012-04-16 Thread Rural Hunter
Hi Albe, Yes, I'm using a Chinese tsearch2 plugin called nlpbamboo and the default default_text_search_config is 'chinesecfg'. I tested "select to_tsvector('english',content) from tmp_article" and this works. So it's the problem of nlpbamboo? The result of 'CAST (content AS bytea)' is shown b

Re: [ADMIN] invalid byte sequence for encoding "UTF8": 0xf481 - how could this happen?

2012-04-16 Thread Albe Laurenz
Please don't top post. Rural Hunter wrote: >>> My db is in utf-8, I have a row in my table say tmp_article and I wanted >>> to generate ts_vector from the article content: >>> select to_tsvector(content) from tmp_article; >>> But I got this error: >>> ERROR: invalid byte sequence for encoding "UT

Re: [ADMIN] invalid byte sequence for encoding "UTF8": 0xf481 - how could this happen?

2012-04-16 Thread Rural Hunter
=>SELECT convert_to(content, 'UTF8') FROM tmp_article; This works. My pg is at latest 9.1.3 on ubuntu 10.04 server. We have millions of data in the db but this is the only one we met the problem. The bad data is inserted in recent days and we upgraded to 9.1.3 right after it was released. 于 2

Re: [ADMIN] invalid byte sequence for encoding "UTF8": 0xf481 - how could this happen?

2012-04-16 Thread Albe Laurenz
Rural Hunter wrote: > My db is in utf-8, I have a row in my table say tmp_article and I wanted > to generate ts_vector from the article content: > select to_tsvector(content) from tmp_article; > But I got this error: > ERROR: invalid byte sequence for encoding "UTF8": 0xf481 > > I am wondering ho

Re: [ADMIN] invalid byte sequence for encoding "UTF8": 0xf481 - how could this happen?

2012-04-14 Thread Rural Hunter
still doesn't work. db=# set client_encoding='SQL_ASCII' ; SET db=# select to_tsvector(content) from tmp_article; ERROR:  invalid byte sequence for encoding "UTF8": 0xf481 于 2012/4/15 1:38, raghu ram 写道: On Sat, Apr 14, 2012 at 9:31 AM, Rur

Re: [ADMIN] invalid byte sequence for encoding "UTF8": 0xf481 - how could this happen?

2012-04-14 Thread raghu ram
On Sat, Apr 14, 2012 at 9:31 AM, Rural Hunter wrote: > doesn't work either. > > db=# show client_encoding; > client_encoding > - > UTF8 > (1 row) > > db=# set client_encoding='LATIN1'; > SET > db=# show client_encoding; > client_encoding > - > LATIN1 > (1 row)

Re: [ADMIN] invalid byte sequence for encoding "UTF8": 0xf481 - how could this happen?

2012-04-13 Thread Rural Hunter
doesn't work either. db=# show client_encoding;  client_encoding -  UTF8 (1 row) db=# set client_encoding='LATIN1'; SET db=# show client_encoding;  client_encoding -  LATIN1 (1 row) db=#

Re: [ADMIN] invalid byte sequence for encoding "UTF8": 0xf481 - how could this happen?

2012-04-13 Thread raghu ram
2012/4/14 Rural Hunter > My db is in utf-8, I have a row in my table say tmp_article and I wanted > to generate ts_vector from the article content: > select to_tsvector(content) from tmp_article; > But I got this error: > ERROR: invalid byte sequence for encoding "UTF8": 0xf481 > > I am wonderin

[ADMIN] invalid byte sequence for encoding "UTF8": 0xf481 - how could this happen?

2012-04-13 Thread Rural Hunter
My db is in utf-8, I have a row in my table say tmp_article and I wanted to generate ts_vector from the article content: select to_tsvector(content) from tmp_article; But I got this error: ERROR: invalid byte sequence for encoding "UTF8": 0xf481 I am wondering how this could happen. I think if