Rural Hunter wrote:
>> Do you get the error if you try Chinese settings without nlpbamboo?
> How can I do this? The Chinese processing is provided by nlpbamboo.
Er, sorry, forget what I said. I have no experience with Chinese.
I guess you should try to ask the nlpbamboo people.
Is there anythin
于 2012/4/17 18:06, Albe Laurenz 写道:
Rural Hunter wrote:
That's probably the problem - it seems to emit something that is
not proper UTF-8 sometimes.
Do you get the error if you try Chinese settings without nlpbamboo?
How can I do this? The Chinese processing is provided by nlpbamboo.
--
Sent v
Rural Hunter wrote:
> My db is in utf-8, I have a row in my table say tmp_article and I wanted
> to generate ts_vector from the article content:
> select to_tsvector(content) from tmp_article;
> But I got this error:
> ERROR: invalid byte sequence for encoding "UTF8": 0xf481
>
Hi Albe,
Yes, I'm using a Chinese tsearch2 plugin called nlpbamboo and the
default default_text_search_config is 'chinesecfg'. I tested "select
to_tsvector('english',content) from tmp_article" and this works. So it's
the problem of nlpbamboo? The result of 'CAST (content AS bytea)' is
shown b
Please don't top post.
Rural Hunter wrote:
>>> My db is in utf-8, I have a row in my table say tmp_article and I wanted
>>> to generate ts_vector from the article content:
>>> select to_tsvector(content) from tmp_article;
>>> But I got this error:
>>> ERROR: invalid byte sequence for encoding "UT
=>SELECT convert_to(content, 'UTF8') FROM tmp_article;
This works. My pg is at latest 9.1.3 on ubuntu 10.04 server. We have
millions of data in the db but this is the only one we met the problem.
The bad data is inserted in recent days and we upgraded to 9.1.3 right
after it was released.
于 2
Rural Hunter wrote:
> My db is in utf-8, I have a row in my table say tmp_article and I
wanted
> to generate ts_vector from the article content:
> select to_tsvector(content) from tmp_article;
> But I got this error:
> ERROR: invalid byte sequence for encoding "UTF8": 0xf481
>
> I am wondering ho
still doesn't work.
db=# set client_encoding='SQL_ASCII' ;
SET
db=# select to_tsvector(content) from tmp_article;
ERROR: invalid byte sequence for encoding "UTF8": 0xf481
于 2012/4/15 1:38, raghu ram 写道:
On Sat, Apr 14, 2012 at 9:31 AM, Rur
On Sat, Apr 14, 2012 at 9:31 AM, Rural Hunter wrote:
> doesn't work either.
>
> db=# show client_encoding;
> client_encoding
> -
> UTF8
> (1 row)
>
> db=# set client_encoding='LATIN1';
> SET
> db=# show client_encoding;
> client_encoding
> -
> LATIN1
> (1 row)
doesn't work either.
db=# show client_encoding;
client_encoding
-
UTF8
(1 row)
db=# set client_encoding='LATIN1';
SET
db=# show client_encoding;
client_encoding
-
LATIN1
(1 row)
db=#
2012/4/14 Rural Hunter
> My db is in utf-8, I have a row in my table say tmp_article and I wanted
> to generate ts_vector from the article content:
> select to_tsvector(content) from tmp_article;
> But I got this error:
> ERROR: invalid byte sequence for encoding "UTF8": 0xf481
>
> I am wonderin
My db is in utf-8, I have a row in my table say tmp_article and I wanted
to generate ts_vector from the article content:
select to_tsvector(content) from tmp_article;
But I got this error:
ERROR: invalid byte sequence for encoding "UTF8": 0xf481
I am wondering how this could happen. I think if
12 matches
Mail list logo