Re: [GENERAL] Postgresql as a dictionary coder backend?

2011-01-25 Thread Cédric Villemain
2011/1/23 Attila Nagy b...@fsn.hu:  Hello, I'm looking for a database backend for a dictionary coder project. It would have three major tasks: - take a text corpus, get their words and substitute each word by a 64 bit integer (the word:integer is always constant) and store the result

Re: [GENERAL] Postgresql as a dictionary coder backend?

2011-01-24 Thread Attila Nagy
On 01/24/2011 03:19 AM, Ben Chobot wrote: On Jan 23, 2011, at 3:29 AM, Attila Nagy wrote: Hello, I'm looking for a database backend for a dictionary coder project. It would have three major tasks: - take a text corpus, get their words and substitute each word by a 64 bit integer (the

Re: [GENERAL] Postgresql as a dictionary coder backend?

2011-01-24 Thread Attila Nagy
On 01/24/2011 05:27 AM, Fredric Fredricson wrote: I can easily do this with any RDBMS, with a table of three columns: auto incremented ID, word and refcount, with a unique index on word. The challenge could be: - that it should scale to several TBs of size and several (hundred) billion of

[GENERAL] Postgresql as a dictionary coder backend?

2011-01-23 Thread Attila Nagy
Hello, I'm looking for a database backend for a dictionary coder project. It would have three major tasks: - take a text corpus, get their words and substitute each word by a 64 bit integer (the word:integer is always constant) and store the result (encoding) - take the previous result and

Re: [GENERAL] Postgresql as a dictionary coder backend?

2011-01-23 Thread Ben Chobot
On Jan 23, 2011, at 3:29 AM, Attila Nagy wrote: Hello, I'm looking for a database backend for a dictionary coder project. It would have three major tasks: - take a text corpus, get their words and substitute each word by a 64 bit integer (the word:integer is always constant) and store

Re: [GENERAL] Postgresql as a dictionary coder backend?

2011-01-23 Thread Fredric Fredricson
On 01/23/2011 12:29 PM, Attila Nagy wrote: Hello, I'm looking for a database backend for a dictionary coder project. It would have three major tasks: - take a text corpus, get their words and substitute each word by a 64 bit integer (the word:integer is always constant) and store the result