Hi Graham,

Warned? about what? If the Wordnet list is Opensource then what is the issue?

I understood you want to start from _scratch_. Wordnet was sponsored by a 20-million USD grant, and done by a team of really qualified linguists. And it is one of the biggest achievements in computer linguistics as such. So you know that trying to beat that requires a lot of resources, and IT resources are really not so important.

If you want to extend Wordnet, then it's another story. Of course, it's easier to do so.

I'd recommend searching for local English Wordnets (or similar
linguistic projects), maybe there are Australian versions.

I've already done that, those that I've seen are small operations run by a single enthusiast on a small backroom server that has a single point of failure.

The server is a minor issue. The major issue is how to start a team - single enthusiasts could never achieve that with no remuneration.

Trying to
build a new thesaurus from scratch is simply futile.

It's a good thing Mr Roget didn't think that.

But it's not 19th century anymore. Roget's thesaurus is really worse than Wordnet in linguistic terms. And in linguistics, you try to bootstrap and reuse the data.

If therefore Openthesaurus is a bad option, the assumption I take from what you are saying is; setting up a local Wordnet is the best alternative.

It is not a bad option. I'm using OpenThesaurus myself. But if you want to reuse Wordnet, you need to convert it into OpenThesaurus, and this is a non-trivial task.

You cannot setup a local Wordnet without any software as Wordnet is only a file. You need an editing environment. You can use some other software (there are many software packages for professional linguists - used for building national wordnets - but they could be far too complicated for an average user).

What is the step from Wordnet Database to installed Thesaurus in OOo?

Conversion of the database. Take a look at scripts at Daniel Naber's site.

But note that this conversion does not allow any direct edition Wordnet nor edition of OOo thesaurus.

Where can I find someone who can exchange emails with my clients people to get them under way

No idea. First you have to find someone who has some natural language processing background, and is able to make mapping between Wordnet's relations to MySQL database in OpenThesaurus, and make a decision whether some of the relations are to be discarded or ported into OpenThesaurus software. I wouldn't start a project without finding the person who is able to do that - these processes are non-trivial.

I would recommend you to contact linguistics (NLP) departments at Australian universities. This is a task that make a good postgraduate work.

Regards,
Marcin

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to