As you say, the main problem is that there are so many different
possible sets. Some will be proprietary, which would limit their
usefulness although there would, I believe, otherwise be no objection
to its inclusion. If you can come up with a reasonably standard set and
reasonably consistent data across several dictionaries referencing it,
I'm sure there'd be no objection to including it.
On Dec 16, 2004, at 2:19 PM, Erik Peterson wrote:
Hello,
I've found many uses for the UniHan data file the past few years.
It's a great source of information.
One potential addition that I've wanted is a field listing the
simplified Chinese radical for at least the simplified Chinese
characters, like what exists for the Xinhua Zidian ("Xinhua
Dictionary") and other mainland Chinese dictionaries. I was wondering
if this has been discussed before?
Some potential difficulties I could see include the fact that
mainland dictionaries use a variety of different radical schemes. The
most standard one that I can find is the Chinese Academy of Social
Sciences (CASS) set with 189 different radicals. Even for dictionaries
that use this set the ordering is often different. Could the radical
set also be proprietary in some way?
Anyway, I was curious. I've been working on something like this
myself that I could also contribute when it's farther along.
Regards,
Erik Peterson