Hi, On Sat, Nov 03, 2007 at 01:41:30AM -0400, Philip Goetz wrote: > Why don't you describe what you've done in more detail, e.g., what > parser you're using, and how you hooked it up to Cyc?
I "randomly" selected the link grammer parser http://www.link.cs.cmu.edu/link/ for the parser, although there are 3 or 4 other free parsers on the net. I am currently using it in its most basic, most fundamental mode -- at this stage, it wouldn't matter which parser one picked. The NL input is converted into parse trees, which are then used to bind unbound variables, for example: "(VP tell (NP me) (PP about (NP ?THING))))" So when you say "tell me about Lincoln", ?THING becomes Lincoln. The knowlegebase is searched for statements of the form "Lincoln is-a X", and the list of possible values for X is converted to a short essay. The value of ?THING is kept in memory as the "topic of conversation", so that if the next sentence is "tell me more about that", it knows that "that==Lincoln", and says more about Lincoln. I am not using either CYC, or opencyc, but rather, just the opencyc knowledgebase. Dumped to an ascii file, its about 750MB in size. I wrote a mini-micro-sized query engine that can read in this file, and then return results for a handful of queries, such as "(CYC-QUERY (#isa #$Lincoln ?WHAT))". The above is a sum-total of about 5KLOC, which is (very) small in the grand scheme of software development. I am hacking around with this thing, in various directions: adding support for more sentence types, being more sophisticated with references such as "it is", "that which", "the previous"... I'm also trying to get it to be inquisitive about the speaker (so that it can "learn" about the speaker, and "remember" things about them.) I'm also trying to validate assertions to see if they are common sense, or not, so that "Abraham Lincoln is a rock" would be rejected as nonsense, or at least, questioned. There are many, many interesting things that can be tried, and they all require "just a little bit more time and effort". --linas ----- This list is sponsored by AGIRI: http://www.agiri.org/email To unsubscribe or change your options, please go to: http://v2.listbox.com/member/?member_id=8660244&id_secret=61301397-dfa120