Hi.
The state of the art in language modeling is at the level of simple sentences, modeling syntax using n-grams (usually trigrams) or hidden Markov models ...
Just a remark: google recently made their up-to-5-grams available through LDC http://googleresearch.blogspot.com/2006/08/all-our-n-gram-are-belong-to-you.html - lk ----- This list is sponsored by AGIRI: http://www.agiri.org/email To unsubscribe or change your options, please go to: http://v2.listbox.com/member/[EMAIL PROTECTED]