is statistical data extracted from web DFSG compliant?

2008-11-09 Thread Kov Chai
Hi everyone, i am working on a Chinese input method engine [1,2]. This input method engine is based on the statistical language model. We extract the language model from a 150MiB corpus collected from some choosed Chinese websites using a training algorithm. The extracted data -- the language mode

Re: is statistical data extracted from web DFSG compliant?

2008-11-10 Thread Neil Williams
On Mon, 2008-11-10 at 10:06 +0800, Kov Chai wrote: > i am working on a Chinese input method engine [1,2]. This input method > engine is based on the statistical language model. We extract the > language model from a 150MiB corpus collected from some choosed > Chinese websites using a training algor

Re: is statistical data extracted from web DFSG compliant?

2008-11-11 Thread Neil Williams
On Tue, 2008-11-11 at 00:00 +0800, Kov Chai wrote: > Thanks a lot for your insightful analysis, Neil. =) > But I am still confused about some problems. I can't give a definitive answer because I'm not on the ftpmaster team, but I can try to give some idea of how I'd expect it to be handled within

Re: is statistical data extracted from web DFSG compliant?

2008-11-11 Thread Kov Chai
Thanks a lot for your insightful analysis, Neil. =) But I am still confused about some problems. On Mon, Nov 10, 2008 at 10:34:59AM +, Neil Williams wrote: > On Mon, 2008-11-10 at 10:06 +0800, Kov Chai wrote: > > i am working on a Chinese input method engine [1,2]. This input method > > engine

Re: is statistical data extracted from web DFSG compliant?

2008-11-12 Thread Kov Chai
On Tue, Nov 11, 2008 at 02:30:29PM +, Neil Williams wrote: > Yes - the tools to generate/merge/query the binary blob should exist in > Debian *before* or with the tools that require the binary blob (that > package must depend on the tools to generate the blob). > > > I would not advise uplo