Re: document relations

2005-01-10 Thread DES
Hi, I am currently implementing some sort of semantic indexing with Lucene. I'm using lucene index to generate TFIDF matrix and then apply singular value decomposition on it. So I get some kind 2D vector representation for each document. The distance between two vectors is their relevance. If y

Re: Indexing terms only

2004-12-22 Thread DES
stion? -Mike On Wed, 22 Dec 2004 17:23:24 +0100, DES <[EMAIL PROTECTED]> wrote: hi i need to index my text so that index contains only tokenized stemmed words without stopwords etc. The text ist german, so I tried to use GermanAnalyzer, but it stores whole text, not terms. Please give me a tip h

Indexing terms only

2004-12-22 Thread DES
hi i need to index my text so that index contains only tokenized stemmed words without stopwords etc. The text ist german, so I tried to use GermanAnalyzer, but it stores whole text, not terms. Please give me a tip how to index terms only. Thanks! DES

SGML Indexing

2004-12-01 Thread DES
ML? I can create different documents, but how can i relate this index-documents to my articles within SGML files? thanks for help des

multi-dimensional scaling

2004-11-22 Thread DES
Is it possible to combine Lucene and multi-dimensional scaling in some way?

Re: Lucene and SVD

2004-11-18 Thread DES
I don't know. Is there a good SVD package for Java? I think about implementation with Jama, there is a SVD class, but I don't know yet if it's good or not - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-ma

Lucene and SVD

2004-11-17 Thread DES
Hi I need some kind of implementation of SVD (singular value decomposition) or LSI with Lucene engine. Have anyone any ideas how to create a query table for decomposition? The table must have documents as rows and terms as columns, if a term is presented in the docuement, the corresponding field