Hi,
I am currently implementing some sort of semantic indexing with Lucene. I'm
using lucene index to generate TFIDF matrix and then apply singular value
decomposition on it. So I get some kind 2D vector representation for each
document. The distance between two vectors is their relevance.
If y
stion?
-Mike
On Wed, 22 Dec 2004 17:23:24 +0100, DES <[EMAIL PROTECTED]> wrote:
hi
i need to index my text so that index contains only tokenized stemmed
words without stopwords etc. The text ist german, so I tried to use
GermanAnalyzer, but it stores whole text, not terms. Please give me a tip
h
hi
i need to index my text so that index contains only tokenized stemmed words
without stopwords etc. The text ist german, so I tried to use GermanAnalyzer,
but it stores whole text, not terms. Please give me a tip how to index terms
only. Thanks!
DES
ML? I can create
different documents, but how can i relate this index-documents to my articles
within SGML files?
thanks for help
des
Is it possible to combine Lucene and multi-dimensional scaling in some way?
I don't know. Is there a good SVD package for Java?
I think about implementation with Jama, there is a SVD class, but I don't
know yet if it's good or not
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-ma
Hi
I need some kind of implementation of SVD (singular value decomposition) or
LSI with Lucene engine. Have anyone any ideas how to create a query table
for decomposition? The table must have documents as rows and terms as
columns, if a term is presented in the docuement, the corresponding field