to discover topics better
> > > >
> > > > Chirag Nagpal
> > > > Department of Computer Engineering
> > > > Army Institute of Technology, Pune
> > > >
> > > >
> > > > From: He
gt; > That way you will be able to discover topics better
> > >
> > > Chirag Nagpal
> > > Department of Computer Engineering
> > > Army Institute of Technology, Pune
> > >
> > >
> > > From: Hersheeta Chandankar
> > &
Hi Ted,
Thank you for a quick reply.
It would be of great help if you could please explain what kind of 'linking
information between documents' I should look for.
; > Chirag Nagpal
> > Department of Computer Engineering
> > Army Institute of Technology, Pune
> >
> > ____
> > From: Hersheeta Chandankar
> > Sent: Thursday, March 26, 2015 6:25 PM
> > To: user@mahout.apache.org
> >
neering
> Army Institute of Technology, Pune
>
>
> From: Hersheeta Chandankar
> Sent: Thursday, March 26, 2015 6:25 PM
> To: user@mahout.apache.org
> Subject: Latent Semantic Analysis for Document Categorization
>
> Hi,
>
> I'
able to discover topics better
>
> Chirag Nagpal
> Department of Computer Engineering
> Army Institute of Technology, Pune
>
>
> From: Hersheeta Chandankar
> Sent: Thursday, March 26, 2015 6:25 PM
> To: user@mahout.apache.org
> Subject:
topics better
Chirag Nagpal
Department of Computer Engineering
Army Institute of Technology, Pune
From: Hersheeta Chandankar
Sent: Thursday, March 26, 2015 6:25 PM
To: user@mahout.apache.org
Subject: Latent Semantic Analysis for Document Categorization
Hi
in mahout which has given good accuracy result of 70%-75%. But I
would still like to improve the accuracy by retrieving the semantic
dependencies between words of the documents.
I've read about Latent Semantic Analysis(LSA) which creates a term-document
matrix and subjects it to mathematical
t; >> >>>> >>> for the third time, in context of lsa, faster and hence
> >> perhaps
> >> >> >> better
> >> >> >> >>>> >>> alternative to lanczos is ssvd. Is there any specific
> reason
bimov <
>>> >> >> dlie...@gmail.com>
>>> >> >> >>>> wrote:
>>> >> >> >>>> >>
>>> >> >> >>>> >>> for the third time, in context of lsa, faster and hence
>>> perhaps
>>> &g
t;> >> >>>> >>> > Hi Guys,
>> >> >> >>>> >>> >
>> >> >> >>>> >>> > Per you advice I did upgrade to Mahout .6 and did a bunch
>> of
>> >> API
>> >> >> >>>> >>> > changes and in the meantime realized I had a bug wit
g the below error now, in the
> context
> >> >> of some
> >> >> >>>> >>> > other Mahout algorithm there was a mention of '/tmp' vs
> >> '/_tmp'
> >> >> >>>> >>> >
gt;> >> >>>> >>> > SEVERE: java.util.NoSuchElementException
>> >> >>>> >>> > at
>> >> >>>> >>>
>> >> >>>>
>> >>
>> com.google.common.c
t; >>>>
> >>
> org.apache.mahout.math.decomposer.lanczos.LanczosSolver.solve(LanczosSolver.java:104)
> >> >>>> >>> >at
> >> >>>> >>>
> lsa4solr.mahout_matrix$decompose_svd.invoke(mahout_matrix.clj:165)
> >> >>>>
>> >>>> >>> >
>> >>>> >>> >
>> >>>> >>> > On Mon, Feb 20, 2012 at 10:38 AM, Dmitriy Lyubimov <
>> >>>> dlie...@gmail.com>
>> >>>> >>> wrote:
>> >>
at 10:38 AM, Dmitriy Lyubimov <
>> >>>> dlie...@gmail.com>
>> >>>> >>> wrote:
>> >>>> >>> >> Peyman,
>> >>>> >>> >>
>> >>>> >>> >>
>> >>>> >>> >>
t;>>> >>> wrote:
> >>>> >>> >>> Hi Dmitriy & Others,
> >>>> >>> >>>
> >>>> >>> >>> Dmitriy thanks for your previous response.
> >>>> >>> >>> I have a follow up question to my LSA
>>>> However my
>>>> >>> >>> LanczosSolver in Mahout.4 does not find any eigenvalues (there are
>>>> >>> >>> eigenvectors as you see in the follow up logs).
>>>> >>> >>> The only things I'm doing di
y field already removes the noise
>>> and
>>> >>> >>> make the clustering work and the raw index data does not do that,
>>> am I
>>> >>> >>> correct or there are other potential explanations? For the desired
>>> >>> >>> rank I'
;> >>> result comes out, no clusters found.
>> >>> >>> If my issue is related to not having summarization done, how can
>> that
>> >>> >>> be done in Solr? I wasn't able to fine a Summary field in Solr.
>> >>> >>
Eigenvector 0 found with eigenvalue 0.0
> >>> >>> Feb 19, 2012 3:25:20 AM
> >>> >>> org.apache.mahout.math.decomposer.lanczos.LanczosSolver solve
> >>> >>> INFO: Eigenvector 1 found with eigenvalue 0.0
> >>> >>> Fe
mahout.math.decomposer.lanczos.LanczosSolver solve
>>> >>> INFO: Eigenvector 3 found with eigenvalue 0.0
>>> >>> Feb 19, 2012 3:25:20 AM
>>> >>> org.apache.mahout.math.decomposer.lanczos.LanczosSolver solve
>>> >>> INFO: Eigenvector 4 found wi
0.0
>> >>> Feb 19, 2012 3:25:20 AM
>> >>> org.apache.mahout.math.decomposer.lanczos.LanczosSolver solve
>> >>> INFO: Eigenvector 6 found with eigenvalue 0.0
>> >>> Feb 19, 2012 3:25:20 AM
>> >>> org.apache.mahout.mat
>>> org.apache.mahout.math.decomposer.lanczos.LanczosSolver solve
> >>> INFO: Eigenvector 7 found with eigenvalue 0.0
> >>> Feb 19, 2012 3:25:20 AM
> >>> org.apache.mahout.math.decomposer.lanczos.LanczosSolver solve
> >>> INFO: Eigenvector 8
eb 19, 2012 3:25:20 AM
>>> org.apache.mahout.math.decomposer.lanczos.LanczosSolver solve
>>> INFO: Eigenvector 10 found with eigenvalue 0.0
>>> Feb 19, 2012 3:25:20 AM
>>> org.apache.mahout.math.decomposer.lanczos.LanczosSolver solve
>>> INFO: LanczosSolver
osSolver solve
>> INFO: Eigenvector 10 found with eigenvalue 0.0
>> Feb 19, 2012 3:25:20 AM
>> org.apache.mahout.math.decomposer.lanczos.LanczosSolver solve
>> INFO: LanczosSolver finished.
>>
>>
>> On Sun, Jan 1, 2012 at 10:06 PM, Dmitriy Lyubimov wrote:
&
ommands. Nuances are understanding dictionary format and llr anaylysis of
>> n-grams and perhaps use a slightly better lemmatizer than the default one.
>>
>> With indexing part you are on your own at this point.
>> On Jan 1, 2012 2:28 PM, "Peyman Mohajerian" wrote
mat and llr anaylysis
> of
> > n-grams and perhaps use a slightly better lemmatizer than the default
> one.
> >
> > With indexing part you are on your own at this point.
> > On Jan 1, 2012 2:28 PM, "Peyman Mohajerian" wrote:
> >
> >> Hi Guy
PM, "Peyman Mohajerian" wrote:
>
>> Hi Guys,
>>
>> I'm interested in this work:
>>
>> http://www.ccri.com/blog/2010/4/2/latent-semantic-analysis-in-solr-using-clojure.html
>>
>> I looked at some of the comments and notices that there wa
PM, "Peyman Mohajerian" wrote:
> Hi Guys,
>
> I'm interested in this work:
>
> http://www.ccri.com/blog/2010/4/2/latent-semantic-analysis-in-solr-using-clojure.html
>
> I looked at some of the comments and notices that there was interest
> in incorporating
Hi Guys,
I'm interested in this work:
http://www.ccri.com/blog/2010/4/2/latent-semantic-analysis-in-solr-using-clojure.html
I looked at some of the comments and notices that there was interest
in incorporating it into Mahout, back in 2010. I'm also having issues
running this c
31 matches
Mail list logo