> > > Hello all ; > > Currently i have a project about 'news documents ' from streaming > > sources :the goal is to cluster and analyse these documents; unfortuntly > > i am blocked on the phase of clustering ; is there an algorithme or a > > methode that i can use for text clustering, like kmeans streaming on spark? > > I tried to calculate tfidf for text then to use kmeans with flink ; but no > > success ; any other ideas ? Thankyou all > > > > Bilel SOULI