Hi,

I am trying to include the stop words into hadoop map reduce, and later on,
into hive.
What is the accepted solution regarding the stop words in hadoop?

All I can think is to load all the stop words into an array in the mapper,
and then check each token against the stop words..(this would be O(n^2) )

Regards

Reply via email to