Hi, I am trying to include the stop words into hadoop map reduce, and later on, into hive. What is the accepted solution regarding the stop words in hadoop?
All I can think is to load all the stop words into an array in the mapper, and then check each token against the stop words..(this would be O(n^2) ) Regards