Read each file word by word and insert into a Suffix Tree...
Terminal node of each word contains the FileNo/FileName...

Quite simple

On Dec 14, 5:42 pm, Tuaa <vention.goth...@gmail.com> wrote:
> According to me, the problem is regarding fastest search of
> substrings..
> Hashing is one of the solutions.
> Use Rabin-Karp Search..
> Use wiki 
> at:http://en.wikipedia.org/wiki/Rabin%E2%80%93Karp_string_search_algorit...
>
> On Dec 14, 4:01 pm, sourabh jakhar <sourabhjak...@gmail.com> wrote:
>
> > i have a one idea in my mind is to implement a hash table structure based on
> > 26 alphabets
> > and a data structure of words.
> > struct word
> > {
> > int info;
> > char a[n];};
>
> > structure  contains the info about the word and an array in  which document
> > it is present or not out of n
> > ex if it is word is mnnit and it is in document 1 ,4,6,9
> > the info of the structure would be char[1,4,6,9]=1
> > and the rest are zero.
> > insertion of word would take o(1) time but searching would take o(n) time if
> > list is implemented as an array list
> > and if implemented as an binary search tree would take it log(n) but than
> > insertion would also take log(n) time
>
> > On Mon, Dec 13, 2010 at 9:55 PM, GOBIND KUMAR <gobind....@gmail.com> wrote:
> > > One of my friends attended google interview.This was one the question
> > > asked.
>
> > >      "you are given N documents(possibly in millions) with words in them.
> > > design datastructures such that the following scenarios take optimal time:
>
> > >         a. print all the the docs having a given word
>
> > >         b. print the docs having both of 2 given words"
>
> > > Help me in solving this problem.
>
> > >  --
> > > You received this message because you are subscribed to the Google Groups
> > > "Algorithm Geeks" group.
> > > To post to this group, send email to algoge...@googlegroups.com.
> > > To unsubscribe from this group, send email to
> > > algogeeks+unsubscr...@googlegroups.com<algogeeks%2bunsubscr...@googlegroups­.com>
> > > .
> > > For more options, visit this group at
> > >http://groups.google.com/group/algogeeks?hl=en.
>
> > --
> > SOURABH JAKHAR,(CSE)(3 year)
> > ROOM NO 167 ,
> > TILAK,HOSTEL
> > 'MNNIT ALLAHABAD- Hide quoted text -
>
> > - Show quoted text -

-- 
You received this message because you are subscribed to the Google Groups 
"Algorithm Geeks" group.
To post to this group, send email to algoge...@googlegroups.com.
To unsubscribe from this group, send email to 
algogeeks+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/algogeeks?hl=en.

Reply via email to