Hi Karl, Where is the introduction of below algorithm? Thanks. "Very simple algorithmic solutions usually involve ranking top senstances by looking at distribution of terms in sentances, paragraphs and the whole document. I implemented something like this a couple of years back that worked fairly well."
2008/2/29, Karl Wettin <[EMAIL PROTECTED]>: > > [EMAIL PROTECTED] skrev: > > >> If you want something from an index it has to be IN the > >> index. So, store a > >> summary field in each document and make sure that field is part of the > >> query. > > > > And how could one create automatically such a summary? > > Taking the first 2 lines of a document makes not always much sense. > > How does google this? > > > Google don't summarize, they highlight parts that match the query. See > previous reponses. > > If you really want to summarize there are a number of more and less > scientific ways to figure out what's important and what's not. > > Very simple algorithmic solutions usually involve ranking top senstances > by looking at distribution of terms in sentances, paragraphs and the > whole document. I implemented something like this a couple of years back > that worked fairly well. > > Citeseer is a great source for papers on pretty much any IR related > subject: <http://citeseer.ist.psu.edu/cs?cs=1&q=text+summarization> > > > > karl > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > >