At 04:55 PM 03/26/02 +0800, Stas Bekman wrote:
OK, how about this:
When indexing strip all the HTML. and replace \n with whatever swish-e can store.
You mean replace \n with some flag (e.g. '#$%$#') only in <pre> sections? \n in HTML should matter.
You suggested %0A in the previous email, but I was actually thinking to have all sentences separated with <br> rather than thrown altogether. no matter if it's HTML or <pre>
When presenting results display:
- N "short sentences" per hit - enclosed in <pre></pre> - and \n restored
this should make it perfect speed and usability-wise if this is doable. And this will improve the speed of highlighting since it'll be a plain text.
It won't gain any speed since it currently only works with plain text. The trade off will be maybe better readability, but a loss of compactness.
It might be fun to try -- wouldn't be that hard...thanks to Perl.
cool ;)
__________________________________________________________________ Stas Bekman JAm_pH ------> Just Another mod_perl Hacker http://stason.org/ mod_perl Guide ---> http://perl.apache.org mailto:[EMAIL PROTECTED] http://use.perl.org http://apacheweek.com http://modperlbook.org http://apache.org http://ticketmaster.com
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
