On Thu, 23 Feb 2006 13:02:32 -0500, Uri Guttman wrote:
> AY> $text =~ s{(
> AY> (\b\w+(?:['-]+\w+)*\b)
>
>why the multiple ['-] inside the words? could those chars ever begin or
>end words? so just [\w'-]+ should be fine there.
That reminds me, only earlier today I looked at the word frequency
counter code in perlfaq6.
<http://perldoc.perl.org/perlfaq6.html#How-can-I-print-out-a-word-frequency-or-line-frequency-summary%3f>
I'm a bit puzzled by the comment:
while ( /(\b[^\W_\d][\w'-]+\b)/g ) { # misses "`sheep'"
$seen{$1}++;
}
I'm wondering why they do it this way...
--
Bart.