Hi,

  When trying to extract table from a HTML file, I got the warning

   Parsing of undecoded UTF-8 will give garbage when decoding entities
at ./parse-html.pl line 14.

  and no output was generated.  So I can go about doing to extract table
from a HTML file.  Please help.  Attached is the code.

Regards

Guan Boon


#!/usr/bin/perl -w -d

use HTML::TableExtract;

my $temp_file = do {
            open my $in, '<','report-recent-c-level-purchases.html'
                or die "Can't open in $!\n";
            local $/ = undef;
            <$in>;
        };
$temp_file =~ /([\w]+\.[\w\d]+)/ms;
$te = HTML::TableExtract->new( headers => [qw('Company Sector)]);

$te->parse($temp_file);

my @all_table_content = $te->tables;

print join @all_table_content,'';
foreach my $ts (@all_table_content){
  print $ts,"\n";
   }




-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
<http://learn.perl.org/> <http://learn.perl.org/first-response>


Reply via email to