Dear Flexwikers,
Here is (below) flexwiki.pm for html2wiki, with local anchors working
(thanks to James W. Davidson's suggestion). I'll repost it (and add
it to cpan) when I have images, and what else I see missing, working.
Peter Schwenn
On Sat, Apr 12, 2008 at 7:02 PM, Peter Schwenn <[EMAIL PROTECTED]> wrote:
> Dear Flexwikers,
>
> Below is a Flexwiki dialect perl module for the Perl program html2wiki
> (get that program, documentation, and other dialects at cpan.org --
> search for HTML-wikiconverter.)
>
> Its not complete but is already useful - I guess I may have some work
> to do with images among other things.
>
> To use this you'll need a Perl package (e.g. ActiveState's) and you'll
> need to know how to run cpan or ppm to install html2wiki.
>
> You'll need to put the following code in a file "flexwiki.pm" which
> must be in the perl library path (e.g.
> perl/site/lib/HTML/wikiconverter).
>
> An example of how to run it:
>
> html2wiki --dialect flexwiki somefile.html > somefile.wikitext
>
>
------------------------------------------------------------------------------
package HTML::WikiConverter::flexwiki;
use warnings;
use strict;
use base 'HTML::WikiConverter';
our $VERSION = '0.02';
=head1 NAME
HTML::WikiConverter::flexwiki - Convert HTML to flexwiki markup
=head1 SYNOPSIS
use HTML::WikiConverter;
my $wc = new HTML::WikiConverter( dialect => 'flexwiki' );
print $wc->html2wiki( $html );
=head1 DESCRIPTION
This module contains rules for converting HTML into flexwiki markup. See
L<HTML::WikiConverter> for additional usage details.
=cut
sub rules {
my %rules = (
hr => { replace => "\n----\n" },
br => { replace => \&_br },
h1 => { start => '!', block => 1, trim => 'both', line_format
=> 'single' },
h2 => { start => '!!', block => 1, trim => 'both', line_format
=> 'single' },
h3 => { start => '!!!', block => 1, trim => 'both', line_format
=> 'single' },
h4 => { start => '!!!!', block => 1, trim => 'both', line_format
=> 'single' },
h5 => { start => '!!!!!', block => 1, trim => 'both', line_format
=> 'single' },
h6 => { start => '!!!!!!', block => 1, trim => 'both', line_format
=> 'single' },
blockquote => { start => \&_blockquote_start, trim => 'both',
block => 1, line_format => 'multi' },
pre => { line_prefix => ' ', block => 1 },
p => { block => 1, trim => 'both', line_format => 'multi' },
b => { start => "'''", end => "'''", line_format => 'single' },
strong => { alias => 'b' },
i => { start => "''", end => "''", line_format => 'single' },
em => { alias => 'i' },
tt => { start => ' ', end => ' ', trim => 'both', line_format
=> 'single' },
code => { alias => 'tt' },
big => { start => "'+", end => "+'", line_format => 'single' },
small => { start => "'-", end => "-'", line_format => 'single' },
sup => { start => "'^", end => "^'", line_format => 'single' },
sub => { start => "~", end => "~", line_format => 'single' },
ins => { start => '{+', end => '+}', line_format => 'single' },
del => { start => '-', end => '-', line_format => 'single' },
ul => { line_format => 'multi', block => 1 },
ol => { alias => 'ul' },
li => { start => \&_li_start, trim => 'leading' },
dl => { alias => 'ul' },
dt => { start => \&_li_start, line_format => 'single', trim => 'both' },
dd => { start => ': ' },
a => { replace => \&_link },
img => { replace => \&_image },
table => { start => \&_table_start, block => 1 },
tr => { start => "\n||", line_format => 'single' },
td => { start => \&_td_start, end => \&_td_end, trim => 'both' },
th => { alias => 'td' }
);
return \%rules;
}
sub _br {
my( $self, $node, $rules ) = @_;
return " [[<<]] " if $node->look_up( _tag => 'table' );
return " \\\\\n";
}
sub _table_start {
my( $self, $node, $rules ) = @_;
my @attrs = qw/ border cellpadding cellspacing width bgcolor align /;
return '|| '.$self->get_attr_str( $node, @attrs );
}
sub _td_start {
my( $self, $node, $rules ) = @_;
my $prefix = $node->tag eq 'th' ? '!' : '';
my $align = $node->attr('align') || 'left';
$prefix .= ' ' if $align eq 'center' or $align eq 'right';
return $prefix;
}
sub _td_end {
my( $self, $node, $rules ) = @_;
my $colspan = $node->attr('colspan') || 1;
my $suffix = ( '||' ) x $colspan;
my $align = $node->attr('align') || 'left';
$suffix = ' '.$suffix if $align eq 'center' or $align eq 'left';
return $suffix;
}
sub _blockquote_start {
my( $self, $node, $rules ) = @_;
my @parent_bqs = $node->look_up( _tag => 'blockquote' );
my $depth = @parent_bqs;
my $start = ( '-' ) x $depth;
return "\n".$start.'>';
}
sub _li_start {
my( $self, $node, $rules ) = @_;
my @parent_lists = $node->look_up( _tag => qr/ul|ol|dl/ );
my $depth = @parent_lists;
my $prefix = " ";
my $bullet = '';
$bullet = '*' if $node->parent->tag eq 'ul';
$bullet = '#' if $node->parent->tag eq 'ol';
$bullet = ':' if $node->parent->tag eq 'dl';
my $fix = ( $prefix ) x $depth;
return "\n".$fix.$bullet;
}
sub _link {
my( $self, $node, $rules ) = @_;
return $self->_anchor($node, $rules) if $node->attr('name');
my $url = $node->attr('href') || '';
my $text = $self->get_elem_contents($node) || '';
$url =~ s/%20/ /g;
$url =~ s/[.,\+ ]/_/g; #encode (destructive)
$url =~ s/#/#A/; #anchors must start with capital letter
$url =~ s/%..//; #encode (destructive)
return $url if $text eq $url;
return "\"$text\":[$url]";
}
sub _anchor {
my( $self, $node, $rules ) = @_;
my $name = $node->attr('name') || '';
$name =~ s/[.,\+ ]/_/g; #encode (destructive)
return ":A$name: "; #\n's are ineffective; \r's are effective but
may cause problems or regex'g underlying \n's #pts was"[[#$name]]";
}
sub _image {
my( $self, $node, $rules ) = @_;
return $node->attr('src') || '';
}
sub preprocess_node {
my( $self, $node ) = @_;
my $tag = $node->tag || '';
$self->_move_aname($node) if $tag eq 'a' and $node->attr('name');
$self->caption2para($node) if $tag eq 'caption';
if( $tag eq '~text' and $node->left and $node->left->tag and
$node->left->tag eq 'br' and !$node->look_up(_tag => 'pre') ) {
( my $text = $node->attr('text') ) =~ s/^\s+//;
$node->attr( text => $text );
}
}
sub postprocess_output{
my ($self, $outref) [EMAIL PROTECTED];
$$outref =~ s/(:\[)#A([^\]\n]*)]/$1]#A$2/g; # Anchor reference not
inside the [link]
my $anchor= "A$2";
# and further, if [] is empty (=> local anchor), must generate a
special wikitalk form to generate the anchor ref:
#@@Presentations.Link(["#", "<anchorname>"].ToOneString,
"<anchorname>")@@ # is the form and here is the concrete s///; for it
$$outref =~
s/"([^"]*)":\[]#(A[^\n]*)/ [EMAIL PROTECTED]@Presentations.Link(["#",
"$2"].ToOneString, "$1")[EMAIL PROTECTED]@/g;
$$outref =~ s/\n(!+)('*)(:[^:\n]*: )([^\n]*)\n/$3\n$1$2$4/g; #anchor
definition PRECEDES any <Hn>
}
sub _move_aname {
my( $self, $node ) = @_;
my $name = $node->attr('name') || '';
$node->attr( name => undef );
my $aname = new HTML::Element( 'a', name => $name );
$node->preinsert($aname);
# Keep 'a href's around
$node->replace_with_content->delete unless $node->attr('href');
}
=head1 AUTHOR
David J. Iberri, C<< <diberri at cpan.org> >>
=head1 BUGS
Please report any bugs or feature requests to
C<bug-html-wikiconverter-flexwiki at rt.cpan.org>, or through the web
interface at
L<http://rt.cpan.org/NoAuth/ReportBug.html?Queue=HTML-WikiConverter-flexwiki>.
I will be notified, and then you'll automatically be notified of
progress on your bug as I make changes.
=head1 SUPPORT
You can find documentation for this module with the perldoc command.
perldoc HTML::WikiConverter::flexwiki
You can also look for information at:
=over 4
=item * AnnoCPAN: Annotated CPAN documentation
L<http://annocpan.org/dist/HTML-WikiConverter-flexwiki>
=item * CPAN Ratings
L<http://cpanratings.perl.org/d/HTML-WikiConverter-flexwiki>
=item * RT: CPAN's request tracker
L<http://rt.cpan.org/NoAuth/Bugs.html?Dist=HTML-WikiConverter-flexwiki>
=item * Search CPAN
L<http://search.cpan.org/dist/HTML-WikiConverter-flexwiki>
=back
=head1 COPYRIGHT & LICENSE
Copyright 2006 David J. Iberri, all rights reserved.
This program is free software; you can redistribute it and/or modify
it under the same terms as Perl itself.
=cut
1;
-------------------------------------------------------------------------------
-------------------------------------------------------------------------
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference
Don't miss this year's exciting event. There's still time to save $100.
Use priority code J8TL2D2.
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
_______________________________________________
Flexwiki-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/flexwiki-users