Re: Was `readdir [...]' Comments on JK's script

John W . Krahn Mon, 03 Dec 2007 13:24:40 -0800

On Monday 03 December 2007 10:34, [EMAIL PROTECTED] wrote:
>
> John W.Krahn <[EMAIL PROTECTED]> writes:
> >
> > If you want to incorporate the grep into the perl program then this
> > may work (UNTESTED):
>
> It works with 1 change and one caveat,  The cavaet is that the file
> names in $ARGV must be absolute format or the program fails.


The following program does not use the $ARGV variable, nor does Perl 
use it.


> That
> has something to do with File::Finds builtin of cd ing to the source
> directory I think.

???


> Johns program; with a few questions and annotations:
> (stripping the normal reply (>) indicators:
>
>   #!/usr/bin/perl
>   use warnings;
>   use strict;
>
>   use File::Find;
>
>   ( my $myscript = $0 ) =~ s!\A.*/!!;
>
>   @ARGV == 3 or die "usage: $myscript 'REGEX' SOMEDIRECTORY
>   TARGETDIRECTORY\n";
>
>   my $regex = qr/$ARGV[0]/;
>
>   my ( $SrcDir, $TrgDir ) = @ARGV[ 1, 2 ];
>
> ## [HP I would have thought it would cause perl to still treat ARGV
> as ## file names since its value is not zeroed out by the above ##
> operations, but apparently that is not a problem
> ## ]

@ARGV is just an array, you can process its contents in whatever way 
you like.  The contents are assumed to be file names when you use the 
special <> readline operator.


>   my ( $OldCnt, $Num );
>
> ## [HP $Num needs to be set to zero because this section
> ## will eventually throw this error:
> ## Use of uninitialized value in numeric lt (<) at ./Krahn.pl line 20
> ## if it isn't

OK, change that line to:

my ( $Num, $OldCnt ) = 0;


> $Num = 0;
>
> ## ]
>
>   {   opendir my $dh, $TrgDir or die "Cannot open '$TrgDir' $!";
>       while ( my $file = readdir $dh ) {
>           next unless -f "$TrgDir/$file" and $file =~ /\A\d+\z/;
>           $Num = $file if $Num < $file;
>           $OldCnt++;
>           }
>   }
>
> ## [ I don't really understand the format of having the `open' and
> ## processing enclosed in {}.  Don't think I've noticed that before..
> ## But the clause is nice and tidy.. thanks
> ## ]

It is used to limit the scope of the lexical variables, particulary the 
directory handle $dh which will be closed automatically when it goes 
out of scope.


>   my $CopyCnt;
>   find sub {
>       return unless -f and /\A\d+\z/;
>       open my $fh, '<:raw', $_
>           or die "Cannot open '$_' $!";
>       my $size = -s $fh;
>       $size == read $fh, my $data, $size
>           or die "Cannot read '$_' $!";
>       return unless $data =~ $regex;
>
> ## [HP I don't really understand how `return' operates here.  Is it
> ## doing the same job as `next'?
> ## ]

'next' works inside loops.  Because this is inside a subroutine you 
have to use 'return'.


>       $Num++;
>       open my $out, '>:raw', "$TrgDir/$Num"
>           or die "Cannot open '$TrgDir/$Num' $!";
>       print $out $data
>           or die "Cannot print to '$TrgDir/$Num' $!";
>       $CopyCnt++;
>       }, $SrcDir;
>
> ## [ I don't really understand why some of the processing is done.  I
> ## hadn't seen `:raw' used before but apparently the :raw part is
> ## there to handle the possiblitiy of different line endings in the
> ## files.  Then size is checked; apparently to ensure the size
> reported ## in -s is the same when `read'

Correct.


> ## And then the data is `printed' to its new home instead of just
> ## being copied there.... why is that being done?

It is being copied, since all the data from the old file is in the 
$data variable it is copied to the new file.


> ## ]


HTH

John
-- 
use Perl;
program
fulfillment

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/

Re: Was `readdir [...]' Comments on JK's script

Reply via email to