subject:"Re\: regular expression help"

Re: Regular Expression Help.

2015-03-25 Thread Shlomi Fish

Hi Frank,

On Wed, 25 Mar 2015 10:31:40 +0530
Frank Vino vinofra...@gmail.com wrote:

 Hi Team,
 
 How to understand Regular Expression in a easy way?
 

This page has links to some recommended tutorials about learning regular
expressions:

http://perl-begin.org/topics/regular-expressions/

*NOTE*: I originated perl-begin.org and still maintain it, but everyone is
welcome to contribute to it.

Regards,

Shlomi Fish

-- 
-
Shlomi Fish   http://www.shlomifish.org/
List of Text Processing Tools - http://shlom.in/text-proc

When it comes to terminators, you have a better shot at Arnold Schwarzenegger
than at Summer Glau.  (Via The Big Bang Theory)
— http://www.shlomifish.org/humour/bits/facts/Summer-Glau/

Please reply to list if it's a mailing list post - http://shlom.in/reply .

--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular Expression Help.

2015-03-25 Thread Simon Reinhardt

Hi Frank,

when first learning regexps I read the section In the World of Regular
Expressions in the Lama-Book [1]. If you find this introduction to
slow, you might also take a look at chromatic's Modern Perl, which is
available for free [2].

Regards, Simon

Am 25.03.2015 um 06:01 schrieb Frank Vino:
 Hi Team,
 
 How to understand Regular Expression in a easy way?
 
 Thanks,
 Frank

[1] Learning Perl, Sixth Edition by Randal L. Schwartz, brian d foy, and
Tom Phoenix

[2] http://onyxneon.com/books/modern_perl/

-- 
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular Expression Help.

2015-03-25 Thread Shawn H Corey

On Wed, 25 Mar 2015 10:31:40 +0530
Frank Vino vinofra...@gmail.com wrote:

 Hi Team,
 
 How to understand Regular Expression in a easy way?
 
 Thanks,
 Frank

Sorry Frank but there's no easy way. ☹

Some things to remember:

Some punctuation marks have special meaning, like periods, question
marks, and asterisks. But if there is a backslash before them, then
they match themselves. This is true for all punctuation marks.

And the opposite is true for ASCII letters and numbers. A backslash
before them may give them special meaning but if there is none, then
they match themselves.


-- 
Don't stop where the ink does.
Shawn

--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular Expression Help.

2015-03-25 Thread Frank Vino

Thanks a lot Simon

-Frank

On Wed, Mar 25, 2015 at 5:44 PM, Simon Reinhardt si...@keinstein.org
wrote:

 Hi Frank,

 when first learning regexps I read the section In the World of Regular
 Expressions in the Lama-Book [1]. If you find this introduction to
 slow, you might also take a look at chromatic's Modern Perl, which is
 available for free [2].

 Regards, Simon

 Am 25.03.2015 um 06:01 schrieb Frank Vino:
  Hi Team,
 
  How to understand Regular Expression in a easy way?
 
  Thanks,
  Frank

 [1] Learning Perl, Sixth Edition by Randal L. Schwartz, brian d foy, and
 Tom Phoenix

 [2] http://onyxneon.com/books/modern_perl/

 --
 To unsubscribe, e-mail: beginners-unsubscr...@perl.org
 For additional commands, e-mail: beginners-h...@perl.org
 http://learn.perl.org/

Re: Regular Expression Help.

2015-03-24 Thread Rahul Gojame

Frank,

Just go through below site, it helps to build regex and test same easily.
http://www.regexr.com/

~Rahul

On Wed, Mar 25, 2015 at 10:42 AM, Akshay Mohit akshaymohit2...@gmail.com
wrote:

 Just start using it and you will find it very easy to understand.

 -Akshay

 On Wed, Mar 25, 2015 at 10:31 AM, Frank Vino vinofra...@gmail.com wrote:

 Hi Team,

 How to understand Regular Expression in a easy way?

 Thanks,
 Frank

Re: Regular Expression Help.

2015-03-24 Thread Akshay Mohit

Just start using it and you will find it very easy to understand.

-Akshay

On Wed, Mar 25, 2015 at 10:31 AM, Frank Vino vinofra...@gmail.com wrote:

 Hi Team,

 How to understand Regular Expression in a easy way?

 Thanks,
 Frank

Re: Regular expression help pls !

2013-08-23 Thread Gianrossi, Paolo

Not sure I get it, but would

/^fc3\/2\b/ 

(assuming you're looking for fc3/2 and not fc3/23) work?

hth
paolino

On 23 Aug 2013, at 17:06, jet speed speedj...@googlemail.com wrote:

 Chaps,
 
 Please i need help on the regular expression, i have the sample code below.
 I only want to match the entries from the array to the file and print the 
 matching line
 
 for example if i only want to match fc3/23, in my code it prints both the 
 lines fc3/2 and fc3/23. How to restrict to exact matches and print only ex: 
 fc3/23
 
 
 
 out.txt 
 --
 
 fc3/2  10:00:00:00  host1
 fc3/23 10:00:00:00  host2
 fc10/1 10:00:00:00  host3
 fc10/11 10:00:00:00  host4
 fc3/1 10:00:00:00  host5
 fc3/14 10:00:00:00  host6
 fc12/1 10:00:00:00  host7
 fc12/1210:00:00:00  host8
 
 
 sample code 
 -
 
 my @check = (fc3/23, fc10/1, fc3/14, fc12/12);
 
 my $f2 = 'out.txt';
 for my $element(@check) {
 open my $fh2, '', $f2 or die could not open $f2: $!;
 while (my $line = $fh2) {
 chomp $line;
 if ($line = / ^(fc\d+\/\d+/) {
 $match=$;
 }
 if  ($element =~ $match) {
 print $line \n;
 }
 }
 }
 
 
 required output
 
 
 fc3/23 10:00:0:00  host2
 fc10/1 10:00:0:00  host3
 fc3/14 10:00:00:00  host6
 fc12/1210:00:00:00  host8
 
 Thanks
 Sj
 


--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular expression help pls !

2013-08-23 Thread Shawn H Corey

On Fri, 23 Aug 2013 17:06:41 +0100
jet speed speedj...@googlemail.com wrote:

 my @check = (fc3/23, fc10/1, fc3/14, fc12/12);

my @check = qw( fc3/23 fc10/1 fc3/14 fc12/12 );

 
 my $f2 = 'out.txt';
 for my $element(@check) {
 open my $fh2, '', $f2 or die could not open $f2: $!;
 while (my $line = $fh2) {
 chomp $line;

# put the for loop inside the while
open my $fh2, '', $f2 or die could not open $f2: $!;
while (my $line = $fh2) {
chomp $line;
for my $element(@check) {
if( $line =~ /^\Q$element\E\s/ ){
print $line \n;
last;
}
}
}



-- 
Don't stop where the ink does.
Shawn

-- 
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular expression help pls !

2013-08-23 Thread David Precious

On Fri, 23 Aug 2013 17:06:41 +0100
jet speed speedj...@googlemail.com wrote:

 Chaps,
 
 Please i need help on the regular expression, i have the sample code
 below. I only want to match the entries from the array to the file
 and print the matching line
 
 for example if i only want to match fc3/23, in my code it prints both
 the lines fc3/2 and fc3/23. How to restrict to exact matches and
 print only ex: fc3/23

[...]
 if ($line = / ^(fc\d+\/\d+/) {

First off, you mean =~ to match against a regex - that's a fairly
meaningless assignment there.  Was that a typo or is that how it
actually is in your code?

Secondly, that regex shouldn't even compile - you've missed a closing
parenthesis.

Perl should have told you something like:

  Unmatched ( in regex; marked by  

You meant:

if ($line =~ /^(fc\d+\/\d+)/) {

Although I would write that a little more clearly - using qr lets you
pick delimiters that avoid you having to escape the slash in the
middle, and using the x modifier makes the regex whitespace
insensitive, so you can space things out for readability - and also,
assign the match to a meaningful var immediately:

if (my ($port) = $line =~ qr{^ (fc \d+ / \d+ ) }x) {
# matched, $port contains e.g. fc3/23
}


Also, iterating through the file once for each port you want to look
for is a rather inefficient approach - reverse the logic, and loop
through each line in the file, checking it against each of your ports
of interest within the loop - for example:

open my $fh, '', $filename or die Failed to open $filename - $!;
while (my $line = $fh) {
chomp $line;
if (my ($port) = $line =~ qr{^ (fc \d+ / \d+ ) }x) {
if (grep { $port eq $_ } @check) {
# $port matched one of the ones you wanted.
print $line\n;
}
}
}


Hope that helps?


-- 
David Precious (bigpresh) dav...@preshweb.co.uk
http://www.preshweb.co.uk/ www.preshweb.co.uk/twitter
www.preshweb.co.uk/linkedinwww.preshweb.co.uk/facebook
www.preshweb.co.uk/cpanwww.preshweb.co.uk/github



-- 
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular expression help pls !

2013-08-23 Thread Nathan Hilterbrand


See sample code below


 Chaps,

 Please i need help on the regular expression, i have the sample code
 below.
 I only want to match the entries from the array to the file and print
 the
 matching line

 for example if i only want to match fc3/23, in my code it prints both
 the
 lines fc3/2 and fc3/23. How to restrict to exact matches and print
 only ex:
 fc3/23



 out.txt
 --

 fc3/2  10:00:00:00  host1
 fc3/23 10:00:00:00  host2
 fc10/1 10:00:00:00  host3
 fc10/11 10:00:00:00  host4
 fc3/1 10:00:00:00  host5
 fc3/14 10:00:00:00  host6
 fc12/1 10:00:00:00  host7
 fc12/1210:00:00:00  host8


 sample code
 -

 my @check = (fc3/23, fc10/1, fc3/14, fc12/12);

 my $f2 = 'out.txt';
 for my $element(@check) {
 open my $fh2, '', $f2 or die could not open $f2: $!;
 while (my $line = $fh2) {
 chomp $line;
 if ($line = / ^(fc\d+\/\d+/) {
 $match=$;
 }
 if  ($element =~ $match) {
 print $line \n;
 }
 }
 }


 required output
 

 fc3/23 10:00:0:00  host2
 fc10/1 10:00:0:00  host3
 fc3/14 10:00:00:00  host6
 fc12/1210:00:00:00  host8

 Thanks
 Sj


This worked for me:

#!/usr/bin/perl

  use strict;  # Always a good idea!
  use warnings;# likewise!

  # You need quotes around your literals in the line that creates
  # and assigns @check:
  my @check = (fc3/23, fc10/1, fc3/14, fc12/12);
  my $f2 = 'out.txt';

  # The following lines create a reqexp variable that will match any
  # of the strings in @check.  Note in the map that I placed a space
  # after $_.  This takes care of the problem where fc3/2 matched
  # fc3/23.  When this is done, for this sample, the generated regular
  # expression looks like:
  #(?:fc3/23 )|(?:fc10/1 )|(?:fc3/14 )|(?:fc12/12 )
  # This attempts to match against all of the strings that were in
@check.
  # The use of (?: instead of ( to start each group will make the
  # parenthesis non-capturing, which is a bit quicker when you don't
need
  # to capture the actual matches.
  #

  my @tmpcheck = map {(?:$_ )} @check;
  my $regexp_str = join |, @tmpcheck;
  my $regexp = qr/$regexp_str/;

  open my $fh2, '', $f2 or die could not open $f2: $!;

  while (my $line = $fh2) {
print $line if $line =~ $regexp;# Print only lines that match
  }

  # Note that I did not chomp the newline and put it back on when
printing.
  # The presence of the newline does not effect the match in this
case, so
  # that was not necessary.

Best of luck.

Nathan
-- 



-- 
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular expression help pls !

2013-08-23 Thread Jim Gibson


On Aug 23, 2013, at 9:06 AM, jet speed wrote:

 Chaps,
 
 Please i need help on the regular expression, i have the sample code below.
 I only want to match the entries from the array to the file and print the 
 matching line
 
 for example if i only want to match fc3/23, in my code it prints both the 
 lines fc3/2 and fc3/23. How to restrict to exact matches and print only ex: 
 fc3/23
 
 
 
 out.txt 
 --
 
 fc3/2  10:00:00:00  host1
 fc3/23 10:00:00:00  host2
 fc10/1 10:00:00:00  host3
 fc10/11 10:00:00:00  host4
 fc3/1 10:00:00:00  host5
 fc3/14 10:00:00:00  host6
 fc12/1 10:00:00:00  host7
 fc12/1210:00:00:00  host8
 
 
 sample code 
 -

The code below does not compile. Please make sure you post valid programs if 
you want help.

 
 my @check = (fc3/23, fc10/1, fc3/14, fc12/12);

You need to put quotes around strings:

my @check = ('fc3/23', 'fc10/1', 'fc3/14', 'fc12/12');


 my $f2 = 'out.txt';
 for my $element(@check) {

You should indent blocks (for, while) for readability.


 open my $fh2, '', $f2 or die could not open $f2: $!;

Here, you are opening the file for each iteration. Reading files is much slower 
than iterating over an array. Therefore, if you have to 1) read a file and 2) 
iterate over an array, you should reverse the order of operations: Read a line 
from the file and then iterate over the array for each line in the file.

 while (my $line = $fh2) {
 chomp $line;
 if ($line = / ^(fc\d+\/\d+/) {

This line has three syntax errors:
1. You want the binding operator =~ instead of assignment =
2. You don't want a space between / and ^
3. The final / and closing ) should be swapped.

I am assuming you mean:

  if ($line =~ /^(fc\d+\/\d+)/ {


 $match=$;

Since your regular expression captures the entire match, you should use the 
capturing variable $1 instead of the whatever-matched variable $. It is 
actually faster.

Every line of your input will match the above regular expression, so the only 
effect is to extract the first field. You can do that by splitting the line on 
whitespace:

  my( $first, @rest ) = split(/ /,$line);

 }
 if  ($element =~ $match) {

You need slashes or the m operator to define a regular expression:

  if( $element =~ /$match/ ) {

If you want to match the entire string, you need to add beginning and ending 
anchors:

  if( $element =~ /^$match$/ ) {

But if that is what you want, just use eq:

  if( $element eq $match ) {


 print $line \n;
 }
 }
 }
 
 
 required output
 
 
 fc3/23 10:00:0:00  host2
 fc10/1 10:00:0:00  host3
 fc3/14 10:00:00:00  host6
 fc12/1210:00:00:00  host8
 
 Thanks
 Sj
 


--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular expression help pls !

2013-08-23 Thread jet speed

Chaps,
I am testing all your code one by one, Appreciate your time and detailed
inputs.

Many Thanks
Sj




On Fri, Aug 23, 2013 at 6:01 PM, Jim Gibson jimsgib...@gmail.com wrote:


 On Aug 23, 2013, at 9:06 AM, jet speed wrote:

  Chaps,
 
  Please i need help on the regular expression, i have the sample code
 below.
  I only want to match the entries from the array to the file and print
 the matching line
 
  for example if i only want to match fc3/23, in my code it prints both
 the lines fc3/2 and fc3/23. How to restrict to exact matches and print only
 ex: fc3/23
 
 
 
  out.txt
  --
 
  fc3/2  10:00:00:00  host1
  fc3/23 10:00:00:00  host2
  fc10/1 10:00:00:00  host3
  fc10/11 10:00:00:00  host4
  fc3/1 10:00:00:00  host5
  fc3/14 10:00:00:00  host6
  fc12/1 10:00:00:00  host7
  fc12/1210:00:00:00  host8
 
 
  sample code
  -

 The code below does not compile. Please make sure you post valid programs
 if you want help.

 
  my @check = (fc3/23, fc10/1, fc3/14, fc12/12);

 You need to put quotes around strings:

 my @check = ('fc3/23', 'fc10/1', 'fc3/14', 'fc12/12');


  my $f2 = 'out.txt';
  for my $element(@check) {

 You should indent blocks (for, while) for readability.


  open my $fh2, '', $f2 or die could not open $f2: $!;

 Here, you are opening the file for each iteration. Reading files is much
 slower than iterating over an array. Therefore, if you have to 1) read a
 file and 2) iterate over an array, you should reverse the order of
 operations: Read a line from the file and then iterate over the array for
 each line in the file.

  while (my $line = $fh2) {
  chomp $line;
  if ($line = / ^(fc\d+\/\d+/) {

 This line has three syntax errors:
 1. You want the binding operator =~ instead of assignment =
 2. You don't want a space between / and ^
 3. The final / and closing ) should be swapped.

 I am assuming you mean:

   if ($line =~ /^(fc\d+\/\d+)/ {


  $match=$;

 Since your regular expression captures the entire match, you should use
 the capturing variable $1 instead of the whatever-matched variable $. It
 is actually faster.

 Every line of your input will match the above regular expression, so the
 only effect is to extract the first field. You can do that by splitting the
 line on whitespace:

   my( $first, @rest ) = split(/ /,$line);

  }
  if  ($element =~ $match) {

 You need slashes or the m operator to define a regular expression:

   if( $element =~ /$match/ ) {

 If you want to match the entire string, you need to add beginning and
 ending anchors:

   if( $element =~ /^$match$/ ) {

 But if that is what you want, just use eq:

   if( $element eq $match ) {


  print $line \n;
  }
  }
  }
 
 
  required output
  
 
  fc3/23 10:00:0:00  host2
  fc10/1 10:00:0:00  host3
  fc3/14 10:00:00:00  host6
  fc12/1210:00:00:00  host8
 
  Thanks
  Sj
 


 --
 To unsubscribe, e-mail: beginners-unsubscr...@perl.org
 For additional commands, e-mail: beginners-h...@perl.org
 http://learn.perl.org/

Re: regular expression help

2012-09-21 Thread Octavian Rasnita


From: Dr.Ruud rvtol+use...@isolution.nl


On 2012-09-20 09:08, Octavian Rasnita wrote:


my ( $file_name ) = $data =~ /([^\\]+)$/g;


No need for that g-modifier.

--
Ruud




Yes, you are right. I added it by mistake.

Octavian


--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: regular expression help

2012-09-20 Thread Octavian Rasnita


From: Irfan Sayed irfan_sayed2...@yahoo.com

i have string 'c:\p4\car\abc\xyz.csproj'

i just need to match the xyz.csproj

i tried few option but does not help.

can someone please suggest

regards
irfan




my $data = 'c:\p4\car\abc\xyz.csproj';

my ( $file_name ) = $data =~ /([^\\]+)$/g;

print $file_name;

It will print:
xyz.csproj

( and ) captures the matched string but because in this case they capture 
the whole regular expression, you can omit using them like:


my ( $file_name ) = $data =~ /[^\\]+$/g;

[^\\] means that it matches everything which is not a \ char, and because \ 
is a special char, it should be escaped with another \ before it (this is 
why there are 2 \ chars).


There is a + char after the [^\\] meaning that [^\\] doesn't match just a 
single char, but one or more.


And at the end there is a $ sign meaning that after this string that mached, 
it is the end of the string, so this regex will match any substring which 
doesn't contain a \ char which appear at the end of the string.


HTH, and of course, use strict and warnings.

Octavian



--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: regular expression help

2012-09-20 Thread Irfan Sayed

got it myself :)
thanks a lot 
 $line_to_add =~ m/([a-zA-Z]+\.csproj)/;

regards




 From: Irfan Sayed irfan_sayed2...@yahoo.com
To: Perl Beginners beginners@perl.org 
Sent: Thursday, September 20, 2012 12:07 PM
Subject: regular expression help 
 
i have string 'c:\p4\car\abc\xyz.csproj'

i just need to match the xyz.csproj

i tried few option but does not help. 

can someone please suggest

regards
irfan

Re: regular expression help

2012-09-20 Thread Michael Brader



On 09/20/2012 04:39 PM, Irfan Sayed wrote:

got it myself :)
thanks a lot
  $line_to_add =~ m/([a-zA-Z]+\.csproj)/;



Hi Irfan,

Your solution will only match files that consist of ASCII alphabetic 
characters followed by '.csproj'. It will also match these:


* 'c:\p4\car\abc\foo.csproj\file.txt'
* 'c:\p4\car\abc\123xyx.csproj'
* 'c:\p4\car\abc\xyz.csprojabc'

Octavian's solution is much more general. Run this program to see why:

#!/usr/bin/perl

use strict;
use warnings;

my @paths = (
'c:\p4\car\abc\xyz.csproj',
'c:\p4\car\abc\foo.csproj\file.txt',
'c:\p4\car\abc\123xyx.csproj',
'c:\p4\car\abc\xyz.csprojabc',
);

foreach my $path ( @paths ) {
my ($irfan) = $path =~ m/([a-zA-Z]+\.csproj)/;
$irfan ||= 'no match';
my ($octavian) = $path =~ /([^\\]+)$/;
$octavian ||= 'no match';
print EOINFO;
Path: $path
Irfan: $irfan
Octavian: $octavian

EOINFO
}
__END__

Path: c:\p4\car\abc\xyz.csproj
Irfan: xyz.csproj
Octavian: xyz.csproj

Path: c:\p4\car\abc\foo.csproj\file.txt
Irfan: foo.csproj
Octavian: file.txt

Path: c:\p4\car\abc\123xyx.csproj
Irfan: xyx.csproj
Octavian: 123xyx.csproj

Path: c:\p4\car\abc\xyz.csprojabc
Irfan: xyz.csproj
Octavian: xyz.csprojabc

A more idiomatic way to do this is to use the File::Spec module. Inspect 
the output of this program for inspiration:


#!/usr/bin/perl

use strict;
use warnings;

use File::Spec; # for splitpath

my $path = 'c:\p4\car\abc\xyz.csproj';

my ( $volume, $directories, $file) = File::Spec-splitpath( $path );

print EOINFO;
Volume: $volume
Directories: $directories
File: $file
EOINFO

exit 0;
__END__

Cheers,
Michael


--
Michael BraderSenior Software Engineer and Perl Person
  Technology/Softdev/ME Small Change Team
Internode   http://internode.on.net/  mbra...@internode.com.au
iiNet http://iinet.net.au/ m.bra...@staff.iinet.net.au


--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: regular expression help

2012-09-20 Thread Shlomi Fish

On Thu, 20 Sep 2012 17:13:07 +0930
Michael Brader mbra...@internode.com.au wrote:

 A more idiomatic way to do this is to use the File::Spec module.
 Inspect the output of this program for inspiration:
 

There's also File::Basename:

http://perldoc.perl.org/File/Basename.html

Regards,

Shlomi Fish

-- 
-
Shlomi Fish   http://www.shlomifish.org/
Optimising Code for Speed - http://shlom.in/optimise

Chuck Norris was never a newbie! He will kill anyone who implies otherwise. In
a very not newbie-like manner.

Please reply to list if it's a mailing list post - http://shlom.in/reply .

-- 
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: regular expression help

2012-09-20 Thread Irfan Sayed

thanks a lot for all the responses :)

regards

 From: Shlomi Fish shlo...@shlomifish.org
To: Michael Brader mbra...@internode.com.au 
Cc: beginners@perl.org 
Sent: Thursday, September 20, 2012 2:53 PM
Subject: Re: regular expression help

On Thu, 20 Sep 2012 17:13:07 +0930
Michael Brader mbra...@internode.com.au wrote:

 A more idiomatic way to do this is to use the File::Spec module.
 Inspect the output of this program for inspiration:

There's also File::Basename:

http://perldoc.perl.org/File/Basename.html

Regards,

    Shlomi Fish

-- 
-
Shlomi Fish       http://www.shlomifish.org/
Optimising Code for Speed - http://shlom.in/optimise

Chuck Norris was never a newbie! He will kill anyone who implies otherwise. In
a very not newbie-like manner.

Please reply to list if it's a mailing list post - http://shlom.in/reply .

-- 
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: regular expression help

2012-09-20 Thread Dr.Ruud


On 2012-09-20 09:08, Octavian Rasnita wrote:


my ( $file_name ) = $data =~ /([^\\]+)$/g;


No need for that g-modifier.

--
Ruud



--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular Expression help!

2011-05-11 Thread Leo Susanto

I ended up confused after reading your email.

Please specify INPUT + OUTPUT/condition.

You have already specify INPUT which is:
LOGICAL UNIT NUMBER 587
UID:60:06:01:60:42:40:21:00:3A:AA:55:37:91:8A:DF:11

LOGICAL UNIT NUMBER 128
UID:60:06:01:60:50:40:21:00:D0:23:D5:C2:BA:D9:DE:11

LOGICAL UNIT NUMBER 763
UID:60:06:01:60:50:40:21:00:25:C6:3F:A7:CA:2D:DF:11

What is the desired OUTPUT/condition?

Is it something like this?
587 wwn is 60:06:01:60:42:40:21:00:3A:AA:55:37:91:8A:DF:11
128 wwn is 60:06:01:60:50:40:21:00:D0:23:D5:C2:BA:D9:DE:11
763 wwn is 60:06:01:60:50:40:21:00:25:C6:3F:A7:CA:2D:DF:11

On Wed, May 11, 2011 at 8:38 AM, jet speed speedj...@googlemail.com wrote:
 Hi All,

 I need help in matching the regular expression, the file is as below.

 I am trying to match number followed by Number ex 587, 128 in $1 and
 60:06:01:60:42:40:21:00:3A:AA:55:37:91:8A:DF:11 in $2

 the $1 match works find with regulare expression  #if ($_=~
 /\w{7}\s\w{4}\s\w{6}\s(\d{1,4})/i) { #workts for 1st line

 However i am not sure how to get both $1 and $2 togather.

 Ideally i would like to have an output printed

 print $1 wwn is $2 \n;

 Any help on this would be much appreciated.


 FILE
 ---


 LOGICAL UNIT NUMBER 587
 UID:                        60:06:01:60:42:40:21:00:3A:AA:55:37:91:8A:DF:11

 LOGICAL UNIT NUMBER 128
 UID:                        60:06:01:60:50:40:21:00:D0:23:D5:C2:BA:D9:DE:11

 LOGICAL UNIT NUMBER 763
 UID:                        60:06:01:60:50:40:21:00:25:C6:3F:A7:CA:2D:DF:11

 ---


 #!/usr/bin/perl
 open(FILE, clrlist) || die Can't open clrlist : $!\n;

 while (FILE) {
 #if ($_=~ /\w{7}\s\w{4}\s\w{6}\s(\d{1,4})/i) { #workts for 1st line
 if ($_=~ 
 /\w{7}\s\w{4}\s\w{6}\s(\d{1,4})\n[a-z0-9+]\s*(\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*)/i)
 {
 print $1 ;
 print $2;
        }
 }

 -

 Sj

 --
 To unsubscribe, e-mail: beginners-unsubscr...@perl.org
 For additional commands, e-mail: beginners-h...@perl.org
 http://learn.perl.org/




--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular Expression help!

2011-05-11 Thread Shawn H Corey


On 11-05-11 11:38 AM, jet speed wrote:

I need help in matching the regular expression, the file is as below.

I am trying to match number followed by Number ex 587, 128 in $1 and
60:06:01:60:42:40:21:00:3A:AA:55:37:91:8A:DF:11 in $2

the $1 match works find with regulare expression  #if ($_=~
/\w{7}\s\w{4}\s\w{6}\s(\d{1,4})/i) { #workts for 1st line

However i am not sure how to get both $1 and $2 togather.

Ideally i would like to have an output printed

print $1 wwn is $2 \n;

Any help on this would be much appreciated.


You are trying to do too much in one regular expression.  Process the 
file one line at a time.


#!/usr/bin/env perl

use strict;
use warnings;

my $lun = 0;
while( DATA ){
  if( /^\s*LOGICAL\s+UNIT\s+NUMBER\s+(\d+)/ ){
$lun = $1;
  }elsif( /^UID:\s*(\S+)/ ){
print $lun wwn is $1 \n;
  }
}

__DATA__


LOGICAL UNIT NUMBER 587
UID:60:06:01:60:42:40:21:00:3A:AA:55:37:91:8A:DF:11

LOGICAL UNIT NUMBER 128
UID:60:06:01:60:50:40:21:00:D0:23:D5:C2:BA:D9:DE:11

LOGICAL UNIT NUMBER 763
UID:60:06:01:60:50:40:21:00:25:C6:3F:A7:CA:2D:DF:11




--
Just my 0.0002 million dollars worth,
  Shawn

Confusion is the first step of understanding.

Programming is as much about organization and communication
as it is about coding.

The secret to great software:  Fail early  often.

Eliminate software piracy:  use only FLOSS.

--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular Expression help!

2011-05-11 Thread Rob Dixon

On 11/05/2011 16:38, jet speed wrote:
 Hi All,
 
 I need help in matching the regular expression, the file is as below.
 
 I am trying to match number followed by Number ex 587, 128 in $1 and
 60:06:01:60:42:40:21:00:3A:AA:55:37:91:8A:DF:11 in $2
 
 the $1 match works find with regulare expression  #if ($_=~ 
 /\w{7}\s\w{4}\s\w{6}\s(\d{1,4})/i) { #workts for 1st line
 
 However i am not sure how to get both $1 and $2 togather.
 
 Ideally i would like to have an output printed
 
 print $1 wwn is $2 \n;
 
 Any help on this would be much appreciated.
 
 
 FILE
 ---
 
 
 LOGICAL UNIT NUMBER 587
 UID:60:06:01:60:42:40:21:00:3A:AA:55:37:91:8A:DF:11
 
 LOGICAL UNIT NUMBER 128
 UID:60:06:01:60:50:40:21:00:D0:23:D5:C2:BA:D9:DE:11
 
 LOGICAL UNIT NUMBER 763
 UID:60:06:01:60:50:40:21:00:25:C6:3F:A7:CA:2D:DF:11
 
 ---
 
 
 #!/usr/bin/perl
 open(FILE, clrlist) || die Can't open clrlist : $!\n;
 
 while (FILE) {
 #if ($_=~ /\w{7}\s\w{4}\s\w{6}\s(\d{1,4})/i) { #workts for 1st line
 if ($_=~ 
 /\w{7}\s\w{4}\s\w{6}\s(\d{1,4})\n[a-z0-9+]\s*(\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*)/i)
 {
 print $1 ;
 print $2;
  }
 }
 
 -

I think I understand what you hope to achieve, but your code will not
compile as it stands.

First of all, instead of

  /\w{7}\s\w{4}\s\w{6}\s(\d{1,4})/i

you should write simply

  /LOGICAL UNIT NUMBER (\d{1,4})/i

Your second regex is similar, but anything like

  /\d{2}*/

gives the error 'Nested quantifiers in regex'. You can specify either
{2} (exactly two occurrences) or * (zero or more occurrences) but not both!

Also, you are trying to match a string of 16 hex bytes using /\d/. That
matches only 0..9 in ASCII (and a lot more in Unicode, but that isn't a
problem here). For hex, use either /[0-9A-X]/i or /[[:xdigit:]]/.

But the main problem is that you are trying to match two lines
simultaneously. Your regex contains /\n/, but you are reading from the
file only one line at a time. Each line will contain *either*

  LOGICAL UNIT NUMBER 587\n

*or*

  UID:
60:06:01:60:50:40:21:00:25:C6:3F:A7:CA:2D:DF:11\n

so you need to code accordingly.

On top of that, your second regex includes /[a-z0-9+]/i when I think you
meant /[a-z0-9]+/i, and it doesn't allow for the colon after 'UID'. Once
more, this should be /UID:\s*/ or something similar. Without seeing your
data I cannot help further.

For those who would help, I believe the program looks as below.

I hope this helps,

Rob



use strict;
use warnings;

while (DATA) {

  #if ($_=~ /\w{7}\s\w{4}\s\w{6}\s(\d{1,4})/i) { #workts for 1st line

  if ($_=~ 
/\w{7}\s\w{4}\s\w{6}\s(\d{1,4})\n[a-z0-9+]\s*(\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*\d{2}*)/i)
 {
print $1 ;
print $2;
  }
}

__DATA__
LOGICAL UNIT NUMBER 587
UID:60:06:01:60:42:40:21:00:3A:AA:55:37:91:8A:DF:11

LOGICAL UNIT NUMBER 128
UID:60:06:01:60:50:40:21:00:D0:23:D5:C2:BA:D9:DE:11

LOGICAL UNIT NUMBER 763
UID:60:06:01:60:50:40:21:00:25:C6:3F:A7:CA:2D:DF:11

-- 
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular Expression help!

2011-05-11 Thread C.DeRykus

On May 11, 8:38 am, speedj...@googlemail.com (jet speed) wrote:
 Hi All,

 I need help in matching the regular expression, the file is as below.

 I am trying to match number followed by Number ex 587, 128 in $1 and
 60:06:01:60:42:40:21:00:3A:AA:55:37:91:8A:DF:11 in $2

 the $1 match works find with regulare expression  #if ($_=~
 /\w{7}\s\w{4}\s\w{6}\s(\d{1,4})/i) { #workts for 1st line

 However i am not sure how to get both $1 and $2 togather.

 Ideally i would like to have an output printed

 print $1 wwn is $2 \n;

 Any help on this would be much appreciated.

 FILE
 ---

 LOGICAL UNIT NUMBER 587
 UID:                        60:06:01:60:42:40:21:00:3A:AA:55:37:91:8A:DF:11

 LOGICAL UNIT NUMBER 128
 UID:                        60:06:01:60:50:40:21:00:D0:23:D5:C2:BA:D9:DE:11

 LOGICAL UNIT NUMBER 763
 UID:                        60:06:01:60:50:40:21:00:25:C6:3F:A7:CA:2D:DF:11

 ---

 #!/usr/bin/perl
 open(FILE, clrlist) || die Can't open clrlist : $!\n;

 while (FILE) {
 #if ($_=~ /\w{7}\s\w{4}\s\w{6}\s(\d{1,4})/i) { #workts for 1st line
 ...

Also, you may want to consider the /x modifier to avoid
regex blindness, [a visual Bermuda Triangle that will
cause anyone viewing it to avert their eyes and quickly
paddle the other way].

Even with an easy regex,  the view improves considerably
with /x :

if  (  /  \w{7}  \s \w{4} \s# insert comment here...
  \d{1,4}   # more comments...
  
   /ix  )

Or just:
  / \w{7} \s  \w{4}  \s   /ix;  # whitespace enhanced

--
Charles DeRykus


--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular Expression help!

2011-05-11 Thread jet speed

Hi All,

Thanks for your time and valuable inputs, Appreciate it.

I will try your suggestions and test it in my program.

Sj

Re: Regular expression help !!

2011-04-27 Thread Paolo Gianrossi

2011/4/27 jet speed speedj...@googlemail.com

 Hi,

 Please could you advice, how can i write a regular expression for the
 line below to capture 0079 and 69729260057253303030373


 0079 Not Visible 69729260057253303030373

 i tried this one, no luck

 /(^\d{4})\s\w+\s\w+\s+\d+/ig)



What about
 /^(\d+)\D+(\d+)$/ ?



 Appreciate your help with this.


Not sure wether this is actually what you need, though


 Sj


cheers
paolino


 --
 To unsubscribe, e-mail: beginners-unsubscr...@perl.org
 For additional commands, e-mail: beginners-h...@perl.org
 http://learn.perl.org/

Re: Regular expression help !!

2011-04-27 Thread Jeff Pang

2011/4/27 jet speed speedj...@googlemail.com:
 Hi,

 Please could you advice, how can i write a regular expression for the
 line below to capture 0079 and 69729260057253303030373


 0079 Not Visible             69729260057253303030373



This might help?

$ perl -le '
$str=0079 Not Visible 69729260057253303030373;
@re = $str =~ /^(\d+).*?(\d+)$/;
print @re'

0079 69729260057253303030373

--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular expression help !!

2011-04-27 Thread Rob Dixon


On 27/04/2011 11:47, jet speed wrote:


Please could you advice, how can i write a regular expression for the
line below to capture 0079 and 69729260057253303030373


0079 Not Visible 69729260057253303030373

i tried this one, no luck

/(^\d{4})\s\w+\s\w+\s+\d+/ig)


It works fine! You are simply not capturing the second sequence of 
digits. This


use strict;
use warnings;

for ('0079 Not Visible 69729260057253303030373') {
  print /(^\d{4})\s\w+\s\w+\s+(\d+)/ig ? $1 $2 : 'no match';
}

prints '0079 69729260057253303030373'.

But, depending on your data, there may be no need to match the line so 
precisely. This does the same thing


use strict;
use warnings;

for ('0079 Not Visible 69729260057253303030373') {
  my @n = (split)[0,-1];
  print @n;
}


HTH,

Rob

--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular expression help !!

2011-04-27 Thread jet speed

Hi all,

Thanks for all our inputs,

The regular expression below works fine if do it for single line, i am
trying to caputre the match $1, and $2 into array. only the first line
is pushed to the array. what am i doing wrong ?
how to get all the $1 and $2 match values for each line into arrary ?
Kindly advice.


# more wwnlist
0079 Not Visible 69729260057253303030373
007A Not Visible 69729260057253303030374
007B Not Visible 69729260057253303030374
007C Not Visible 69729260057253303030374
007D Not Visible 69729260057253303030374
007E Not Visible 69729260057253303030374
 more wwnmod.pl
#!/usr/bin/perl

#0079 Not Visible 69729260057253303030373


open(FILE, wwnlist) || die Can't open wwnlist : $!\n;
while (FILE) {
if ($_=~/\b(\d{4})\s[a-z]{1,3}\s[a-z]{1,7}\s*(\d{1,32})\b/i) {
push (@dev, $1);
push (@wwn, $2);
}
}

print @dev \n;
print @wwn \n;

./wwnmod.pl
0079
69729260057253303030373

Regards

Sj


On 4/27/11, Rob Dixon rob.di...@gmx.com wrote:
 On 27/04/2011 11:47, jet speed wrote:

 Please could you advice, how can i write a regular expression for the
 line below to capture 0079 and 69729260057253303030373


 0079 Not Visible 69729260057253303030373

 i tried this one, no luck

 /(^\d{4})\s\w+\s\w+\s+\d+/ig)

 It works fine! You are simply not capturing the second sequence of
 digits. This

 use strict;
 use warnings;

 for ('0079 Not Visible 69729260057253303030373') {
print /(^\d{4})\s\w+\s\w+\s+(\d+)/ig ? $1 $2 : 'no match';
 }

 prints '0079 69729260057253303030373'.

 But, depending on your data, there may be no need to match the line so
 precisely. This does the same thing

 use strict;
 use warnings;

 for ('0079 Not Visible 69729260057253303030373') {
my @n = (split)[0,-1];
print @n;
 }


 HTH,

 Rob


-- 
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular expression help !!

2011-04-27 Thread Jim Gibson

On 4/27/11 Wed  Apr 27, 2011  8:32 AM, jet speed
speedj...@googlemail.com scribbled:

 Hi all,
 
 Thanks for all our inputs,
 
 The regular expression below works fine if do it for single line, i am
 trying to caputre the match $1, and $2 into array. only the first line
 is pushed to the array. what am i doing wrong ?
 how to get all the $1 and $2 match values for each line into arrary ?
 Kindly advice.
 
 
 # more wwnlist
 0079 Not Visible 69729260057253303030373
 007A Not Visible 69729260057253303030374
 007B Not Visible 69729260057253303030374
 007C Not Visible 69729260057253303030374
 007D Not Visible 69729260057253303030374
 007E Not Visible 69729260057253303030374
  more wwnmod.pl
 #!/usr/bin/perl
 
 #0079 Not Visible 69729260057253303030373
 
 
 open(FILE, wwnlist) || die Can't open wwnlist : $!\n;
 while (FILE) {
 if ($_=~/\b(\d{4})\s[a-z]{1,3}\s[a-z]{1,7}\s*(\d{1,32})\b/i) {
 push (@dev, $1);
 push (@wwn, $2);
 }
 }
 
 print @dev \n;
 print @wwn \n;
 
 ./wwnmod.pl
 0079
 69729260057253303030373
[
The metasymbol \d matches the characters [0-9], not the extended hexadecimal
set that includes A-Z. To match those, construct your own character class:

[0-9A-Z]

So your regular expression test will become:

if ($_=~/\b([0-9A-Z]{4})\s[a-z]{1,3}\s[a-z]{1,7}\s*(\d{1,32})\b/i) {




-- 
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular expression help !!

2011-04-27 Thread Paul Johnson

On Wed, Apr 27, 2011 at 04:32:57PM +0100, jet speed wrote:
 Hi all,
 
 Thanks for all our inputs,
 
 The regular expression below works fine if do it for single line, i am
 trying to caputre the match $1, and $2 into array. only the first line
 is pushed to the array. what am i doing wrong ?
 how to get all the $1 and $2 match values for each line into arrary ?
 Kindly advice.
 
 
 # more wwnlist
 0079 Not Visible 69729260057253303030373
 007A Not Visible 69729260057253303030374
 007B Not Visible 69729260057253303030374
 007C Not Visible 69729260057253303030374
 007D Not Visible 69729260057253303030374
 007E Not Visible 69729260057253303030374

 if ($_=~/\b(\d{4})\s[a-z]{1,3}\s[a-z]{1,7}\s*(\d{1,32})\b/i) {

\d doesn't match hex digits or, at least not all of them.

Instead of your first \d you could use [0-9A-F] which actually matches exactly
the characters you want.  \d will also match many characters you probably
don't want to match, unfortunately.

However, I might be tempted to go with Rob's second solution, unless you are
sure each line matches that regexp exactly.  And if you are, is it always Not
Visible in there, or can it really be any three and seven letter words?  If
not, why not use Not Visible in your regexp?

-- 
Paul Johnson - p...@pjcj.net
http://www.pjcj.net

-- 
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular expression help !!

2011-04-27 Thread Shawn H Corey


On 11-04-27 12:47 PM, Jim Gibson wrote:

The metasymbol \d matches the characters [0-9], not the extended hexadecimal
set that includes A-Z. To match those, construct your own character class:

[0-9A-Z]


You can use the POSIX xdigit character class instead:

#!/usr/bin/env perl

use strict;
use warnings;

use Data::Dumper;

# Make Data::Dumper pretty
$Data::Dumper::Sortkeys = 1;
$Data::Dumper::Indent   = 1;

# Set maximum depth for Data::Dumper, zero means unlimited
local $Data::Dumper::Maxdepth = 0;

while( DATA ){
  my @hex_numbers = ( m{ ( [[:xdigit:]]+ ) }gmsx )[ 0, -1 ];
  print '@hex_numbers: ', Dumper \@hex_numbers;
}

__DATA__
0079 Not Visible 69729260057253303030373
007A Not Visible 69729260057253303030374
007B Not Visible 69729260057253303030374
007C Not Visible 69729260057253303030374
007D Not Visible 69729260057253303030374
007E Not Visible 69729260057253303030374


See `perldoc perlreref` and search for /xdigit/


--
Just my 0.0002 million dollars worth,
  Shawn

Confusion is the first step of understanding.

Programming is as much about organization and communication
as it is about coding.

The secret to great software:  Fail early  often.

Eliminate software piracy:  use only FLOSS.

--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular expression help !!

2011-04-27 Thread Brian Fraser

On Wed, Apr 27, 2011 at 2:48 PM, Shawn H Corey shawnhco...@ncf.ca wrote:

 On 11-04-27 12:47 PM, Jim Gibson wrote:

 The metasymbol \d matches the characters [0-9], not the extended
 hexadecimal
 set that includes A-Z. To match those, construct your own character class:

 [0-9A-Z]


 You can use the POSIX xdigit character class instead:

Or the Unicode character propriety[0] \p{Hex} (or \p{Hex_Digit} or whichever
form you prefer the most).

Or, since the data seems to always come in the same form, instead of using a
regex, you could pull out unpack [1]:
use 5.010;

while (DATA) {
say join ',', (unpack A5 A24 A*)[0,-1];
}

You can learn more about pack/unpack in perlpacktut[2]

Or you could use substr[3]:
while (DATA) {
chomp;
say join ',', substr($_, 0, 4), substr($_, 29);
}

Brian.

[0] http://perldoc.perl.org/perluniprops.html
[1] http://perldoc.perl.org/functions/unpack.html
[2] http://perldoc.perl.org/perlpacktut.html
[3] http://perldoc.perl.org/functions/substr.html

Re: Regular expression help !!

2011-04-27 Thread Dr.Ruud


On 2011-04-27 18:47, Jim Gibson wrote:


The metasymbol \d matches the characters [0-9],


Beware: the \d matches 250+ code points. So don't use \d if you only 
mean [0-9].




not the extended hexadecimal
set that includes A-Z. To match those, construct your own character class:

[0-9A-Z]


Or use [[:xdigit:]], though that also matches the lowercase a-f.

(I assume that you meant A-F)

--
Ruud

--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular expression help !!

2011-04-27 Thread jet speed

Excellent Guys, I would like thank each one of you for inputs. Much
appreciated.

i got blinded by just the numbers 0079, i didn't cater for the next line
which is hex 007A, as one of you rightly pointed out [ 0-9A-Z] , does the
trick.  its amazing to see different technique to achieve the same result.
Thanks again.

Sj

Re: Regular expression help

2009-08-26 Thread Uri Guttman

 DT == Dave Tang d.t...@imb.uq.edu.au writes:


  DT a,b,c,d,e,f1,f2,g1,g2 which spoil my split(/,/).

  DT Could someone provide some guidance?

use a CSV module. parsing csv files is a pain with regexes (even if
doable). there are very stable and fast csv modules on cpan so get one
and use it.

uri

-- 
Uri Guttman  --  u...@stemsystems.com    http://www.sysarch.com --
-  Perl Code Review , Architecture, Development, Training, Support --
-  Gourmet Hot Cocoa Mix    http://bestfriendscocoa.com -

-- 
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular expression help

2009-08-26 Thread Chas. Owens

On Wed, Aug 26, 2009 at 02:23, Dave Tangd.t...@imb.uq.edu.au wrote:
 Dear list,

 I am trying to import entries in a csv file into a relational database,
 however there are entries such as:

 a,b,c,d,e,f1,f2,g1,g2 which spoil my split(/,/).
snip

Sounds like a job for [Text::CSV][1].  Of course, you an always write
a quick parser:

#!/usr/bin/perl

use strict;
use warnings;

my $line = q/a,b,c,d,e,f1,f2,g1,g2/;

my $in_string;
my @rec = ();
for my $token ($line =~ /([,]|[^,]+)/g) {
if ($in_string) {
if ($token eq q//) {
$in_string = 0;
push @rec, ;
next;
}
} elsif ($token eq q/,/) {
push @rec, ;
next;
} elsif ($token eq q//) {
$in_string = 1;
next;
}
$rec[-1] .= $token;
}

print join(|, @rec), \n;


[1] : http://search.cpan.org/dist/Text-CSV/lib/Text/CSV.pm

-- 
Chas. Owens
wonkden.net
The most important skill a programmer can have is the ability to read.

-- 
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular expression help

2009-08-26 Thread Dave Tang

On Wed, 26 Aug 2009 16:41:39 +1000, Chas. Owens chas.ow...@gmail.com  
wrote:



On Wed, Aug 26, 2009 at 02:23, Dave Tangd.t...@imb.uq.edu.au wrote:

Dear list,

I am trying to import entries in a csv file into a relational database,
however there are entries such as:

a,b,c,d,e,f1,f2,g1,g2 which spoil my split(/,/).

snip

Sounds like a job for [Text::CSV][1].  Of course, you an always write
a quick parser:


I did some searching on cpan as Uri suggested and decided on this module.  
Thank you very much for the code! I have some questions on the code.




#!/usr/bin/perl

use strict;
use warnings;

my $line = q/a,b,c,d,e,f1,f2,g1,g2/;

my $in_string;
my @rec = ();
for my $token ($line =~ /([,]|[^,]+)/g) {


I changed the single pipe (|) to double pipes (||) and $token also  
contained empty strings. Could you explain the difference between the  
pipes?



if ($in_string) {
if ($token eq q//) {
$in_string = 0;
push @rec, ;
next;
}
} elsif ($token eq q/,/) {
push @rec, ;
next;
} elsif ($token eq q//) {
$in_string = 1;
next;
}
$rec[-1] .= $token;


Is this a commonly used method where you push empty values into an array  
(if $token is a , or ) and append stuff to the last array element (which  
is an empty string)?



}

print join(|, @rec), \n;


[1] : http://search.cpan.org/dist/Text-CSV/lib/Text/CSV.pm



Thanks again Chas.

--
Using Opera's revolutionary e-mail client: http://www.opera.com/mail/


--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular expression help

2009-08-26 Thread Chas. Owens

On Wed, Aug 26, 2009 at 03:46, Dave Tangd.t...@imb.uq.edu.au wrote:
snip
 for my $token ($line =~ /([,]|[^,]+)/g) {

 I changed the single pipe (|) to double pipes (||) and $token also contained
 empty strings. Could you explain the difference between the pipes?
snip

The pipe character in regexes creates an alternation.  An alternation
matches if one of the expressions matches.  By adding another pipe,
you told the regex engine that it should match [,] or nothing or
[^,].  The empty strings you saw were the nothings being matched.  I
can only assume you changed it to || out of some mistaken belief that
the pipe character in a regex is an or statement (like the ||
operator).  While it does operator in a similar fashion, it is not an
or operator.

        if ($in_string) {
                if ($token eq q//) {
                        $in_string = 0;
                        push @rec, ;
                        next;
                }
        } elsif ($token eq q/,/) {
                push @rec, ;
                next;
        } elsif ($token eq q//) {
                $in_string = 1;
                next;
        }
        $rec[-1] .= $token;

 Is this a commonly used method where you push empty values into an array (if
 $token is a , or ) and append stuff to the last array element (which is an
 empty string)?

It is commonly used by me, I don't know about others. The string at
the end of the record array is not necessarily empty.  In the case of
'foo,bar', the tokens are ('', foo, ,, bar, '').  This means
$rec[-1] starts empty (my @rec = ();), then foo is concatenated
onto it, then ,, then bar, finally it sees the second '' token
and pushes a new empty string onto the array.  It looks like there is
a bug though.  It should only push a new string onto the array when it
sees a comma.


-- 
Chas. Owens
wonkden.net
The most important skill a programmer can have is the ability to read.

--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

RE: regular expression help

2009-06-17 Thread Ajay Kumar

Hi Irfan
This code solve your problem

my $p=\ProductName\ = \8:EXFO RTU System 1.2.42\;
my ($val)=$p=~ m/\d+.\d+.(\d+)\/;
my $inval=$val+1;
$p=~s/$val/$inval/;
print===$p\n;
thanks
Ajay


-Original Message-
From: Irfan Sayed [mailto:irfan_sayed2...@yahoo.com]
Sent: Wednesday, June 17, 2009 4:10 PM
To: beginners@perl.org
Subject: regular expression help

Hi All,

need help on regular expression.
i have string like this

 ProductName = 8:EXFO RTU System 1.2.42

now i want regular expression in such a way that it will change the line to :

 ProductName = 8:EXFO RTU System 1.2.43

i tried in the following way.

$_ =~ s/:(.*)\s(.*)\s(.*)\s\d*.\d*.\d*/:(.*)\s(.*)\s(.*)\s$chver)/;
where $chver is variable which contains value as 1.2.43

it is not giving result as exprected.

plz advice

regards
irf




--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: regular expression help

2009-06-17 Thread Irfan Sayed

it is not just 1.2.43 it may be anything
it may be like 2.3.56 or 2.0.12 and so on...


plz advice





From: Ajay Kumar ajay.kum...@synopsys.com
To: Irfan Sayed irfan_sayed2...@yahoo.com
Cc: beginners@perl.org beginners@perl.org
Sent: Wednesday, June 17, 2009 5:01:41 PM
Subject: RE: regular expression help

Hi Irfan
This code solve your problem

my $p=\ProductName\ = \8:EXFO RTU System 1.2.42\;
my ($val)=$p=~ m/\d+.\d+.(\d+)\/;
my $inval=$val+1;
$p=~s/$val/$inval/;
print===$p\n;
thanks
Ajay


-Original Message-
From: Irfan Sayed [mailto:irfan_sayed2...@yahoo.com]
Sent: Wednesday, June 17, 2009 4:10 PM
To: beginners@perl.org
Subject: regular expression help

Hi All,

need help on regular expression.
i have string like this

ProductName = 8:EXFO RTU System 1.2.42

now i want regular expression in such a way that it will change the line to :

ProductName = 8:EXFO RTU System 1.2.43

i tried in the following way.

    $_ =~ s/:(.*)\s(.*)\s(.*)\s\d*.\d*.\d*/:(.*)\s(.*)\s(.*)\s$chver)/;
where $chver is variable which contains value as 1.2.43

it is not giving result as exprected.

plz advice

regards
irf




--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: regular expression help

2009-06-17 Thread John W. Krahn


Irfan Sayed wrote:

Hi All,


Hello,


need help on regular expression.
i have string like this

 ProductName = 8:EXFO RTU System 1.2.42
 
now i want regular expression in such a way that it will change the line to :


 ProductName = 8:EXFO RTU System 1.2.43


$ perl -le'
$_ = q[ProductName = 8:EXFO RTU System 1.2.42];
print;
s/(\d+)(\D*)$/ $1 + 1 . $2 /e;
print;
'
ProductName = 8:EXFO RTU System 1.2.42
ProductName = 8:EXFO RTU System 1.2.43



John
--
Those people who think they know everything are a great
annoyance to those of us who do.-- Isaac Asimov

--
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/

Re: Regular Expression Help

2008-04-16 Thread Rob Dixon

Ley, Chung wrote:
 Hi,
 
  
 
 I have a program that will take in a string that will resolve to a path
 where the output is going to store.
 
  
 
 The path can includes variables in this format %VariableName%.
 The acceptable variableNames that the program will support are fixed to
 a list such as Person, Class, Dept.  This list may grow in the
 future; but for now it is fixed.
 
  
 
 So, some of the acceptable strings are:
 
 C:\Windows\%Person%\%Class%
 
 C:\MyHomeDir\%Dept%\%Person%
 
 C:\MyHomeDir\%Dept%_%Person%
 
 and etc...
 
  
 
 I like to develop a validate method to return false when the string in
 between the % pair don't belong to the acceptable list, and was trying
 to do that with an expression.
 
 - Look for the string that is in between the % by the closest pair.
 
 - The % pair must start at the odd occurrence and finish on the
 next even occurrence...  Don't know how to describe this exactly, but
 I don't want it to pack this up (InBetween) from this
 (C:\MyHomeDir\%Dept%InBetween%Person) even though it is in between a %
 pair.
 
 - Then compare the string within the % pair to see if it belongs to
 the acceptable list or not.
 
  
 
 Is using a regular expression the right approach here?  or I should just
 iterate thru?

If the names are environment variable names, which is implied by your
syntax, then you can substitute the values of those variables as shown
in the program below.

HTH,

Rob


use strict;
use warnings;

my $str = '%windir%\system32';

$str =~ s/%(.*?)%/$ENV{uc $1}/g;

print $str;

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/

Re: Regular Expression Help

2008-04-15 Thread Jialin Li

my $input =q(C:\Windows\%Person%\%Class%);

my @vars = $input =~ /%([^%]+)%/g;

local $, = $/;
print @vars;




On Tue, Apr 15, 2008 at 10:20 PM, Ley, Chung [EMAIL PROTECTED] wrote:

 Hi,



 I have a program that will take in a string that will resolve to a path
 where the output is going to store.



 The path can includes variables in this format %VariableName%.
 The acceptable variableNames that the program will support are fixed to
 a list such as Person, Class, Dept.  This list may grow in the
 future; but for now it is fixed.



 So, some of the acceptable strings are:

 C:\Windows\%Person%\%Class%

 C:\MyHomeDir\%Dept%\%Person%

 C:\MyHomeDir\%Dept%_%Person%

 and etc...



 I like to develop a validate method to return false when the string in
 between the % pair don't belong to the acceptable list, and was trying
 to do that with an expression.

 - Look for the string that is in between the % by the closest pair.

 - The % pair must start at the odd occurrence and finish on the
 next even occurrence...  Don't know how to describe this exactly, but
 I don't want it to pack this up (InBetween) from this
 (C:\MyHomeDir\%Dept%InBetween%Person) even though it is in between a %
 pair.

 - Then compare the string within the % pair to see if it belongs to
 the acceptable list or not.



 Is using a regular expression the right approach here?  or I should just
 iterate thru?



 Thanks...



 --Chung Ley

RE: Regular Expression Help

2008-04-15 Thread Ley, Chung

Thank you!

 

This looks much cleaner.

 



From: Jialin Li [mailto:[EMAIL PROTECTED] 
Sent: Tuesday, April 15, 2008 8:56 PM
To: Ley, Chung
Cc: beginners@perl.org
Subject: Re: Regular Expression Help

 


my $input =q(C:\Windows\%Person%\%Class%);

my @vars = $input =~ /%([^%]+)%/g;

local $, = $/;
print @vars;





On Tue, Apr 15, 2008 at 10:20 PM, Ley, Chung [EMAIL PROTECTED] wrote:

Hi,



I have a program that will take in a string that will resolve to a path
where the output is going to store.



The path can includes variables in this format %VariableName%.
The acceptable variableNames that the program will support are fixed to
a list such as Person, Class, Dept.  This list may grow in the
future; but for now it is fixed.



So, some of the acceptable strings are:

C:\Windows\%Person%\%Class%

C:\MyHomeDir\%Dept%\%Person%

C:\MyHomeDir\%Dept%_%Person%

and etc...



I like to develop a validate method to return false when the string in
between the % pair don't belong to the acceptable list, and was trying
to do that with an expression.

- Look for the string that is in between the % by the closest pair.

- The % pair must start at the odd occurrence and finish on the
next even occurrence...  Don't know how to describe this exactly, but
I don't want it to pack this up (InBetween) from this
(C:\MyHomeDir\%Dept%InBetween%Person) even though it is in between a %
pair.

- Then compare the string within the % pair to see if it belongs to
the acceptable list or not.



Is using a regular expression the right approach here?  or I should just
iterate thru?



Thanks...



--Chung Ley

Re: Regular Expression Help

2008-04-15 Thread John W. Krahn


Ley, Chung wrote:

Hi,


Hello,


I have a program that will take in a string that will resolve to a path
where the output is going to store.

The path can includes variables in this format %VariableName%.
The acceptable variableNames that the program will support are fixed to
a list such as Person, Class, Dept.  This list may grow in the
future; but for now it is fixed.


If you know what the variable names will be in advance then you could 
use a regular expression like /%(?:Person|Class|Dept)%/.



John
--
Perl isn't a toolbox, but a small machine shop where you
can special-order certain sorts of tools at low cost and
in short order.-- Larry Wall

--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/

Re: Regular Expression Help

2006-11-01 Thread John W. Krahn

Ashish Srivastava wrote:
 I have a string which have multiple placeholder, for example: 
 
 $str = 'Hello [[Name]], Your login id is  [[Login]] !!!;
 
 I want to replace all placeholder with some identifier. In above example: 
 identifier are Name and Login.
 The regular expression for this requirment is pretty staightward:
 
 $str =~ s/\[\[([\d\w\-]+)\]\]/$1/g;
 
 Now the problem is there could be  Nested Placeholders. For example:
 
 $str = 'Error is: id]]-[[message'
 
 so in above example if idenfiers are defined as:
 
 id = 1001
 Message = Crash
 1001-Crash = System overloaded
 
 so output of my required regular expression for subsitution should be:
 
 Error is: System overloaded

$ perl -le'
my %t = (
id   = 1001,
message  = Crash,
1001-Crash = System overloaded,
);

my $str = q/Error is: id]]-[[message/;

print $str;

1 while $str =~ s/\[\[([\w-]+)]]/$t{$1}/g;

print $str;
'
Error is: id]]-[[message
Error is: System overloaded




John
-- 
Perl isn't a toolbox, but a small machine shop where you can special-order
certain sorts of tools at low cost and in short order.   -- Larry Wall

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

Re: Regular Expression Help

2006-11-01 Thread Anshul Saxena


Ashish ,
you might have the [ in a recurring sequence using [*

On 11/1/06, John W. Krahn [EMAIL PROTECTED] wrote:


Ashish Srivastava wrote:
 I have a string which have multiple placeholder, for example:

 $str = 'Hello [[Name]], Your login id is  [[Login]] !!!;

 I want to replace all placeholder with some identifier. In above
example: identifier are Name and Login.
 The regular expression for this requirment is pretty staightward:

 $str =~ s/\[\[([\d\w\-]+)\]\]/$1/g;

 Now the problem is there could be  Nested Placeholders. For example:

 $str = 'Error is: id]]-[[message'

 so in above example if idenfiers are defined as:

 id = 1001
 Message = Crash
 1001-Crash = System overloaded

 so output of my required regular expression for subsitution should be:

 Error is: System overloaded

$ perl -le'
my %t = (
id   = 1001,
message  = Crash,
1001-Crash = System overloaded,
);

my $str = q/Error is: id]]-[[message/;

print $str;

1 while $str =~ s/\[\[([\w-]+)]]/$t{$1}/g;

print $str;
'
Error is: id]]-[[message
Error is: System overloaded




John
--
Perl isn't a toolbox, but a small machine shop where you can special-order
certain sorts of tools at low cost and in short order.   -- Larry Wall

--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

Re: Regular Expression Help

2006-11-01 Thread Ashish Srivastava

Thank You John and Anshul for response. It was great help.

Regards, 
Ashish Srivastava 

 



- Original Message 
From: Anshul Saxena [EMAIL PROTECTED]
To: John W. Krahn [EMAIL PROTECTED]
Cc: Perl Beginners beginners@perl.org
Sent: Wednesday, 1 November, 2006 11:01:38 PM
Subject: Re: Regular Expression Help


Ashish ,
you might have the [ in a recurring sequence using [*

On 11/1/06, John W. Krahn [EMAIL PROTECTED] wrote:

 Ashish Srivastava wrote:
  I have a string which have multiple placeholder, for example:
 
  $str = 'Hello [[Name]], Your login id is  [[Login]] !!!;
 
  I want to replace all placeholder with some identifier. In above
 example: identifier are Name and Login.
  The regular expression for this requirment is pretty staightward:
 
  $str =~ s/\[\[([\d\w\-]+)\]\]/$1/g;
 
  Now the problem is there could be  Nested Placeholders. For example:
 
  $str = 'Error is: id]]-[[message'
 
  so in above example if idenfiers are defined as:
 
  id = 1001
  Message = Crash
  1001-Crash = System overloaded
 
  so output of my required regular expression for subsitution should be:
 
  Error is: System overloaded

 $ perl -le'
 my %t = (
 id   = 1001,
 message  = Crash,
 1001-Crash = System overloaded,
 );

 my $str = q/Error is: id]]-[[message/;

 print $str;

 1 while $str =~ s/\[\[([\w-]+)]]/$t{$1}/g;

 print $str;
 '
 Error is: id]]-[[message
 Error is: System overloaded




 John
 --
 Perl isn't a toolbox, but a small machine shop where you can special-order
 certain sorts of tools at low cost and in short order.   -- Larry Wall

 --
 To unsubscribe, e-mail: [EMAIL PROTECTED]
 For additional commands, e-mail: [EMAIL PROTECTED]
 http://learn.perl.org/ http://learn.perl.org/first-response






__
Yahoo! India Answers: Share what you know. Learn something new
http://in.answers.yahoo.com/

Re: regular expression help

2006-07-24 Thread Rob Dixon


Jonathan Weber wrote:

 Hi. I have some HTML files with lines like the following:

 a name=w12234 /a h2A Title/h2

 I'm using a regular expression to find these and capture the name
 attribute (w12234 in the example) and the contents of the h2 tag (A
 Title).

 $_ =~ /a name=(w\d+)\s*\/a\s*h2(+)\/h2/

 That's my regex, except I'm having trouble with the _ part. No
 matter what I seem to try, it won't match incidences where there's a
 newline somewhere in the string. I tried all manner of things,
 including [.\n], which if I understand correctly should match
 *everything*.

 I'm doing this on Windows; does the carriage return/line feed business
 have anything to do with this?

Hi Jonathan.

Some points:

- The character wildcard '.' is just a dot within a character class, so [.\n]
will match only a dot or a newline

- The /s modifier will force '.' to match absolutely anything, including a
newline. So you could write:

  $_ =~ /a name=(w\d+)\s*\/a\s*h2(.+)\/h2/s;

but that isn't what you want as /.+/ will eat up all of the rest of the string
until the last /h2 it finds. You could get away with /.+?/ but nicer is
/[^]+/ which will match any number of any character except for an open angle
bracket

- If you're matching against $_ then you can omit it altogether:

  /a name=(w\d+)\s*\/a\s*h2(.+)\/h2/;

does the same thing

- Enclosing a regex in slashes allows you to omit an implied m// operator, which
you have (i.e. /regex/ is the same as m/regex/). Putting the m back lets you use
whatever delimiters you want, so you don't have to escape the contained slashes
and can make it more readable:

  m#a name=(w\d+)\s*/a\s*h2([^]+)/h2#;

- Regexes aren't the best way of parsing HTML, unless the document is very
simple and predictable. Take a look at somthing like HTML::TreeBuilder if you're
doing this a lot on varying or non-trivial documents.

- This program does what you want:

  use strict;
  use warnings;

  my $string = HTML;
  a name=w12234 /a h2A
  Title/h2
  HTML

  $string =~ m#a name=(w\d+)\s*/a\s*h2([^]+)/h2#;

  print $1, \n;
  print $2, \n;

OUTPUT

  w12234
  A
  Title


I hope this helps.

Rob

--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

Re: regular expression help

2006-07-24 Thread Dr.Ruud

Jonathan Weber schreef:

 a name=w12234 /a h2A Title/h2

 I'm using a regular expression to find these and capture the name
 attribute (w12234 in the example) and the contents of the h2 tag (A
 Title).

 $_ =~ /a name=(w\d+)\s*\/a\s*h2(+)\/h2/

 That's my regex, except I'm having trouble with the _ part. No
 matter what I seem to try, it won't match incidences where there's a
 newline somewhere in the string. I tried all manner of things,
 including [.\n], which if I understand correctly should match
 *everything*.

Not [.\n] (because that contains a literal dot), but (?:.|\n).
Or do a s/\n/ /g first.

But you don't need all that, see `perldoc perlre` about the s-modifier.

And much better: use a proper HTML parser, see CPAN.

-- 
Affijn, Ruud

Gewoon is een tijger.



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

Re: regular expression help

2006-07-24 Thread Jonathan Weber


On 24 Jul 2006, at 5:48 PM, Rob Dixon wrote:

- The character wildcard '.' is just a dot within a character  
class, so [.\n]

will match only a dot or a newline


Ah, I hadn't realized that characters in [ ] are literals. That  
clears up a lot of the problem.


- Regexes aren't the best way of parsing HTML, unless the document  
is very
simple and predictable. Take a look at somthing like  
HTML::TreeBuilder if you're

doing this a lot on varying or non-trivial documents.


Yeah, I realize that it's not the best way to go about it, but I had  
many documents that were all the same, plus I figured this was a good  
excuse to learn regexes.


Thanks for the help!

Jonathan

--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

Re: Regular expression help

2005-04-25 Thread Jay Savage

On 4/25/05, John W. Krahn [EMAIL PROTECTED] wrote:
 Owen wrote:
  I found a message from Randal Schwartz, Message-ID:
  [EMAIL PROTECTED]#1/1
  which gave a regular expression for a valid Unix name,
 
/^(?=.*?\D)[a-z\d]+$/
 
  That works but why does it work?
 
  /
^   # Start of a string
 (?=# 0 or 1 instance of
.*? # anything but a newline
\D  # Non digit
 )  #
 
[a-z\d]+# All match a-z and any digit at
$   # End of a string
  /
 
 
  I tried breaking it down like above but it still doesn't say Must not be
  all numbers and letters must be all lowercase
 
  Any help in turning that re into plain words would be appreciated
 
 (?=.*?\D) is a zero-width assertion so it doesn't affect the match which is
 /^[a-z\d]+$/ -- match a string whose only characters are a-z0-9.  When you add
 the zero-width assertion it ensures that there is at least one \D character in
 the match.
 
 John

I think this may be part of the confusion, too:

 (?=# 0 or 1 instance of
.*? # anything but a newline

/.*?/ is not that same as /.?/.  /.?/ matches zero or 1 of any
character but the newline.  /.*?/, however, matches zero or more, just
like /.*/.  It's non-greedy, though, so it will stop matching at the
first occurance of \D instead of the last.  In this case it boils down
roughly to /(?=[^\D]{0,}\D/.

HTH,

--jay

--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

RE: Regular expression help

2005-04-24 Thread Charles K. Clarkson

Owen mailto:[EMAIL PROTECTED] wrote:

: I found a message from Randal Schwartz, Message-ID:
: [EMAIL PROTECTED]#1/1
: which gave a regular expression for a valid Unix name,
:
:   /^(?=.*?\D)[a-z\d]+$/
:
: That works but why does it work?
:
: /
:   ^   # Start of a string
:(?=# 0 or 1 instance of

 (?=# Zero-width positive look ahead assertion.


:   .*? # anything but a newline
:)  #
:
:   [a-z\d]+# All match a-z and any digit at

[a-z\d]+# Match a-z (lowercase only) and any digit


:   $   # End of a string
: /
:
: I tried breaking it down like above but it still doesn't say
: Must not be all numbers and letters must be all lowercase
:
: Any help in turning that re into plain words would be
: appreciated

AFAIK, a (?= ... ) construct affects the contents of $, $`,
and $'. Can you show any code immediately following this regular
expression and the whole line with it?


HTH,

Charles K. Clarkson
-- 
Mobile Homes Specialist
254 968-8328


-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

Re: Regular expression help

2005-04-24 Thread John W. Krahn

Owen wrote:
I found a message from Randal Schwartz, Message-ID:
[EMAIL PROTECTED]#1/1
which gave a regular expression for a valid Unix name, 

/^(?=.*?\D)[a-z\d]+$/
That works but why does it work?
/
	^		# Start of a string 
	 (?=		# 0 or 1 instance of 
	.*?		# anything but a newline
	 	\D	# Non digit
	 )		#
	
	[a-z\d]+	# All match a-z and any digit at
		$	# End of a string
/

I tried breaking it down like above but it still doesn't say Must not be
all numbers and letters must be all lowercase
Any help in turning that re into plain words would be appreciated
(?=.*?\D) is a zero-width assertion so it doesn't affect the match which is
/^[a-z\d]+$/ -- match a string whose only characters are a-z0-9.  When you add
the zero-width assertion it ensures that there is at least one \D character in
the match.
John
--
use Perl;
program
fulfillment
--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

Re: regular expression help

2005-03-11 Thread Ing. Branislav Gerzo

Stone [S], on Thursday, March 10, 2005 at 16:46 (-0800) typed:

S That won't work either.  When you say ([1-9]\d?) you're telling it
S (If there is a match) capture the stuff in parentheses and store it.
S  When you say \1 you're telling the script you know that first
S batch of stuff you captured?  Well I want that here.  But it's not
S the pattern, it's the literal substring of the match.

When I read your first line of reply, I realise, that you have right
again. Tomorrow was hard day for me, and also after some glassess of
wine I simply forget to some small details. And also I have to thank
you for nice and long explanation :) Ok, so I'm sorry, if I
confused someone, here is my explanation:

I don't like simple regular expressions, as was posted in first
message:

$field =~ /^[1-9]\d?\.[1-9]\d?\.[1-9]\d?$/;

that's too clear. I wanted change it (short), so I post 2 bad
examples. Clear shorting is:

$field =~ /^([1-9]\d?\.){2}[1-9]\d?$/;

but that's too much simple too :)

So let's continue how we can represent those rulez, this is a bit
tricky:

$field =~ /^([1-9]\d(?:\.[1-9]\d){2})$/;

and this is not shorter, but this way works too :):

$field =~ /^(?:(\d)(??{ $1  0  $1 = 9 ?  : (?!) })\d\.){2}
   (\d)(??{ $2  0  $2 = 9 ?  : (?!) })\d/x;

Ok, that's enough, now we can benchmark all those:

use strict;
use warnings;
use Benchmark 'cmpthese';

my $field = '19.12.12';

cmpthese -5, {
simple = sub {
if ( $field =~ /^[1-9]\d?\.[1-9]\d?\.[1-9]\d?$/ ) {}
},
short = sub {
if ( $field =~ /^([1-9]\d\.){2}[1-9]\d$/ ) {}
},
shortwoc = sub {
if ( $field =~ /^(?:[1-9]\d\.){2}[1-9]\d$/ ) {}
},
tricky = sub {
if ( $field =~ /^([1-9]\d(?:\.[1-9]\d){2})$/ ) {}
},
trickywoc = sub {
if ( $field =~ /^(?:[1-9]\d(?:\.[1-9]\d){2})$/ ) {}
},
long = sub {
if  ( $field =~ /^(?:(\d)(??{ $1  0  $1 = 9 ?  : (?!) 
})\d\.){2}
 (\d)(??{ $2  0  $2 = 9 ?  : (?!) 
})\d/x ) {}
},
using_var = sub {
my $reg = '[1-9]\d';
if ( $field =~ /^(?:$reg\.){2}$reg$/ ) {}
}
};

__END__

   Ratelong using_var   short  tricky  simple shortwoc trickywoc
long30584/s  --  -95%-95%-95%-98% -98%  -98%
using_var  582064/s   1803%---11%-14%-56% -60%  -62%
short  650382/s   2027%   12%  -- -3%-51% -56%  -58%
tricky 673134/s   2101%   16%  3%  ---49% -54%  -57%
simple1328337/s   4243%  128%104% 97%  -- -10%  -14%
shortwoc  1470620/s   4708%  153%126%118% 11%   --   -5%
trickywoc 1550263/s   4969%  166%138%130% 17%   5%--

second benchmark:
my $field = '19.12.02';

   Ratelong using_var shortwoc   short  simple  tricky trickywoc
long28809/s  --  -95% -98%-98%-98%-98%  -98%
using_var  606756/s   2006%-- -63%-63%-64%-64%  -66%
shortwoc  1627591/s   5550%  168%   -- -1% -3% -5%   -9%
short 1651154/s   5631%  172%   1%  -- -2% -3%   -8%
simple1681601/s   5737%  177%   3%  2%  -- -2%   -6%
tricky1707274/s   5826%  181%   5%  3%  2%  --   -5%
trickywoc 1790738/s   6116%  195%  10%  8%  6%  5%--

so conlusion is: I definitely would pick up
tricky WithOut Capturing: $field =~ /^(?:[1-9]\d(?:\.[1-9]\d){2})$/

I hope I don't have any bugs here :)

Have a nice day.

-- 

 ...m8s, cu l8r, Brano.

[Real knowledge is to know the extent of one's ignorance. - Confucius]



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

Re: regular expression help

2005-03-11 Thread Stone

 I hope I don't have any bugs here :)

Just one.  :)

Your expressions all say \d instead of \d? for the second digit in
each set, while the simple one correctly has \d?.   So your
expressions had an unfair advantage and as a result finish faster.

Add \d? to your expressions, and you should find that simple,
besides probably being the easiest to read, is also the most efficient
of the expressions listed.  (Note that I removed the obviously
inefficient ones.)

use strict;
use warnings;
use Benchmark 'cmpthese';

my $field = '19.12.12';

cmpthese -5, {
   simple = sub {
   if ( $field =~ /^[1-9]\d?\.[1-9]\d?\.[1-9]\d?$/ ) {}
   },
   short = sub {
   if ( $field =~ /^([1-9]\d?\.){2}[1-9]\d?$/ ) {}
   },
   shortwoc = sub {
   if ( $field =~ /^(?:[1-9]\d?\.){2}[1-9]\d?$/ ) {}
   },
   trickywoc = sub {
   if ( $field =~ /^(?:[1-9]\d?(?:\.[1-9]\d?){2})$/ ) {}
   },
};

__END__   Rate short  shortwoc trickywocsimple
short  677138/s--  -23%  -25%  -40%
shortwoc   880536/s   30%--   -2%  -22%
trickywoc  901095/s   33%2%--  -20%
simple1127127/s   66%   28%   25%--

second benchmark:
my $field = '19.12.02';

Rate short  shortwoc trickywocsimple
short  927396/s--  -10%  -12%  -29%
shortwoc  1027728/s   11%--   -3%  -22%
trickywoc 1055892/s   14%3%--  -19%
simple1310837/s   41%   28%   24%--

 Have a nice day.

You too.  I learned a lot from this, so thanks.

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

Re: regular expression help

2005-03-11 Thread Ing. Branislav Gerzo

Stone [S], on Friday, March 11, 2005 at 01:52 (-0800) made these
points:

S Just one.  :)
S Your expressions all say \d instead of \d? for the second digit in
S each set, while the simple one correctly has \d?.   So your
S expressions had an unfair advantage and as a result finish faster.

right, forget about 1.1.1 rule :)

S Add \d? to your expressions, and you should find that simple,
S besides probably being the easiest to read, is also the most efficient
S of the expressions listed.  (Note that I removed the obviously
S inefficient ones.)

yes, and it is logical at end. The regexp using parentheses is always
slower, than longer, which doesn't. It have to remember values
inside them.

thanks to you, I discover some new things too :)

-- 

 ...m8s, cu l8r, Brano.

['It takes a giant to fight a giant' - H. Prym]



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

RE: regular expression help

2005-03-10 Thread Wagner, David --- Senior Programmer Analyst --- WGO

CM Analyst wrote:
 I am quite new to programming and Perl, so please bear
 with me.
 
 I have a requirement to check the format of a field
 before a record can be saved. The format of the field
 needs to be double digit value separated by a .
 (period) like 00.00.00. This I managed to do using
 the following:
 
 use strict;
 use warnings;
 
 my $field;
 
 print This program will verify field value.\n;
 
 print Enter field value:  ;
 
 $field = STDIN;
 
 chomp ($field);
 
 if ( $field =~ /^[0-9]{2}\.[0-9]{2}\.[0-9]{2}$/ )
Simplest is change the {2} to {1,2} for all your entries. Now you mush 
have from 1 to 2 digits.
Wags ;)
 
 {
 print Value is OK;
 }
 else
 {
 print Value is Wrong. Please enter a double digit
 value like 00.00.00;
 }
 ###
 
 However, the requirement has changed in that we while
 we want to allow the 00.00.00 format we do not a 0
 (zero) to be a leading value.
 
 How can this be accomplished?
 
 Please advise.
 
 Regards.
 
 Amad
 
 
 
 __
 Do you Yahoo!?
 Yahoo! Small Business - Try our new resources site!
 http://smallbusiness.yahoo.com/resources/



***
This message contains information that is confidential
and proprietary to FedEx Freight or its affiliates.
It is intended only for the recipient named and for
the express purpose(s) described therein.
Any other use is prohibited.
***


--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

Re: regular expression help

2005-03-10 Thread Peter Rabbitson

 Simplest is change the {2} to {1,2} for all your entries. Now you 
 mush have from 1 to 2 digits.
 Wags ;)


I think what he really wants is to throw a fit when there is a leading zero 
for which your solution won't cut it. Here is how I see it:

$field =~ /^[1-9]\d?\.[1-9]\d?\.[1-9]\d?$/



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

RE: regular expression help

2005-03-10 Thread Wagner, David --- Senior Programmer Analyst --- WGO

Peter Rabbitson wrote:
 Simplest is change the {2} to {1,2} for all your entries. Now you
 mush have from 1 to 2 digits. Wags ;)
 
 
 I think what he really wants is to throw a fit when there is a
 leading zero for which your solution won't cut it. Here is how I see
 it: 
 
 $field =~ /^[1-9]\d?\.[1-9]\d?\.[1-9]\d?$/

Correct. I missed read the msg.
Wags ;)


***
This message contains information that is confidential
and proprietary to FedEx Freight or its affiliates.
It is intended only for the recipient named and for
the express purpose(s) described therein.
Any other use is prohibited.
***


--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

Re: regular expression help

2005-03-10 Thread Ing. Branislav Gerzo

Peter Rabbitson [PR], on Thursday, March 10, 2005 at 14:00 (-0500)
wrote:

PR I think what he really wants is to throw a fit when there is a leading zero
PR for which your solution won't cut it. Here is how I see it:

PR $field =~ /^[1-9]\d?\.[1-9]\d?\.[1-9]\d?$/

yes, and for complexity:

$field =~ /^([1-9]\d?)\.{2}\1$/;

untested :)

-- 

 ...m8s, cu l8r, Brano.

[And don't call me `Chum'. -- Richard Sloat]



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

Re: regular expression help

2005-03-10 Thread Stone

 yes, and for complexity:
 
 $field =~ /^([1-9]\d?)\.{2}\1$/;

I know you said that's untested, but I don't think it's correct.

You're saying:
1.  ^ - Start
2.  ([1-9]\d?) -Any character 1-9 followed by zero or one digit characters.
3.  \.{2} - Two periods.
4.  \1 - The same sequence of characters as in #2.  (\1 references the
actual parenthetical match, not the expression.)
5.  $ - The end.

So your expression would match things like:
1..1
11..11

-- 
http://xstonedogx.heroesmarket.net

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

Re: regular expression help

2005-03-10 Thread Ing. Branislav Gerzo

Stone [S], on Thursday, March 10, 2005 at 12:51 (-0800) wrote the
following:

 yes, and for complexity:
 
 $field =~ /^([1-9]\d?)\.{2}\1$/;

S I know you said that's untested, but I don't think it's correct.

yes, I'm sorry for that, should be this correct:

$field =~ /^(?:([1-9]\d?)\.){2}\1$/;

untested too :)


-- 

 ...m8s, cu l8r, Brano.

[Virus related taglines.]



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

Re: regular expression help

2005-03-10 Thread Stone

 $field =~ /^(?:([1-9]\d?)\.){2}\1$/;

That won't work either.  When you say ([1-9]\d?) you're telling it
(If there is a match) capture the stuff in parentheses and store it.
 When you say \1 you're telling the script you know that first
batch of stuff you captured?  Well I want that here.  But it's not
the pattern, it's the literal substring of the match.

Let's simplify it a little bit and consider this example:

10.15 =~ /^([1-9]\d?)\.\1$/;

In this case it would not match.  It works like this:

1.  [1-9]\d? matches 10.
2.  () stores that value 10 as \1 (or actually as $1, but within the
pattern you dereference it with \1).  Note that it doesn't store the
pattern [1-9]\d?, but the actual substring 10.  So at this point,
the \1 becomes 10 and looks like this:

/^[1-9]\d?\.10$/

Obviously 15 !~ 10, so there is no match.

You would use that if you wanted to find things like 1.1 or 10.10.

With your expression:
10.15.20 =~ /^(?:([1-9]\d?)\.){2}\1$/;

1.  [1-9]\d? - Matches 10.
2.  () - Stores 10 as $1.  
3.  Pattern now looks like /^(?:([1-9]\d?)\.){2}10$/
4.  {2} - Requires a second match of (?:([1-9]\d?)\.).
5.  [1-9]\d? - Matches 15.  
6.  () - Stores 15 as $1 (overwriting the previous value).  (I may
be mistaken on what exactly happens here, but this is effectively the
case.  If so, someone more knowledgable will hopefully correct me.)
7.  Pattern now looks like /^(?:([1-9]\d?)\.){2}15$/
8.  20 !~ 15, so pattern does not match.

So your expression will match 10.10.10 or even 10.15.15, but it won't
match 10.15.20.

HTH

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

Re: Regular expression help

2004-10-22 Thread Bee

snip
 __DATA__
 M:356 358 386 R:#132 W1:319 NRT:32 R:#132


snip

 but I would really like it to capture the first part of the element so
that the result would be;

 M:356 358 386 R:#132 W1:319 NRT:32 R:#132
 $y[0]
 $y[1] M 356 358 386
 $y[2] R #132
 $y[3] W1 319
 $y[4] NRT 32
 $y[5] R #132


I'd do a little add on for the string, so it much easier for split.

# usuw;
my $line = M:356 358 386 R:#132 W1:319 NRT:32 R:#132;
print $line . \n;
$line =~ s/(\s)(\w{1,}:)/\x00$2/g; # So I add extra delimiters here.
print $line . \n;

my @d = split /\x00/, $line;
print $_  for @d;

But only if you sure that char is not exist with the data string,
you can use. Otherwise, use other approach, or other delimiter.


 Also, why is $y[0] undefined or null, scalar (@y) is 6.

I don't know how to tell, but I think you want \s , but not \b.

HTH,
Bee



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

Re: Regular expression help

2004-10-22 Thread Gunnar Hjalmarsson

Bee wrote:
I'd do a little add on for the string, so it much easier for split.
# usuw;
my $line = M:356 358 386 R:#132 W1:319 NRT:32 R:#132;
print $line . \n;
$line =~ s/(\s)(\w{1,}:)/\x00$2/g; # So I add extra delimiters here.
print $line . \n;
my @d = split /\x00/, $line;
print $_  for @d;
But only if you sure that char is not exist with the data string,
you can use. Otherwise, use other approach, or other delimiter.
Other approaches:
my @y;
push @y, $1 $2 while $line =~ /(\w+):([^A-Z]+)(?: |$)/g;
or
my @y = map { tr/:/ /; $_ } $line =~ /(\w+:[^A-Z]+)(?: |$)/g;
Also, why is $y[0] undefined or null, scalar (@y) is 6.
That's explained in perldoc -f split.
--
Gunnar Hjalmarsson
Email: http://www.gunnar.cc/cgi-bin/contact.pl
--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/ http://learn.perl.org/first-response

RE: Regular expression help

2003-08-19 Thread EUROSPACE SZARINDAR


Hi Rob, 


You are totally right your exemple (' data11 '' data12' 'data13') should
raise an error and this is normal. I will nethertheless look carefully for
the Text::CSV module.

 Thanks 

Michel




-Message d'origine-
De: Rob Anderson [mailto:[EMAIL PROTECTED]
Date: lundi 18 août 2003 17:34
À: [EMAIL PROTECTED]
Objet: Re: Regular expression help



Eurospace Szarindar [EMAIL PROTECTED] wrote in
message news:[EMAIL PROTECTED]

Thanks you, it works fine. Could you explain me why have you added the \1 ?

Hi,

A quick breakdown...


/(['])(((''|)|[^'])*?)\1(\s|$)/g
 ^^
1  
  2
  ^^
  3


1) This is picking up either a single or double quote

2) This is scoping up any quotes pairs, and is being carefull not
   to match quotes on thier own.

3) In a regex \1, \2 etc.. will match the same sequence of characters
matched
   earlier in the same-numbered pair of parentheses, in this case, it means
   that were making sure the opening and closing quotes are the same by
mathching
   that same character as in part 1.

Explaining it like this has made me spot a problem. The regex you really
want is

m/(['])(((''|)|[^\1])*?)\1(\s|$)/g

This way our 'scoping up' part matches any paired quotes or anything other
than
our closing quote.


The following test data wouldn't have worked...

' data11 '' data12' 'data13'


I will have a look at Text::CSV


Please do, my mistake above shows you why you shouldn't reinvent this stuff
if you can
get away with it. :-)

Ta

Rob



Michel

-Message d'origine-
De: Rob Anderson [mailto:[EMAIL PROTECTED]






-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Regular expression help

2003-08-18 Thread Rob Anderson


Eurospace Szarindar [EMAIL PROTECTED] wrote in
message news:[EMAIL PROTECTED]
 Hi,

 I tried to write a script to extrat data from the given DATA but I can
find
 the right regular expression to do that.

Hi Szarindar,

Firstly why are you using this format and trying to parse it yourself? If
you've got control of your data, use something like Text::CSV. The guy who
maintains your code will thank you for it.

otherwise, this seems to work (you'll have to print with $2 though)...

/(['])(((''|)|[^'])*?)\1(\s|$)/g

HTH

Rob Anderson

 RULE: I need to catch everything between quotes (single or double) and if
 inside exists a repeated quote (single or double) it is not seen as end of
 the match.

 #!/usr/bin/perl -w

 use strict;

 my $begin = '[\,\']'; #
 my $end = '[\,\']'; #
 my $pattern = ${begin}(.*?)${end}

 while (DATA) {
 print line content $_ \n
 while ( /$pattern/g ){
 print line $. found : $1\n
 }

 }


 __DATA__
 ' data11 '' data12' 'data13'
  data21  data22 data23
  data31 '' data32  data34
 data41 data42 data43 __--data44
   '' '' 



 Result should be
 1:( data11 '' data12)
 2:(data13)
 3:( data21  data22)
 4:(data23)
 5:(data31 '' data32)
 6:()
 7:(data34)
 8:(data41)
 9:(data42)
 10:(data43)
 11:(__--data44)
 12:()
 13:('')
 14:()
 15:('')


 It could be resolved by replacing the \{2} or \'{2} by a [EMAIL PROTECTED] and
 alterwards relacing it back to the previous value but it is not very
smart.



 great thanks in advance

 Michel



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

RE: Regular expression help

2003-08-18 Thread EUROSPACE SZARINDAR


Thanks you, it works fine. Could you explain me why have you added the \1 ?
I will have a look at Text::CSV

Michel

-Message d'origine-
De: Rob Anderson [mailto:[EMAIL PROTECTED]
Date: lundi 18 août 2003 16:39
À: [EMAIL PROTECTED]
Objet: Re: Regular expression help



Eurospace Szarindar [EMAIL PROTECTED] wrote in
message news:[EMAIL PROTECTED]
 Hi,

 I tried to write a script to extrat data from the given DATA but I can
find
 the right regular expression to do that.

Hi Szarindar,

Firstly why are you using this format and trying to parse it yourself? If
you've got control of your data, use something like Text::CSV. The guy who
maintains your code will thank you for it.

otherwise, this seems to work (you'll have to print with $2 though)...

/(['])(((''|)|[^'])*?)\1(\s|$)/g

HTH

Rob Anderson

 RULE: I need to catch everything between quotes (single or double) and if
 inside exists a repeated quote (single or double) it is not seen as end of
 the match.

 #!/usr/bin/perl -w

 use strict;

 my $begin = '[\,\']'; #
 my $end = '[\,\']'; #
 my $pattern = ${begin}(.*?)${end}

 while (DATA) {
 print line content $_ \n
 while ( /$pattern/g ){
 print line $. found : $1\n
 }

 }


 __DATA__
 ' data11 '' data12' 'data13'
  data21  data22 data23
  data31 '' data32  data34
 data41 data42 data43 __--data44
   '' '' 



 Result should be
 1:( data11 '' data12)
 2:(data13)
 3:( data21  data22)
 4:(data23)
 5:(data31 '' data32)
 6:()
 7:(data34)
 8:(data41)
 9:(data42)
 10:(data43)
 11:(__--data44)
 12:()
 13:('')
 14:()
 15:('')


 It could be resolved by replacing the \{2} or \'{2} by a [EMAIL PROTECTED] and
 alterwards relacing it back to the previous value but it is not very
smart.



 great thanks in advance

 Michel



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Regular expression help

2003-08-18 Thread Rob Anderson


Eurospace Szarindar [EMAIL PROTECTED] wrote in
message news:[EMAIL PROTECTED]

Thanks you, it works fine. Could you explain me why have you added the \1 ?

Hi,

A quick breakdown...


/(['])(((''|)|[^'])*?)\1(\s|$)/g
 ^^
1  
  2
  ^^
  3


1) This is picking up either a single or double quote

2) This is scoping up any quotes pairs, and is being carefull not
   to match quotes on thier own.

3) In a regex \1, \2 etc.. will match the same sequence of characters
matched
   earlier in the same-numbered pair of parentheses, in this case, it means
   that were making sure the opening and closing quotes are the same by
mathching
   that same character as in part 1.

Explaining it like this has made me spot a problem. The regex you really
want is

m/(['])(((''|)|[^\1])*?)\1(\s|$)/g

This way our 'scoping up' part matches any paired quotes or anything other
than
our closing quote.


The following test data wouldn't have worked...

' data11 '' data12' 'data13'


I will have a look at Text::CSV


Please do, my mistake above shows you why you shouldn't reinvent this stuff
if you can
get away with it. :-)

Ta

Rob



Michel

-Message d'origine-
De: Rob Anderson [mailto:[EMAIL PROTECTED]






-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

RE: Regular Expression help

2002-09-24 Thread nkuipers


if($array[$x] =~ /\d{1,3}?/)
{
...do something
}

Unfortunately this if statement never appears to come true.

That's because you have two quantifiers specified, the {1,3} and the ?.

for (@array) { 
if ( m/^\d+$/ ) { #regex breaks on decimal numbers
do something... 
}
}


-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

RE: Regular Expression help

2002-09-24 Thread Mark Anderson

see my comments at bottom...

-Original Message-
From: Shaun Bramley [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, September 24, 2002 4:39 PM
To: [EMAIL PROTECTED]
Subject: Regular Expression help

Hi all.

I am hoping that someone can help me determine what is wronf with my regualr
expression.

background info: @array contains 'text', numbers, INT, or TINYINT.

I am trying to identify if the array element is a number.

What I have right now is:

if($array[$x] =~ /\d{1,3}?/)
{
.do something
}

Unfortunately this if statement never appears to come true.

thank you

Shaun

-Response Follows--

I'm not clear on what your problem is.  Could you give some examples of what
in the array you want to do something, and some examples that you don't?
Perhaps show us more of the code than just the regexp?

I quickly wrote up the following code (using your regexp):

push @array,'1';
push @array,'xyzzy';
push @array,'1.234';
push @array,0x1234;
push @array,3;

for ($x=-1;$x++$#array;) {
if ($array[$x] =~ /\d{1,3}?/) {
print Yes: 
} else {
print No : 
}
print $array[$x].\n;
}

which produces:
Yes: 1
No : xyzzy
Yes: 1.234
Yes: 0x1234
Yes: 3

which appears to me to be correct.

/\/\ark

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

RE: regular expression help....

2002-09-12 Thread david


David --- Senior Programmer Analyst --- Wgo Wagner wrote:

 Your log shows a space between the time and hyphen and hyphen and
 Micro. You dont' have that in the regex and even more so, there is no
 hyphen before Adapter log.
 
 You might want:
 /^(\d\d-\d\d-\d{4})\s+(\d\d:\d\d:\d\d\.\d\d).+Adapter
 log\s+(opened|closed)/i
 
 Anchor with the ^ and you will have date in $1, time in $2 and
 opened or closed in $3.
 A shot.
 Wags ;)

you can avoid putting tons of \d in your reg by using:

my($d,$t) = (split(/\s+/))[0,1];
print $d,$t\n;

__END__

but a reg. expression should be much faster.

david

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

RE: regular expression help....

2002-09-11 Thread Wagner, David --- Senior Programmer Analyst --- WGO


Your log shows a space between the time and hyphen and hyphen and
Micro. You dont' have that in the regex and even more so, there is no hyphen
before Adapter log.

You might want:
/^(\d\d-\d\d-\d{4})\s+(\d\d:\d\d:\d\d\.\d\d).+Adapter
log\s+(opened|closed)/i

Anchor with the ^ and you will have date in $1, time in $2 and
opened or closed in $3.
A shot.
Wags ;)

-Original Message-
From: Steve [mailto:[EMAIL PROTECTED]]
Sent: Wednesday, September 11, 2002 08:12
To: [EMAIL PROTECTED]
Subject: regular expression help


I have the following line in a log file:

09-07-2002 11:39:25.95 - Microsoft Dial Up Adapter log opened.

The date and time will always change but the after that it's always 
consistent (well opened will sometimes be closed)  I need to get the entire 
date in a variable and the entire time in a variable.  I will compare the 
difference between open and close and place THAT into a cumulative variable.

Anyway this the rexexp I have to get the date and time but it's not
matching:

/(\d\d-\d\d-\d{4}) (\d\d:\d\d:\d\d\.\d\d)-(Adapter log)/

Now if I just do it like /(Adapter log)/ I get a match on Adapter log, 
which leads me to think that the problem is somewhere with my digit
matching.

Thanks



-

The three most dangerous things are a programmer with a soldering iron, a 
manager who codes, and a user who gets ideas.



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


**
This message contains information that is confidential
and proprietary to FedEx Freight or its affiliates.
It is intended only for the recipient named and for
the express purpose(s) described therein.
Any other use is prohibited.



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Regular Expression Help sought

2002-07-04 Thread Nigel Peck


s/((^|\n\n).+?)\n/$1 /g;

s/   substitute
( put this is $1
(^  the start of the file
|  or
\n\n)  two newlines
..+?  followed by 1 or more characters up until a
) stop putting it in $1
\n  newline
/ with
$1 /the text captured above followed by a space
g;keep doing it until you can't match anymore

 Sunish Kapoor [EMAIL PROTECTED] 07/04/02 04:27pm 
Dear Katy,

It works great after the changed the regular expression to as given by
u
below..!
Thanks a ton for the help.but may I request for a small explanation
of the
line
$file =~ s/((^|\n\n).+?)\n/$1 /g;

Sunish

Katy Brownfield wrote:

 Or, instead of changing the input file, change the regular expression
to
 also detect the beginning of the string.

 $file =~ s/((^|\n\n).+?)\n/$1 /g;

 Katy

 Timothy Johnson wrote:
 
 
  I think you'll need 2 blank lines.
 
  -Original Message-
  From: Sunish Kapoor
  To: Nigel Peck
  Cc: [EMAIL PROTECTED] 
  Sent: 7/3/02 6:51 PM
  Subject: Re: Regular Expression Help sought
 
  Dear Nigel,
 
  Thanks a ton for the script..It works fine though it skips the
first
  record
  even though I make
  the file begin with a blank line by simply hitting enter !
 
  Regards
 
  Sunish
  Nigel Peck wrote:
 
   My first attempt, which may be a bit simplified, would be to
  substitute
   any newline, which occured at the end of a line which was
preceeded by
  2
   consecutive newlines, with a space. This relies on the format of
the
   file being as shown for every company. It also may miss the
first
   company in the file if the file doesn't start with a blank line.
  
   So
  
   undef $/;
   $file = FILE;
   $file =~ s/(\n\n.+?)\n/$1 /g;
   print $file;
  
   [untested]
  
   HTH
   Nigel
  
[EMAIL PROTECTED] 07/03/02 10:52pm 
   Hi  All,
  
   I am new to Perl and need help to solve this one !
  
   I have a txt file and the contents of the file are as below  :
  
   ---CONTENTS   OF   TEXT  
FILE--
   Abdullah Ahmed Hassan
   Trading
   Location Ruwi Souk St, Ruwi
   Bus Hrs 0930-1300:1630-2200
   P.O. Box 197 Ruwi Code 112
   Phone 708940
   Fax 794156
   Key Staff Abdullah Ahmed Hassan
   Mohammad Hussain Ahmed Hassan
   Irfan Ahmed
   Mohammad Asim
   Activities :3410
   Email : [EMAIL PROTECTED] 
  
   Abu Ali Trading  Cont
   Est
   Location Azaiba
   B us H rs 0700-1200; 1500-1900
   P.O. Box 1695 C. P. 0. Code 111
   Phone 595324
   Key Staff K. Sasi, Prop
   Ali Suleiman Al Ghubshi, Sponsor
   Activities 2700
  
   Abu Al Dahab Trading  Contg
   Est
   Location 23rd July St, Opp Al Ghobish
   Furniture Showroom
   Bus Hrs 0800-1300 (Sat-Thu); 1600-1900 (Sat-
   Fri)
   P.O. Box 1254 Salalah Code 211
   Phone 297879
   Fax 297879
   Key Staff Syed Tajudeen Madani, Gen Mgr
   Activities 2660
   ---CONTENTS  OF TEXT FILE--
  
   The names of three companies as above are:
  
   Abdullah Ahmed Hassan
   Trading
  
   Abu Ali Trading  Cont
   Est
  
   Abu Al Dahab Trading  Contg
   Est
  
   I want the data this way .
  
   ---DATA WANTED THIS WAY
   Abdullah Ahmed Hassan Trading
   Location Ruwi Souk St, Ruwi
   Bus Hrs 0930-1300:1630-2200
   P.O. Box 197 Ruwi Code 112
   Phone 708940
   Fax 794156
   Key Staff Abdullah Ahmed Hassan
   Mohammad Hussain Ahmed Hassan
   Irfan Ahmed
   Mohammad Asim
   Activities :3410
   Email : [EMAIL PROTECTED] 
  
   Abu Ali Trading  Cont Est
   Location Azaiba
   B us H rs 0700-1200; 1500-1900
   P.O. Box 1695 C. P. 0. Code 111
   Phone 595324
   Key Staff K. Sasi, Prop
   Ali Suleiman Al Ghubshi, Sponsor
   Activities 2700
  
   Abu Al Dahab Trading  Contg Est
   Location 23rd July St, Opp Al Ghobish
   Furniture Showroom
   Bus Hrs 0800-1300 (Sat-Thu); 1600-1900 (Sat-
   Fri)
   P.O. Box 1254 Salalah Code 211
   Phone 297879
   Fax 297879
   Key Staff Syed Tajudeen Madani, Gen Mgr
   Activities 2660
   ---DATA WANTED THIS WAY
  
   I want PERL to make the name of the company to come in
   one line as it has done for all three records.
  
   I request your help solving this.
  
   Regards
  
   Sunish Kapoor
  
   ITM Business Solutions
   Unit 4
   Nine Trees Trading Estate
   Morthen Road
   Rotherham
   S66 9JG
  
   Reception
   Tel: 01709 703288
   Fax: 01709 701549
  
   Help Desk
   Tel:01709 530424
   Fax: 01709 702159
  
   CONFIDENTIALITY NOTICE: This message is intended only for the use
of
   the individual or entity to which it is addressed, and may
contain
   information that is privileged, confidential and exempt from
  disclosure
   under applicable law.
  
   --
   To unsubscribe, e-mail: [EMAIL PROTECTED] 
   For additional commands, e-mail: [EMAIL PROTECTED] 
 
  --
  To unsubscribe, e-mail: [EMAIL PROTECTED] 
  For additional commands, e-mail: [EMAIL PROTECTED] 
 
  --
  To unsubscribe, e-mail: [EMAIL PROTECTED] 
  For additional commands, e-mail: [EMAIL PROTECTED

Re: Regular Expression Help sought

2002-07-04 Thread Katy Brownfield


One thing I would add is that the inner pair of parentheses is used to
isolate ^|\n\n from the rest of the pattern so it is clear what the | is
operating on. /cat|blue fish/ matches cat or blue fish but
/(cat|blue) fish/ matches cat fish or blue fish. (The parentheses
also capture the thing that was matched in $2, but we don't need it.
(?:^|\n\n) would prevent it from capturing the match, but capturing
doesn't hurt anything in this case and the pattern is easier to read.)

Katy


Nigel Peck wrote:
 
 s/((^|\n\n).+?)\n/$1 /g;
 
 s/   substitute
 ( put this is $1
 (^  the start of the file
 |  or
 \n\n)  two newlines
 ..+?  followed by 1 or more characters up until a
 ) stop putting it in $1
 \n  newline
 / with
 $1 /the text captured above followed by a space
 g;keep doing it until you can't match anymore
 
  Sunish Kapoor [EMAIL PROTECTED] 07/04/02 04:27pm 
 Dear Katy,
 
 It works great after the changed the regular expression to as given by
 u
 below..!
 Thanks a ton for the help.but may I request for a small explanation
 of the
 line
 $file =~ s/((^|\n\n).+?)\n/$1 /g;
 
 Sunish
 
 Katy Brownfield wrote:
 
  Or, instead of changing the input file, change the regular expression
 to
  also detect the beginning of the string.
 
  $file =~ s/((^|\n\n).+?)\n/$1 /g;
 
  Katy
 
  Timothy Johnson wrote:
  
  
   I think you'll need 2 blank lines.
  
   -Original Message-
   From: Sunish Kapoor
   To: Nigel Peck
   Cc: [EMAIL PROTECTED]
   Sent: 7/3/02 6:51 PM
   Subject: Re: Regular Expression Help sought
  
   Dear Nigel,
  
   Thanks a ton for the script..It works fine though it skips the
 first
   record
   even though I make
   the file begin with a blank line by simply hitting enter !
  
   Regards
  
   Sunish
   Nigel Peck wrote:
  
My first attempt, which may be a bit simplified, would be to
   substitute
any newline, which occured at the end of a line which was
 preceeded by
   2
consecutive newlines, with a space. This relies on the format of
 the
file being as shown for every company. It also may miss the
 first
company in the file if the file doesn't start with a blank line.
   
So
   
undef $/;
$file = FILE;
$file =~ s/(\n\n.+?)\n/$1 /g;
print $file;
   
[untested]
   
HTH
Nigel
   
 [EMAIL PROTECTED] 07/03/02 10:52pm 
Hi  All,
   
I am new to Perl and need help to solve this one !
   
I have a txt file and the contents of the file are as below  :
   
---CONTENTS   OF   TEXT
 FILE--
Abdullah Ahmed Hassan
Trading
Location Ruwi Souk St, Ruwi
Bus Hrs 0930-1300:1630-2200
P.O. Box 197 Ruwi Code 112
Phone 708940
Fax 794156
Key Staff Abdullah Ahmed Hassan
Mohammad Hussain Ahmed Hassan
Irfan Ahmed
Mohammad Asim
Activities :3410
Email : [EMAIL PROTECTED]
   
Abu Ali Trading  Cont
Est
Location Azaiba
B us H rs 0700-1200; 1500-1900
P.O. Box 1695 C. P. 0. Code 111
Phone 595324
Key Staff K. Sasi, Prop
Ali Suleiman Al Ghubshi, Sponsor
Activities 2700
   
Abu Al Dahab Trading  Contg
Est
Location 23rd July St, Opp Al Ghobish
Furniture Showroom
Bus Hrs 0800-1300 (Sat-Thu); 1600-1900 (Sat-
Fri)
P.O. Box 1254 Salalah Code 211
Phone 297879
Fax 297879
Key Staff Syed Tajudeen Madani, Gen Mgr
Activities 2660
---CONTENTS  OF TEXT FILE--
   
The names of three companies as above are:
   
Abdullah Ahmed Hassan
Trading
   
Abu Ali Trading  Cont
Est
   
Abu Al Dahab Trading  Contg
Est
   
I want the data this way .
   
---DATA WANTED THIS WAY
Abdullah Ahmed Hassan Trading
Location Ruwi Souk St, Ruwi
Bus Hrs 0930-1300:1630-2200
P.O. Box 197 Ruwi Code 112
Phone 708940
Fax 794156
Key Staff Abdullah Ahmed Hassan
Mohammad Hussain Ahmed Hassan
Irfan Ahmed
Mohammad Asim
Activities :3410
Email : [EMAIL PROTECTED]
   
Abu Ali Trading  Cont Est
Location Azaiba
B us H rs 0700-1200; 1500-1900
P.O. Box 1695 C. P. 0. Code 111
Phone 595324
Key Staff K. Sasi, Prop
Ali Suleiman Al Ghubshi, Sponsor
Activities 2700
   
Abu Al Dahab Trading  Contg Est
Location 23rd July St, Opp Al Ghobish
Furniture Showroom
Bus Hrs 0800-1300 (Sat-Thu); 1600-1900 (Sat-
Fri)
P.O. Box 1254 Salalah Code 211
Phone 297879
Fax 297879
Key Staff Syed Tajudeen Madani, Gen Mgr
Activities 2660
---DATA WANTED THIS WAY
   
I want PERL to make the name of the company to come in
one line as it has done for all three records.
   
I request your help solving this.
   
Regards
   
Sunish Kapoor
   
ITM Business Solutions
Unit 4
Nine Trees Trading Estate
Morthen Road
Rotherham
S66 9JG

Re: Regular Expression Help sought

2002-07-03 Thread Sunish Kapoor



Dear Nigel,

Thanks a ton for the script..It works fine though it skips the first record
even though I make
the file begin with a blank line by simply hitting enter !

Regards

Sunish
Nigel Peck wrote:

 My first attempt, which may be a bit simplified, would be to substitute
 any newline, which occured at the end of a line which was preceeded by 2
 consecutive newlines, with a space. This relies on the format of the
 file being as shown for every company. It also may miss the first
 company in the file if the file doesn't start with a blank line.

 So

 undef $/;
 $file = FILE;
 $file =~ s/(\n\n.+?)\n/$1 /g;
 print $file;

 [untested]

 HTH
 Nigel

  [EMAIL PROTECTED] 07/03/02 10:52pm 
 Hi  All,

 I am new to Perl and need help to solve this one !

 I have a txt file and the contents of the file are as below  :

 ---CONTENTS   OF   TEXT   FILE--
 Abdullah Ahmed Hassan
 Trading
 Location Ruwi Souk St, Ruwi
 Bus Hrs 0930-1300:1630-2200
 P.O. Box 197 Ruwi Code 112
 Phone 708940
 Fax 794156
 Key Staff Abdullah Ahmed Hassan
 Mohammad Hussain Ahmed Hassan
 Irfan Ahmed
 Mohammad Asim
 Activities :3410
 Email : [EMAIL PROTECTED]

 Abu Ali Trading  Cont
 Est
 Location Azaiba
 B us H rs 0700-1200; 1500-1900
 P.O. Box 1695 C. P. 0. Code 111
 Phone 595324
 Key Staff K. Sasi, Prop
 Ali Suleiman Al Ghubshi, Sponsor
 Activities 2700

 Abu Al Dahab Trading  Contg
 Est
 Location 23rd July St, Opp Al Ghobish
 Furniture Showroom
 Bus Hrs 0800-1300 (Sat-Thu); 1600-1900 (Sat-
 Fri)
 P.O. Box 1254 Salalah Code 211
 Phone 297879
 Fax 297879
 Key Staff Syed Tajudeen Madani, Gen Mgr
 Activities 2660
 ---CONTENTS  OF TEXT FILE--

 The names of three companies as above are:

 Abdullah Ahmed Hassan
 Trading

 Abu Ali Trading  Cont
 Est

 Abu Al Dahab Trading  Contg
 Est

 I want the data this way .

 ---DATA WANTED THIS WAY
 Abdullah Ahmed Hassan Trading
 Location Ruwi Souk St, Ruwi
 Bus Hrs 0930-1300:1630-2200
 P.O. Box 197 Ruwi Code 112
 Phone 708940
 Fax 794156
 Key Staff Abdullah Ahmed Hassan
 Mohammad Hussain Ahmed Hassan
 Irfan Ahmed
 Mohammad Asim
 Activities :3410
 Email : [EMAIL PROTECTED]

 Abu Ali Trading  Cont Est
 Location Azaiba
 B us H rs 0700-1200; 1500-1900
 P.O. Box 1695 C. P. 0. Code 111
 Phone 595324
 Key Staff K. Sasi, Prop
 Ali Suleiman Al Ghubshi, Sponsor
 Activities 2700

 Abu Al Dahab Trading  Contg Est
 Location 23rd July St, Opp Al Ghobish
 Furniture Showroom
 Bus Hrs 0800-1300 (Sat-Thu); 1600-1900 (Sat-
 Fri)
 P.O. Box 1254 Salalah Code 211
 Phone 297879
 Fax 297879
 Key Staff Syed Tajudeen Madani, Gen Mgr
 Activities 2660
 ---DATA WANTED THIS WAY

 I want PERL to make the name of the company to come in
 one line as it has done for all three records.

 I request your help solving this.

 Regards

 Sunish Kapoor

 ITM Business Solutions
 Unit 4
 Nine Trees Trading Estate
 Morthen Road
 Rotherham
 S66 9JG

 Reception
 Tel: 01709 703288
 Fax: 01709 701549

 Help Desk
 Tel:01709 530424
 Fax: 01709 702159

 CONFIDENTIALITY NOTICE: This message is intended only for the use of
 the individual or entity to which it is addressed, and may contain
 information that is privileged, confidential and exempt from disclosure
 under applicable law.

 --
 To unsubscribe, e-mail: [EMAIL PROTECTED]
 For additional commands, e-mail: [EMAIL PROTECTED]


-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

RE: Regular Expression Help sought

2002-07-03 Thread Timothy Johnson


 
I think you'll need 2 blank lines.

-Original Message-
From: Sunish Kapoor
To: Nigel Peck
Cc: [EMAIL PROTECTED]
Sent: 7/3/02 6:51 PM
Subject: Re: Regular Expression Help sought


Dear Nigel,

Thanks a ton for the script..It works fine though it skips the first
record
even though I make
the file begin with a blank line by simply hitting enter !

Regards

Sunish
Nigel Peck wrote:

 My first attempt, which may be a bit simplified, would be to
substitute
 any newline, which occured at the end of a line which was preceeded by
2
 consecutive newlines, with a space. This relies on the format of the
 file being as shown for every company. It also may miss the first
 company in the file if the file doesn't start with a blank line.

 So

 undef $/;
 $file = FILE;
 $file =~ s/(\n\n.+?)\n/$1 /g;
 print $file;

 [untested]

 HTH
 Nigel

  [EMAIL PROTECTED] 07/03/02 10:52pm 
 Hi  All,

 I am new to Perl and need help to solve this one !

 I have a txt file and the contents of the file are as below  :

 ---CONTENTS   OF   TEXT   FILE--
 Abdullah Ahmed Hassan
 Trading
 Location Ruwi Souk St, Ruwi
 Bus Hrs 0930-1300:1630-2200
 P.O. Box 197 Ruwi Code 112
 Phone 708940
 Fax 794156
 Key Staff Abdullah Ahmed Hassan
 Mohammad Hussain Ahmed Hassan
 Irfan Ahmed
 Mohammad Asim
 Activities :3410
 Email : [EMAIL PROTECTED]

 Abu Ali Trading  Cont
 Est
 Location Azaiba
 B us H rs 0700-1200; 1500-1900
 P.O. Box 1695 C. P. 0. Code 111
 Phone 595324
 Key Staff K. Sasi, Prop
 Ali Suleiman Al Ghubshi, Sponsor
 Activities 2700

 Abu Al Dahab Trading  Contg
 Est
 Location 23rd July St, Opp Al Ghobish
 Furniture Showroom
 Bus Hrs 0800-1300 (Sat-Thu); 1600-1900 (Sat-
 Fri)
 P.O. Box 1254 Salalah Code 211
 Phone 297879
 Fax 297879
 Key Staff Syed Tajudeen Madani, Gen Mgr
 Activities 2660
 ---CONTENTS  OF TEXT FILE--

 The names of three companies as above are:

 Abdullah Ahmed Hassan
 Trading

 Abu Ali Trading  Cont
 Est

 Abu Al Dahab Trading  Contg
 Est

 I want the data this way .

 ---DATA WANTED THIS WAY
 Abdullah Ahmed Hassan Trading
 Location Ruwi Souk St, Ruwi
 Bus Hrs 0930-1300:1630-2200
 P.O. Box 197 Ruwi Code 112
 Phone 708940
 Fax 794156
 Key Staff Abdullah Ahmed Hassan
 Mohammad Hussain Ahmed Hassan
 Irfan Ahmed
 Mohammad Asim
 Activities :3410
 Email : [EMAIL PROTECTED]

 Abu Ali Trading  Cont Est
 Location Azaiba
 B us H rs 0700-1200; 1500-1900
 P.O. Box 1695 C. P. 0. Code 111
 Phone 595324
 Key Staff K. Sasi, Prop
 Ali Suleiman Al Ghubshi, Sponsor
 Activities 2700

 Abu Al Dahab Trading  Contg Est
 Location 23rd July St, Opp Al Ghobish
 Furniture Showroom
 Bus Hrs 0800-1300 (Sat-Thu); 1600-1900 (Sat-
 Fri)
 P.O. Box 1254 Salalah Code 211
 Phone 297879
 Fax 297879
 Key Staff Syed Tajudeen Madani, Gen Mgr
 Activities 2660
 ---DATA WANTED THIS WAY

 I want PERL to make the name of the company to come in
 one line as it has done for all three records.

 I request your help solving this.

 Regards

 Sunish Kapoor

 ITM Business Solutions
 Unit 4
 Nine Trees Trading Estate
 Morthen Road
 Rotherham
 S66 9JG

 Reception
 Tel: 01709 703288
 Fax: 01709 701549

 Help Desk
 Tel:01709 530424
 Fax: 01709 702159

 CONFIDENTIALITY NOTICE: This message is intended only for the use of
 the individual or entity to which it is addressed, and may contain
 information that is privileged, confidential and exempt from
disclosure
 under applicable law.

 --
 To unsubscribe, e-mail: [EMAIL PROTECTED]
 For additional commands, e-mail: [EMAIL PROTECTED]


-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Regular Expression Help sought

2002-07-03 Thread Katy Brownfield


Or, instead of changing the input file, change the regular expression to
also detect the beginning of the string.

$file =~ s/((^|\n\n).+?)\n/$1 /g;

Katy

Timothy Johnson wrote:
 
 
 I think you'll need 2 blank lines.
 
 -Original Message-
 From: Sunish Kapoor
 To: Nigel Peck
 Cc: [EMAIL PROTECTED]
 Sent: 7/3/02 6:51 PM
 Subject: Re: Regular Expression Help sought
 
 Dear Nigel,
 
 Thanks a ton for the script..It works fine though it skips the first
 record
 even though I make
 the file begin with a blank line by simply hitting enter !
 
 Regards
 
 Sunish
 Nigel Peck wrote:
 
  My first attempt, which may be a bit simplified, would be to
 substitute
  any newline, which occured at the end of a line which was preceeded by
 2
  consecutive newlines, with a space. This relies on the format of the
  file being as shown for every company. It also may miss the first
  company in the file if the file doesn't start with a blank line.
 
  So
 
  undef $/;
  $file = FILE;
  $file =~ s/(\n\n.+?)\n/$1 /g;
  print $file;
 
  [untested]
 
  HTH
  Nigel
 
   [EMAIL PROTECTED] 07/03/02 10:52pm 
  Hi  All,
 
  I am new to Perl and need help to solve this one !
 
  I have a txt file and the contents of the file are as below  :
 
  ---CONTENTS   OF   TEXT   FILE--
  Abdullah Ahmed Hassan
  Trading
  Location Ruwi Souk St, Ruwi
  Bus Hrs 0930-1300:1630-2200
  P.O. Box 197 Ruwi Code 112
  Phone 708940
  Fax 794156
  Key Staff Abdullah Ahmed Hassan
  Mohammad Hussain Ahmed Hassan
  Irfan Ahmed
  Mohammad Asim
  Activities :3410
  Email : [EMAIL PROTECTED]
 
  Abu Ali Trading  Cont
  Est
  Location Azaiba
  B us H rs 0700-1200; 1500-1900
  P.O. Box 1695 C. P. 0. Code 111
  Phone 595324
  Key Staff K. Sasi, Prop
  Ali Suleiman Al Ghubshi, Sponsor
  Activities 2700
 
  Abu Al Dahab Trading  Contg
  Est
  Location 23rd July St, Opp Al Ghobish
  Furniture Showroom
  Bus Hrs 0800-1300 (Sat-Thu); 1600-1900 (Sat-
  Fri)
  P.O. Box 1254 Salalah Code 211
  Phone 297879
  Fax 297879
  Key Staff Syed Tajudeen Madani, Gen Mgr
  Activities 2660
  ---CONTENTS  OF TEXT FILE--
 
  The names of three companies as above are:
 
  Abdullah Ahmed Hassan
  Trading
 
  Abu Ali Trading  Cont
  Est
 
  Abu Al Dahab Trading  Contg
  Est
 
  I want the data this way .
 
  ---DATA WANTED THIS WAY
  Abdullah Ahmed Hassan Trading
  Location Ruwi Souk St, Ruwi
  Bus Hrs 0930-1300:1630-2200
  P.O. Box 197 Ruwi Code 112
  Phone 708940
  Fax 794156
  Key Staff Abdullah Ahmed Hassan
  Mohammad Hussain Ahmed Hassan
  Irfan Ahmed
  Mohammad Asim
  Activities :3410
  Email : [EMAIL PROTECTED]
 
  Abu Ali Trading  Cont Est
  Location Azaiba
  B us H rs 0700-1200; 1500-1900
  P.O. Box 1695 C. P. 0. Code 111
  Phone 595324
  Key Staff K. Sasi, Prop
  Ali Suleiman Al Ghubshi, Sponsor
  Activities 2700
 
  Abu Al Dahab Trading  Contg Est
  Location 23rd July St, Opp Al Ghobish
  Furniture Showroom
  Bus Hrs 0800-1300 (Sat-Thu); 1600-1900 (Sat-
  Fri)
  P.O. Box 1254 Salalah Code 211
  Phone 297879
  Fax 297879
  Key Staff Syed Tajudeen Madani, Gen Mgr
  Activities 2660
  ---DATA WANTED THIS WAY
 
  I want PERL to make the name of the company to come in
  one line as it has done for all three records.
 
  I request your help solving this.
 
  Regards
 
  Sunish Kapoor
 
  ITM Business Solutions
  Unit 4
  Nine Trees Trading Estate
  Morthen Road
  Rotherham
  S66 9JG
 
  Reception
  Tel: 01709 703288
  Fax: 01709 701549
 
  Help Desk
  Tel:01709 530424
  Fax: 01709 702159
 
  CONFIDENTIALITY NOTICE: This message is intended only for the use of
  the individual or entity to which it is addressed, and may contain
  information that is privileged, confidential and exempt from
 disclosure
  under applicable law.
 
  --
  To unsubscribe, e-mail: [EMAIL PROTECTED]
  For additional commands, e-mail: [EMAIL PROTECTED]
 
 --
 To unsubscribe, e-mail: [EMAIL PROTECTED]
 For additional commands, e-mail: [EMAIL PROTECTED]
 
 --
 To unsubscribe, e-mail: [EMAIL PROTECTED]
 For additional commands, e-mail: [EMAIL PROTECTED]


-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Regular Expression Help sought

2002-07-03 Thread Sunish Kapoor


Dear Katy,

It works great after the changed the regular expression to as given by u
below..!
Thanks a ton for the help.but may I request for a small explanation of the
line
$file =~ s/((^|\n\n).+?)\n/$1 /g;

Sunish

Katy Brownfield wrote:

 Or, instead of changing the input file, change the regular expression to
 also detect the beginning of the string.

 $file =~ s/((^|\n\n).+?)\n/$1 /g;

 Katy

 Timothy Johnson wrote:
 
 
  I think you'll need 2 blank lines.
 
  -Original Message-
  From: Sunish Kapoor
  To: Nigel Peck
  Cc: [EMAIL PROTECTED]
  Sent: 7/3/02 6:51 PM
  Subject: Re: Regular Expression Help sought
 
  Dear Nigel,
 
  Thanks a ton for the script..It works fine though it skips the first
  record
  even though I make
  the file begin with a blank line by simply hitting enter !
 
  Regards
 
  Sunish
  Nigel Peck wrote:
 
   My first attempt, which may be a bit simplified, would be to
  substitute
   any newline, which occured at the end of a line which was preceeded by
  2
   consecutive newlines, with a space. This relies on the format of the
   file being as shown for every company. It also may miss the first
   company in the file if the file doesn't start with a blank line.
  
   So
  
   undef $/;
   $file = FILE;
   $file =~ s/(\n\n.+?)\n/$1 /g;
   print $file;
  
   [untested]
  
   HTH
   Nigel
  
[EMAIL PROTECTED] 07/03/02 10:52pm 
   Hi  All,
  
   I am new to Perl and need help to solve this one !
  
   I have a txt file and the contents of the file are as below  :
  
   ---CONTENTS   OF   TEXT   FILE--
   Abdullah Ahmed Hassan
   Trading
   Location Ruwi Souk St, Ruwi
   Bus Hrs 0930-1300:1630-2200
   P.O. Box 197 Ruwi Code 112
   Phone 708940
   Fax 794156
   Key Staff Abdullah Ahmed Hassan
   Mohammad Hussain Ahmed Hassan
   Irfan Ahmed
   Mohammad Asim
   Activities :3410
   Email : [EMAIL PROTECTED]
  
   Abu Ali Trading  Cont
   Est
   Location Azaiba
   B us H rs 0700-1200; 1500-1900
   P.O. Box 1695 C. P. 0. Code 111
   Phone 595324
   Key Staff K. Sasi, Prop
   Ali Suleiman Al Ghubshi, Sponsor
   Activities 2700
  
   Abu Al Dahab Trading  Contg
   Est
   Location 23rd July St, Opp Al Ghobish
   Furniture Showroom
   Bus Hrs 0800-1300 (Sat-Thu); 1600-1900 (Sat-
   Fri)
   P.O. Box 1254 Salalah Code 211
   Phone 297879
   Fax 297879
   Key Staff Syed Tajudeen Madani, Gen Mgr
   Activities 2660
   ---CONTENTS  OF TEXT FILE--
  
   The names of three companies as above are:
  
   Abdullah Ahmed Hassan
   Trading
  
   Abu Ali Trading  Cont
   Est
  
   Abu Al Dahab Trading  Contg
   Est
  
   I want the data this way .
  
   ---DATA WANTED THIS WAY
   Abdullah Ahmed Hassan Trading
   Location Ruwi Souk St, Ruwi
   Bus Hrs 0930-1300:1630-2200
   P.O. Box 197 Ruwi Code 112
   Phone 708940
   Fax 794156
   Key Staff Abdullah Ahmed Hassan
   Mohammad Hussain Ahmed Hassan
   Irfan Ahmed
   Mohammad Asim
   Activities :3410
   Email : [EMAIL PROTECTED]
  
   Abu Ali Trading  Cont Est
   Location Azaiba
   B us H rs 0700-1200; 1500-1900
   P.O. Box 1695 C. P. 0. Code 111
   Phone 595324
   Key Staff K. Sasi, Prop
   Ali Suleiman Al Ghubshi, Sponsor
   Activities 2700
  
   Abu Al Dahab Trading  Contg Est
   Location 23rd July St, Opp Al Ghobish
   Furniture Showroom
   Bus Hrs 0800-1300 (Sat-Thu); 1600-1900 (Sat-
   Fri)
   P.O. Box 1254 Salalah Code 211
   Phone 297879
   Fax 297879
   Key Staff Syed Tajudeen Madani, Gen Mgr
   Activities 2660
   ---DATA WANTED THIS WAY
  
   I want PERL to make the name of the company to come in
   one line as it has done for all three records.
  
   I request your help solving this.
  
   Regards
  
   Sunish Kapoor
  
   ITM Business Solutions
   Unit 4
   Nine Trees Trading Estate
   Morthen Road
   Rotherham
   S66 9JG
  
   Reception
   Tel: 01709 703288
   Fax: 01709 701549
  
   Help Desk
   Tel:01709 530424
   Fax: 01709 702159
  
   CONFIDENTIALITY NOTICE: This message is intended only for the use of
   the individual or entity to which it is addressed, and may contain
   information that is privileged, confidential and exempt from
  disclosure
   under applicable law.
  
   --
   To unsubscribe, e-mail: [EMAIL PROTECTED]
   For additional commands, e-mail: [EMAIL PROTECTED]
 
  --
  To unsubscribe, e-mail: [EMAIL PROTECTED]
  For additional commands, e-mail: [EMAIL PROTECTED]
 
  --
  To unsubscribe, e-mail: [EMAIL PROTECTED]
  For additional commands, e-mail: [EMAIL PROTECTED]

 --
 To unsubscribe, e-mail: [EMAIL PROTECTED]
 For additional commands, e-mail: [EMAIL PROTECTED]


-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Regular Expression Help

2002-02-05 Thread Jeff 'japhy' Pinyan


On Feb 5, David Mamanakis said:

I am building a parsing routine, and am using a regular expression, which
works, EXCEPT when I need to EXCLUDE certain things...

$right =~ s/A/X/g;

However, I may need to exclude this replacement in some of the values of
$right...

Anything found between  and  SHOULD NOT be replaced.  \.*\
Anything found between  and ; SHOULD NOT be replaced.  \.*\;
Anything found between {$ and } SHOULD NOT be replaced.  \{$.*\}
Anything found between  and = SHOULD NOT be replaced.  \.*\=

Here's a crafty trick (from DALnet #perl a couple of minutes ago)...

  $text =~ m{(.*?|.*?;|{\$.*?}|.*?=)|A}{$1 || X}seg;

Basically, if it's a special case, it's matched and put in $1, and if it's
A, it's matched but $1 is undefined.  Then, on the right-hand side, we
use $1 if it has a value, and X otherwise.

The /s is so that . matches newlines, the /e is so that $1 || X is
evaluated as code, and the /g is for all matches.

It kinda sounds like you're working with HTML and some template thing,
though... you might want to use a full HTML parser instead.

-- 
Jeff japhy Pinyan  [EMAIL PROTECTED]  http://www.pobox.com/~japhy/
RPI Acacia brother #734   http://www.perlmonks.org/   http://www.cpan.org/
** Look for Regular Expressions in Perl published by Manning, in 2002 **
stu what does y/// stand for?  tenderpuss why, yansliterate of course.


-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Regular Expression Help

2002-02-05 Thread John W. Krahn


Jeff 'Japhy' Pinyan wrote:
 
 On Feb 5, David Mamanakis said:
 
 I am building a parsing routine, and am using a regular expression, which
 works, EXCEPT when I need to EXCLUDE certain things...
 
 $right =~ s/A/X/g;
 
 However, I may need to exclude this replacement in some of the values of
 $right...
 
 Anything found between  and  SHOULD NOT be replaced.  \.*\
 Anything found between  and ; SHOULD NOT be replaced.  \.*\;
 Anything found between {$ and } SHOULD NOT be replaced.  \{$.*\}
 Anything found between  and = SHOULD NOT be replaced.  \.*\=
 
 Here's a crafty trick (from DALnet #perl a couple of minutes ago)...
 
   $text =~ m{(.*?|.*?;|{\$.*?}|.*?=)|A}{$1 || X}seg;
 ^
 ^
 s


 Basically, if it's a special case, it's matched and put in $1, and if it's
 A, it's matched but $1 is undefined.  Then, on the right-hand side, we
 use $1 if it has a value, and X otherwise.
 
 The /s is so that . matches newlines, the /e is so that $1 || X is
 evaluated as code, and the /g is for all matches.
 
 It kinda sounds like you're working with HTML and some template thing,
 though... you might want to use a full HTML parser instead.



John
-- 
use Perl;
program
fulfillment

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Regular Expression Help

2002-02-05 Thread Jeff 'japhy' Pinyan


On Feb 5, John W. Krahn said:

Jeff 'Japhy' Pinyan wrote:
 
   $text =~ m{(.*?|.*?;|{\$.*?}|.*?=)|A}{$1 || X}seg;
 ^
 ^
 s

Sorry, thanks for the correction. :)

-- 
Jeff japhy Pinyan  [EMAIL PROTECTED]  http://www.pobox.com/~japhy/
RPI Acacia brother #734   http://www.perlmonks.org/   http://www.cpan.org/
** Look for Regular Expressions in Perl published by Manning, in 2002 **
stu what does y/// stand for?  tenderpuss why, yansliterate of course.


-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

RE: regular expression help

2002-01-22 Thread John Edwards


Escape the brackets like so.

s/\(locked\)//;

John

-Original Message-
From: David Samuelsson (PAC) [mailto:[EMAIL PROTECTED]]
Sent: 22 January 2002 09:37
To: '[EMAIL PROTECTED]'
Subject: regular expression help


Hello!

if i have this line ROXETTE_PC_SW_R1D08 (locked)

and just want to remove the (locked) part from it with an regexp how would
that look?

i can do: s/(locked)// that leaves the  pesky () how can i get rid off
those?

//Dave




-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


--Confidentiality--.
This E-mail is confidential.  It should not be read, copied, disclosed or
used by any person other than the intended recipient.  Unauthorised use,
disclosure or copying by whatever medium is strictly prohibited and may be
unlawful.  If you have received this E-mail in error please contact the
sender immediately and delete the E-mail from your system.



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

RE: regular expression help

2002-01-22 Thread marcus_holland-moritz


Try:

  s/\s+\(locked\)//

The parens are meta-symbols in regexes and need to be escaped
if you want to match them. The above will additionally make
sure that the blanks before (locked) are also removed.

HTH,
Marcus

| -Original Message-
| From: David Samuelsson (PAC) [mailto:[EMAIL PROTECTED]]
| Sent: Tuesday, January 22, 2002 10:37 AM
| To: '[EMAIL PROTECTED]'
| Subject: regular expression help
| 
| 
| Hello!
| 
| if i have this line ROXETTE_PC_SW_R1D08 (locked)
| 
| and just want to remove the (locked) part from it with an 
| regexp how would that look?
| 
| i can do: s/(locked)// that leaves the  pesky () how can i 
| get rid off those?
| 
| //Dave
| 
| 
| 
| 
| -- 
| To unsubscribe, e-mail: [EMAIL PROTECTED]
| For additional commands, e-mail: [EMAIL PROTECTED]
| 

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

RE: regular expression help

2002-01-22 Thread Jonathan E. Paton



 | Hello!
 | 
 | If I have this line ROXETTE_PC_SW_R1D08 (locked)
 | and just want to remove the (locked) part from it
 | with an regexp how would that look?
 | 
 | I can do: s/(locked)// that leaves the  pesky () how
 | can I get rid off those?

 Try:
 
   s/\s+\(locked\)//
 
 The parens are meta-symbols in regexes and need to be
 escaped if you want to match them. The above will
 additionally make sure that the blanks before (locked)
 are also removed.

Hi,

The input looks something like:

ROXETTE_PC_SW_R1D08 (locked)

you *might* consider just taking the leftmost characters. 
In a regex that'd look like:

my ($symbol) = /^(\w*)/;

Jonathan Paton 

__
Do You Yahoo!?
Everything you'll ever need on one web page
from News and Sport to Email and Music Charts
http://uk.my.yahoo.com

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

RE: Regular expression help!!

2001-10-25 Thread Robert Graham


Hi

You can try the following:

$line = insert_job: DUKS_rtcf_daily_log_purge job_type: c;
($rest) = $line =~ m/(\W\w.*)/;


Regards
Robert

-Original Message-
From: Woz [mailto:[EMAIL PROTECTED]]
Sent: 25 October 2001 11:17
To: [EMAIL PROTECTED]
Subject: Regular expression help!!


Hi,
 
I'm relatively new to the wonders of Perl programming and I've yet to
quite get my head around regular expressions.
I'm attempting to generate a search and replace expression that will
turn the following string
 
insert_job: DUKS_rtcf_daily_log_purge job_type: c
 
into
 
DUKS_rtcf_daily_log_purge job_type: c
 
i.e. remove from the beginning of the line up the space following the
first :
All my efforts so far remove upto the second : though and leave just
'c'.
 
Any ideas?
 
Any help much appreciated!
 
Thanks,
 
Warren
 



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

RE: Regular expression help!!

2001-10-25 Thread Woz


Wow, that was fast!
 
Thanks very much, that works well.
For my next question (:-)) how can I make it strip the leading space
that it leaves on the resulting string. i.e. I want it to strip
'insert_job: ' from the string - note the trailing space.
 
At the moment I have: -
 
  ($Value)=$_=~m/(\W\w\S.*)/; 
  ($Value)=$Value=~m/(\S.*)/;
 
but I'm sure there's a far more elegant solution that combines it all
into one.
 
Many many thanks for your help.
 
Warren
note to self - buy O'Reillys book on regular expressions!


-Original Message- 
From: Robert Graham 
Sent: Thu 25/10/2001 10:27 
To: Woz; [EMAIL PROTECTED] 
Cc: 
Subject: RE: Regular expression help!!



Hi 

You can try the following: 

$line = insert_job: DUKS_rtcf_daily_log_purge job_type: c; 
($rest) = $line =~ m/(\W\w.*)/; 


Regards 
Robert 

-Original Message- 
From: Woz [mailto:[EMAIL PROTECTED]] 
Sent: 25 October 2001 11:17 
To: [EMAIL PROTECTED] 
Subject: Regular expression help!! 


Hi, 
  
I'm relatively new to the wonders of Perl programming and I've
yet to 
quite get my head around regular expressions. 
I'm attempting to generate a search and replace expression that
will 
turn the following string 
  
insert_job: DUKS_rtcf_daily_log_purge job_type: c 
  
into 
  
DUKS_rtcf_daily_log_purge job_type: c 
  
i.e. remove from the beginning of the line up the space
following the 
first : 
All my efforts so far remove upto the second : though and leave
just 
'c'. 
  
Any ideas? 
  
Any help much appreciated! 
  
Thanks, 
  
Warren 
  



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

RE: Regular expression help!!

2001-10-25 Thread Robert Graham


You can use the following to remove the space in front
$Value =~ /s/^\s+//;
 
And for the sake of interest $Value =~ s/\s+$//; will remove any trailing
spaces from a string
 
Regards
Robert Graham

-Original Message-
From: Woz [mailto:[EMAIL PROTECTED]]
Sent: 25 October 2001 12:04
To: [EMAIL PROTECTED]; [EMAIL PROTECTED]
Subject: RE: Regular expression help!!


Wow, that was fast!
 
Thanks very much, that works well.
For my next question (:-)) how can I make it strip the leading space that it
leaves on the resulting string. i.e. I want it to strip 'insert_job: ' from
the string - note the trailing space.
 
At the moment I have: -
 
  ($Value)=$_=~m/(\W\w\S.*)/; 
  ($Value)=$Value=~m/(\S.*)/;
 
but I'm sure there's a far more elegant solution that combines it all into
one.
 
Many many thanks for your help.
 
Warren
note to self - buy O'Reillys book on regular expressions!


-Original Message- 
From: Robert Graham 
Sent: Thu 25/10/2001 10:27 
To: Woz; [EMAIL PROTECTED] 
Cc: 
Subject: RE: Regular expression help!!



Hi 

You can try the following: 

$line = insert_job: DUKS_rtcf_daily_log_purge job_type: c; 
($rest) = $line =~ m/(\W\w.*)/; 


Regards 
Robert 

-Original Message- 
From: Woz [ mailto:[EMAIL PROTECTED] mailto:[EMAIL PROTECTED] ] 
Sent: 25 October 2001 11:17 
To: [EMAIL PROTECTED] 
Subject: Regular expression help!! 


Hi, 
  
I'm relatively new to the wonders of Perl programming and I've yet to 
quite get my head around regular expressions. 
I'm attempting to generate a search and replace expression that will 
turn the following string 
  
insert_job: DUKS_rtcf_daily_log_purge job_type: c 
  
into 
  
DUKS_rtcf_daily_log_purge job_type: c 
  
i.e. remove from the beginning of the line up the space following the 
first : 
All my efforts so far remove upto the second : though and leave just 
'c'. 
  
Any ideas? 
  
Any help much appreciated! 
  
Thanks, 
  
Warren 
  




-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Regular Expression Help

2001-10-19 Thread Randal L. Schwartz


 Stephan == Stephan Gross [EMAIL PROTECTED] writes:

Stephan I'm matching a SQL date like this:
Stephan$ftime =~ /(\d+)-(\d+)-(\d+)\s+(\d+):(\d+):(\d+)/;
 
Stephan If $ftime is 2001-05-13 11:53:00, then $1 is 2001, $2 is 05, etc.
 
Stephan However, I also want to match if the user types in a partial string.  For
Stephan example, if $ftime is 2001-12-18, I want $1, $2 and $3 to be 2001, 12
Stephan and 18 but $4 to be null.  In reality, my regex fails entirely and $1 is
Stephan null.
 
Stephan How can I get $1, $2, $3, $4, $5 and $6 to contain as many matches as my
Stephan input string has?  Do I have to use 6 separate regexes?


if (my @matches = grep defined, $ftime =~ 
/(\d+)(?:-(\d+)(?:-(\d+)(?:\s+(\d+)(?::(\d+)(?::(\d+))?)?)?)?)?/) {

   ... $matches[0]..$matches[6] are now set as many as are defined...
}

Your first example was broken in that you weren't testing the overall
match, and if that had failed, you would have gotten the *previous* match.
Bad News.


-- 
Randal L. Schwartz - Stonehenge Consulting Services, Inc. - +1 503 777 0095
[EMAIL PROTECTED] URL:http://www.stonehenge.com/merlyn/
Perl/Unix/security consulting, Technical writing, Comedy, etc. etc.
See PerlTraining.Stonehenge.com for onsite and open-enrollment Perl training!

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

90 matches

Mail list logo