On 12Nov2018 07:24, Asad <asad.hasan2...@gmail.com> wrote:
Thanks for the reply . I am building a framework for the two
error
conditions, therefore I need to read and readlines because in one only
regex is required and in other regex+ n-1 line is required to process :
#Here we are opening the file and substituting space " " for each \n
encountered
f3 = open (r"D:\QI\log.log", 'r')
string = f3.read()
string1 = f3.readlines()
My first remark is that both these lines read _and_ _consume_ the file
content. So "string" gets the entire file content, and "string1" gets an
empty array of lines, because the file is already at the end, where
there is no more data.
It is also better to use this idiom to read and then close a file:
with open(r"D:\QI\log.log", 'r') as f3:
string = f3.read()
This reliably closes f3 once the "with" suite completes, even if there's
some kind of exception.
You need 2 copies of the file data. You can do this 2 ways. The first
way is to read the file twice:
with open(r"D:\QI\log.log", 'r') as f3:
string = f3.read()
with open(r"D:\QI\log.log", 'r') as f3:
string1 = f3.readlines()
The efficient way is to read the file once, then make string from
string1, or string1 from string. For example:
with open(r"D:\QI\log.log", 'r') as f3:
string1 = f3.readlines()
string = ''.join(string1)
regex = re.compile ( "\n" )
st = regex.sub ( " ", string )
Using a regular expression to replace a fixed string such as "\n" is
overkill. Consider:
st = string.replace("\n", " ")
Python strings have a bunch of handy methods for common simple things.
Have a read of the docs for further detail.
if re.search('ERR1',st):
y=re.findall("[A-Z][a-z][a-z] [ 123][0-9]
[012][0-9]:[0-5][0-9]:[0-5][0-9] [0-9][0-9][0-9][0-9]",st)
print y
On the other hand, a regexp is a good tool for something like the above.
patchnumber = re.compile(r'(\d+)\/(\d+)') ======> doesnot
work it only works if I use #string = f3.read()
This may be because "string" is a single string (the whole file text as
one string). "string1" is a _list_ of individual strings, one for each
line. Personally, i would call this "strings" or "lines" or some other
plural word; your code will be easier to read, and easier to debug.
Conversely, a misleading name makes debugging harder because you expect
the variable to contain what its name suggests, and if it doesn't this
will impede you in finding problems, because you will be thinking the
whrong thing about what your program is doing.
for j in range(len(string1)):
if re.search ( r'ERR2', string1[j] ):
print "Error line \n", string1[j - 1]
mo = patchnumber.search (string1[j-1])
a = mo.group()
print a
print os.getcwd()
break
Please advice how to proceed.
mo.group() returns the whole match. The above seems to look for the
string 'ERR2' in a line, and look for a patch number in the previous
line. Is that what is it supposed to do?
If the above isn't working, it would help to see the failing output and
a description of what good output is meant to look like.
Finally, please consider turning off "digest mode" in your list
subscription. It will make things easier for everyone.
Cheers,
Cameron Simpson <c...@cskk.id.au>
_______________________________________________
Tutor maillist - Tutor@python.org
To unsubscribe or change subscription options:
https://mail.python.org/mailman/listinfo/tutor