On 9/13/2010 2:21 PM, Roelof Wobben wrote:
----------------------------------------
From: rwob...@hotmail.com
To: bgai...@gmail.com
Subject: RE: [Tutor] wierd replace problem
Date: Mon, 13 Sep 2010 18:19:43 +0000
I suggest you give a clear, complete and correct problem statement.
Right now we are shooting in the dark at a moving target.
Something like.
Given the file alice_in_wonderland.txt, copied from url so-and-so
Remove these characters ...
Split into words (not letters?) where word is defined as
Count the frequency of each word. =
Hello,
The problem as stated in the book is :
3.Write a program called alice_words.py that creates a text file named
alice_words.txt containing an alphabetical listing of all the words found in
alice_in_wonderland.txt together with the number of times each word occurs. The
first 10 lines of your output file should look something like this:
Word Count
=======================
a 631
a-piece 1
abide 1
able 1
about 94
above 3
absence 1
absurd 2
How many times does the word, alice, occur in the book?
We still do not have a definition of "word". Only some examples.
The text can be found here :
http://openbookproject.net/thinkcs/python/english2e/resources/ch10/alice_in_wonderland.txt
So I open the file.
Read the first rule.
This is no problem for me.
Then I want to remove some characters like ' , " when the word in the text
begins with these characters.
And there is the problem. The ' and " can't be removed with replace.
Not true. replace() will replace any character. You wrote in your other post
letter2 = letter.strip('`")
SyntaxError: EOL while scanning string literal
Change it to (''`"") do not help either.
Do you understand the error? strip expects a string.
'`" and ''`"" are NOT strings. Please review Python syntax for string literals.
Here again we bump into a fundamental problem - your not understanding some of
the basics of Python.
So in the output you will see something like this "dark instead of dark
word is the words of the sentence which is read in from the text-file.
Am i now clear what the problem is Im facing.
Somewhat clearer. We need a definition of "word". Examples help but are
not definitions.
Example - word is a string of characters including a-z and -. The first
and last characters must be in a-z. Your definition may be different.
BTW see http://dictionary.reference.com/browse/%27tis where 'tis IS a word.
Your original program (what DID become of the backslash???) is WAY off
the mark. You must process one character at a time, decide whether it is
the beginning of a word, the end of a word, within a word, or outside
any word.
Take the beginning of the alice file, and BY HAND decide which category
the first character is in. Then the 2nd. etc. That gives you the
algorithm, Then translate that to Python.
Keep fishing. One day the struggle will be over.
HTH
--
Bob Gailer
919-636-4239
Chapel Hill NC
_______________________________________________
Tutor maillist - Tutor@python.org
To unsubscribe or change subscription options:
http://mail.python.org/mailman/listinfo/tutor