On 9/13/2010 2:21 PM, Roelof Wobben wrote:
----------------------------------------From: [email protected] To: [email protected] Subject: RE: [Tutor] wierd replace problem Date: Mon, 13 Sep 2010 18:19:43 +0000I suggest you give a clear, complete and correct problem statement. Right now we are shooting in the dark at a moving target. Something like. Given the file alice_in_wonderland.txt, copied from url so-and-so Remove these characters ... Split into words (not letters?) where word is defined as Count the frequency of each word. =Hello, The problem as stated in the book is :3.Write a program called alice_words.py that creates a text file named alice_words.txt containing an alphabetical listing of all the words found in alice_in_wonderland.txt together with the number of times each word occurs. The first 10 lines of your output file should look something like this: Word Count ======================= a 631 a-piece 1 abide 1 able 1 about 94 above 3 absence 1 absurd 2 How many times does the word, alice, occur in the book?
We still do not have a definition of "word". Only some examples.
The text can be found here : http://openbookproject.net/thinkcs/python/english2e/resources/ch10/alice_in_wonderland.txt So I open the file. Read the first rule. This is no problem for me. Then I want to remove some characters like ' , " when the word in the text begins with these characters. And there is the problem. The ' and " can't be removed with replace.
Not true. replace() will replace any character. You wrote in your other post
letter2 = letter.strip('`")
SyntaxError: EOL while scanning string literal
Change it to (''`"") do not help either.
Do you understand the error? strip expects a string. '`" and ''`"" are NOT strings. Please review Python syntax for string literals. Here again we bump into a fundamental problem - your not understanding some of the basics of Python.
Somewhat clearer. We need a definition of "word". Examples help but are not definitions.So in the output you will see something like this "dark instead of dark word is the words of the sentence which is read in from the text-file. Am i now clear what the problem is Im facing.
Example - word is a string of characters including a-z and -. The first and last characters must be in a-z. Your definition may be different.
BTW see http://dictionary.reference.com/browse/%27tis where 'tis IS a word.Your original program (what DID become of the backslash???) is WAY off the mark. You must process one character at a time, decide whether it is the beginning of a word, the end of a word, within a word, or outside any word.
Take the beginning of the alice file, and BY HAND decide which category the first character is in. Then the 2nd. etc. That gives you the algorithm, Then translate that to Python.
Keep fishing. One day the struggle will be over. HTH -- Bob Gailer 919-636-4239 Chapel Hill NC _______________________________________________ Tutor maillist - [email protected] To unsubscribe or change subscription options: http://mail.python.org/mailman/listinfo/tutor
