Re: What's the best way to write this regular expression?

2012-03-09 Thread jkn
Also unrelated to the OP, but a far superior Commnad line interface to Windows is the (unhelpfully-named) 'console' program: http://sourceforge.net/projects/console/ This has tabbed windows, preset directory navigation, good copy/paste facilities, the ability to configure different shells, et

Re: What's the best way to write this regular expression?

2012-03-08 Thread John Salerno
On Thursday, March 8, 2012 9:38:51 PM UTC-6, alex23 wrote: > John Salerno wrote: > > So much work just to get a 3rd party module installed! > > "New! Try out the beta release of Beautiful Soup 4. (Last updated > February 28, 2012) > easy_install beautifulsoup4 or pip install beautifulsoup4 or dow

Re: What's the best way to write this regular expression?

2012-03-08 Thread alex23
John Salerno wrote: > So much work just to get a 3rd party module installed! "New! Try out the beta release of Beautiful Soup 4. (Last updated February 28, 2012) easy_install beautifulsoup4 or pip install beautifulsoup4 or download a tarball." http://www.crummy.com/software/BeautifulSoup/ Worke

Re: What's the best way to write this regular expression?

2012-03-08 Thread Ethan Furman
Dave Angel wrote: On 03/08/2012 04:40 PM, John Salerno wrote: http://i271.photobucket.com/albums/jj138/JohnJSal/lxml_error.png Nothing to do with Python, but you'd save us all a lot of space and bandwidth if you learned how to copy/paste from a Windows cmd window. On Windows XP it is: Mo

Re: What's the best way to write this regular expression?

2012-03-08 Thread Dave Angel
On 03/08/2012 06:02 PM, Prasad, Ramit wrote: Actually in quick-edit mode (XP and higher) you just select with left click and then hit enter which copies it to the clipboard. If you also enable insert mode (not sure if this is Win7 specific) you can even right click to paste into the console, just

RE: What's the best way to write this regular expression?

2012-03-08 Thread Prasad, Ramit
> > Alternatively, you can put the console in quick-edit mode (I think it's > > called, it's been a long time since I ran Windows). That's an option you > > set on one cmd window, and it sticks for future windows. > > > > In quick-edit, you just right-click-drag on the cmd window to select a > > r

Re: What's the best way to write this regular expression?

2012-03-08 Thread John Salerno
Thanks, I had no idea about either option, since I don't use the command prompt very much. Needless to say, the Linux console is much nicer :) On Thu, Mar 8, 2012 at 4:19 PM, Dave Angel wrote: > On 03/08/2012 04:40 PM, John Salerno wrote: >> >> >> http://i271.photobucket.com/albums/jj138/JohnJ

Re: What's the best way to write this regular expression?

2012-03-08 Thread Dave Angel
On 03/08/2012 04:40 PM, John Salerno wrote: http://i271.photobucket.com/albums/jj138/JohnJSal/lxml_error.png Nothing to do with Python, but you'd save us all a lot of space and bandwidth if you learned how to copy/paste from a Windows cmd window. If you're just doing it rarely, you can rig

Re: What's the best way to write this regular expression?

2012-03-08 Thread John Salerno
On Mar 8, 3:40 pm, John Salerno wrote: > Now I have no idea what to do. Hmph, I suppose I should have more patience. I realized that the easy_install for lxml only tried to install a binary version, which doesn't exist for the version it found (the latest, 2.3.3). I just had to look through the

Re: What's the best way to write this regular expression?

2012-03-08 Thread John Gordon
In <21519dbf-4097-4780-874d-41d76f645...@x17g2000yqj.googlegroups.com> John Salerno writes: > Well, after a bit of experimentation, I got it to run, but I seem to > have run into the same error as when I used setup.py: > http://i271.photobucket.com/albums/jj138/JohnJSal/lxml_error.png > Now I

Re: What's the best way to write this regular expression?

2012-03-08 Thread John Salerno
On Mar 8, 3:33 pm, John Salerno wrote: > Alright, I'm simply lost about how to install these modules. I > extracted the folders from the .tar.gz files and then went into those > folders in my command prompt. I typed: > > C:\Python32\python setup.py install > > and for a while something was happeni

Re: What's the best way to write this regular expression?

2012-03-08 Thread John Salerno
Alright, I'm simply lost about how to install these modules. I extracted the folders from the .tar.gz files and then went into those folders in my command prompt. I typed: C:\Python32\python setup.py install and for a while something was happening (I was doing the lxml one) and then it stopped wi

Re: What's the best way to write this regular expression?

2012-03-07 Thread John Salerno
On Mar 7, 4:02 pm, Evan Driscoll wrote: > On 01/-10/-28163 01:59 PM, Prasad, Ramit wrote: > > > gz stands for gzip and is a form of compression (like rar/zip ). > > tar stands for a tape archive. It is basically a box that holds the > > files. So you need to "unzip" and then "open the box". > > >

Re: What's the best way to write this regular expression?

2012-03-07 Thread John Salerno
On Mar 7, 11:03 pm, Chris Angelico wrote: > On Thu, Mar 8, 2012 at 7:39 AM, John Salerno wrote: > > it only > > seemed to support Python 2.7. I'm using 3.2. Is 2.7 just the minimum > > version it requires? It didn't say something like "2.7+", so I wasn't > > sure, and I don't want to start instal

Re: What's the best way to write this regular expression?

2012-03-07 Thread Chris Angelico
On Thu, Mar 8, 2012 at 7:39 AM, John Salerno wrote: > it only > seemed to support Python 2.7. I'm using 3.2. Is 2.7 just the minimum > version it requires? It didn't say something like "2.7+", so I wasn't > sure, and I don't want to start installing a bunch of stuff that will > clog up my director

Re: RE: What's the best way to write this regular expression?

2012-03-07 Thread Evan Driscoll
On 01/-10/-28163 01:59 PM, Prasad, Ramit wrote: gz stands for gzip and is a form of compression (like rar/zip ). tar stands for a tape archive. It is basically a box that holds the files. So you need to "unzip" and then "open the box". Normally programs like WinZip / WinRar / 7-zip will do both

Re: What's the best way to write this regular expression?

2012-03-07 Thread John Salerno
On Wed, Mar 7, 2012 at 3:34 PM, Ian Kelly wrote: > The setup.py file (as well as the other files) would be inside the > .tar file.  Unlike a Windows zip file, which does both archival and > compression, Unix files are typically archived and compressed in two > separate steps: "tar" denotes the ar

Re: What's the best way to write this regular expression?

2012-03-07 Thread Ian Kelly
On Wed, Mar 7, 2012 at 2:11 PM, John Salerno wrote: > The only files included in the .tar.gz file is a .tar file of the same > name. So I guess the setup option doesn't exist for these particular > packages. The setup.py file (as well as the other files) would be inside the .tar file. Unlike a W

RE: What's the best way to write this regular expression?

2012-03-07 Thread Prasad, Ramit
> The only files included in the .tar.gz file is a .tar file of the same > name. gz stands for gzip and is a form of compression (like rar/zip ). tar stands for a tape archive. It is basically a box that holds the files. So you need to "unzip" and then "open the box". Normally programs like WinZ

Re: What's the best way to write this regular expression?

2012-03-07 Thread Benjamin Kaplan
On Wed, Mar 7, 2012 at 4:11 PM, John Salerno wrote: > > On Wed, Mar 7, 2012 at 3:01 PM, Ian Kelly wrote: > > > There is a fork of setuptools called "distribute" that supports Python > > 3. > > Thanks, I guess I'll give this a try tonight! > > > setup.py is a file that should be included at the to

Re: What's the best way to write this regular expression?

2012-03-07 Thread John Salerno
On Wed, Mar 7, 2012 at 3:01 PM, Ian Kelly wrote: > There is a fork of setuptools called "distribute" that supports Python 3. Thanks, I guess I'll give this a try tonight! > setup.py is a file that should be included at the top-level of the > .tar files you downloaded.  Generally, to install som

Re: What's the best way to write this regular expression?

2012-03-07 Thread Ian Kelly
On Wed, Mar 7, 2012 at 1:39 PM, John Salerno wrote: > Ok, first major roadblock. I have no idea how to install Beautiful > Soup or lxml on Windows! All I can find are .tar files. Based on what > I've read, I can use the easy_setup module to install these types of > files, but when I went to downlo

Re: What's the best way to write this regular expression?

2012-03-07 Thread John Salerno
Ok, first major roadblock. I have no idea how to install Beautiful Soup or lxml on Windows! All I can find are .tar files. Based on what I've read, I can use the easy_setup module to install these types of files, but when I went to download the setuptools package, it only seemed to support Python 2

Re: What's the best way to write this regular expression?

2012-03-07 Thread Paul Rubin
John Salerno writes: > The Beautiful Soup 4 documentation was very clear, and BS4 itself is > so simple and Pythonic. And best of all, since version 4 no longer > does the parsing itself, you can choose your own parser, and it works > with lxml, so I'll still be using lxml, but with a nice, clean

Re: What's the best way to write this regular expression?

2012-03-06 Thread John Salerno
After a bit of reading, I've decided to use Beautiful Soup 4, with lxml as the parser. I considered simply using lxml to do all the work, but I just got lost in the documentation and tutorials. I couldn't find a clear explanation of how to parse an HTML file and then navigate its structure. The Be

Re: What's the best way to write this regular expression?

2012-03-06 Thread Roy Smith
In article <12783654.1174.1331073814011.JavaMail.geo-discussion-forums@yner4>, John Salerno wrote: > I sort of have to work with what the website gives me (as you'll see below), > but today I encountered an exception to my RE. Let me just give all the > specific information first. The point o

Re: What's the best way to write this regular expression?

2012-03-06 Thread Terry Reedy
On 3/6/2012 6:57 PM, John Salerno wrote: Also, you're still double-posting. Grr. I just reported it to Google, but I think if I start to frequent the newsgroup again I'll have to switch to Thunderbird, or perhaps I'll just try switching back to the old Google Groups interface. I think the issue

Re: What's the best way to write this regular expression?

2012-03-06 Thread Terry Reedy
On 3/6/2012 6:05 PM, John Salerno wrote: Anything that allows me NOT to use REs is welcome news, so I look forward to learning about something new! :) I should ask though...are there alternatives already bundled with Python that I could use? lxml is +- upward compatible with xml.etree in the

RE: What's the best way to write this regular expression?

2012-03-06 Thread Prasad, Ramit
> > > Also, you're still double-posting. > > Grr. I just reported it to Google, but I think if I start to frequent the > newsgroup again I'll have to switch to Thunderbird, or perhaps I'll just > try switching back to the old Google Groups interface. I think the issue is > the new interface. > >

Re: What's the best way to write this regular expression?

2012-03-06 Thread John Salerno
> Also, you're still double-posting. Grr. I just reported it to Google, but I think if I start to frequent the newsgroup again I'll have to switch to Thunderbird, or perhaps I'll just try switching back to the old Google Groups interface. I think the issue is the new interface. Sorry. -- http

Re: What's the best way to write this regular expression?

2012-03-06 Thread Steven D'Aprano
On Tue, 06 Mar 2012 15:05:39 -0800, John Salerno wrote: >> Anything that allows me NOT to use REs is welcome news, so I look >> forward to learning about something new! :) > > I should ask though...are there alternatives already bundled with Python > that I could use? Now that you mention it, I r

Re: What's the best way to write this regular expression?

2012-03-06 Thread John Salerno
Thanks. I'm thinking the choice might be between lxml and Beautiful Soup, but since BS uses lxml as a parser, I'm trying to figure out the difference between them. I don't necessarily need the simplest (html.parser), but I want to choose one that is simple enough yet powerful enough that I won't ha

Re: What's the best way to write this regular expression?

2012-03-06 Thread John Salerno
On Tuesday, March 6, 2012 5:05:39 PM UTC-6, John Salerno wrote: > > Anything that allows me NOT to use REs is welcome news, so I look forward > > to learning about something new! :) > > I should ask though...are there alternatives already bundled with Python that > I could use? Now that you ment

Re: What's the best way to write this regular expression?

2012-03-06 Thread Ian Kelly
On Tue, Mar 6, 2012 at 4:05 PM, John Salerno wrote: >> Anything that allows me NOT to use REs is welcome news, so I look forward to >> learning about something new! :) > > I should ask though...are there alternatives already bundled with Python that > I could use? Now that you mention it, I reme

Re: What's the best way to write this regular expression?

2012-03-06 Thread John Salerno
On Tuesday, March 6, 2012 5:05:39 PM UTC-6, John Salerno wrote: > > Anything that allows me NOT to use REs is welcome news, so I look forward > > to learning about something new! :) > > I should ask though...are there alternatives already bundled with Python that > I could use? Now that you ment

Re: What's the best way to write this regular expression?

2012-03-06 Thread John Salerno
> Anything that allows me NOT to use REs is welcome news, so I look forward to > learning about something new! :) I should ask though...are there alternatives already bundled with Python that I could use? Now that you mention it, I remember something called HTMLParser (or something like that) a

Re: What's the best way to write this regular expression?

2012-03-06 Thread John Salerno
On Tuesday, March 6, 2012 4:52:10 PM UTC-6, Chris Rebert wrote: > On Tue, Mar 6, 2012 at 2:43 PM, John Salerno wrote: > > I sort of have to work with what the website gives me (as you'll see > > below), but today I encountered an exception to my RE. Let me just give all > > the specific informat

Re: What's the best way to write this regular expression?

2012-03-06 Thread Chris Rebert
On Tue, Mar 6, 2012 at 2:43 PM, John Salerno wrote: > I sort of have to work with what the website gives me (as you'll see below), > but today I encountered an exception to my RE. Let me just give all the > specific information first. The point of my script is to go to the specified > URL and e