Re: [SLUG] shell script to parse html and date comp ?

2014-02-13 Thread Rick Welykochy
li...@sbt.net.au wrote: I would like to fetch date/time from html file, and use date comparison and make an ics/vcal file eventually I found a few pages on the web that discuss what you are doing: "web scraping", using python as it happens. http://www.rexx.com/~dkuhlman/quixote_htmlscraping.htm

Re: [SLUG] shell script to parse html and date comp ?

2014-02-13 Thread Rick Welykochy
li...@sbt.net.au wrote: that works well, the other item I need to grab is 'Duration', which can be 2 or 3 digits as below; is there a way to have egrep get such 2 or 3 digits ? Duration: 60 or ...120... Use the ? operator, which means that a match is optional, i.e. egrep -i 'duration' | egr

Re: [SLUG] shell script to parse html and date comp ?

2014-02-13 Thread lists
On Thu, February 13, 2014 10:54 pm, Michael Chesterton wrote: Michael, thanks > try > > sudo grep -r en_US /etc > > also the locale command on systems prints what everything is set for > > $ locale > LANG=en_AU.UTF-8 locale was OK # locale LANG=en_AU.UTF-8 LC_CTYPE="en_AU.UTF-8" LC_NUMERIC="en_

Re: [SLUG] shell script to parse html and date comp ?

2014-02-13 Thread lists
On Thu, February 13, 2014 11:55 am, Rick Welykochy wrote: >> Start Date Time: > rowspan="1">20/03/2014 1400 Thursday >> >> is 'grep -o' the way to go ? what regex do I need where I put ? >> >> grep -o 'Start Date Time: > colspan="1" rowspan="1">' > I would recommend egrep and use the foll

Re: [SLUG] shell script to parse html and date comp ?

2014-02-13 Thread Michael Chesterton
On 13/02/14 22:47, li...@sbt.net.au wrote: > Jiri, thanks that didn't seem to stick, I've now edited > "/etc/sysconfig/i18n" ,that seems to return correct LANG # set | grep > LC_ # set | grep LANG LANG=en_AU.UTF-8 BUT, using 'date' it still > wants m/d/y # date --date='20/03/2014' +"%s" date: inval

Re: [SLUG] shell script to parse html and date comp ?

2014-02-13 Thread lists
On Thu, February 13, 2014 4:55 pm, Jiří Baum wrote: >> where should I change it to en_au ? > > LANG=en_AU.UTF-8; export LANG Jiri, thanks that didn't seem to stick, I've now edited "/etc/sysconfig/i18n" ,that seems to return correct LANG # set | grep LC_ # set | grep LANG LANG=en_AU.UTF-8 BU

Re: [SLUG] shell script to parse html and date comp ?

2014-02-12 Thread Jiří Baum
Hi, > Rick, Jiri, > many thanks, I'm trying to get this done; > egrep works good, I was trying to shortcut date comparison thing by > converting to seconds, BUT, hit a snag as my system is set to: > # set | grep LANG > LANG=en_US.UTF-8 check also the LC_* variables: set | grep LC_ > # da

Re: [SLUG] shell script to parse html and date comp ?

2014-02-12 Thread lists
> I would recommend egrep and use the following extended regular > expression: > > > egrep -o '[0-9]{2}/[0-9]{2}/[0-9]{4}[[:space:]][0-9]{4}' Rick, Jiri, many thanks, I'm trying to get this done; egrep works good, I was trying to shortcut date comparison thing by converting to seconds, BUT, hi

Re: [SLUG] shell script to parse html and date comp ?

2014-02-12 Thread Rick Welykochy
li...@sbt.net.au wrote: I would like to fetch date/time from html file, and use date comparison and make an ics/vcal file eventually the date comes as so: Start Date Time: 20/03/2014 1400 Thursday is 'grep -o' the way to go ? what regex do I need where I put ? grep -o 'Start Date Time: ?

Re: [SLUG] shell script to parse html and date comp ?

2014-02-12 Thread Jiří Baum
Hi, Voytek: > I would like to fetch date/time from html file, and use date comparison > and make an ics/vcal file eventually Hmm, sounds like you might want to use a different programming language, like python, perl or something, rather than just a shell script. It can be done in shell script, bu

[SLUG] shell script to parse html and date comp ?

2014-02-12 Thread lists
I would like to fetch date/time from html file, and use date comparison and make an ics/vcal file eventually the date comes as so: Start Date Time: 20/03/2014 1400 Thursday is 'grep -o' the way to go ? what regex do I need where I put ? grep -o 'Start Date Time: ' ? what do I need to