Re: I'm new at this

2010-01-17 Thread James Taylor
cool

fortunately i don't have to deal much with Word html files these days :)

On Sun, Jan 17, 2010 at 7:22 PM, Kerri Hicks  wrote:
> On Sun, Jan 17, 2010 at 7:05 PM, James Taylor  wrote:
>> Dreamweaver does a great job of cleaning up Word HTML & Inline CSS
>>
>> see
>>
>> http://stackoverflow.com/questions/1929011/what-is-the-best-way-to-get-clean-semantic-xhtml-from-ms-word-documents
>
>
> I also wrote a tutorial on migrating from Word HTML to valid HTML.
> It's a few years old, but the basics are the same.
>
> http://www.stg.brown.edu/edu/tips/word_to_html_with_bbedit_1.html
>
> --Kerri
>
> --
> You received this message because you are subscribed to the
> "BBEdit Talk" discussion group on Google Groups.
> To post to this group, send email to bbedit@googlegroups.com
> To unsubscribe from this group, send email to
> bbedit+unsubscr...@googlegroups.com
> For more options, visit this group at
> http://groups.google.com/group/bbedit?hl=en
> If you have a feature request or would like to report a problem,
> please email "supp...@barebones.com" rather than posting to the group.
>
-- 
You received this message because you are subscribed to the 
"BBEdit Talk" discussion group on Google Groups.
To post to this group, send email to bbedit@googlegroups.com
To unsubscribe from this group, send email to
bbedit+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/bbedit?hl=en
If you have a feature request or would like to report a problem, 
please email "supp...@barebones.com" rather than posting to the group.

Re: I'm new at this

2010-01-17 Thread Kerri Hicks
On Sun, Jan 17, 2010 at 7:05 PM, James Taylor  wrote:
> Dreamweaver does a great job of cleaning up Word HTML & Inline CSS
>
> see
>
> http://stackoverflow.com/questions/1929011/what-is-the-best-way-to-get-clean-semantic-xhtml-from-ms-word-documents


I also wrote a tutorial on migrating from Word HTML to valid HTML.
It's a few years old, but the basics are the same.

http://www.stg.brown.edu/edu/tips/word_to_html_with_bbedit_1.html

--Kerri
-- 
You received this message because you are subscribed to the 
"BBEdit Talk" discussion group on Google Groups.
To post to this group, send email to bbedit@googlegroups.com
To unsubscribe from this group, send email to
bbedit+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/bbedit?hl=en
If you have a feature request or would like to report a problem, 
please email "supp...@barebones.com" rather than posting to the group.

Re: I'm new at this

2010-01-17 Thread James Taylor
Dreamweaver does a great job of cleaning up Word HTML & Inline CSS

see

http://stackoverflow.com/questions/1929011/what-is-the-best-way-to-get-clean-semantic-xhtml-from-ms-word-documents

On Thu, Jan 14, 2010 at 11:56 PM, Robert A. Rosenberg  wrote:
> At 19:48 +0100 on 01/14/2010, Bucky Junior wrote about Re: I'm new at this:
>
>> First of all, I've seen huge blocks of code unnecessarily filled with CSS
>> style definitions where it was completely insane. Then, the code is
>> generally non-compliant with W3C specifications. Lastly, By knowing some
>> html coding and cascading style sheets, you can probably do the same page
>> with 1/20th the volume of code.
>
> The problem is two fold. First is that no version of Office since Office 97
> (Windows)/98(Macintosh) has output Pure HTML (ie: No CSS). Every Version on
> both Windows and Macintosh has put out HTML+CSS (not a problem if the CSS is
> good - See Point 2 below). The junk CSS that you see is to allow Office the
> ability to read the HTML as if it were a DOC version of the file (it
> contains CSS to allow the file to be round-tripped so that when read you end
> up with the same content/formatting as you would have had if you had saved
> as .doc/.docx in lieu of .html. There is a setting so that you can save the
> HTML WITHOUT the round-trip crud. You still however run into the second
> problem - The CSS is verbose and badly formed. For example, instead of
> having a definition for the P tag to define the default paragraph this
> information is stored as a CLASS definition and EVERY P tag has the class=
> reference.
> --
>
> Robert A. Rosenberg
> RAR Programming Systems Ltd.
> Home: 845-357-0931
> Cell: 646-479-1984
> Fax:  646-349-4025
>
> --
> You received this message because you are subscribed to the
> "BBEdit Talk" discussion group on Google Groups.
> To post to this group, send email to bbedit@googlegroups.com
> To unsubscribe from this group, send email to
> bbedit+unsubscr...@googlegroups.com
> For more options, visit this group at
> http://groups.google.com/group/bbedit?hl=en
> If you have a feature request or would like to report a problem,
> please email "supp...@barebones.com" rather than posting to the group.
>
-- 
You received this message because you are subscribed to the 
"BBEdit Talk" discussion group on Google Groups.
To post to this group, send email to bbedit@googlegroups.com
To unsubscribe from this group, send email to
bbedit+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/bbedit?hl=en
If you have a feature request or would like to report a problem, 
please email "supp...@barebones.com" rather than posting to the group.

Re: I'm new at this

2010-01-14 Thread Robert A. Rosenberg

At 13:59 -0500 on 01/14/2010, Rich Siegel wrote about Re: I'm new at this:


On 1/14/10 at 1:48 PM, buckyjunior...@googlemail.com (Bucky Junior)
wrote:


Easily? BBEdit will open it easily but my experience with getting
documents that Word had made into html has been abysmal and
frustrating.


Try this: <http://www.stg.brown.edu/edu/tips/word_to_html_with_bbedit_1.html>.


While this helps, there are still problems. Tidy while it does remove 
the Office Crud does not put out correct CSS and HTML. First it tags 
every paragraph as  instead of defining a P rule 
with the settings that are assigned to style1. Worse, in the tests I 
ran, it does not even define style1. The correct method is to have 
the .style1 and p rules the same. Each Paragraph is the just . If 
there needs to be a style change in the middle of a paragraph (other 
than  or ) you use a SPAN assigning a CSS class for bold, 
italic, whatever and the declare a  for the rest 
of that paragraph if needed.

--

Robert A. Rosenberg
RAR Programming Systems Ltd.
Home: 845-357-0931
Cell: 646-479-1984
Fax:  646-349-4025
-- 
You received this message because you are subscribed to the 
"BBEdit Talk" discussion group on Google Groups.
To post to this group, send email to bbedit@googlegroups.com
To unsubscribe from this group, send email to
bbedit+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/bbedit?hl=en
If you have a feature request or would like to report a problem, 
please email "supp...@barebones.com" rather than posting to the group.

Re: I'm new at this

2010-01-14 Thread Robert A. Rosenberg

At 19:48 +0100 on 01/14/2010, Bucky Junior wrote about Re: I'm new at this:

First of all, I've seen huge blocks of code unnecessarily filled 
with CSS style definitions where it was completely insane. Then, the 
code is generally non-compliant with W3C specifications. Lastly, By 
knowing some html coding and cascading style sheets, you can 
probably do the same page with 1/20th the volume of code.


The problem is two fold. First is that no version of Office since 
Office 97 (Windows)/98(Macintosh) has output Pure HTML (ie: No CSS). 
Every Version on both Windows and Macintosh has put out HTML+CSS (not 
a problem if the CSS is good - See Point 2 below). The junk CSS that 
you see is to allow Office the ability to read the HTML as if it were 
a DOC version of the file (it contains CSS to allow the file to be 
round-tripped so that when read you end up with the same 
content/formatting as you would have had if you had saved as 
.doc/.docx in lieu of .html. There is a setting so that you can save 
the HTML WITHOUT the round-trip crud. You still however run into the 
second problem - The CSS is verbose and badly formed. For example, 
instead of having a definition for the P tag to define the default 
paragraph this information is stored as a CLASS definition and EVERY 
P tag has the class= reference.

--

Robert A. Rosenberg
RAR Programming Systems Ltd.
Home: 845-357-0931
Cell: 646-479-1984
Fax:  646-349-4025
-- 
You received this message because you are subscribed to the 
"BBEdit Talk" discussion group on Google Groups.
To post to this group, send email to bbedit@googlegroups.com
To unsubscribe from this group, send email to
bbedit+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/bbedit?hl=en
If you have a feature request or would like to report a problem, 
please email "supp...@barebones.com" rather than posting to the group.

Re: i'm new at this

2010-01-14 Thread Jack Web
At 7:41 AM -0800 1/14/10, shimon apisdorf wrote:
>i'm building my first site using a template at godaddy.
>
>i have lots of documents in word for mac and want to copy them on to
>pages for the site.
>
>will BBEdit allow me to copy from word to BBE to webpages easily?
>retain formatting?
>
>thanks
>any guidance much appreciated
>shimon

My way of going at this is to use Text Edit,

1. Open the Word document in Text Edit.

2. "Save As" choosing HTML as your output format.

3. Copy the text (encoded HTML) from the resulting document into your page.

4. Clean with tidy if necessary (usually not needed).

If I did it a lot I would build an AppleScript using "textutil" to perform the 
conversions, pipe it to tidy and save it.

See man textutil in terminal

HTH
JW
-- 
You received this message because you are subscribed to the 
"BBEdit Talk" discussion group on Google Groups.
To post to this group, send email to bbedit@googlegroups.com
To unsubscribe from this group, send email to
bbedit+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/bbedit?hl=en
If you have a feature request or would like to report a problem, 
please email "supp...@barebones.com" rather than posting to the group.

Re: i'm new at this

2010-01-14 Thread klanxner
If you merely want to copy the Word text and then tag it with 
your own HTML, you can just copy and paste to BBEdit. Then --- 
from the Text menu -- apply Convert to ASCII and then Straighten 
Quotes. Then use Zap Gremlins to make sure you've found 
everything with the previous commands. That will give you clean 
plain text ready for HTML --- in most cases. :-)


Ken

sjd...@gmail.com (shimon apisdorf) wrote on  1/14/10  7:41 AM


i'm building my first site using a template at godaddy.

i have lots of documents in word for mac and want to copy them on to
pages for the site.

will BBEdit allow me to copy from word to BBE to webpages easily?
retain formatting?

thanks
any guidance much appreciated
shimon


-


--
Simple Lives Web Design
http://simplelives.com

-- 
You received this message because you are subscribed to the 
"BBEdit Talk" discussion group on Google Groups.
To post to this group, send email to bbedit@googlegroups.com
To unsubscribe from this group, send email to
bbedit+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/bbedit?hl=en
If you have a feature request or would like to report a problem, 
please email "supp...@barebones.com" rather than posting to the group.

Re: i'm new at this

2010-01-14 Thread Rich Siegel

On 1/14/10 at 1:48 PM, buckyjunior...@googlemail.com (Bucky Junior)
wrote:


Easily? BBEdit will open it easily but my experience with getting
documents that Word had made into html has been abysmal and
frustrating.


Try this: .

R.
--
Rich Siegel Bare Bones Software, Inc.
  

Someday I'll look back on all this and laugh... until they 
sedate me.


-- 
You received this message because you are subscribed to the 
"BBEdit Talk" discussion group on Google Groups.
To post to this group, send email to bbedit@googlegroups.com
To unsubscribe from this group, send email to
bbedit+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/bbedit?hl=en
If you have a feature request or would like to report a problem, 
please email "supp...@barebones.com" rather than posting to the group.

Re: i'm new at this

2010-01-14 Thread Bucky Junior
If you "Export" or "Save As" an html file (I think that Word does  
that) you will get an html file that BBEdit can open and work with.  
When displayed, your new html file will kinda look like you how  
designed it.


Easily? BBEdit will open it easily but my experience with getting  
documents that Word had made into html has been abysmal and frustrating.


First of all, I've seen huge blocks of code unnecessarily filled with  
CSS style definitions where it was completely insane. Then, the code  
is generally non-compliant with W3C specifications. Lastly, By knowing  
some html coding and cascading style sheets, you can probably do the  
same page with 1/20th the volume of code.


That said, give it a try. Then take some time to learn some html and  
css if you haven't already. Then return to your pages after a year or  
so and see what you think.


Bucky

On Jan 14, 2010, at 4:41 PM, shimon apisdorf wrote:


i'm building my first site using a template at godaddy.

i have lots of documents in word for mac and want to copy them on to
pages for the site.

will BBEdit allow me to copy from word to BBE to webpages easily?
retain formatting?

thanks
any guidance much appreciated
shimon


-- 
You received this message because you are subscribed to the 
"BBEdit Talk" discussion group on Google Groups.
To post to this group, send email to bbedit@googlegroups.com
To unsubscribe from this group, send email to
bbedit+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/bbedit?hl=en
If you have a feature request or would like to report a problem, 
please email "supp...@barebones.com" rather than posting to the group.

i'm new at this

2010-01-14 Thread shimon apisdorf
i'm building my first site using a template at godaddy.

i have lots of documents in word for mac and want to copy them on to
pages for the site.

will BBEdit allow me to copy from word to BBE to webpages easily?
retain formatting?

thanks
any guidance much appreciated
shimon
-- 
You received this message because you are subscribed to the 
"BBEdit Talk" discussion group on Google Groups.
To post to this group, send email to bbedit@googlegroups.com
To unsubscribe from this group, send email to
bbedit+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/bbedit?hl=en
If you have a feature request or would like to report a problem, 
please email "supp...@barebones.com" rather than posting to the group.