You get the HTML as an object first, then you can pull what you want from it.
If you want to parse it as a string, just use CFHTTP. On Wed, Nov 16, 2011 at 2:04 PM, Jason King <[email protected]> wrote: > doc = html( FileRead("http://www.openbd.org/") > > Is the function pulling the html serviced by openbd.org and parsing it? > > I just want to parse a string, not an entire html document. Or, is it getting > the 'string' by 'fileread' so what's captured by fileread IS A STRING of html > characters, not any different than the string I'm providing? > > > On Wed, Nov 16, 2011 at 4:01 PM, Jason King <[email protected]>wrote: > >> I'm looking at it now.. >> >> do i have to use CFSCRIPT? >> >> >> On Wed, Nov 16, 2011 at 4:00 PM, Alex Skinner <[email protected]> wrote: >> >>> Did you have a look at the example docs I sent >>> >>> So the example code on the manual page would be pretty close ? >>> >>> <cfscript> >>> doc = html( FileRead("http://www.openbd.org/") ); >>> >>> allLinks = doc.select("a[href]"); >>> for ( x = 1; x <= ArrayLen(allLinks); x++ ){ >>> WriteOutput("<li>" & allLinks[x].attr("href") & " / " & >>> allLinks[x].text() & "</li>" ); >>> } >>> </cfscript> >>> >>> >>> Or the example on jsoup ? >>> >>> http://jsoup.org/cookbook/extracting-data/attributes-text-html >>> >>> Alex >>> >>> >>> >>> On 16 November 2011 21:56, Jason King <[email protected]> wrote: >>> >>>> ahh.. i never really work with J-Query >>>> >>>> How would I use it to get 'http://google.com' out of the link ref >>>> below? >>>> >>>> <a href="http://google.com"></a> >>>> >>>> Assume >>>> >>>> <CFSET var = '<a href="http://google.com"></a>'> >>>> >>>> How would I apply html() to 'var' to strip that out? >>>> >>>> >>>> >>>> On Wed, Nov 16, 2011 at 3:48 PM, Alex Skinner <[email protected]> wrote: >>>> >>>>> Now you can do JQuery-like selectors on it >>>>> >>>>> You've just been given a document object >>>>> >>>>> http://www.openbluedragon.org/manual/?/function/html >>>>> >>>>> http://jsoup.org/ >>>>> >>>>> For some samples >>>>> >>>>> Alex >>>>> >>>>> On 16 November 2011 21:44, Jason King <[email protected]> wrote: >>>>> >>>>>> Upgraded.... >>>>>> >>>>>> I have a string called 'body' and I'm applying it to the function. >>>>>> >>>>>> <cfset demovarA = "#html(body)#"> >>>>>> >>>>>> What's going on here? debugging is giving me this entire array as the >>>>>> value of demovarA >>>>>> >>>>>> *object of org.jsoup.nodes.Document* >>>>>> Methods clone (returns org.jsoup.nodes.Node) >>>>>> >>>>>> clone (returns org.jsoup.nodes.Element) >>>>>> >>>>>> clone (returns org.jsoup.nodes.Document) >>>>>> clone (returns java.lang.Object) >>>>>> head (returns org.jsoup.nodes.Element) >>>>>> body (returns org.jsoup.nodes.Element) >>>>>> text (returns org.jsoup.nodes.Element) >>>>>> nodeName (returns java.lang.String) >>>>>> createElement (returns org.jsoup.nodes.Element) >>>>>> normalise (returns org.jsoup.nodes.Document) >>>>>> title (returns void) >>>>>> title (returns java.lang.String) >>>>>> createShell (returns org.jsoup.nodes.Document) >>>>>> quirksMode (returns org.jsoup.nodes.Document) >>>>>> quirksMode (returns org.jsoup.nodes.Document$QuirksMode) >>>>>> outerHtml (returns java.lang.String) >>>>>> outputSettings (returns org.jsoup.nodes.Document$OutputSettings) >>>>>> parent (returns org.jsoup.nodes.Element) >>>>>> parent (returns org.jsoup.nodes.Node) >>>>>> addClass (returns org.jsoup.nodes.Element) >>>>>> equals (returns boolean) >>>>>> toString (returns java.lang.String) >>>>>> append (returns org.jsoup.nodes.Element) >>>>>> hashCode (returns int) >>>>>> wrap (returns org.jsoup.nodes.Node) >>>>>> wrap (returns org.jsoup.nodes.Element) >>>>>> empty (returns org.jsoup.nodes.Element) >>>>>> val (returns java.lang.String) >>>>>> val (returns org.jsoup.nodes.Element) >>>>>> after (returns org.jsoup.nodes.Node) >>>>>> after (returns org.jsoup.nodes.Node) >>>>>> after (returns org.jsoup.nodes.Element) >>>>>> after (returns org.jsoup.nodes.Element) >>>>>> before (returns org.jsoup.nodes.Element) >>>>>> before (returns org.jsoup.nodes.Node) >>>>>> before (returns org.jsoup.nodes.Element) >>>>>> before (returns org.jsoup.nodes.Node) >>>>>> id (returns java.lang.String) >>>>>> attr (returns org.jsoup.nodes.Element) >>>>>> attr (returns org.jsoup.nodes.Node) >>>>>> prepend (returns org.jsoup.nodes.Element) >>>>>> children (returns org.jsoup.select.Elements) >>>>>> className (returns java.lang.String) >>>>>> text (returns java.lang.String) >>>>>> data (returns java.lang.String) >>>>>> child (returns org.jsoup.nodes.Element) >>>>>> appendChild (returns org.jsoup.nodes.Element) >>>>>> select (returns org.jsoup.select.Elements) >>>>>> html (returns java.lang.String) >>>>>> html (returns org.jsoup.nodes.Element) >>>>>> getElementById (returns org.jsoup.nodes.Element) >>>>>> tagName (returns org.jsoup.nodes.Element) >>>>>> tagName (returns java.lang.String) >>>>>> tag (returns org.jsoup.parser.Tag) >>>>>> dataset (returns java.util.Map) >>>>>> getAllElements (returns org.jsoup.select.Elements) >>>>>> classNames (returns org.jsoup.nodes.Element) >>>>>> classNames (returns java.util.Set) >>>>>> isBlock (returns boolean) >>>>>> parents (returns org.jsoup.select.Elements) >>>>>> prependChild (returns org.jsoup.nodes.Element) >>>>>> appendElement (returns org.jsoup.nodes.Element) >>>>>> prependElement (returns org.jsoup.nodes.Element) >>>>>> appendText (returns org.jsoup.nodes.Element) >>>>>> prependText (returns org.jsoup.nodes.Element) >>>>>> siblingElements (returns org.jsoup.select.Elements) >>>>>> nextElementSibling (returns org.jsoup.nodes.Element) >>>>>> previousElementSibling (returns org.jsoup.nodes.Element) >>>>>> firstElementSibling (returns org.jsoup.nodes.Element) >>>>>> elementSiblingIndex (returns java.lang.Integer) >>>>>> lastElementSibling (returns org.jsoup.nodes.Element) >>>>>> getElementsByTag (returns org.jsoup.select.Elements) >>>>>> getElementsByClass (returns org.jsoup.select.Elements) >>>>>> getElementsByAttribute (returns org.jsoup.select.Elements) >>>>>> getElementsByAttributeStarting (returns org.jsoup.select.Elements) >>>>>> getElementsByAttributeValue (returns org.jsoup.select.Elements) >>>>>> getElementsByAttributeValueNot (returns org.jsoup.select.Elements) >>>>>> getElementsByAttributeValueStarting (returns >>>>>> org.jsoup.select.Elements) >>>>>> getElementsByAttributeValueEnding (returns org.jsoup.select.Elements) >>>>>> getElementsByAttributeValueContaining (returns >>>>>> org.jsoup.select.Elements) >>>>>> getElementsByAttributeValueMatching (returns >>>>>> org.jsoup.select.Elements) >>>>>> getElementsByAttributeValueMatching (returns >>>>>> org.jsoup.select.Elements) >>>>>> getElementsByIndexLessThan (returns org.jsoup.select.Elements) >>>>>> getElementsByIndexGreaterThan (returns org.jsoup.select.Elements) >>>>>> getElementsByIndexEquals (returns org.jsoup.select.Elements) >>>>>> getElementsContainingText (returns org.jsoup.select.Elements) >>>>>> getElementsContainingOwnText (returns org.jsoup.select.Elements) >>>>>> getElementsMatchingText (returns org.jsoup.select.Elements) >>>>>> getElementsMatchingText (returns org.jsoup.select.Elements) >>>>>> getElementsMatchingOwnText (returns org.jsoup.select.Elements) >>>>>> getElementsMatchingOwnText (returns org.jsoup.select.Elements) >>>>>> ownText (returns java.lang.String) >>>>>> hasText (returns boolean) >>>>>> hasClass (returns boolean) >>>>>> removeClass (returns org.jsoup.nodes.Element) >>>>>> toggleClass (returns org.jsoup.nodes.Element) >>>>>> remove (returns void) >>>>>> replaceWith (returns void) >>>>>> attr (returns java.lang.String) >>>>>> childNode (returns org.jsoup.nodes.Node) >>>>>> attributes (returns org.jsoup.nodes.Attributes) >>>>>> nextSibling (returns org.jsoup.nodes.Node) >>>>>> previousSibling (returns org.jsoup.nodes.Node) >>>>>> ownerDocument (returns org.jsoup.nodes.Document) >>>>>> unwrap (returns org.jsoup.nodes.Node) >>>>>> baseUri (returns java.lang.String) >>>>>> setBaseUri (returns void) >>>>>> childNodes (returns java.util.List) >>>>>> absUrl (returns java.lang.String) >>>>>> siblingIndex (returns int) >>>>>> hasAttr (returns boolean) >>>>>> removeAttr (returns org.jsoup.nodes.Node) >>>>>> siblingNodes (returns java.util.List) >>>>>> wait (returns void) >>>>>> wait (returns void) >>>>>> wait (returns void) >>>>>> getClass (returns java.lang.Class) >>>>>> notify (returns void) >>>>>> notifyAll (returns void) >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> official tag/function reference: http://openbd.org/manual/ >>>>>> mailing list - http://groups.google.com/group/openbd?hl=en >>>>>> >>>>> >>>>> >>>>> >>>>> -- >>>>> Alex Skinner >>>>> Managing Director >>>>> Pixl8 Interactive >>>>> >>>>> Tel: +448452600726 >>>>> Email: [email protected] >>>>> Web: pixl8.co.uk >>>>> >>>>> -- >>>>> official tag/function reference: http://openbd.org/manual/ >>>>> mailing list - http://groups.google.com/group/openbd?hl=en >>>>> >>>> >>>> -- >>>> official tag/function reference: http://openbd.org/manual/ >>>> mailing list - http://groups.google.com/group/openbd?hl=en >>>> >>> >>> >>> >>> -- >>> Alex Skinner >>> Managing Director >>> Pixl8 Interactive >>> >>> Tel: +448452600726 >>> Email: [email protected] >>> Web: pixl8.co.uk >>> >>> -- >>> official tag/function reference: http://openbd.org/manual/ >>> mailing list - http://groups.google.com/group/openbd?hl=en >>> >> >> > -- > official tag/function reference: http://openbd.org/manual/ > mailing list - http://groups.google.com/group/openbd?hl=en > -- Matthew Woodward [email protected] http://blog.mattwoodward.com identi.ca / Twitter: @mpwoodward Please do not send me proprietary file formats such as Word, PowerPoint, etc. as attachments. http://www.gnu.org/philosophy/no-word-attachments.html -- official tag/function reference: http://openbd.org/manual/ mailing list - http://groups.google.com/group/openbd?hl=en
