Re: How to pull webpage into batch job

Barkow, Eileen Thu, 27 Apr 2017 06:51:54 -0700

Thank you Andrew for the info about Jsoup - I  had never heard of it.

the  jar files to compile and  run can be downloaded from:




https://jsoup.org/download



api is at:



https://jsoup.org/apidocs/



-----Original Message-----
From: IBM Mainframe Discussion List [mailto:IBM-MAIN@LISTSERV.UA.EDU] On Behalf 
Of Andrew Rowley
Sent: Thursday, April 27, 2017 3:19 AM
To: IBM-MAIN@LISTSERV.UA.EDU
Subject: Re: How to pull webpage into batch job



I would suggest Java as well. There are open source libraries that can

do the HTML parsing too e.g. Jsoup.



I just tested this example on z/OS, it worked (fetch the Wikipedia home

page and list items from the In the news section):



import java.io.IOException;

import org.jsoup.Jsoup;

import org.jsoup.nodes.Document;

import org.jsoup.nodes.Element;

import org.jsoup.select.Elements;



public class JsoupTest {

     public static void main(String[] args) throws IOException {

         Document doc = Jsoup.connect("http://en.wikipedia.org/";).get();

         Elements newsHeadlines = doc.select("#mp-itn li");

         for (Element e : newsHeadlines) {

             System.out.println(e.text());

         }

     }

}



--

Andrew Rowley

Black Hill Software

+61 413 302 386



----------------------------------------------------------------------

For IBM-MAIN subscribe / signoff / archive access instructions,

send email to lists...@listserv.ua.edu<mailto:lists...@listserv.ua.edu> with 
the message: INFO IBM-MAIN

________________________________

This e-mail, including any attachments, may be confidential, privileged or 
otherwise legally protected. It is intended only for the addressee. If you 
received this e-mail in error or from someone who was not authorized to send it 
to you, do not disseminate, copy or otherwise use this e-mail or its 
attachments. Please notify the sender immediately by reply e-mail and delete 
the e-mail from your system.

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to lists...@listserv.ua.edu with the message: INFO IBM-MAIN

Re: How to pull webpage into batch job

Reply via email to