Thank you Andrew for the info about Jsoup - I had never heard of it. the jar files to compile and run can be downloaded from:
https://jsoup.org/download api is at: https://jsoup.org/apidocs/ -----Original Message----- From: IBM Mainframe Discussion List [mailto:IBM-MAIN@LISTSERV.UA.EDU] On Behalf Of Andrew Rowley Sent: Thursday, April 27, 2017 3:19 AM To: IBM-MAIN@LISTSERV.UA.EDU Subject: Re: How to pull webpage into batch job I would suggest Java as well. There are open source libraries that can do the HTML parsing too e.g. Jsoup. I just tested this example on z/OS, it worked (fetch the Wikipedia home page and list items from the In the news section): import java.io.IOException; import org.jsoup.Jsoup; import org.jsoup.nodes.Document; import org.jsoup.nodes.Element; import org.jsoup.select.Elements; public class JsoupTest { public static void main(String[] args) throws IOException { Document doc = Jsoup.connect("http://en.wikipedia.org/").get(); Elements newsHeadlines = doc.select("#mp-itn li"); for (Element e : newsHeadlines) { System.out.println(e.text()); } } } -- Andrew Rowley Black Hill Software +61 413 302 386 ---------------------------------------------------------------------- For IBM-MAIN subscribe / signoff / archive access instructions, send email to lists...@listserv.ua.edu<mailto:lists...@listserv.ua.edu> with the message: INFO IBM-MAIN ________________________________ This e-mail, including any attachments, may be confidential, privileged or otherwise legally protected. It is intended only for the addressee. If you received this e-mail in error or from someone who was not authorized to send it to you, do not disseminate, copy or otherwise use this e-mail or its attachments. Please notify the sender immediately by reply e-mail and delete the e-mail from your system. ---------------------------------------------------------------------- For IBM-MAIN subscribe / signoff / archive access instructions, send email to lists...@listserv.ua.edu with the message: INFO IBM-MAIN