Re: GitHub could be acquired by Microsoft

Nick Sabalausky (Abscissa) via Digitalmars-d-announce Sat, 09 Jun 2018 00:11:36 -0700

On 06/08/2018 06:02 PM, Brad Roberts wrote:

Essentially (if not actually) everything on github is available throughtheir api's. No need for scraping or other heroics to gather it.

That does make things a little bit simpler, but web scraping reallyisn't all that much more complicated.

Whether web API or web scraping: Either way, you still have to submit anHTTP request, parse the results according to the format the server haschosen to spit out, and possibly follow up with additional HTTPrequests. The main differences are just: Web scraping can occasionallyget thwarted by changes in the webapp's presentation layer. Whereas webAPI can occasionally get thwarted by business rules changing whatis/isn't accessible via API (this has been known to happen).

Ie, scraping needs to deal with UI changes, but unlike API, it cannot beselectively hindered/disabled (unless the primary website itself ishindered/disabled, too).

Thus, a robust tool will support both published web API and webscraping, and select the answers from whichever one works.

Re: GitHub could be acquired by Microsoft

Reply via email to