Re: url no longer working as expected

2024-06-28 Thread Richard Gaskin via use-livecode
Likely the case. The expense of collating all that data and presenting it to 
their site visitors is considerable.  They use advertising to cover those 
costs.  If the data were easily scrapable, scrapers diminish revenue, putting 
the resource itself at risk.

Some data provides offer APIs.  When you see anti-scraping effects, look for 
API options (I saw none there but I didn't look deeply).  APIs take fewer 
resources to deliver, and may have strategic benefit for some data brokers.

But if they have scrape-prevention and no API, they're sending a clear signal: 
"We need to pay our bills, please send your traffic to our page so we can do 
that."

That said, I've come across stock APIs before, and while I don't recall many 
free ones there likely are some.

Richard Gaskin
FourthWorld.com



Paul Dupuis wrote:

> I get a response from Yahoos that is an html page with a 404
> information as part of it.
...
> I think this is Yahoo Finance not being able to detect the
> browser type and intentionally returning a 404 as a method
> of deterring screen scraping.
>
>
On 6/28/2024 1:04 PM, Hugh Senior via use-livecode wrote:
...
>> Problem:
>> Enter "https://uk.finance.yahoo.com/quote/SHEL.L/history/"; into
>> any web browser and the page is displayed as expected.
>>
>> Use LC's URL command to access the same page direct returns a
>> 404
>> put url "https://uk.finance.yahoo.com/quote/SHEL.L/history/";
>>
>> Anyone got any insights?

___
use-livecode mailing list
use-livecode@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode


Re: url no longer working as expected

2024-06-28 Thread Paul Dupuis via use-livecode

Yes.

put url "https://uk.finance.yahoo.com/quote/SHEL.L/history/";

In the message box on 9.6.11 and 9.6.12 under Windows 11. Both return a 
pile of HTML text that is all the formatting and CSS linked stuff to 
show a "404" page.


This suggests that put URL is working and it is the Yahoo server that 
returning a different page of HTML/CSS for the put vs when you enter the 
URL in a browser (Firefox in my case, where I get the Yahoo finance data 
for Shell, although I did have to respond to a Cookies dialog first).



On 6/28/2024 2:03 PM, Bob Sneidar via use-livecode wrote:

Did you try that in the message box?

Bob S



On Jun 28, 2024, at 10:50 AM, Paul Dupuis via use-livecode 
 wrote:

I get a response from Yahoos that is an html page with a 404 information as 
part of it.

This happens under LC 9.6.12 and 9.6.11

I think this is Yahoo Finance not being able to detect the browser type and 
intentionally returning a 404 as a method of deterring screen scraping.


On 6/28/2024 1:04 PM, Hugh Senior via use-livecode wrote:

Platform: Windows 11, LC 9.6.12
Query: Using URL to access a web page

Problem:
Enter "https://uk.finance.yahoo.com/quote/SHEL.L/history/"; into any web
browser and the page is displayed as expected.

Use LC's URL command to access the same page direct returns a 404
put url "https://uk.finance.yahoo.com/quote/SHEL.L/history/";

Anyone got any insights?

Hugh Senior


___
use-livecode mailing list
use-livecode@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode


___
use-livecode mailing list
use-livecode@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode


___
use-livecode mailing list
use-livecode@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode



___
use-livecode mailing list
use-livecode@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode


Re: url no longer working as expected

2024-06-28 Thread Bob Sneidar via use-livecode
Did you try that in the message box? 

Bob S


> On Jun 28, 2024, at 10:50 AM, Paul Dupuis via use-livecode 
>  wrote:
> 
> I get a response from Yahoos that is an html page with a 404 information as 
> part of it.
> 
> This happens under LC 9.6.12 and 9.6.11
> 
> I think this is Yahoo Finance not being able to detect the browser type and 
> intentionally returning a 404 as a method of deterring screen scraping.
> 
> 
> On 6/28/2024 1:04 PM, Hugh Senior via use-livecode wrote:
>> Platform: Windows 11, LC 9.6.12
>> Query: Using URL to access a web page
>> 
>> Problem:
>> Enter "https://uk.finance.yahoo.com/quote/SHEL.L/history/"; into any web
>> browser and the page is displayed as expected.
>> 
>> Use LC's URL command to access the same page direct returns a 404
>> put url "https://uk.finance.yahoo.com/quote/SHEL.L/history/";
>> 
>> Anyone got any insights?
>> 
>> Hugh Senior
>> 
>> 
>> ___
>> use-livecode mailing list
>> use-livecode@lists.runrev.com
>> Please visit this url to subscribe, unsubscribe and manage your subscription 
>> preferences:
>> http://lists.runrev.com/mailman/listinfo/use-livecode
> 
> 
> ___
> use-livecode mailing list
> use-livecode@lists.runrev.com
> Please visit this url to subscribe, unsubscribe and manage your subscription 
> preferences:
> http://lists.runrev.com/mailman/listinfo/use-livecode


___
use-livecode mailing list
use-livecode@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode


Re: url no longer working as expected

2024-06-28 Thread Paul Dupuis via use-livecode
I get a response from Yahoos that is an html page with a 404 information 
as part of it.


This happens under LC 9.6.12 and 9.6.11

I think this is Yahoo Finance not being able to detect the browser type 
and intentionally returning a 404 as a method of deterring screen scraping.



On 6/28/2024 1:04 PM, Hugh Senior via use-livecode wrote:

Platform: Windows 11, LC 9.6.12
Query: Using URL to access a web page

Problem:
Enter "https://uk.finance.yahoo.com/quote/SHEL.L/history/"; into any web
browser and the page is displayed as expected.

Use LC's URL command to access the same page direct returns a 404
put url "https://uk.finance.yahoo.com/quote/SHEL.L/history/";

Anyone got any insights?

Hugh Senior


___
use-livecode mailing list
use-livecode@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode



___
use-livecode mailing list
use-livecode@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode


Re: url no longer working as expected

2024-06-28 Thread Bob Sneidar via use-livecode
I get the HTML of the page. Are you trying to open the page in a browser? 

Bob S


> On Jun 28, 2024, at 10:04 AM, Hugh Senior via use-livecode 
>  wrote:
> 
> 
> Platform: Windows 11, LC 9.6.12
> Query: Using URL to access a web page
> 
> Problem:
> Enter "https://uk.finance.yahoo.com/quote/SHEL.L/history/"; into any web
> browser and the page is displayed as expected.
> 
> Use LC's URL command to access the same page direct returns a 404
> put url "https://uk.finance.yahoo.com/quote/SHEL.L/history/";
> 
> Anyone got any insights?
> 
> Hugh Senior
> 
> 
> ___
> use-livecode mailing list
> use-livecode@lists.runrev.com
> Please visit this url to subscribe, unsubscribe and manage your subscription 
> preferences:
> http://lists.runrev.com/mailman/listinfo/use-livecode


___
use-livecode mailing list
use-livecode@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode


url no longer working as expected

2024-06-28 Thread Hugh Senior via use-livecode


Platform: Windows 11, LC 9.6.12
Query: Using URL to access a web page

Problem:
Enter "https://uk.finance.yahoo.com/quote/SHEL.L/history/"; into any web
browser and the page is displayed as expected.

Use LC's URL command to access the same page direct returns a 404
put url "https://uk.finance.yahoo.com/quote/SHEL.L/history/";

Anyone got any insights?

Hugh Senior


___
use-livecode mailing list
use-livecode@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode