Re: [Ur] Drop of several orders of magnitude in Techempower benchmarks

2019-08-06 Thread Oisín Mac Fhearaí
On Tue, 6 Aug 2019 at 19:00, Adam Chlipala  wrote:

> On 8/5/19 5:17 PM, Oisín Mac Fhearaí wrote:
> > [...]
> > It would seem that Unicode characters are being stripped from the
> > output, causing the test to fail. I'm not familiar with exactly what
> > the test is trying to do, and I don't know much about how Ur handles
> > UTF-8.
> That's odd.  I see the Unicode characters when I run that benchmark
> locally with a recent Git checkout of Ur/Web.  Are you sure you ran the
> database-setup scripts properly?  What happens when you query the
> database manually?  Are the right characters there?
>

I didn't run the database-setup scripts manually; the "tfb" script at the
repo root does that. I also tested one of the Go frameworks the same way:
"/tfb --mode benchmark --test fasthttp-postgresql --type fortune", which
seems to pass the test. When I ran the benchmark with the Ubuntu package
version of urweb, it also passed the test (albeit many, many times more
slowly).

To double-check though, I built an image from
toolset/databases/postgres/postgres.dockerfile and saw that there are
actually two duplicate tables: "fortune" and "Fortune". That's curious, but
they contain the same 12 rows (including UTF characters) anyway.

It is a bit puzzling, because my local Urweb version seems to have no
problem showing UTF-8 text from a table.

>
> ___
> Ur mailing list
> Ur@impredicative.com
> http://www.impredicative.com/cgi-bin/mailman/listinfo/ur
>
___
Ur mailing list
Ur@impredicative.com
http://www.impredicative.com/cgi-bin/mailman/listinfo/ur


Re: [Ur] Drop of several orders of magnitude in Techempower benchmarks

2019-08-06 Thread Adam Chlipala

On 8/5/19 5:17 PM, Oisín Mac Fhearaí wrote:

Update! The good news:
I was able to update the Dockerfile to build Ur/web from the latest 
release tarball (basically, using the old round 16 Dockerfile with a 
couple of small fixes like installing libicu-dev) and compare the 
benchmarks with the version installed with apt from the Ubuntu repo. 
The version built from the latest release was over ten times faster, 
even running on my old laptop.
Very interesting finding!  I've asked the Debian packager if he can 
think of some build-process change there that would have introduced a 
slowdown.

The bad news:
The latest version of Ur appears to fail the "fortunes" test with the 
following diff (there is more, but this seems to explain it):


fortune: -6Emacs is a nice operating system, but I 
prefer UNIX. — Tom Christaensen
fortune: +6Emacs is a nice operating system, but I 
prefer UNIX.  Tom Christaensen

fortune: @@ -17 +17 @@
fortune: -12フレームワークのベンチマーク
fortune: +12

It would seem that Unicode characters are being stripped from the 
output, causing the test to fail. I'm not familiar with exactly what 
the test is trying to do, and I don't know much about how Ur handles 
UTF-8.
That's odd.  I see the Unicode characters when I run that benchmark 
locally with a recent Git checkout of Ur/Web.  Are you sure you ran the 
database-setup scripts properly?  What happens when you query the 
database manually?  Are the right characters there?


___
Ur mailing list
Ur@impredicative.com
http://www.impredicative.com/cgi-bin/mailman/listinfo/ur