Re: [fpc-devel] The 15k bounty: Optimizing executable speed forLinux x86 / LLVM

2018-12-04 Thread Sven Barth via fpc-devel
Am Mi., 5. Dez. 2018, 00:06 hat J. Gareth Moreton geschrieben: > The more you learn! What is TLS, curiously? Given that I do a lot of my > work at the assembler level, I figure this is something I should know! > Thread Local Storage. For ELF they are pseudo instructions that the assembler needs

Re: [fpc-devel] Optimization theory

2018-12-04 Thread J. Gareth Moreton
Not sure if this was intended for the group mailing list or not, but was only e-mailed to me. One quesion though... in your example, what level of your optimisation are you compiling under? I thought the peephole optimizer already changes things like "lea (%ecx,%eax),%eax" to "add %ecx,%eax".  N

Re: [fpc-devel] The 15k bounty: Optimizing executable speed forLinux x86 / LLVM

2018-12-04 Thread J. Gareth Moreton
The more you learn!  What is TLS, curiously? Given that I do a lot of my work at the assembler level, I figure this is something I should know! Gareth On Tue 04/12/18 22:48 , Simon Kissel simon.kis...@nerdherrschaft.com sent: Hi Florian, > Do you compile with -Aas? The internal assemblers

Re: [fpc-devel] The 15k bounty: Optimizing executable speed for Linux x86 / LLVM

2018-12-04 Thread Simon Kissel
Hi Florian, > Do you compile with -Aas? The internal assemblers do not support TLS yet, > this is WIP. Ah wow! -Aas does indeed help. Both the assembler errors and the internal error are gone, both in Linux i386 and ARM. And the created binaries even work. Nice! Thank you! Cheers, Simon

Re: [fpc-devel] The 15k bounty: Optimizing executable speed for Linux x86 / LLVM

2018-12-04 Thread Florian Klämpfl
Am 04.12.2018 um 02:16 schrieb Simon Kissel: > Hi Florian, > > > > we are currently to try to do some real-life benchmarks with our > products, however with rev. 40346 compilation fails with the two following > showstoppers: Do you compile with -Aas? The internal assemblers do not support TLS yet

Re: [fpc-devel] The 15k bounty: Optimizing executable speedforLinux x86 / LLVM

2018-12-04 Thread J. Gareth Moreton
At the moment I'm trying to fix some Linux bugs.  When I compile Lazarus with it, I get about a 5% speed increase for -O1 and a 15% speed increase for -O3, but someone else reported a 2% slowdown for -O2 on their own test project.  Either way, I hope my fundamental theory is sound... reducing the

Re: [fpc-devel] The 15k bounty: Optimizing executable speed forLinux x86 / LLVM

2018-12-04 Thread Simon Kissel
Hi Gareth, > A regression like this is quite serious. I'd recommend opening a > bug report with a reproducible case so we can investigate and hopefully fix > it within the day. created a test project, and opened two tickets: https://bugs.freepascal.org/view.php?id=34646 https://bugs.freepascal

Re: [fpc-devel] wrong step-over with fpc debug info / how to do objcopy on Mac, or strip .debug_frame

2018-12-04 Thread Jonas Maebe
On 03/12/18 13:58, Martin wrote: Posts by "bigDan": http://forum.lazarus-ide.org/index.php/topic,42869.msg303599.html#msg303599 The log he provided shows that - lldb got a "thread step-over" - lldb believed to have stopped at the end of step-over (not any other reason): "stop reason = step over