On 05/04/17 13:29, Wilco Dijkstra wrote:
> Code scheduling for Cortex-A53 isn't as good as it could be. It turns out
> code runs faster overall if we place loads and stores with a dependency
> closer together. To achieve this effect, this patch adds a bypass between
> cortex_a53_load1 and cortex_
ping
From: Wilco Dijkstra
Sent: 05 April 2017 13:29
To: GCC Patches
Cc: nd; James Greenhalgh
Subject: [PATCH][AArch64] Model Cortex-A53 load forwarding
Code scheduling for Cortex-A53 isn't as good as it could be. It turns out
code runs faster overall if we place loads and stores w
Code scheduling for Cortex-A53 isn't as good as it could be. It turns out
code runs faster overall if we place loads and stores with a dependency
closer together. To achieve this effect, this patch adds a bypass between
cortex_a53_load1 and cortex_a53_load*/cortex_a53_store* if the result of an
e