On 6/6/19 3:41 AM, Bas Nieuwenhuizen wrote:
On Wed, Jun 5, 2019 at 12:04 PM Samuel Pitoiset
wrote:
On 6/5/19 2:51 AM, Bas Nieuwenhuizen wrote:
On Thu, May 30, 2019 at 4:02 PM Samuel Pitoiset
wrote:
From the Vulkan spec 1.1.109:
"Some implementations may need to evaluate
vkrunner recently added a new option (--device-id=) for selecting
the physical device. This is quite useful in a multi-GPU scenario.
This allows to set some vkrunner options via vkrunner:extra_args
in the configuration file or with PIGLIT_VKRUNNER_EXTRA_ARGS.
Signed-off-by: Samuel Pitoiset
Reviewed-by: Samuel Pitoiset
On 6/5/19 5:09 PM, Connor Abbott wrote:
While we're here, copy the comment explaining this from radeonsi.
---
src/amd/common/ac_nir_to_llvm.c | 8 +++-
1 file changed, 7 insertions(+), 1 deletion(-)
diff --git a/src/amd/common/ac_nir_to_llvm.c b/src/amd
On 6/5/19 2:51 AM, Bas Nieuwenhuizen wrote:
On Thu, May 30, 2019 at 4:02 PM Samuel Pitoiset
wrote:
From the Vulkan spec 1.1.109:
"Some implementations may need to evaluate depth image values
while performing image layout transitions. To accommodate this,
inst
On 6/5/19 2:37 AM, Bas Nieuwenhuizen wrote:
On Thu, May 30, 2019 at 4:02 PM Samuel Pitoiset
wrote:
This will be used for the depth decompress pass that might need
to emit variable sample locations during layout transitions.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta.c
For some reasons, this actually introduced rendering issues with
Far Cry 3 (and probably Far Cry 4). I'm reverting it for now
until I figure out the right fix.
See the following link for reference:
https://github.com/ValveSoftware/Proton/issues/727#issuecomment-498638025
Cc: 19.0 19.1
This
Patches 1-2:
Reviewed-by: Samuel Pitoiset
On 6/4/19 2:02 AM, Marek Olšák wrote:
From: Marek Olšák
---
src/amd/common/ac_llvm_build.c | 74 +-
1 file changed, 37 insertions(+), 37 deletions(-)
diff --git a/src/amd/common/ac_llvm_build.c b/src/amd/common
It's unsupported, only load/store format with vec3 are supported.
v2: - allow to use load/store format with vec3
Fixes: 6970a9a6ca9 ("ac,radv: remove the vec3 restriction with LLVM 9+")"
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c| 12 ++--
On 6/3/19 9:33 PM, Marek Olšák wrote:
SI doesn't support buffer_load_dwordx3 and buffer_store_dwordx3, but
it supports buffer_load_format_xyz and buffer_store_format_xyz.
OK, I will update.
Marek
On Mon, Jun 3, 2019 at 9:09 AM Samuel Pitoiset
mailto:samuel.pitoi...@gmail.com>>
The driver should only fast depth clears with the graphics path
when the view covers all image layers, otherwise this might
corrupt layers when HTILE is enabled.
Cc: 19.0 19.1 mesa-sta...@lists.freedesktop.org
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta_clear.c | 1 +
1 file
On 6/3/19 3:48 PM, Matt Arsenault wrote:
On Jun 3, 2019, at 9:13 AM, Samuel Pitoiset wrote:
I thought LLVM was able to handle that itself but actually it
does not. That means we shouldn't try to emit vec3 on SI because
it's unsupported.
It should. Can you file a bug with an example
I thought LLVM was able to handle that itself but actually it
does not. That means we shouldn't try to emit vec3 on SI because
it's unsupported.
Fixes: 6970a9a6ca9 ("ac,radv: remove the vec3 restriction with LLVM 9+")"
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_
On 6/3/19 9:46 AM, Alex Smith wrote:
On Sun, 2 Jun 2019 at 11:59, Bas Nieuwenhuizen
mailto:b...@basnieuwenhuizen.nl>> wrote:
On Sun, Jun 2, 2019 at 12:32 PM Alex Smith
mailto:asm...@feralinteractive.com>>
wrote:
>
> Put the uncached GTT type at a higher index than the
Reviewed-by: Samuel Pitoiset
On 6/2/19 12:32 PM, Alex Smith wrote:
Put the uncached GTT type at a higher index than the visible VRAM type,
rather than having GTT first.
When we don't have dedicated VRAM, we don't have a non-visible VRAM
type, and the property flags for GTT and visible VRAM
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_device.c | 8
src/amd/vulkan/radv_extensions.py | 2 +-
2 files changed, 1 insertion(+), 9 deletions(-)
diff --git a/src/amd/vulkan/radv_device.c b/src/amd/vulkan/radv_device.c
index 3cf050ed220..16efc6134f1 100644
--- a/src
If VK_EXT_sample_locations is used, the driver might need to emit
the sample locations specified during layout transitions.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta_decompress.c | 9 -
1 file changed, 8 insertions(+), 1 deletion(-)
diff --git a/src/amd/vulkan
This is now supported.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_image.c | 7 ---
1 file changed, 7 deletions(-)
diff --git a/src/amd/vulkan/radv_image.c b/src/amd/vulkan/radv_image.c
index b69dda39ce1..161997ae196 100644
--- a/src/amd/vulkan/radv_image.c
+++ b/src/amd/vulkan
matic
layout transition has to take place. [...] and
VkRenderPassSampleLocationsBeginInfoEXT can be chained from
VkRenderPassBeginInfo to provide sample locations for layout
transitions performed implicitly by a render pass instance."
Signed-off-by: Samuel Pitoiset
---
src/
ransitions performed
with CmdWaitEvents() or CmdPipelineBarrier().
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_cmd_buffer.c | 35 +---
1 file changed, 28 insertions(+), 7 deletions(-)
diff --git a/src/amd/vulkan/radv_cmd_buffer.c b/src/amd/vulkan/radv_cmd_buff
of this series enables VK_EXT_sample_locations.
Please review,
thanks!
Samuel Pitoiset (8):
radv: allow to save/restore sample locations during meta operations
radv: allow to set dynamic sample locations to the depth decompress
pass
radv: allow the depth decompress pass to emit dynamic
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_pass.c| 18 ++
src/amd/vulkan/radv_private.h | 3 ++-
2 files changed, 20 insertions(+), 1 deletion(-)
diff --git a/src/amd/vulkan/radv_pass.c b/src/amd/vulkan/radv_pass.c
index 4d1e38a780e..8fa098eaac8 100644
--- a/src
This will be used for the depth decompress pass that might need
to emit variable sample locations during layout transitions.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta.c | 20
src/amd/vulkan/radv_meta.h | 2 ++
2 files changed, 22 insertions(+)
diff --git
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_cmd_buffer.c | 3 ++-
src/amd/vulkan/radv_meta.h| 6 --
src/amd/vulkan/radv_meta_decompress.c | 29 +++
3 files changed, 31 insertions(+), 7 deletions(-)
diff --git a/src/amd/vulkan
This might fix initial subpass transitions when multiview is used.
Noticed while implementing sample locations during layout transitions.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_cmd_buffer.c | 5 +++--
1 file changed, 3 insertions(+), 2 deletions(-)
diff --git a/src/amd/vulkan
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta.c | 14 --
1 file changed, 8 insertions(+), 6 deletions(-)
diff --git a/src/amd/vulkan/radv_meta.c b/src/amd/vulkan/radv_meta.c
index ec4fc4a6d4b..721c7039830 100644
--- a/src/amd/vulkan/radv_meta.c
+++ b/src/amd/vulkan
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta.c | 4 ++--
1 file changed, 2 insertions(+), 2 deletions(-)
diff --git a/src/amd/vulkan/radv_meta.c b/src/amd/vulkan/radv_meta.c
index 721c7039830..3bfe49b4dab 100644
--- a/src/amd/vulkan/radv_meta.c
+++ b/src/amd/vulkan/radv_meta.c
The driver should already support this without any changes.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_device.c | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/src/amd/vulkan/radv_device.c b/src/amd/vulkan/radv_device.c
index d04dcbed89e..16efc6134f1 100644
AM Samuel Pitoiset
wrote:
It makes sense to use the image view formats when resolving
inside subpasses, while we have to use the image formats for
normal resolves.
Original patch by Philip Rebohle.
Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110348
Signed-off-by: Samuel Pitoiset
If the driver waits for CP DMA to be idle and emit an EOP event
we need more space.
This fixes a crash with Quake Champions.
Cc:
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_cmd_buffer.c | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/src/amd/vulkan
Make sure to sync all previous work if the given command buffer
has pending active queries. Otherwise the GPU might write queries
data after the reset operation.
This fixes a bunch of new dEQP-VK.query_pool.* CTS failures.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_cmd_buffer.c
It makes sense to use the image view formats when resolving
inside subpasses, while we have to use the image formats for
normal resolves.
Original patch by Philip Rebohle.
Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110348
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta_resolve.c | 21 +
1 file changed, 13 insertions(+), 8 deletions(-)
diff --git a/src/amd/vulkan/radv_meta_resolve.c
b/src/amd/vulkan/radv_meta_resolve.c
index b4551a98637..2b97c42fc69 100644
--- a/src/amd/vulkan
On 5/27/19 6:48 PM, Bas Nieuwenhuizen wrote:
On Mon, May 27, 2019 at 5:38 PM Samuel Pitoiset
wrote:
Only supported with vkCreateRenderPass2().
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_pass.c| 30 +-
src/amd/vulkan/radv_private.h | 3 +++
2
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta_resolve_cs.c | 56 +--
1 file changed, 36 insertions(+), 20 deletions(-)
diff --git a/src/amd/vulkan/radv_meta_resolve_cs.c
b/src/amd/vulkan/radv_meta_resolve_cs.c
index 76612672b1c..67df4800023 100644
This makes use of radv_meta_resolve_compute_image() by filling
a VkImageResolve region instead of duplicating code.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta_resolve_cs.c | 85 +--
1 file changed, 29 insertions(+), 56 deletions(-)
diff --git a/src/amd
This path supports layers but it requires to decompress HTILE
before resolving. The driver also needs to fixup HTILE after
the resolve. This path is probably slower than the graphics one.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta_resolve_cs.c | 491
The driver might need to clear one aspect of the depth/stencil
resolve attachment before performing the resolve itself.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta_clear.c | 73
1 file changed, 55 insertions(+), 18 deletions(-)
diff --git a/src
Only supported with vkCreateRenderPass2().
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_pass.c| 30 +-
src/amd/vulkan/radv_private.h | 3 +++
2 files changed, 32 insertions(+), 1 deletion(-)
diff --git a/src/amd/vulkan/radv_pass.c b/src/amd/vulkan
It's required to decompress HTILE before resolving with the
compute path.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta_resolve.c | 26 ++
1 file changed, 26 insertions(+)
diff --git a/src/amd/vulkan/radv_meta_resolve.c
b/src/amd/vulkan
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_device.c | 21 +
src/amd/vulkan/radv_extensions.py | 1 +
2 files changed, 22 insertions(+)
diff --git a/src/amd/vulkan/radv_device.c b/src/amd/vulkan/radv_device.c
index 4203074a4cd..ead6d63f1c5 100644
--- a/src
When using graphics, the driver doesn't need to decompress HTILE
before resolving. This path currently doesn't support layers
so we have to fallback to the compute path.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta_resolve_fs.c | 597 +-
src/amd/vulkan
Only fallback to the compute path for layers.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta_resolve.c | 77 +-
1 file changed, 64 insertions(+), 13 deletions(-)
diff --git a/src/amd/vulkan/radv_meta_resolve.c
b/src/amd/vulkan/radv_meta_resolve.c
index
but it requires a decompression pass.
All CTS tests pass on Polaris10 and Vega10.
Please review,
Thanks!
Samuel Pitoiset (7):
radv: record if a render pass has depth/stencil resolve attachments
radv: implement all depth/stencil resolve modes using graphics
radv: implement all depth/stencil
Based on ANV.
v2: - remove the if statement
- update the comment
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta_resolve.c | 12 +++-
1 file changed, 3 insertions(+), 9 deletions(-)
diff --git a/src/amd/vulkan/radv_meta_resolve.c
b/src/amd/vulkan/radv_meta_resolve.c
On 5/26/19 2:50 PM, Bas Nieuwenhuizen wrote:
On Wed, May 22, 2019 at 11:20 AM Samuel Pitoiset
wrote:
Based on ANV.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta_resolve.c | 21 -
1 file changed, 12 insertions(+), 9 deletions(-)
diff --git a/src/amd
From the Vulkan spec 1.1.108:
"vkCmdCopyQueryPoolResults is guaranteed to see the effect of
previous uses of vkCmdResetQueryPool in the same queue, without any
additional synchronization."
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_qu
during layout
transitions?
AFAIK those are needed for e.g. HTILE decompression as it is based on
equations somehow.
On Thu, May 16, 2019 at 11:51 AM Samuel Pitoiset
mailto:samuel.pitoi...@gmail.com>> wrote:
>
> Basically, this extension allows applicat
: - change sampleLocationCoordinateRange[1] to 0.9375
- compute and emit PA_SC_CENTROID_PRIORITY_{0,1}
- rebased on top of master
- some cleanups
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_cmd_buffer.c | 223 ++
src/amd/vulkan/radv_device.c
The old code was not wrong because the transitions performed
after the resolves should re-emit the framebuffer if needed.
This change is mostly a no-op but it improves consistency
regarding other meta operations that need to save/restore subpasses.
Signed-off-by: Samuel Pitoiset
---
src/amd
11:51 AM Samuel Pitoiset
mailto:samuel.pitoi...@gmail.com>> wrote:
>
> Basically, this extension allows applications to use custom
> sample locations. It doesn't support variable sample locations
> during subpass. Note that we don't have to upload the use
Just move the block that checks the availability bit into the
switch like other query types.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_query.c | 12 +---
1 file changed, 5 insertions(+), 7 deletions(-)
diff --git a/src/amd/vulkan/radv_query.c b/src/amd/vulkan/radv_query.c
This helper will be useful for clearing HTILE after some
depth/stencil resolves.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_cmd_buffer.c | 7 +--
src/amd/vulkan/radv_meta.h | 3 +++
src/amd/vulkan/radv_meta_clear.c | 12
3 files changed, 16 insertions(+), 6
Based on ANV.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta_resolve.c | 21 -
1 file changed, 12 insertions(+), 9 deletions(-)
diff --git a/src/amd/vulkan/radv_meta_resolve.c
b/src/amd/vulkan/radv_meta_resolve.c
index ade5d438438..6a8abce1ddb 100644
--- a/src
to zero, and it uses a COND_WRITE to packet to
set the bit if the timestamp is ready.
This fixes
dEQP-VK.pipeline.timestamp.misc_tests.reset_query_before_copy.
Signed-off-by: Samuel Pitoiset
---
src/amd/common/sid.h| 3 +++
src/amd/vulkan/radv_query.c | 31 +--
2
From the Vulkan spec 1.1.108:
"After query pool creation, each query must be reset before
it is used."
So, the driver doesn't need to do this at creation time.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_query.c | 3 ---
1 file changed, 3 deletions(-)
diff --git
This removes one useless SMEM load operations which pointed to
the same descriptor anyway.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta_fmask_expand.c | 27 +++--
1 file changed, 3 insertions(+), 24 deletions(-)
diff --git a/src/amd/vulkan
The driver will emit GLC=1.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta_fmask_expand.c | 1 +
1 file changed, 1 insertion(+)
diff --git a/src/amd/vulkan/radv_meta_fmask_expand.c
b/src/amd/vulkan/radv_meta_fmask_expand.c
index 12533935348..44068edc89e 100644
--- a/src/amd
Both input and output images use the same type.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_meta_fmask_expand.c | 9 +++--
1 file changed, 3 insertions(+), 6 deletions(-)
diff --git a/src/amd/vulkan/radv_meta_fmask_expand.c
b/src/amd/vulkan/radv_meta_fmask_expand.c
index
: - change sampleLocationCoordinateRange[1] to 0.9375
- compute and emit PA_SC_CENTROID_PRIORITY_{0,1}
- rebased on top of master
- some cleanups
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_cmd_buffer.c | 223 ++
src/amd/vulkan/radv_device.c
It should be 7, not 8.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/si_cmd_buffer.c | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/src/amd/vulkan/si_cmd_buffer.c b/src/amd/vulkan/si_cmd_buffer.c
index 9f6f92a42be..c73c6ecd65c 100644
--- a/src/amd/vulkan/si_cmd_buffer.c
The driver only supports up to 8 samples.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_private.h | 1 -
src/amd/vulkan/si_cmd_buffer.c | 50 --
2 files changed, 51 deletions(-)
diff --git a/src/amd/vulkan/radv_private.h b/src/amd/vulkan
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/si_cmd_buffer.c | 19 ---
1 file changed, 16 insertions(+), 3 deletions(-)
diff --git a/src/amd/vulkan/si_cmd_buffer.c b/src/amd/vulkan/si_cmd_buffer.c
index 56d7a9a3682..9f6f92a42be 100644
--- a/src/amd/vulkan/si_cmd_buffer.c
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_cmd_buffer.c | 2 +-
src/amd/vulkan/radv_pipeline.c | 2 +-
src/amd/vulkan/radv_private.h| 4 +-
src/amd/vulkan/si_cmd_buffer.c | 166 ++-
4 files changed, 76 insertions(+), 98 deletions(-)
diff
This fixes some CTS failures related to VK_EXT_sample_locations.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_cmd_buffer.c | 15 +--
1 file changed, 13 insertions(+), 2 deletions(-)
diff --git a/src/amd/vulkan/radv_cmd_buffer.c b/src/amd/vulkan/radv_cmd_buffer.c
index
On 5/15/19 9:26 AM, Dave Airlie wrote:
From: Marek Olšák
It would be nice to have a reasoning why, I assume that is what they
are called internally and in the kernel, but it would be good to
mention something to justify it in the commit msg.
Probably because Navi will be GFX10? :-)
VK_AMD_draw_indirect_count has been promoted with the suffix
changed to KHR.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_cmd_buffer.c | 49 ---
src/amd/vulkan/radv_extensions.py | 2 +-
2 files changed, 1 insertion(+), 50 deletions(-)
diff --git a/src
Nothing to do.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_device.c | 6 ++
src/amd/vulkan/radv_extensions.py | 1 +
2 files changed, 7 insertions(+)
diff --git a/src/amd/vulkan/radv_device.c b/src/amd/vulkan/radv_device.c
index 6daf9f48182..b197d7f9ec8 100644
--- a/src/amd
We need to know the number of rectangles.
This fixes new CTS dEQP-VK.draw.discard_rectangles.dynamic_*.
Fixes: 5db0bf99944 ("radv: Implement VK_EXT_discard_rectangles.")
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_pipeline.c | 10 ++
1 file changed, 6 insert
("ac/nir: only use the new raw/struct image atomic
intrinsics with LLVM 9+")
Signed-off-by: Samuel Pitoiset
Reviewed-by: Bas Nieuwenhuizen
---
src/amd/common/ac_nir_to_llvm.c | 12 +++-
src/amd/common/ac_shader_abi.h| 1 +
src/amd/vulkan/radv_nir_to_llvm.c | 6 +
Err no, my mistake.
I will write a backport.
On 5/8/19 10:54 AM, Samuel Pitoiset wrote:
You mean 19.1?
On 5/7/19 8:29 PM, Dylan Baker wrote:
Hi Samuel,
This doesn't apply cleanly on 19.0, and I'm not sure how to resolve
the diff.
Could you provide a packport please?
Thanks,
Dylan
You mean 19.1?
On 5/7/19 8:29 PM, Dylan Baker wrote:
Hi Samuel,
This doesn't apply cleanly on 19.0, and I'm not sure how to resolve the diff.
Could you provide a packport please?
Thanks,
Dylan
Quoting Samuel Pitoiset (2019-05-03 02:45:34)
Because the new raw/struct intrinsics are buggy
LGTM.
Thanks for double checking Tim!
On 5/8/19 1:27 AM, Timothy Arceri wrote:
On 8/5/19 1:51 am, Samuel Pitoiset wrote:
What games are affected btw?
PERCENTAGE DELTAS Shaders SGPRs VGPRs SpillSGPR CodeSize
MaxWaves
batman-arkham-city 2581
What games are affected btw?
Can you please double check before pushing because of the flrp changes
that landed around?
On 5/7/19 7:14 AM, Timothy Arceri wrote:
ping!
On 2/5/19 1:38 pm, Timothy Arceri wrote:
The pattern of calling opt algebraic first seems to have originated
in i965. The
On 5/7/19 5:30 PM, Ian Romanick wrote:
On 5/7/19 8:20 AM, Samuel Pitoiset wrote:
This introduces glitches with Talos and Serious Sam 2017 with RADV...
Are you able to reproduce the problem with ANV?
Probably not very easily. If you can figure out which shader it is, it
should be easy
This introduces glitches with Talos and Serious Sam 2017 with RADV...
Are you able to reproduce the problem with ANV?
On 5/7/19 8:01 AM, GitLab Mirror wrote:
Module: Mesa
Branch: master
Commit: 5b908db604b2f47bb8382047533e556db8d5f52b
URL:
://reviews.llvm.org/D61313.
Cc: "19.0" "19.1"
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_util.c | 7 ---
src/amd/common/ac_llvm_util.h | 1 +
src/amd/vulkan/radv_debug.h | 1 +
src/amd/vulkan/radv_device.c | 8
src/amd/vulkan/radv_shader.c | 2 ++
Seems fine to,
Reviewed-by: Samuel Pitoiset
Bas, any comments?
On 5/7/19 7:14 AM, Timothy Arceri wrote:
ping!
On 2/5/19 1:38 pm, Timothy Arceri wrote:
The pattern of calling opt algebraic first seems to have originated
in i965. The order in OpenGL drivers generally doesn't matter
because
On 5/3/19 12:39 PM, Nicolai Hähnle wrote:
On 30.04.19 21:20, Marek Olšák wrote:
Why can we not use tbuffer loads?
tbuffer_load_format has the exact same limitations as
buffer_load_format. They both use the same hardware path, the only
difference is that tbuffer_load_format gets the format
nsics with LLVM 9+")
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_nir_to_llvm.c | 12 +++-
src/amd/common/ac_shader_abi.h| 1 +
src/amd/vulkan/radv_nir_to_llvm.c | 6 ++
3 files changed, 14 insertions(+), 5 deletions(-)
diff --git a/src/amd/common/ac_nir_to_llvm.c b/src/
This restores the previous behaviour before YCBCR landed. For D+S
formats, it returns the depth format.
This fixes an assertion with Thrones of Britannia.
Fixes: 66507cc6563 ("radv: Add single plane image views & meta operations")
Signed-off-by: Samuel Pitoiset
---
src/amd/vulka
Other GFX9 chips aren't affected.
Cc: "19.0" "19.1"
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_nir_to_llvm.c | 4 ++--
1 file changed, 2 insertions(+), 2 deletions(-)
diff --git a/src/amd/vulkan/radv_nir_to_llvm.c
b/src/amd/vulkan/radv_nir_to_llvm.c
index b4a1
On 5/2/19 5:35 PM, Marek Olšák wrote:
This bug is only on Vega10 and Raven1.
Yep, I figured after pushing...
I will update the workaround.
Thanks Marek!
Marek
On Wed, May 1, 2019 at 10:10 AM Samuel Pitoiset
mailto:samuel.pitoi...@gmail.com>> wrote:
This chip doesn'
653664 -> 8630316 (-0.27 %) bytes
Max Waves: 60811 -> 60737 (-0.12 %)
v3: - rebase on top of master
- remove the restriction for SSBO stores as well
v2: - fix llvm 8
Signed-off-by: Samuel Pitoiset
---
I plan to run benchmarks with that change.
src/amd/common/ac_llvm_build.c
According to RadeonSI, this seems to be required by the hardware
to avoid GPU hangs. I think I just forgot to set that bit when I
implemented VK_EXT_transform_feedback.
This fixes a GPU hang with Space Engineers and DXVK.
Cc: "19.0" "19.1"
Signed-off-by: Samuel Pitoiset
that
028ce527395642b68612d10c6030be5d4706a65e also was, and it looks like I
did not CC that to 19.0, partially because of a lot of dependencies.
Might make more sense to disable the ext on the 19.0 branch.
Yes, that seems better to disable it. Can you send a patch?
On Wed, May 1, 2019 at 9:56 PM Samuel
wait what? Are we backporting a new feature into a stable branch? Do we
really need that?
On 5/1/19 6:35 PM, GitLab Mirror wrote:
Module: Mesa
Branch: staging/19.0
Commit: a1cdab7bd4ecad7c1c518fb8430613a7559d4f9d
URL:
This chip doesn't need the fixup. This fixes a bunch of
dEQP-VK.tessellation tests and avoid random GPU hangs.
Cc: "19.0"
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_nir_to_llvm.c | 1 +
1 file changed, 1 insertion(+)
diff --git a/src/amd/vulkan/radv_nir_to_llvm.c
b/src/
: e45ba51ea45 ("radv: add support for VK_EXT_conditional_rendering")
Reported-by: Philip Rebohle
Signed-off-by: Samuel Pitoiset
Reviewed-by: Bas Nieuwenhuizen (v1)
---
src/amd/vulkan/radv_cmd_buffer.c | 47 ++--
1 file changed, 44 insertions(+), 3 deletions(-)
diff -
Reviewed-by: Samuel Pitoiset
On 4/27/19 11:47 AM, Rhys Perry wrote:
It previously used var->type instead of deref_instr->type and didn't
handle 64-bit outputs.
This fixes lots of transform feedback CTS tests involving transform
feedback and geometry shaders (mostl
Otherwise it returns "AMD RADV unknown".
Cc: 19.0
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_device.c | 1 +
1 file changed, 1 insertion(+)
diff --git a/src/amd/vulkan/radv_device.c b/src/amd/vulkan/radv_device.c
index b7f991fce72..8340bf52c11 100644
--- a/src/
Thanks for your investigations Rhys. As discussed over IRC, this makes
sense to me.
Reviewed-by: Samuel Pitoiset
On 4/26/19 3:48 PM, Rhys Perry wrote:
https://reviews.llvm.org/rL356946 (present in LLVM 9 and later) changed
the meaning of the "system" sync scope, making it
On 4/26/19 1:58 PM, Samuel Pitoiset wrote:
Per the Vulkan spec 1.1.107, the predicate is a 32-bit value. Though
the AMD hardware treats it as a 64-bit value which means it might
fail to discard.
I don't know why this extension has been drafted like that but this
definitely not fit with AMD
value for the predicate, so we need to implement a workaround.
This fixes an issue when DXVK enables conditional rendering with RADV.
Fixes: e45ba51ea45 ("radv: add support for VK_EXT_conditional_rendering")
Reported-by: Philip Rebohle
Signed-off-by: Samuel Pitoiset
Reviewed-by: Bas Nie
For consistency with ac_build_llvm8_buffer_{load,store}_common
helpers and that will help a bit for removing the vec3 restriction.
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c | 26 +-
1 file changed, 13 insertions(+), 13 deletions(-)
diff --git
Reviewed-by: Samuel Pitoiset
Thanks fo the fix Tim!
On 4/25/19 3:17 AM, Timothy Arceri wrote:
We were only setting the used mask for the first component of a
varying. Since the linking opts split vectors into scalars this
has mostly worked ok.
However this causes an issue where for example
Reviewed-by: Samuel Pitoiset
On 4/24/19 2:54 PM, Bas Nieuwenhuizen wrote:
Reviewed-by: Bas Nieuwenhuizen
On Wed, Apr 24, 2019 at 2:40 PM Eleni Maria Stea wrote:
Before setting the physical device API version, we should check if the
MESA_VK_VERSION_OVERRIDE environment variable is set
looks okay?
Thanks,
Dylan
Quoting Samuel Pitoiset (2019-04-16 08:24:01)
I don't have much context for that issue, so:
Acked-by: Samuel Pitoiset
On 4/12/19 10:15 PM, Marek Ol\u0161 k wrote:
Done locally.
Marek
On Fri, Apr 12, 2019 at 12:20 PM Samuel Pitoiset wrote:
I
On 4/23/19 5:16 PM, Jason Ekstrand wrote:
On Tue, Apr 23, 2019 at 7:46 AM Samuel Pitoiset
mailto:samuel.pitoi...@gmail.com>> wrote:
On 4/23/19 10:45 AM, Bas Nieuwenhuizen wrote:
> On Tue, Apr 23, 2019 at 9:35 AM Samuel Pitoiset
> mailto:samuel.pitoi...@gmail.com
On 4/23/19 10:45 AM, Bas Nieuwenhuizen wrote:
On Tue, Apr 23, 2019 at 9:35 AM Samuel Pitoiset
wrote:
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_shader.c | 2 +-
src/compiler/nir/nir.h | 3 ++-
src/compiler/nir/nir_opt_if.c
Per the Vulkan spec this is definitely invalid but X4 Foundations
does that and it ends up by hanging the GPU.
Found while enabling validation layers with the game. The issue
will be reported to the developers.
Cc: "19.0" mesa-sta...@lists.freedesktop.org
Signed-off-by: Samue
301 - 400 of 7132 matches
Mail list logo