Re: Blur effect on live scene?

Kevin Rushforth Thu, 13 Aug 2015 12:34:40 -0700

Seems there are a few different issues here. I'll weigh in on a coupleof them. To summarize what Jim has already said:


* We always do GPU hardware rendering wherever possible

* Node caching was intended as a hint to the system to cache the resultof potentially expensive effects into a texture (which is rendered intoby the GPU) and re-render that texture when needed. Jim was suggestingthat an application might be able to use that mechanism to do somethingother than its original purpose.

* Snapshot was intended to allow an application to take (as the namesuggests) a snapshot of a scene or node rendered into an image andreturn that image to the application. We don't use snapshot internally,rather various "render and then get the results" operations likesnapshot, node caching, and rendering to a JFXPanel all use similarmechanisms which boil down to "render to texture" via OpenGL or D3D. Wehave considered providing new API to allow more of a "liverender-to-texture" operation in the future, but haven't thought about itrecently.

Much of the discussion has been centered on what is possible with thecurrent JavaFX API, but there has also been some discussion on whatmight be done with a new API.

Anyway, to comment on a couple specific points, I have some inlinecomments below.



Jim Graham wrote:

On 8/13/2015 1:29 AM, Matthias Hänel wrote:
I'd argue that we sort of do have something like that - it is thecache flag. If a Node is cached then we do have a copy of it in atexture and that can help make the Blur Effect work moreefficiently, but there may be some additional copies betweentextures if everything isn't set up right. Still, that is anavenue for someone to check to see if there isn't a better way toachieve this effect in the short term...
I am not pretty sure what cache does. Probably some hasmap thatholds objects and they are not instantly destroyed in the graphicsRAM?
It is not a hashmap.

It is a hint to save the rendering of that node in a buffer:
https://docs.oracle.com/javase/8/javafx/api/javafx/scene/Node.html#cacheProperty
I don't like the way that this doc comment is worded as it impliesthat using it on a node that is blurred is unwise, but if the nodeis animated over even GPU acceleration of the rendering and blurringoperations" are going to have some cost that it could save.
That documentation says nodes are cached as Bitmaps. In GPU or in CPUspace? It is not clear here. I suspect it is in CPU RAM space.That will just lead to more copy-tasks from CPU to GPU or not? Ifoptimized there is no benefit at all as stated in the docu "note thaton some platforms such as GPU accelerated platforms there is littlebenefit".
As I said, the doc comments there are confusing. "bitmap" here,despite any capitalization, is a general term for "some type of pixelstore". On GPU accelerated platforms, this is a vram texture/FBO.
Actually, I would expect that renered Nodes are textures in GPU-VRAMto get the most performance out of it.
Yes, nodes with the cache hint are rendered into VRAM textures andreused for subsequent frames if the cache hint is set - subject to thepolicies of the cacheHint property if any (mainly transform)properties of the node are changed in the meantime.
From my current point the major problem with JavaFX is still thesame.
1. Has a good API
2. renders most of its stuff in software, hence does not runperformant3. Has good approaches, but the overall sight on the technology isbroken somewhere.
I am not sure how you come to the conclusion that it renders most ofits stuff in software. It renders quite a lot in hardware. Eventhe example here of using snapshot to optimize a blurred background- the rendering of the scene is done in hw. It is only copied tomain memory because the API requires a persistent image. If yourender that image to the screen it is copied back into a texture andreused from that texture unless we run low on vram. There is norendering in software there, only use of a heap buffer forpersistent storage...
Your point is that it uses hardware to render and it just usessnapshot to satisfy the API?
What is "it"?
JavaFX doesn't "use" snapshot anywhere. It provides the snapshot APIso the developer can use snapshot where the developer wants to, butJavaFX doesn't invoke the snapshot API on its own.
In particular, the node cache hint does not use the snapshot API underthe covers.
My current understaning is ... what happens in our blurred-effect case?
To be clear, you are describing what happens in a particularimplementation of the blurred-effect case. The implementation thatuses snapshot, which may be the popular technique in use at this time.
1. the application constructs a JavaFX node tree.
2. the node tree is rendered mostly in hardware (shader effects andso on on top) to the main framebuffer
Correct
3. Snapshot calls ReadPixels (or whatever it is called on theparticular platform)
4. JavaFX encapsulates this "new" image with Object
Technically, snapshot does its own (hardware accelerated) renderinginto a separate GPU texture/FBO and does a read pixels on thattexture. It then encapsulates those pixels into an Image object.
This snapshot operation need only be done once if the underlying nodetree is static.
5. We draw the "new" image with effects to the OGL context with thesame node-tree API as we did before on top of the first node-tree.
Correct. Note that when you render that Image object its pixels arecached in vram and that vram copy is reused from frame to frame. So,if you reuse the one snapshot then there was only one trip from vramto memory and back to vram on the first frame you did this and theneverything should be done in vram for subsequent frames reusing thesame snapshot.
That works, but there is too much CPU and memcpy involved for mybelieve. Furthermore ReadPixels takes forever in
an OpenGL perspective.
If the underlying tree is static, though, that readpixels operationonly happens once at the start of the operation, but you are correctthat it would be much better if it didn't need to happen at all.
Since this is just one very simple effect, it is actually not good tospend more than approx. 20% CPU (i7) load on it.
I expect 0% (not noticable) for this blurry effect.
That would be ideal. We may need new API to get there, but there arealso options to consider that may get us there in the short term. Inparticular, cached nodes - which are already present in the API - mayget us closer to that goal.
The ideal implementation from my perspective would be:
1. the application constructs a JavaFX node tree.
2. the node tree is rendered mostly in hardware (shader effects andso on on top) to a virtual framebuffer in the GPU space3. The virtual framebuffer is drawn by a simple drawVert-call for thebackground4. The virtual framebuffer is drawn once again shaped and shaded(blurry filter) by another drawVert-call
If you set the cache hint to true on the underlying tree then this maybe approximately what happens in our current implementation. The partI would still need to investigate would be how well the effectsmachinery (called Decora) can reuse the cached version of the nodes.If it doesn't attempt to reuse the cached version of the nodes then itmight end up re-rendering the tree.

This would an interesting experiment, and would let us know how easilyapplications could do what they need with the existing API. If there arelimitations or performance bottlenecks we can look into fixing them.

I have to correct my assumption that JavaFX renders most of its stuffin software a bit. The wording "rendering" was not correct.This feeling comes probaly from the massive use of Snapshot. I havenot understood the entire core of Prism until now.In some implementation of snapshot it seems to do the renderingentirely in software. That might only the fallback, but the
ReadPixels-Stuff is still valid.
To be clear, snapshot was mainly created for its namesake - to producestatic copies of the scene data to be saved as images for putting intodocumentation. Alternate uses that were considered were forgenerating thumbnails of scenes for an application that couldopen/close various scene graph panes. One wouldn't use the Windows"print screen" API in a performance intensive part of one'sapplication either. It is great that it can be used for thisparticular result, but it was not designed to be performant in thatrespect. Note that the primary snapshot API is the asynchronousversion with the callback to deliver the data. The non-asynchronousversion is mostly just a helper around that, but it stalls therendering pipeline to complete its work. None of that was designedfor insertion into a running animation technique.


Exactly.

Tiny-Offtopic: I already meantioned on this list that JFXPanel(Swing) is way below it's expectations. The main reason there is alsothe useof the snapshot function instead of letting render JavaFX in it's ownheavy weight window. In our small test case a simple Listdisplayed via JFXPanel was not even to render more than 1 fps and itslowed down the entire Swing-UI. Since I know jogle and it's canvasimplementation, would it be good to have a similar heavy weight JFXcanvas in Swing? This would let JFX use it's full hardware renderedpower.
It doesn't use snapshot per-se, but it does do something similar.Sharing contexts and GPU resources with AWT/Java2D is on a wish list,but architecturally we aren't there. Kevin would know more about ourchallenges on that front than me...

I haven't looked recently at what it would take to share resourcesbetween Java2D and AWT, but that seems separate from the question ofwhether we could use a heavy-weight JFXCanvas as opposed to the currentlightweight JFXPanel. We haven't considered that since there arelimitations with heavyweight components. I am not aware of anyperformance problems that are as serious as "simple List displayed viaJFXPanel was not even to render more than 1 fps". Do you have a testcase? I do note that we did have a bug on Mac retina displays that wasrecently fixed which killed performance. Was that what you were runningyour 1fps program on?


-- Kevin


            ...jim

Re: Blur effect on live scene?

Reply via email to