Re: Discussion starters for ION session at Linux Plumbers Android+Graphics microconf

2013-09-06 Thread Rob Clark
On Fri, Sep 6, 2013 at 5:16 AM, Daniel Vetter  wrote:
> On Thu, Sep 05, 2013 at 10:06:52PM -0700, John Stultz wrote:
>> On 09/05/2013 08:26 PM, Rob Clark wrote:
>> > On Thu, Sep 5, 2013 at 8:49 PM, John Stultz  wrote:
>> >> Hey everyone,
>> >>In preparation for the Plumbers Android+Graphics microconf, I wanted to
>> >> send out some background documentation to try to get all the context we 
>> >> can
>> >> out there prior to the discussion, as time will be limited and it would be
>> >> best to spend it discussing solutions rather then re-hashing problems and
>> >> requirements.
>> >>
>> >> I'm sure many folks on this list could probably do a better job 
>> >> summarizing
>> >> the issues, but I wanted to get this out there to try to enumerate the
>> >> problems and the different perspectives on the issues that I'm aware of.
>> >>
>> >> The document is on LWN here:
>> >> http://lwn.net/SubscriberLink/565469/9d88daa2282ef6c2/
>> > oh, I had missed that article.. fwiw
>>
>> It was published just moments before I sent out this thread, so I
>> wouldn't have expected anyone to have seen it yet. :)
>>
>>
>> > "Another possible solution is to allow dma-buf exporters to not
>> > allocate the backing buffers immediately. This would allow multiple
>> > drivers to attach to a dma-buf before the allocation occurs. Then,
>> > when the buffer is first used, the allocation is done; at that time,
>> > the allocator could scan the list of attached drivers and be able to
>> > determine the constraints of the attached devices and allocate memory
>> > accordingly. This would allow user space to not have to deal with any
>> > constraint solving. "
>> >
>> > That is actually how dma-buf works today.  And at least with GEM
>> > buffers exported as dma-buf's, the allocation is deferred.  It does
>> > require attaching the buffers in all the devices that will be sharing
>> > the buffer up front (but I suppose you need to know the involved
>> > devices one way or another with any solution, so this approach seems
>> > as good as any).  We *do* still need to spiff up dev->dma_parms a bit
>> > more, and there might be some room for some helpers to figure out the
>> > union of all attached devices constraints, and allocate suitable
>> > backing pages... so perhaps this is one thing we should be talking
>> > about.
>>
>> Ok. I had gone looking for an example of the deferred allocation, but
>> didn't find it.  I'll go look again, but if you have a pointer, that
>> could be useful.
>>
>> So yea, I do think this is the most promising approach, but sorting out
>> the next steps for doing a proof of concept is one thing I'd like to
>> discuss (as mentioned in the article, via a ion-like generic allocator,
>> or trying to wire in the constraint solving to some limited set of
>> drivers via generic helper functions). As well as getting a better
>> understanding the Android developers concern about any non-deterministic
>> cost of allocating at mmap time.
>>
>>
>> Thanks for the feedback and thoughts! I'm hopeful some approach to
>> resolving the various issues can be found, but I suspect it will have a
>> few different parts.
>
> My main gripe with ION is that it creates a parallel infrastructure for
> figuring out allocation constraints of devices. Upstream already has all
> the knowledge (or at least most of it) for cache flushing, mapping into
> iommus and allocating from special pools stored in association with the
> device structure. So imo an upstream ION thing should reuse the
> information each device and its driver already has available.

yeah, we want to make sure that dma-mapping is up to snuff for
handling allocations of backing pages meeting the constraints of a set
of devices (spiffing up dma_parms, etc, as I mentioned in my first
reply).  I see a potential upstream ION as just be a sort of
convenience wrapper for android userspace rather than an actual
allocator of backing pages, etc.  Well, maybe some of this is easier
to do in userspace/gralloc, but for example to ease "jank" fears, it
could pre-attach to all the involved devices for the use-case, and
then do a dummy map_attachment to the ION device to force backing page
allocation.

BR,
-R

> Now I also see that a central allocator has upsides since reinventing this
> wheel for every device driver is not a great idea. One idea to get there
> and keep the benefits of ION with up-front allocations would be.
> 1) Allcoate the dma-buf handle at the central allocator. No backing
> storage gets allocated.
> 2) Import that dma-buf everywhere you want it to be used. That way
> userspace doesn't need to deal with whatever hw madness is actually used
> to implement the drm/v4l/whatever devices nodes internally.
> 3) Ask the central allocator to finalize the buffer allocation placement
> and grab backing storage.
>
> If any further attaching happens that doesn't work out it would simply
> fail, and userspace gets to keep the pieces. Which is already the case in
> today's upstream whe

Re: Discussion starters for ION session at Linux Plumbers Android+Graphics microconf

2013-09-06 Thread Rob Clark
On Fri, Sep 6, 2013 at 1:06 AM, John Stultz  wrote:
> On 09/05/2013 08:26 PM, Rob Clark wrote:
>> On Thu, Sep 5, 2013 at 8:49 PM, John Stultz  wrote:
>>> Hey everyone,
>>>In preparation for the Plumbers Android+Graphics microconf, I wanted to
>>> send out some background documentation to try to get all the context we can
>>> out there prior to the discussion, as time will be limited and it would be
>>> best to spend it discussing solutions rather then re-hashing problems and
>>> requirements.
>>>
>>> I'm sure many folks on this list could probably do a better job summarizing
>>> the issues, but I wanted to get this out there to try to enumerate the
>>> problems and the different perspectives on the issues that I'm aware of.
>>>
>>> The document is on LWN here:
>>> http://lwn.net/SubscriberLink/565469/9d88daa2282ef6c2/
>> oh, I had missed that article.. fwiw
>
> It was published just moments before I sent out this thread, so I
> wouldn't have expected anyone to have seen it yet. :)
>

ahh, ok, that would explain it ;-)

>
>> "Another possible solution is to allow dma-buf exporters to not
>> allocate the backing buffers immediately. This would allow multiple
>> drivers to attach to a dma-buf before the allocation occurs. Then,
>> when the buffer is first used, the allocation is done; at that time,
>> the allocator could scan the list of attached drivers and be able to
>> determine the constraints of the attached devices and allocate memory
>> accordingly. This would allow user space to not have to deal with any
>> constraint solving. "
>>
>> That is actually how dma-buf works today.  And at least with GEM
>> buffers exported as dma-buf's, the allocation is deferred.  It does
>> require attaching the buffers in all the devices that will be sharing
>> the buffer up front (but I suppose you need to know the involved
>> devices one way or another with any solution, so this approach seems
>> as good as any).  We *do* still need to spiff up dev->dma_parms a bit
>> more, and there might be some room for some helpers to figure out the
>> union of all attached devices constraints, and allocate suitable
>> backing pages... so perhaps this is one thing we should be talking
>> about.
>
> Ok. I had gone looking for an example of the deferred allocation, but
> didn't find it.  I'll go look again, but if you have a pointer, that
> could be useful.

for a "pure GEM" (ie. not TTM) driver, drm_gem_get_pages() is where
actual pages get allocated.  This is triggered by various things
(faulting in page for userspace access, dmabuf map_attachment, etc)

BR,
-R

> So yea, I do think this is the most promising approach, but sorting out
> the next steps for doing a proof of concept is one thing I'd like to
> discuss (as mentioned in the article, via a ion-like generic allocator,
> or trying to wire in the constraint solving to some limited set of
> drivers via generic helper functions). As well as getting a better
> understanding the Android developers concern about any non-deterministic
> cost of allocating at mmap time.
>
>
> Thanks for the feedback and thoughts! I'm hopeful some approach to
> resolving the various issues can be found, but I suspect it will have a
> few different parts.
>
> thanks
> -john
>
>
>
___
dri-devel mailing list
dri-devel@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/dri-devel


Re: Discussion starters for ION session at Linux Plumbers Android+Graphics microconf

2013-09-06 Thread Daniel Vetter
On Thu, Sep 05, 2013 at 10:06:52PM -0700, John Stultz wrote:
> On 09/05/2013 08:26 PM, Rob Clark wrote:
> > On Thu, Sep 5, 2013 at 8:49 PM, John Stultz  wrote:
> >> Hey everyone,
> >>In preparation for the Plumbers Android+Graphics microconf, I wanted to
> >> send out some background documentation to try to get all the context we can
> >> out there prior to the discussion, as time will be limited and it would be
> >> best to spend it discussing solutions rather then re-hashing problems and
> >> requirements.
> >>
> >> I'm sure many folks on this list could probably do a better job summarizing
> >> the issues, but I wanted to get this out there to try to enumerate the
> >> problems and the different perspectives on the issues that I'm aware of.
> >>
> >> The document is on LWN here:
> >> http://lwn.net/SubscriberLink/565469/9d88daa2282ef6c2/
> > oh, I had missed that article.. fwiw
> 
> It was published just moments before I sent out this thread, so I
> wouldn't have expected anyone to have seen it yet. :)
> 
> 
> > "Another possible solution is to allow dma-buf exporters to not
> > allocate the backing buffers immediately. This would allow multiple
> > drivers to attach to a dma-buf before the allocation occurs. Then,
> > when the buffer is first used, the allocation is done; at that time,
> > the allocator could scan the list of attached drivers and be able to
> > determine the constraints of the attached devices and allocate memory
> > accordingly. This would allow user space to not have to deal with any
> > constraint solving. "
> >
> > That is actually how dma-buf works today.  And at least with GEM
> > buffers exported as dma-buf's, the allocation is deferred.  It does
> > require attaching the buffers in all the devices that will be sharing
> > the buffer up front (but I suppose you need to know the involved
> > devices one way or another with any solution, so this approach seems
> > as good as any).  We *do* still need to spiff up dev->dma_parms a bit
> > more, and there might be some room for some helpers to figure out the
> > union of all attached devices constraints, and allocate suitable
> > backing pages... so perhaps this is one thing we should be talking
> > about.
> 
> Ok. I had gone looking for an example of the deferred allocation, but
> didn't find it.  I'll go look again, but if you have a pointer, that
> could be useful.
> 
> So yea, I do think this is the most promising approach, but sorting out
> the next steps for doing a proof of concept is one thing I'd like to
> discuss (as mentioned in the article, via a ion-like generic allocator,
> or trying to wire in the constraint solving to some limited set of
> drivers via generic helper functions). As well as getting a better
> understanding the Android developers concern about any non-deterministic
> cost of allocating at mmap time.
> 
> 
> Thanks for the feedback and thoughts! I'm hopeful some approach to
> resolving the various issues can be found, but I suspect it will have a
> few different parts.

My main gripe with ION is that it creates a parallel infrastructure for
figuring out allocation constraints of devices. Upstream already has all
the knowledge (or at least most of it) for cache flushing, mapping into
iommus and allocating from special pools stored in association with the
device structure. So imo an upstream ION thing should reuse the
information each device and its driver already has available.

Now I also see that a central allocator has upsides since reinventing this
wheel for every device driver is not a great idea. One idea to get there
and keep the benefits of ION with up-front allocations would be.
1) Allcoate the dma-buf handle at the central allocator. No backing
storage gets allocated.
2) Import that dma-buf everywhere you want it to be used. That way
userspace doesn't need to deal with whatever hw madness is actually used
to implement the drm/v4l/whatever devices nodes internally.
3) Ask the central allocator to finalize the buffer allocation placement
and grab backing storage.

If any further attaching happens that doesn't work out it would simply
fail, and userspace gets to keep the pieces. Which is already the case in
today's upstream when userspace is unlucky and doesn't pick the most
constrained device.

This only tackles the "make memory allocation predictable" issue ION
solves, which leaves the optimized cache flushing. We can add caches for
pre-flushed objects for that (not rocket science, most of the drm drivers
have that wheel reinvented, too). That leaves us with optimizing cache
flushes (i.e. leaving them out when switching between devices without cpu
accesss in-between). The current linux dma api doesn't really support
this, so we need to add a bit of interfaces there to be able to do
device-to-device cache flushing (which safe for maybe iommu flushes I
expect to be noops). And the central allocator obviously needs to keep
track of where the current cache domain is.

Aside: Intel Atom SoCs have t

Re: Discussion starters for ION session at Linux Plumbers Android+Graphics microconf

2013-09-06 Thread John Stultz
On 09/05/2013 08:26 PM, Rob Clark wrote:
> On Thu, Sep 5, 2013 at 8:49 PM, John Stultz  wrote:
>> Hey everyone,
>>In preparation for the Plumbers Android+Graphics microconf, I wanted to
>> send out some background documentation to try to get all the context we can
>> out there prior to the discussion, as time will be limited and it would be
>> best to spend it discussing solutions rather then re-hashing problems and
>> requirements.
>>
>> I'm sure many folks on this list could probably do a better job summarizing
>> the issues, but I wanted to get this out there to try to enumerate the
>> problems and the different perspectives on the issues that I'm aware of.
>>
>> The document is on LWN here:
>> http://lwn.net/SubscriberLink/565469/9d88daa2282ef6c2/
> oh, I had missed that article.. fwiw

It was published just moments before I sent out this thread, so I
wouldn't have expected anyone to have seen it yet. :)


> "Another possible solution is to allow dma-buf exporters to not
> allocate the backing buffers immediately. This would allow multiple
> drivers to attach to a dma-buf before the allocation occurs. Then,
> when the buffer is first used, the allocation is done; at that time,
> the allocator could scan the list of attached drivers and be able to
> determine the constraints of the attached devices and allocate memory
> accordingly. This would allow user space to not have to deal with any
> constraint solving. "
>
> That is actually how dma-buf works today.  And at least with GEM
> buffers exported as dma-buf's, the allocation is deferred.  It does
> require attaching the buffers in all the devices that will be sharing
> the buffer up front (but I suppose you need to know the involved
> devices one way or another with any solution, so this approach seems
> as good as any).  We *do* still need to spiff up dev->dma_parms a bit
> more, and there might be some room for some helpers to figure out the
> union of all attached devices constraints, and allocate suitable
> backing pages... so perhaps this is one thing we should be talking
> about.

Ok. I had gone looking for an example of the deferred allocation, but
didn't find it.  I'll go look again, but if you have a pointer, that
could be useful.

So yea, I do think this is the most promising approach, but sorting out
the next steps for doing a proof of concept is one thing I'd like to
discuss (as mentioned in the article, via a ion-like generic allocator,
or trying to wire in the constraint solving to some limited set of
drivers via generic helper functions). As well as getting a better
understanding the Android developers concern about any non-deterministic
cost of allocating at mmap time.


Thanks for the feedback and thoughts! I'm hopeful some approach to
resolving the various issues can be found, but I suspect it will have a
few different parts.

thanks
-john



___
dri-devel mailing list
dri-devel@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/dri-devel


Re: Discussion starters for ION session at Linux Plumbers Android+Graphics microconf

2013-09-06 Thread Rob Clark
On Thu, Sep 5, 2013 at 8:49 PM, John Stultz  wrote:
> Hey everyone,
>In preparation for the Plumbers Android+Graphics microconf, I wanted to
> send out some background documentation to try to get all the context we can
> out there prior to the discussion, as time will be limited and it would be
> best to spend it discussing solutions rather then re-hashing problems and
> requirements.
>
> I'm sure many folks on this list could probably do a better job summarizing
> the issues, but I wanted to get this out there to try to enumerate the
> problems and the different perspectives on the issues that I'm aware of.
>
> The document is on LWN here:
> http://lwn.net/SubscriberLink/565469/9d88daa2282ef6c2/

oh, I had missed that article.. fwiw

"Another possible solution is to allow dma-buf exporters to not
allocate the backing buffers immediately. This would allow multiple
drivers to attach to a dma-buf before the allocation occurs. Then,
when the buffer is first used, the allocation is done; at that time,
the allocator could scan the list of attached drivers and be able to
determine the constraints of the attached devices and allocate memory
accordingly. This would allow user space to not have to deal with any
constraint solving. "

That is actually how dma-buf works today.  And at least with GEM
buffers exported as dma-buf's, the allocation is deferred.  It does
require attaching the buffers in all the devices that will be sharing
the buffer up front (but I suppose you need to know the involved
devices one way or another with any solution, so this approach seems
as good as any).  We *do* still need to spiff up dev->dma_parms a bit
more, and there might be some room for some helpers to figure out the
union of all attached devices constraints, and allocate suitable
backing pages... so perhaps this is one thing we should be talking
about.

At any rate, it might not matter if ION cannot import dma-buf's (as
long as every other device importing does not have to differentiate
between importing dma-buf's that are also ION buffers vs dma-buf's
allocated in some other way).  But to be useful upstream, we'd have to
ensure that existing drm drivers can somehow plug-in their existing
allocation mechanisms in to ION.

BR,
-R

> But I wanted to start a discussion thread here, since the LWN comment
> threads (while *much* better then most comment sections) aren't really the
> right place for this sort of thing.
>
> So please feel free to reply to correct any inaccuracies in my summary,
> provide your thoughts on the various proposed solutions, or suggest new
> solutions that we should also discuss at the micro-conference!
>
> thanks
> -john
>
> ___
> dri-devel mailing list
> dri-devel@lists.freedesktop.org
> http://lists.freedesktop.org/mailman/listinfo/dri-devel
>
___
dri-devel mailing list
dri-devel@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/dri-devel


Discussion starters for ION session at Linux Plumbers Android+Graphics microconf

2013-09-05 Thread John Stultz
Hey everyone,
   In preparation for the Plumbers Android+Graphics microconf, I wanted to
send out some background documentation to try to get all the context we can
out there prior to the discussion, as time will be limited and it would be
best to spend it discussing solutions rather then re-hashing problems and
requirements.

I'm sure many folks on this list could probably do a better job summarizing
the issues, but I wanted to get this out there to try to enumerate the
problems and the different perspectives on the issues that I'm aware of.

The document is on LWN here:
http://lwn.net/SubscriberLink/565469/9d88daa2282ef6c2/

But I wanted to start a discussion thread here, since the LWN comment
threads (while *much* better then most comment sections) aren't really the
right place for this sort of thing.

So please feel free to reply to correct any inaccuracies in my summary,
provide your thoughts on the various proposed solutions, or suggest new
solutions that we should also discuss at the micro-conference!

thanks
-john
___
dri-devel mailing list
dri-devel@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/dri-devel