Pawel,

On 20/02/2019 13:18, Pawel Laszczak wrote:
> Hi Roger.
>>
>> On 14/02/2019 21:45, Pawel Laszczak wrote:
>>> Controller for OUT endpoints has shared on-chip buffers for all incoming
>>> packets, including ep0out. It's FIFO buffer, so packets must be handle
>>> by DMA in correct order. If the first packet in the buffer will not be
>>> handled, then the following packets directed for other endpoints and
>>> functions will be blocked.
>>>
>>> Additionally the packets directed to one endpoint can block entire on-chip
>>> buffers. In this case transfer to other endpoints also will blocked.
>>>
>>> To resolve this issue after raising the descriptor missing interrupt
>>> driver prepares internal usb_request object and use it to arm DMA
>>> transfer.
>>>
>>> The problematic situation was observed in case when endpoint has
>>> been enabled but no usb_request were queued. Driver try detects
>>> such endpoints and will use this workaround only for these endpoint.
>>>
>>> Driver use limited number of buffer. This number can be set by macro
>>> CDNS_WA2_NUM_BUFFERS.
>>>
>>> Such blocking situation was observed on ACM gadget. For this function
>>> host send OUT data packet but ACM function is not prepared for
>>> this packet. It's cause that buffer placed in on chip memory block
>>> transfer to other endpoints.
>>>
>>> It's limitation of controller but maybe this issues should be fixed in
>>> function driver.
>>>
>>> This work around can be disabled/enabled by means of quirk_internal_buffer
>>> module parameter. By default feature is enabled. It can has impact to
>>> transfer performance and in most case this feature can be disabled.
>>
>> How much is the performance impact?
> 
> I didn't check this, but performance will be decreased because driver has to 
> copy data from internally allocated buffer to usb_request->buff.
> 
>> You mentioned that the issue was observed only in the ACM case and
>> could be fixed in the function driver?
>> It seems pointless to enable an endpoint and not have any requests queued 
>> for it.
> 
> Yes, it’s true. The request in ACM class is send to Controller Driver when 
> user read file associated 
> with ACM gadget. Problem exist because host send data to controller but USB 
> controller 
> has not prepared buffer for this data by ACM class.
> 
>> Isn't fixing the ACM driver (if there is a real issue) a better approach 
>> than having
>> a workaround that affects performance of all other use cases?
> 
> Yes it should be better. But what ACM driver should do with unexpected data. 
> I'm not sure if we 
> can simply delete them. 
> 
> The best solution would be to make modification in controller. In such case 
> Controller shouldn't accept 
> packet but should send NAK. 

Yes, that should be standard behaviour. No?

>  
> One more thing. Workaround has implemented algorithm that decide for which 
> endpoint it should be enabled.
> e.g for composite device MSC+NCM+ACM it should work only for ACM OUT endpoint.
> 

If ACM driver didn't queue the request for ACM OUT endpoint, why does the 
controller accept the data at all?
I didn't understand why we need a workaround for this. It should be standard 
behaviour to NAK data if
function driver didn't request for all endpoints.

Also I didn't understand why performance should be impacted to just NAK data.

cheers,
-roger

> Cheers,
> Pawel
>>
>>>
>>> Signed-off-by: Pawel Laszczak <paw...@cadence.com>
>>> ---
>>>  drivers/usb/cdns3/gadget.c | 273 ++++++++++++++++++++++++++++++++++++-
>>>  drivers/usb/cdns3/gadget.h |   7 +
>>>  2 files changed, 278 insertions(+), 2 deletions(-)
>>>
>>> diff --git a/drivers/usb/cdns3/gadget.c b/drivers/usb/cdns3/gadget.c
>>> index 7f7f24ee3c4b..5dfbe6e1421c 100644
>>> --- a/drivers/usb/cdns3/gadget.c
>>> +++ b/drivers/usb/cdns3/gadget.c
>>> @@ -27,6 +27,37 @@
>>>   * If (((Dequeue Ptr (i.e. EP_TRADDR) == Enqueue Ptr-1) or
>>>   * (Dequeue Ptr (i.e. EP_TRADDR) == Enqueue Ptr))
>>>   *         and (DRBL==1 and (not EP0)))
>>> + *
>>> + * Work around 2:
>>> + * Controller for OUT endpoints has shared on-chip buffers for all incoming
>>> + * packets, including ep0out. It's FIFO buffer, so packets must be handle 
>>> by DMA
>>> + * in correct order. If the first packet in the buffer will not be handled,
>>> + * then the following packets directed for other endpoints and  functions
>>> + * will be blocked.
>>> + * Additionally the packets directed to one endpoint can block entire 
>>> on-chip
>>> + * buffers. In this case transfer to other endpoints also will blocked.
>>> + *
>>> + * To resolve this issue after raising the descriptor missing interrupt
>>> + * driver prepares internal usb_request object and use it to arm DMA 
>>> transfer.
>>> + *
>>> + * The problematic situation was observed in case when endpoint has been 
>>> enabled
>>> + * but no usb_request were queued. Driver try detects such endpoints and 
>>> will
>>> + * use this workaround only for these endpoint.
>>> + *
>>> + * Driver use limited number of buffer. This number can be set by macro
>>> + * CDNS_WA2_NUM_BUFFERS.
>>> + *
>>> + * Such blocking situation was observed on ACM gadget. For this function
>>> + * host send OUT data packet but ACM function is not prepared for this 
>>> packet.
>>> + * It's cause that buffer placed in on chip memory block transfer to other
>>> + * endpoints.
>>> + *
>>> + * It's limitation of controller but maybe this issues should be fixed in
>>> + * function driver.
>>> + *
>>> + * This work around can be disabled/enabled by means of 
>>> quirk_internal_buffer
>>> + * module parameter. By default feature is enabled. It can has impact to
>>> + * transfer performance and in most case this feature can be disabled.
>>>   */
>>>
>>>  #include <linux/dma-mapping.h>
>>> @@ -42,6 +73,14 @@ static int __cdns3_gadget_ep_queue(struct usb_ep *ep,
>>>                                struct usb_request *request,
>>>                                gfp_t gfp_flags);
>>>
>>> +/*
>>> + * Parameter allows to disable/enable handling of work around 2 feature.
>>> + * By default this value is enabled.
>>> + */
>>> +static bool quirk_internal_buffer = 1;
>>> +module_param(quirk_internal_buffer, bool, 0644);
>>> +MODULE_PARM_DESC(quirk_internal_buffer, "Disable/enable WA2 algorithm");
>>> +
>>>  /**
>>>   * cdns3_handshake - spin reading  until handshake completes or fails
>>>   * @ptr: address of device controller register to be read
>>> @@ -105,6 +144,17 @@ struct usb_request *cdns3_next_request(struct 
>>> list_head *list)
>>>     return list_first_entry_or_null(list, struct usb_request, list);
>>>  }
>>>
>>> +/**
>>> + * cdns3_next_priv_request - returns next request from list
>>> + * @list: list containing requests
>>> + *
>>> + * Returns request or NULL if no requests in list
>>> + */
>>> +struct cdns3_request *cdns3_next_priv_request(struct list_head *list)
>>> +{
>>> +   return list_first_entry_or_null(list, struct cdns3_request, list);
>>> +}
>>> +
>>>  /**
>>>   * select_ep - selects endpoint
>>>   * @priv_dev:  extended gadget object
>>> @@ -384,6 +434,53 @@ static int cdns3_start_all_request(struct cdns3_device 
>>> *priv_dev,
>>>     return ret;
>>>  }
>>>
>>> +/**
>>> + * cdns3_descmiss_copy_data copy data from internal requests to request 
>>> queued
>>> + * by class driver.
>>> + * @priv_ep: extended endpoint object
>>> + * @request: request object
>>> + */
>>> +static void cdns3_descmiss_copy_data(struct cdns3_endpoint *priv_ep,
>>> +                                struct usb_request *request)
>>> +{
>>> +   struct usb_request *descmiss_req;
>>> +   struct cdns3_request *descmiss_priv_req;
>>> +
>>> +   while (!list_empty(&priv_ep->descmiss_req_list)) {
>>> +           int chunk_end;
>>> +           int length;
>>> +
>>> +           descmiss_priv_req =
>>> +                   cdns3_next_priv_request(&priv_ep->descmiss_req_list);
>>> +           descmiss_req = &descmiss_priv_req->request;
>>> +
>>> +           /* driver can't touch pending request */
>>> +           if (descmiss_priv_req->flags & REQUEST_PENDING)
>>> +                   break;
>>> +
>>> +           chunk_end = descmiss_priv_req->flags & REQUEST_INTERNAL_CH;
>>> +           length = request->actual + descmiss_req->actual;
>>> +
>>> +           if (length <= request->length) {
>>> +                   memcpy(&((u8 *)request->buf)[request->actual],
>>> +                          descmiss_req->buf,
>>> +                          descmiss_req->actual);
>>> +                   request->actual = length;
>>> +           } else {
>>> +                   /* It should never occures */
>>> +                   request->status = -ENOMEM;
>>> +           }
>>> +
>>> +           list_del_init(&descmiss_priv_req->list);
>>> +
>>> +           kfree(descmiss_req->buf);
>>> +           cdns3_gadget_ep_free_request(&priv_ep->endpoint, descmiss_req);
>>> +
>>> +           if (!chunk_end)
>>> +                   break;
>>> +   }
>>> +}
>>> +
>>>  /**
>>>   * cdns3_gadget_giveback - call struct usb_request's ->complete callback
>>>   * @priv_ep: The endpoint to whom the request belongs to
>>> @@ -412,6 +509,32 @@ void cdns3_gadget_giveback(struct cdns3_endpoint 
>>> *priv_ep,
>>>     priv_req->flags &= ~REQUEST_PENDING;
>>>     trace_cdns3_gadget_giveback(priv_req);
>>>
>>> +   /* WA2: */
>>> +   if (priv_ep->flags & EP_QUIRK_EXTRA_BUF_EN &&
>>> +       priv_req->flags & REQUEST_INTERNAL) {
>>> +           struct usb_request *req;
>>> +
>>> +           req = cdns3_next_request(&priv_ep->deferred_req_list);
>>> +           request = req;
>>> +           priv_ep->descmis_req = NULL;
>>> +
>>> +           if (!req)
>>> +                   return;
>>> +
>>> +           cdns3_descmiss_copy_data(priv_ep, req);
>>> +           if (!(priv_ep->flags & EP_QUIRK_END_TRANSFER) &&
>>> +               req->length != req->actual) {
>>> +                   /* wait for next part of transfer */
>>> +                   return;
>>> +           }
>>> +
>>> +           if (req->status == -EINPROGRESS)
>>> +                   req->status = 0;
>>> +
>>> +           list_del_init(&req->list);
>>> +           cdns3_start_all_request(priv_dev, priv_ep);
>>> +   }
>>> +
>>>     /* Start all not pending request */
>>>     if (priv_ep->flags & EP_RING_FULL)
>>>             cdns3_start_all_request(priv_dev, priv_ep);
>>> @@ -774,6 +897,59 @@ void cdns3_rearm_transfer(struct cdns3_endpoint 
>>> *priv_ep, u8 rearm)
>>>     }
>>>  }
>>>
>>> +/**
>>> + * cdns3_descmissing_packet - handles descriptor missing event.
>>> + * @priv_dev: extended gadget object
>>> + *
>>> + * This function is used only for WA2. For more information see Work 
>>> around 2
>>> + * description.
>>> + */
>>> +static int cdns3_descmissing_packet(struct cdns3_endpoint *priv_ep)
>>> +{
>>> +   struct cdns3_request *priv_req;
>>> +   struct usb_request *request;
>>> +
>>> +   if (priv_ep->flags & EP_QUIRK_EXTRA_BUF_DET) {
>>> +           priv_ep->flags &= ~EP_QUIRK_EXTRA_BUF_DET;
>>> +           priv_ep->flags |= EP_QUIRK_EXTRA_BUF_EN;
>>> +   }
>>> +
>>> +   cdns3_dbg(priv_ep->cdns3_dev, "WA2: Description Missing detected\n");
>>> +
>>> +   request = cdns3_gadget_ep_alloc_request(&priv_ep->endpoint,
>>> +                                           GFP_ATOMIC);
>>> +   if (!request)
>>> +           return -ENOMEM;
>>> +
>>> +   priv_req = to_cdns3_request(request);
>>> +   priv_req->flags |= REQUEST_INTERNAL;
>>> +
>>> +   /* if this field is still assigned it indicate that transfer related
>>> +    * with this request has not been finished yet. Driver in this
>>> +    * case simply allocate next request and assign flag REQUEST_INTERNAL_CH
>>> +    * flag to previous one. It will indicate that current request is
>>> +    * part of the previous one.
>>> +    */
>>> +   if (priv_ep->descmis_req)
>>> +           priv_ep->descmis_req->flags |= REQUEST_INTERNAL_CH;
>>> +
>>> +   priv_req->request.buf = kzalloc(CDNS3_DESCMIS_BUF_SIZE,
>>> +                                   GFP_ATOMIC);
>>> +   if (!priv_req) {
>>> +           cdns3_gadget_ep_free_request(&priv_ep->endpoint, request);
>>> +           return -ENOMEM;
>>> +   }
>>> +
>>> +   priv_req->request.length = CDNS3_DESCMIS_BUF_SIZE;
>>> +   priv_ep->descmis_req = priv_req;
>>> +
>>> +   __cdns3_gadget_ep_queue(&priv_ep->endpoint,
>>> +                           &priv_ep->descmis_req->request,
>>> +                           GFP_ATOMIC);
>>> +
>>> +   return 0;
>>> +}
>>> +
>>>  /**
>>>   * cdns3_check_ep_interrupt_proceed - Processes interrupt related to 
>>> endpoint
>>>   * @priv_ep: endpoint object
>>> @@ -807,8 +983,31 @@ static int cdns3_check_ep_interrupt_proceed(struct 
>>> cdns3_endpoint *priv_ep)
>>>                     cdns3_rearm_transfer(priv_ep, priv_ep->wa1_set);
>>>     }
>>>
>>> -   if ((ep_sts_reg & EP_STS_IOC) || (ep_sts_reg & EP_STS_ISP))
>>> +   if ((ep_sts_reg & EP_STS_IOC) || (ep_sts_reg & EP_STS_ISP)) {
>>> +           if (priv_ep->flags & EP_QUIRK_EXTRA_BUF_EN) {
>>> +                   if (ep_sts_reg & EP_STS_ISP)
>>> +                           priv_ep->flags |= EP_QUIRK_END_TRANSFER;
>>> +                   else
>>> +                           priv_ep->flags &= ~EP_QUIRK_END_TRANSFER;
>>> +           }
>>> +
>>>             cdns3_transfer_completed(priv_dev, priv_ep);
>>> +   }
>>> +
>>> +   /*
>>> +    * WA2: this condition should only be meet when
>>> +    * priv_ep->flags & EP_QUIRK_EXTRA_BUF_DET or
>>> +    * priv_ep->flags & EP_QUIRK_EXTRA_BUF_EN.
>>> +    * In other cases this interrupt will be disabled/
>>> +    */
>>> +   if (ep_sts_reg & EP_STS_DESCMIS) {
>>> +           int err;
>>> +
>>> +           err = cdns3_descmissing_packet(priv_ep);
>>> +           if (err)
>>> +                   dev_err(priv_dev->dev,
>>> +                           "Failed: No sufficient memory for DESCMIS\n");
>>> +   }
>>>
>>>     return 0;
>>>  }
>>> @@ -1241,13 +1440,26 @@ static int cdns3_gadget_ep_enable(struct usb_ep *ep,
>>>     /* enable interrupt for selected endpoint */
>>>     cdns3_set_register_bit(&priv_dev->regs->ep_ien,
>>>                            BIT(cdns3_ep_addr_to_index(bEndpointAddress)));
>>> +   /*
>>> +    * WA2: Set flag for all not ISOC OUT endpoints. If this flag is set
>>> +    * driver try to detect whether endpoint need additional internal
>>> +    * buffer for unblocking on-chip FIFO buffer. This flag will be cleared
>>> +    * if before first DESCMISS interrupt the DMA will be armed.
>>> +    */
>>> +   if (quirk_internal_buffer) {
>>> +           if (!priv_ep->dir && priv_ep->type != USB_ENDPOINT_XFER_ISOC) {
>>> +                   priv_ep->flags |= EP_QUIRK_EXTRA_BUF_DET;
>>> +                   reg |= EP_STS_EN_DESCMISEN;
>>> +           }
>>> +   }
>>>
>>>     writel(reg, &priv_dev->regs->ep_sts_en);
>>>
>>>     cdns3_set_register_bit(&priv_dev->regs->ep_cfg, EP_CFG_ENABLE);
>>>
>>>     ep->desc = desc;
>>> -   priv_ep->flags &= ~(EP_PENDING_REQUEST | EP_STALL);
>>> +   priv_ep->flags &= ~(EP_PENDING_REQUEST | EP_STALL |
>>> +                       EP_QUIRK_EXTRA_BUF_EN);
>>>     priv_ep->flags |= EP_ENABLED | EP_UPDATE_EP_TRBADDR;
>>>     priv_ep->wa1_set = 0;
>>>     priv_ep->enqueue = 0;
>>> @@ -1272,6 +1484,7 @@ static int cdns3_gadget_ep_enable(struct usb_ep *ep,
>>>  static int cdns3_gadget_ep_disable(struct usb_ep *ep)
>>>  {
>>>     struct cdns3_endpoint *priv_ep;
>>> +   struct cdns3_request *priv_req;
>>>     struct cdns3_device *priv_dev;
>>>     struct usb_request *request;
>>>     unsigned long flags;
>>> @@ -1308,6 +1521,14 @@ static int cdns3_gadget_ep_disable(struct usb_ep *ep)
>>>                                   -ESHUTDOWN);
>>>     }
>>>
>>> +   while (!list_empty(&priv_ep->descmiss_req_list)) {
>>> +           priv_req = cdns3_next_priv_request(&priv_ep->descmiss_req_list);
>>> +
>>> +           kfree(priv_req->request.buf);
>>> +           cdns3_gadget_ep_free_request(&priv_ep->endpoint,
>>> +                                        &priv_req->request);
>>> +   }
>>> +
>>>     while (!list_empty(&priv_ep->deferred_req_list)) {
>>>             request = cdns3_next_request(&priv_ep->deferred_req_list);
>>>
>>> @@ -1348,6 +1569,53 @@ static int __cdns3_gadget_ep_queue(struct usb_ep *ep,
>>>     priv_req = to_cdns3_request(request);
>>>     trace_cdns3_ep_queue(priv_req);
>>>
>>> +   /*
>>> +    * WA2: if transfer was queued before DESCMISS appear than we
>>> +    * can disable handling of DESCMISS interrupt. Driver assumes that it
>>> +    * can disable special treatment for this endpoint.
>>> +    */
>>> +   if (priv_ep->flags & EP_QUIRK_EXTRA_BUF_DET) {
>>> +           u32 reg;
>>> +
>>> +           cdns3_select_ep(priv_dev, priv_ep->num | priv_ep->dir);
>>> +           priv_ep->flags &= ~EP_QUIRK_EXTRA_BUF_DET;
>>> +           reg = readl(&priv_dev->regs->ep_sts_en);
>>> +           reg &= ~EP_STS_EN_DESCMISEN;
>>> +           writel(reg, &priv_dev->regs->ep_sts_en);
>>> +   }
>>> +
>>> +   /* WA2 */
>>> +   if (priv_ep->flags & EP_QUIRK_EXTRA_BUF_EN) {
>>> +           u8 pending_empty = list_empty(&priv_ep->pending_req_list);
>>> +           u8 descmiss_empty = list_empty(&priv_ep->descmiss_req_list);
>>> +
>>> +           /*
>>> +            *  DESCMISS transfer has been finished, so data will be
>>> +            *  directly copied from internal allocated usb_request
>>> +            *  objects.
>>> +            */
>>> +           if (pending_empty && !descmiss_empty &&
>>> +               !(priv_req->flags & REQUEST_INTERNAL)) {
>>> +                   cdns3_descmiss_copy_data(priv_ep, request);
>>> +                   list_add_tail(&request->list,
>>> +                                 &priv_ep->pending_req_list);
>>> +                   cdns3_gadget_giveback(priv_ep, priv_req,
>>> +                                         request->status);
>>> +                   return ret;
>>> +           }
>>> +
>>> +           /*
>>> +            * WA2 driver will wait for completion DESCMISS transfer,
>>> +            * before starts new, not DESCMISS transfer.
>>> +            */
>>> +           if (!pending_empty && !descmiss_empty)
>>> +                   deferred = 1;
>>> +
>>> +           if (priv_req->flags & REQUEST_INTERNAL)
>>> +                   list_add_tail(&priv_req->list,
>>> +                                 &priv_ep->descmiss_req_list);
>>> +   }
>>> +
>>>     ret = usb_gadget_map_request_by_dev(priv_dev->sysdev, request,
>>>                                         usb_endpoint_dir_in(ep->desc));
>>>     if (ret)
>>> @@ -1782,6 +2050,7 @@ static int cdns3_init_eps(struct cdns3_device 
>>> *priv_dev)
>>>
>>>             INIT_LIST_HEAD(&priv_ep->pending_req_list);
>>>             INIT_LIST_HEAD(&priv_ep->deferred_req_list);
>>> +           INIT_LIST_HEAD(&priv_ep->descmiss_req_list);
>>>     }
>>>
>>>     return 0;
>>> diff --git a/drivers/usb/cdns3/gadget.h b/drivers/usb/cdns3/gadget.h
>>> index 817f8ae7a4da..8de733b315e9 100644
>>> --- a/drivers/usb/cdns3/gadget.h
>>> +++ b/drivers/usb/cdns3/gadget.h
>>> @@ -1000,6 +1000,7 @@ struct cdns3_device;
>>>   * @endpoint: usb endpoint
>>>   * @pending_req_list: list of requests queuing on transfer ring.
>>>   * @deferred_req_list: list of requests waiting for queuing on transfer 
>>> ring.
>>> + * @descmiss_req_list: list of requests internally allocated by driver 
>>> (WA2).
>>>   * @trb_pool: transfer ring - array of transaction buffers
>>>   * @trb_pool_dma: dma address of transfer ring
>>>   * @cdns3_dev: device associated with this endpoint
>>> @@ -1026,6 +1027,7 @@ struct cdns3_endpoint {
>>>     struct usb_ep           endpoint;
>>>     struct list_head        pending_req_list;
>>>     struct list_head        deferred_req_list;
>>> +   struct list_head        descmiss_req_list;
>>>
>>>     struct cdns3_trb        *trb_pool;
>>>     dma_addr_t              trb_pool_dma;
>>> @@ -1041,6 +1043,9 @@ struct cdns3_endpoint {
>>>  #define EP_PENDING_REQUEST BIT(5)
>>>  #define EP_RING_FULL               BIT(6)
>>>  #define EP_CLAIMED         BIT(7)
>>> +#define EP_QUIRK_EXTRA_BUF_DET     BIT(8)
>>> +#define EP_QUIRK_EXTRA_BUF_EN      BIT(9)
>>> +#define EP_QUIRK_END_TRANSFER      BIT(10)
>>>
>>>     u32                     flags;
>>>
>>> @@ -1074,6 +1079,7 @@ struct cdns3_endpoint {
>>>   * @start_trb: number of the first TRB in transfer ring
>>>   * @end_trb: number of the last TRB in transfer ring
>>>   * @flags: flag specifying special usage of request
>>> + * @list: used by internally allocated request to add to descmiss_req_list.
>>>   */
>>>  struct cdns3_request {
>>>     struct usb_request      request;
>>> @@ -1086,6 +1092,7 @@ struct cdns3_request {
>>>  #define REQUEST_INTERNAL_CH        BIT(2)
>>>  #define REQUEST_ZLP                BIT(3)
>>>     u32                     flags;
>>> +   struct list_head        list;
>>>  };
>>>
>>>  #define to_cdns3_request(r) (container_of(r, struct cdns3_request, 
>>> request))
>>>
>>
>> --
>> Texas Instruments Finland Oy, Porkkalankatu 22, 00180 Helsinki.
>> Y-tunnus/Business ID: 0615521-4. Kotipaikka/Domicile: Helsinki

-- 
Texas Instruments Finland Oy, Porkkalankatu 22, 00180 Helsinki.
Y-tunnus/Business ID: 0615521-4. Kotipaikka/Domicile: Helsinki

Reply via email to