Re: [Patch Part2 v5 21/31] PCI/MSI: Enhance core to support hierarchy irqdomain

2014-11-09 Thread Suravee Suthikulpanit

On 11/9/14 14:15, Jiang Liu wrote:

On 2014/11/9 12:26, Suravee Suthikulpanit wrote:

Hi Gerry,

Please see my comments / questions below.

On 11/6/14 21:20, Jiang Liu wrote:

Enhance PCI MSI core to support hierarchy irqdomain, so the common
code could be shared among architectures.

Signed-off-by: Jiang Liu 
---
Hi Thomas,
 These changes are a temporary solution, I'm working on another
patch set which will refine these interfaces, especially kill
arch_msi_irq_domain_{set|get}_hwirq().
Regards!
Gerry


Would the change includes the struct irqdomain_msi_data proposed by
Thomas here (https://lkml.org/lkml/2014/11/6/210)?

Hi Suravee,
I'm working on another patch set which will refine the way to
create irqdomain for MSI. I hope will solve all issues mentioned by
Thomas. I will post that patch set as RFC within one or two days.


Let me know if I can help.




[...]
+static int msi_domain_alloc(struct irq_domain *domain, unsigned int
virq,
+unsigned int nr_irqs, void *arg)
+{
+int i, ret;
+irq_hw_number_t hwirq = arch_msi_irq_domain_get_hwirq(arg);
+
+if (irq_find_mapping(domain, hwirq) > 0)
+return -EEXIST;
+
+ret = irq_domain_alloc_irqs_parent(domain, virq, nr_irqs, arg);
+if (ret < 0)
+return ret;
+


When executing irq_domain_alloc_irqs_parent(), it triggers the following
warning message due to

 WARN_ON(desc->irq_data.chip == &no_irq_chip)

[ cut here ]
WARNING: CPU: 2 PID: 912 at kernel/irq/chip.c:734
__irq_set_handler+0x138/0x13c()
Modules linked in: mlx4_core(+) rtc_efi efivarfs
CPU: 2 PID: 912 Comm: modprobe Not tainted 3.18.0-rc3-p2v5+ #53
Call trace:
[] dump_backtrace+0x0/0x16c
[] show_stack+0x10/0x1c
[] dump_stack+0x74/0x98
[] warn_slowpath_common+0x84/0xac
[] warn_slowpath_null+0x14/0x20
[] __irq_set_handler+0x134/0x13c
[] gic_irq_domain_map+0x4c/0xac
[] gic_irq_domain_alloc+0x60/0x88
[] gicv2m_domain_alloc+0x30/0xa8
[] __irq_domain_alloc_irqs+0x144/0x30c
[] gicv2m_setup_msi_irq+0xc0/0x118
[] arch_setup_msi_irq+0x34/0x60
[] arch_setup_msi_irqs+0x50/0xb0
[] pci_enable_msix+0x310/0x39c
[] pci_enable_msix_range+0x34/0x9c


However, I think if we call irq_domain_set_hwirq_and_chip() in the
msi_create_irq_domain() as suggested by Thomas, that should also fix
this issue.

We didn't trigger this warning on x86 because only the outer-most
irqdomain will set irq flow handler. On ARM, the inner GIC domain
will set irq flow handler, but the irq_data->chip for outer-most
domain hasn't been set yet.

I'm still find a way out here. Maybe we need to relax the WARN_ON().


What if we just switch the order and call the 
irq_domain_alloc_irqs_parent() after the loop? That way we can still 
keep the WARN_ON().


Suravee
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [Patch Part2 v5 21/31] PCI/MSI: Enhance core to support hierarchy irqdomain

2014-11-08 Thread Jiang Liu
On 2014/11/9 12:26, Suravee Suthikulpanit wrote:
> Hi Gerry,
> 
> Please see my comments / questions below.
> 
> On 11/6/14 21:20, Jiang Liu wrote:
>> Enhance PCI MSI core to support hierarchy irqdomain, so the common
>> code could be shared among architectures.
>>
>> Signed-off-by: Jiang Liu 
>> ---
>> Hi Thomas,
>> These changes are a temporary solution, I'm working on another
>> patch set which will refine these interfaces, especially kill
>> arch_msi_irq_domain_{set|get}_hwirq().
>> Regards!
>> Gerry
> 
> Would the change includes the struct irqdomain_msi_data proposed by
> Thomas here (https://lkml.org/lkml/2014/11/6/210)?
Hi Suravee,
I'm working on another patch set which will refine the way to
create irqdomain for MSI. I hope will solve all issues mentioned by
Thomas. I will post that patch set as RFC within one or two days.

> 
>> [...]
>> +static int msi_domain_alloc(struct irq_domain *domain, unsigned int
>> virq,
>> +unsigned int nr_irqs, void *arg)
>> +{
>> +int i, ret;
>> +irq_hw_number_t hwirq = arch_msi_irq_domain_get_hwirq(arg);
>> +
>> +if (irq_find_mapping(domain, hwirq) > 0)
>> +return -EEXIST;
>> +
>> +ret = irq_domain_alloc_irqs_parent(domain, virq, nr_irqs, arg);
>> +if (ret < 0)
>> +return ret;
>> +
> 
> When executing irq_domain_alloc_irqs_parent(), it triggers the following
> warning message due to
> 
> WARN_ON(desc->irq_data.chip == &no_irq_chip)
> 
> [ cut here ]
> WARNING: CPU: 2 PID: 912 at kernel/irq/chip.c:734
> __irq_set_handler+0x138/0x13c()
> Modules linked in: mlx4_core(+) rtc_efi efivarfs
> CPU: 2 PID: 912 Comm: modprobe Not tainted 3.18.0-rc3-p2v5+ #53
> Call trace:
> [] dump_backtrace+0x0/0x16c
> [] show_stack+0x10/0x1c
> [] dump_stack+0x74/0x98
> [] warn_slowpath_common+0x84/0xac
> [] warn_slowpath_null+0x14/0x20
> [] __irq_set_handler+0x134/0x13c
> [] gic_irq_domain_map+0x4c/0xac
> [] gic_irq_domain_alloc+0x60/0x88
> [] gicv2m_domain_alloc+0x30/0xa8
> [] __irq_domain_alloc_irqs+0x144/0x30c
> [] gicv2m_setup_msi_irq+0xc0/0x118
> [] arch_setup_msi_irq+0x34/0x60
> [] arch_setup_msi_irqs+0x50/0xb0
> [] pci_enable_msix+0x310/0x39c
> [] pci_enable_msix_range+0x34/0x9c
> 
> 
> However, I think if we call irq_domain_set_hwirq_and_chip() in the
> msi_create_irq_domain() as suggested by Thomas, that should also fix
> this issue.
We didn't trigger this warning on x86 because only the outer-most
irqdomain will set irq flow handler. On ARM, the inner GIC domain
will set irq flow handler, but the irq_data->chip for outer-most
domain hasn't been set yet.

I'm still find a way out here. Maybe we need to relax the WARN_ON().

> 
>> +for (i = 0; i < nr_irqs; i++) {
>> +irq_domain_set_hwirq_and_chip(domain, virq + i, hwirq + i,
>> +  domain->host_data,
>> +  (void *)(long)i);
>> +__irq_set_handler(virq + i, handle_edge_irq, 0, "edge");
>> +}
> 
> Is there are a way to specify other type of handler besides edge?
Yes, the new patch set will address this requirement.

> 
>> [...]
>> +static void msi_domain_activate(struct irq_domain *domain,
>> +struct irq_data *irq_data)
>> +{
>> +struct msi_msg msg;
>> +
>> +/*
>> + * irq_data->chip_data is MSI/MSI-X offset.
>> + * MSI-X message is written per-IRQ, the offset is always 0.
>> + * MSI message denotes a contiguous group of IRQs, written for
>> 0th IRQ.
>> + */
>> +if (irq_data->chip_data)
>> +return;
> 
> Actually, I am a bit confused with this comment here.  If you look at
> "/drivers/pci/msi.c: arch_setup_msi_irq()", it calls
> irq_set_chip_data(des->irq, chip) where chip is *msi_chip.
> 
> It looks like if the arch uses this API, it would conflict with what you
> have here where the irq_data->chip_data would be not NULL at the logic
> above, and would always return.
Good point, it's work on x86, but will break ARM or other platforms.
Will refine this part.
Regards!
Gerry
> 
> Thank you,
> 
> Suravee
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [Patch Part2 v5 21/31] PCI/MSI: Enhance core to support hierarchy irqdomain

2014-11-08 Thread Suravee Suthikulpanit

Hi Gerry,

Please see my comments / questions below.

On 11/6/14 21:20, Jiang Liu wrote:

Enhance PCI MSI core to support hierarchy irqdomain, so the common
code could be shared among architectures.

Signed-off-by: Jiang Liu 
---
Hi Thomas,
These changes are a temporary solution, I'm working on another
patch set which will refine these interfaces, especially kill
arch_msi_irq_domain_{set|get}_hwirq().
Regards!
Gerry


Would the change includes the struct irqdomain_msi_data proposed by 
Thomas here (https://lkml.org/lkml/2014/11/6/210)?



[...]
+static int msi_domain_alloc(struct irq_domain *domain, unsigned int virq,
+   unsigned int nr_irqs, void *arg)
+{
+   int i, ret;
+   irq_hw_number_t hwirq = arch_msi_irq_domain_get_hwirq(arg);
+
+   if (irq_find_mapping(domain, hwirq) > 0)
+   return -EEXIST;
+
+   ret = irq_domain_alloc_irqs_parent(domain, virq, nr_irqs, arg);
+   if (ret < 0)
+   return ret;
+


When executing irq_domain_alloc_irqs_parent(), it triggers the following 
warning message due to


WARN_ON(desc->irq_data.chip == &no_irq_chip)

[ cut here ]
WARNING: CPU: 2 PID: 912 at kernel/irq/chip.c:734 
__irq_set_handler+0x138/0x13c()

Modules linked in: mlx4_core(+) rtc_efi efivarfs
CPU: 2 PID: 912 Comm: modprobe Not tainted 3.18.0-rc3-p2v5+ #53
Call trace:
[] dump_backtrace+0x0/0x16c
[] show_stack+0x10/0x1c
[] dump_stack+0x74/0x98
[] warn_slowpath_common+0x84/0xac
[] warn_slowpath_null+0x14/0x20
[] __irq_set_handler+0x134/0x13c
[] gic_irq_domain_map+0x4c/0xac
[] gic_irq_domain_alloc+0x60/0x88
[] gicv2m_domain_alloc+0x30/0xa8
[] __irq_domain_alloc_irqs+0x144/0x30c
[] gicv2m_setup_msi_irq+0xc0/0x118
[] arch_setup_msi_irq+0x34/0x60
[] arch_setup_msi_irqs+0x50/0xb0
[] pci_enable_msix+0x310/0x39c
[] pci_enable_msix_range+0x34/0x9c


However, I think if we call irq_domain_set_hwirq_and_chip() in the 
msi_create_irq_domain() as suggested by Thomas, that should also fix 
this issue.



+   for (i = 0; i < nr_irqs; i++) {
+   irq_domain_set_hwirq_and_chip(domain, virq + i, hwirq + i,
+ domain->host_data,
+ (void *)(long)i);
+   __irq_set_handler(virq + i, handle_edge_irq, 0, "edge");
+   }


Is there are a way to specify other type of handler besides edge?


[...]
+static void msi_domain_activate(struct irq_domain *domain,
+   struct irq_data *irq_data)
+{
+   struct msi_msg msg;
+
+   /*
+* irq_data->chip_data is MSI/MSI-X offset.
+* MSI-X message is written per-IRQ, the offset is always 0.
+* MSI message denotes a contiguous group of IRQs, written for 0th IRQ.
+*/
+   if (irq_data->chip_data)
+   return;


Actually, I am a bit confused with this comment here.  If you look at 
"/drivers/pci/msi.c: arch_setup_msi_irq()", it calls 
irq_set_chip_data(des->irq, chip) where chip is *msi_chip.


It looks like if the arch uses this API, it would conflict with what you 
have here where the irq_data->chip_data would be not NULL at the logic 
above, and would always return.


Thank you,

Suravee
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [Patch Part2 v5 21/31] PCI/MSI: Enhance core to support hierarchy irqdomain

2014-11-07 Thread Thomas Gleixner
On Thu, 6 Nov 2014, Jiang Liu wrote:

> Enhance PCI MSI core to support hierarchy irqdomain, so the common
> code could be shared among architectures.
> 
> Signed-off-by: Jiang Liu 
> ---
> Hi Thomas,
>   These changes are a temporary solution, I'm working on another
> patch set which will refine these interfaces, especially kill 
> arch_msi_irq_domain_{set|get}_hwirq().

Ok. Please don't forget the irq_chip change :)

Thanks,

tglx
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


[Patch Part2 v5 21/31] PCI/MSI: Enhance core to support hierarchy irqdomain

2014-11-06 Thread Jiang Liu
Enhance PCI MSI core to support hierarchy irqdomain, so the common
code could be shared among architectures.

Signed-off-by: Jiang Liu 
---
Hi Thomas,
These changes are a temporary solution, I'm working on another
patch set which will refine these interfaces, especially kill 
arch_msi_irq_domain_{set|get}_hwirq().
Regards!
Gerry
---
 drivers/pci/Kconfig |4 ++
 drivers/pci/msi.c   |  134 +++
 include/linux/msi.h |   14 ++
 3 files changed, 152 insertions(+)

diff --git a/drivers/pci/Kconfig b/drivers/pci/Kconfig
index b9db0f2ce11f..022e89745f86 100644
--- a/drivers/pci/Kconfig
+++ b/drivers/pci/Kconfig
@@ -16,6 +16,10 @@ config PCI_MSI
 
   If you don't know what to do here, say Y.
 
+config PCI_MSI_IRQ_DOMAIN
+   bool
+   depends on PCI_MSI && IRQ_DOMAIN_HIERARCHY
+
 config PCI_DEBUG
bool "PCI Debugging"
depends on PCI && DEBUG_KERNEL
diff --git a/drivers/pci/msi.c b/drivers/pci/msi.c
index da181c59394b..8de7c8774fd2 100644
--- a/drivers/pci/msi.c
+++ b/drivers/pci/msi.c
@@ -19,6 +19,7 @@
 #include 
 #include 
 #include 
+#include 
 
 #include "pci.h"
 
@@ -1098,3 +1099,136 @@ int pci_enable_msix_range(struct pci_dev *dev, struct 
msix_entry *entries,
return nvec;
 }
 EXPORT_SYMBOL(pci_enable_msix_range);
+
+#ifdef CONFIG_PCI_MSI_IRQ_DOMAIN
+/*
+ * Generate a unique ID number for each possible MSI source, the ID number
+ * is only used within the irqdomain.
+ */
+static inline irq_hw_number_t
+msi_get_hwirq(struct pci_dev *dev, struct msi_desc *desc)
+{
+   return (irq_hw_number_t)desc->msi_attrib.entry_nr |
+   PCI_DEVID(dev->bus->number, dev->devfn) << 11 |
+   (pci_domain_nr(dev->bus) & 0x) << 27;
+}
+
+static int msi_domain_alloc(struct irq_domain *domain, unsigned int virq,
+   unsigned int nr_irqs, void *arg)
+{
+   int i, ret;
+   irq_hw_number_t hwirq = arch_msi_irq_domain_get_hwirq(arg);
+
+   if (irq_find_mapping(domain, hwirq) > 0)
+   return -EEXIST;
+
+   ret = irq_domain_alloc_irqs_parent(domain, virq, nr_irqs, arg);
+   if (ret < 0)
+   return ret;
+
+   for (i = 0; i < nr_irqs; i++) {
+   irq_domain_set_hwirq_and_chip(domain, virq + i, hwirq + i,
+ domain->host_data,
+ (void *)(long)i);
+   __irq_set_handler(virq + i, handle_edge_irq, 0, "edge");
+   }
+
+   return ret;
+}
+
+static void msi_domain_free(struct irq_domain *domain, unsigned int virq,
+   unsigned int nr_irqs)
+{
+   int i;
+
+   for (i = 0; i < nr_irqs; i++) {
+   struct msi_desc *desc = irq_get_msi_desc(virq);
+
+   if (desc)
+   desc->irq = 0;
+   }
+   irq_domain_free_irqs_top(domain, virq, nr_irqs);
+}
+
+static void msi_domain_activate(struct irq_domain *domain,
+   struct irq_data *irq_data)
+{
+   struct msi_msg msg;
+
+   /*
+* irq_data->chip_data is MSI/MSI-X offset.
+* MSI-X message is written per-IRQ, the offset is always 0.
+* MSI message denotes a contiguous group of IRQs, written for 0th IRQ.
+*/
+   if (irq_data->chip_data)
+   return;
+
+   BUG_ON(irq_chip_compose_msi_msg(irq_data, &msg));
+   __write_msi_msg(irq_data->msi_desc, &msg);
+}
+
+static void msi_domain_deactivate(struct irq_domain *domain,
+ struct irq_data *irq_data)
+{
+   struct msi_msg msg;
+
+   if (!irq_data->chip_data) {
+   memset(&msg, 0, sizeof(msg));
+   __write_msi_msg(irq_data->msi_desc, &msg);
+   }
+}
+
+static struct irq_domain_ops msi_domain_ops = {
+   .alloc = msi_domain_alloc,
+   .free = msi_domain_free,
+   .activate = msi_domain_activate,
+   .deactivate = msi_domain_deactivate,
+};
+
+struct irq_domain *msi_create_irq_domain(struct device_node *of_node,
+struct irq_chip *chip,
+struct irq_domain *parent)
+{
+   struct irq_domain *domain;
+
+   domain = irq_domain_add_tree(of_node, &msi_domain_ops, chip);
+   if (!domain)
+   return NULL;
+
+   domain->parent = parent;
+
+   return domain;
+}
+
+int msi_irq_domain_alloc_irqs(struct irq_domain *domain, int type,
+ struct pci_dev *dev, void *arg)
+{
+   int i, virq;
+   struct msi_desc *desc;
+   int node = dev_to_node(&dev->dev);
+
+   list_for_each_entry(desc, &dev->msi_list, list) {
+   arch_msi_irq_domain_set_hwirq(arg, msi_get_hwirq(dev, desc));
+   virq = irq_domain_alloc_irqs(domain, desc->nvec_used,
+node, arg);
+   if (virq < 0) {
+   /*