Hi Marek,

ma...@denx.de wrote on Fri, 2 Dec 2022 16:23:29 +0100:

> On 12/2/22 16:00, Miquel Raynal wrote:
> > Hi Marek,  
> 
> Hi,
> 
> > ma...@denx.de wrote on Fri, 2 Dec 2022 15:31:40 +0100:
> >   
> >> On 12/2/22 15:05, Miquel Raynal wrote:  
> >>> Hi Francesco,  
> >>
> >> Hi,
> >>
> >> [...]
> >>  
> >>> I still strongly disagree with the initial proposal but what I think we
> >>> can do is:
> >>>
> >>> 1. To prevent future breakages:
> >>>     Fix fdt_fixup_mtdparts() in u-boot. This way newer U-Boot + any
> >>>     kernel should work.
> >>>
> >>> 2. To help tracking down situations like that:
> >>>     Keep the warning in ofpart.c but continue to fail.
> >>>
> >>> 3. To fix the current situation:
> >>>      Immediately revert commit (and prevent it from being backported):
> >>>      753395ea1e45 ("ARM: dts: imx7: Fix NAND controller size-cells")
> >>>      This way your own boot flow is fixed in the short term.  
> >>
> >> Here I disagree, the fix is correct and I think we shouldn't
> >> proliferate incorrect DTs which don't match the binding document.  
> > 
> > I agree we should not proliferate incorrect DTs, so let's use a modern
> > description then  
> 
> Yes please !
> 
> > , with a controller and a child node which defines the
> > chip.  
> 
> But what if there is no chip connected to the controller node ?
> 
> If I understand the proposal here right (please correct me if I'm wrong), 
> then:

Good idea to summarize.

> 
> 1) This is the original, old, wrong binding:
> &gpmi {
>    #size-cells = <1>;
>    ...
>    partition@N { ... };
> };

Yes.

> 
> 
> 2) This is the newer, but still wrong binding:
> &gpmi {
>    #size-cells = <0>;
>    ...
>    partitions {
>      partition@N { ... };
>    };
> };

Well, this is wrong description, but it would work (for compat reasons,
even though I don't think this is considered valid DT by the schemas).

> 
> 3) This is the newest binding, what we want:
> &gpmi {
>    #size-cells = <0>;
>    ...
>    nand-chip {
>      partitions {
>        partition@N { ... };
>      };
>    };
> };

Yes

> 
> But if there is no physical nand chip connected to the controller, would we 
> end up with empty nand-chip node in DT, like this?
> &gpmi {
>    #size-cells = <X>;
>    ...
>    nand-chip { /* empty */ };
> };

Is this really a concern? If there is no NAND chip, the controller
should be disabled, no? I guess technically you could even use the
status property in the nand-chip node...

However, it should not be empty, at the very least a reg property
should indicate on which CS it is wired, as expected there:
https://git.kernel.org/pub/scm/linux/kernel/git/mtd/linux.git/tree/Documentation/devicetree/bindings/mtd/nand-chip.yaml?h=mtd/next

But, as nand-chip.yaml references mtd.yaml, you can as well use
whatever is described here:
https://git.kernel.org/pub/scm/linux/kernel/git/mtd/linux.git/tree/Documentation/devicetree/bindings/mtd/mtd.yaml?h=mtd/next

> What would be the gpmi controller size cells (X) in that case, still 0, right 
> ? So how does that help solve this problem, wouldn't U-Boot still populate 
> the partitions directly under the gpmi node or into partitions sub-node ?

The commit that was pointed in the original fix clearly stated that the
NAND chip node was targeted, not the NAND controller node. I hope this
is correctly supported in U-Boot though. So if there is a NAND chip
subnode, I suppose U-Boot would try to create the partitions that are
inside, or even in the sub "partitions" container.

> >> Rather, if a bootloader generates incorrect (new) DT entries, I
> >> believe the driver should implement a fixup and warn user about this.
> >> PC does that as well with broken ACPI tables as far as I can tell.
> >>
> >> I'm not convinced making a DT non-compliant with bindings again,  
> > 
> > I am sorry to say so, but while warnings reported by the tools
> > should be fixed, it's not because the tool does not scream at you that
> > the description is valid. We are actively working on enhancing the
> > schema so that "all" improper descriptions get warnings (see the series
> > pointed earlier), but in no way this change makes the node compliant
> > with modern bindings.
> > 
> > I'm not saying the fix is wrong, but let's be pragmatic, it currently
> > leads to boot failures.  
> 
> I fully agree that we do have a problem, and that it trickled into stable 
> makes it even worse. Maybe I don't fully understand the thing with nand-chip 
> proposal, see my question above, esp. the last part.
> 
> >> only to work around a problem induced by bootloader, is the right approach
> >> here.  
> > 
> > When a patch breaks a board and there is no straight fix, you revert
> > it, then you think harder. That's what I am saying. This is a temporary
> > solution.  
> 
> Isn't this patch the straight fix, at least until the bootloader can be 
> updated to generate the nand-chip node correctly ?
> 
> >> This would be setting a dangerous example, where anyone could request a DT 
> >> fix to be reverted because their random bootloader does the wrong thing 
> >> and with valid DT clean up, something broke.  
> > 
> > Please, you know this is not valid DT clean up. We've been decoupling
> > controller and chip description since 2016. What I am proposing is a
> > valid DT cleanup, not to the latest standard, but way closer than the
> > current solution.  
> 
> I think I really need one more explanation of the nand-chip part above.

I hope things are clearer now.

Thanks,
Miquèl

Reply via email to