[ 
https://issues.apache.org/jira/browse/ARROW-18274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17630499#comment-17630499
 ] 

Matthew Topol commented on ARROW-18274:
---------------------------------------

[~lquerel] There's already a need for a v10.0.1 due to issues with pyarrow and 
R, so there will be a v10.0.1 release and I've tagged this issue so it will get 
cherry-picked and included in that release. Keep an eye on the Mailing list for 
when that happens. Thanks!

> [Go] Sparse union of structs is buggy
> -------------------------------------
>
>                 Key: ARROW-18274
>                 URL: https://issues.apache.org/jira/browse/ARROW-18274
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Go
>    Affects Versions: 10.0.0, 10.0.1
>            Reporter: Laurent Querel
>            Assignee: Matthew Topol
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 11.0.0
>
>          Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> There is a bug with union of structs in V10.
> The first unit test crash with a panic (i.e. invalid memory address or nil 
> pointer dereference). The second test works as expected.
>  
> {code:go}
> func TestDoesNotWork(t *testing.T) {
>    dt1 := arrow.SparseUnionOf([]arrow.Field{
>       {Name: "c", Type: arrow2.DictU16String},
>    }, []arrow.UnionTypeCode{0})
>    dt2 := arrow.StructOf(
>       arrow.Field{Name: "b", Type: dt1},
>    )
>    dt3 := arrow.SparseUnionOf([]arrow.Field{
>       {Name: "a", Type: dt2},
>    }, []arrow.UnionTypeCode{0})
>    pool := memory.NewGoAllocator()
>    builder := array.NewSparseUnionBuilder(pool, dt3)
>    defer builder.Release()
>    arr := builder.NewArray()
>    defer arr.Release()
>    assert.Equal(t, 0, arr.Len())
> }
> func TestWorksAsExpected(t *testing.T) {
>    dt1 := arrow.SparseUnionOf([]arrow.Field{
>       {Name: "c", Type: &arrow.DictionaryType{
>          IndexType: arrow.PrimitiveTypes.Uint16,
>          ValueType: arrow.BinaryTypes.String,
>          Ordered:   false,
>       }},
>    }, []arrow.UnionTypeCode{0})
>    dt2 := arrow.SparseUnionOf([]arrow.Field{
>       {Name: "a", Type: dt1},
>    }, []arrow.UnionTypeCode{0})
>    pool := memory.NewGoAllocator()
>    builder := array.NewSparseUnionBuilder(pool, dt2)
>    defer builder.Release()
>    arr := builder.NewArray()
>    defer arr.Release()
>    assert.Equal(t, 0, arr.Len())
> } {code}
>  
> *Analysis:*
>  - The `NewSparseUnionBuilder` calls the builders for each variant and also 
> calls defer builder.Release. 
>  - The Struct Release method calls the Release methods of every field even if 
> the refCount is not 0, so the Release method of the second union is called 
> followed by the Release method of the dictionary. 
>  - Although, the union builder is returned without error, the builder is not 
> usable.
>  - This bug doesn't happen with 2 nested unions. As the internal counter is 
> properly tested.
>  
> First, I don't understand why the Release method of each variant is called 
> right after the Union constructor is created. I also don't understand why the 
> Release method of the structure calls the Release method of each field 
> regardless of the value of the internal refCount. This looks like a bug to 
> me, but I'm not quite sure yet what the right way to fix it will be.
>  
> Any idea?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to