[ 
https://issues.apache.org/jira/browse/BEAM-6772?focusedWorklogId=223451&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-223451
 ]

ASF GitHub Bot logged work on BEAM-6772:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 05/Apr/19 05:36
            Start Date: 05/Apr/19 05:36
    Worklog Time Spent: 10m 
      Work Description: reuvenlax commented on issue #8006: [BEAM-6772] Change 
Select semantics to match what a user expects
URL: https://github.com/apache/beam/pull/8006#issuecomment-480154077
 
 
   @kanterov I went ahead and changed array and map selects to distribute the 
select. This preserves the invariant that the field name selected is always the 
one that appears in the resulting schema, and I believe is closer to what Spark 
does. 
   
   I still think we should have a thread on the dev list to discuss the 
detailed semantics, as we still might want to change them. However I think with 
this last commit, we are probably closer to where we want to end up.
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 223451)
    Time Spent: 11h 40m  (was: 11.5h)

> Select transform has non-intuitive semantics
> --------------------------------------------
>
>                 Key: BEAM-6772
>                 URL: https://issues.apache.org/jira/browse/BEAM-6772
>             Project: Beam
>          Issue Type: Sub-task
>          Components: sdk-java-core
>            Reporter: Reuven Lax
>            Assignee: Reuven Lax
>            Priority: Major
>          Time Spent: 11h 40m
>  Remaining Estimate: 0h
>
> Consider the following schema:
> User:
>     name: STRING
>     location: Location
>  
> Location:
>     latitude: DOUBLE
>     longitude: DOUBLE
>  
> If you apply Select.fieldNames("location"), most users expect to get back a 
> row matching the Location schema. Instead you get back an outer schema with a 
> single location field in it. Select should instead unnest the output up to 
> the point where multiple fields are selected.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to