Narendra created SPARK-23291:
--------------------------------

             Summary: SparkR : substr : In SparkR dataframe , starting and 
ending position arguments in "substr" is giving wrong result  when the position 
is greater than 1
                 Key: SPARK-23291
                 URL: https://issues.apache.org/jira/browse/SPARK-23291
             Project: Spark
          Issue Type: Bug
          Components: SparkR
    Affects Versions: 2.2.1
            Reporter: Narendra


Defect Description :

-----------------------------

For example ,an input string "2017-12-01" is read into a SparkR dataframe "df" 
with column name "col1".
 The target is to create a a new column named "col2" with the value "12" which 
is inside the string ."12" can be extracted with "starting position" as "6" and 
"Ending position" as "7"
 (the starting position of the first character is considered as "1" )

But,the current code that needs to be written is :
 
 df <- withColumn(df,"col2",substr(df$col1,7,8)))

Observe that the first argument in the "substr" API , which indicates the 
'starting position', is mentioned as "7" 
 Also, observe that the second argument in the "substr" API , which indicates 
the 'ending position', is mentioned as "8"

i.e the number that should be mentioned to indicate the position should be the 
"actual position + 1"

Expected behavior :

----------------------------

The code that needs to be written is :
 
 df <- withColumn(df,"col2",substr(df$col1,6,7)))


Note :

-----------
 This defect is observed with only when the starting position is greater than 1.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to