Hi,

how can I do a data step in R which can be best described in SQL terminology as 
an outer join of rows (eg cases) and inner join of columns (variables)? In case 
of conflicting non-missing values the first dataset has higher always priority. 
Missing values should always be replaced by valid values (cf. coalesce function 
in SQL):

INPUT DATASET 1:

  A  B  C
1 11 NA 31
2 12 22 32
3 NA 23 33

INPUT DATASET 2:

  C  D  E
2 NA 42 52   
3 NA 43 53
4 34 44 54

OUTPUT DATASET:
   A  B  C  D  E
1  11 NA 31 NA NA
2  12 22 32 42 52
3  NA 23 33 43 53
4  NA NA 34 44 54

Thanks in advance,

Will

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to