Re: [R] data managment

2006-06-15 Thread Adaikalavan Ramasamy
If your df contains your data, try

 tmp - cbind( paste(df[ ,1], df[ ,2], sep=:), 
   paste(df[ ,3], df[ ,4], sep=:) )
 tmp - t( apply(tmp, 1, sort) )

 out - data.frame( do.call(rbind, strsplit( tmp[,1], split=: )), 
do.call(rbind, strsplit( tmp[,2], split=: )) )
 colnames(out) - colnames(df)
 out

Regards, Adai



On Wed, 2006-06-14 at 16:35 +0100, yohannes alazar wrote:
 First I would really like to thank the mailing list for help I got in the
 past, as a new to R I am really needing some support on hoe to code the
 following problem.
 
 
 
 I am trying to sort some data I have in a big file. The file has 4 columns
 and 19000 rows. An example of it looks like this:-
 
 
 
 G 0.892   A 0.108
 
 G 0.883   T  0.117
 
 T  0.5   C 0.5
 
 A 0.617   G 0.383
 
 G 0.925   A 0.075
 
 A 0.967   G 0.033
 
 C 0.883   T  0.117
 
 C 0.633   T  0.367
 
 G 0.95 A 0.05
 
 C 0.742   G 0.258
 
 G 0.875   T  0.125
 
 T  0.167   C 0.833
 
 C 0.792   A 0.208
 
 
 
 Columns one and three are alphabets while three and four are their
 corresponding values.
 
 I wanted to sort this data so that my first and third columns are in
 alphabetic order. For example in the first row the order is G then A.
 This is not in alphabetic order therefore we swap them along with their
 values and it becomes:
 
  A0.108   G 0.892
 
 Row two looks fine but row three needs the same rearrangement as row one.
 And the final out put looks like:
 
 A 0.108   G 0.892
 
 G 0.883   T  0.117
 
 C 0.5   T  0.5
 
 A 0.617   G 0.383
 
 A 0.075   G 0.925
 
 A 0.967   G 0.033
 
 C 0.883   T  0.117
 
 C 0.633   T  0.367
 
 A 0.05 G 0.95
 
 C 0.742   G 0.258
 
 G 0.875   T  0.125
 
 C 0.833   T  0.167
 
 A 0.208   C 0.792
 
 Please some help with the relevant command names or a technique to code this
 task.
 
 Thank you in advance
 
 Regards Hannes
 
   [[alternative HTML version deleted]]
 
 __
 R-help@stat.math.ethz.ch mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html


__
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html


Re: [R] data managment

2006-06-15 Thread yohannes alazar
Thank you very much this does the work exactlly.
regards Hannes


On 6/15/06, Adaikalavan Ramasamy [EMAIL PROTECTED] wrote:

 If your df contains your data, try

 tmp - cbind( paste(df[ ,1], df[ ,2], sep=:),
   paste(df[ ,3], df[ ,4], sep=:) )
 tmp - t( apply(tmp, 1, sort) )

 out - data.frame( do.call(rbind, strsplit( tmp[,1], split=: )),
do.call(rbind, strsplit( tmp[,2], split=: )) )
 colnames(out) - colnames(df)
 out

 Regards, Adai



 On Wed, 2006-06-14 at 16:35 +0100, yohannes alazar wrote:
  First I would really like to thank the mailing list for help I got in
 the
  past, as a new to R I am really needing some support on hoe to code the
  following problem.
 
 
 
  I am trying to sort some data I have in a big file. The file has 4
 columns
  and 19000 rows. An example of it looks like this:-
 
 
 
  G 0.892   A 0.108
 
  G 0.883   T  0.117
 
  T  0.5   C 0.5
 
  A 0.617   G 0.383
 
  G 0.925   A 0.075
 
  A 0.967   G 0.033
 
  C 0.883   T  0.117
 
  C 0.633   T  0.367
 
  G 0.95 A 0.05
 
  C 0.742   G 0.258
 
  G 0.875   T  0.125
 
  T  0.167   C 0.833
 
  C 0.792   A 0.208
 
 
 
  Columns one and three are alphabets while three and four are their
  corresponding values.
 
  I wanted to sort this data so that my first and third columns are in
  alphabetic order. For example in the first row the order is G then
 A.
  This is not in alphabetic order therefore we swap them along with their
  values and it becomes:
 
   A0.108   G 0.892
 
  Row two looks fine but row three needs the same rearrangement as row
 one.
  And the final out put looks like:
 
  A 0.108   G 0.892
 
  G 0.883   T  0.117
 
  C 0.5   T  0.5
 
  A 0.617   G 0.383
 
  A 0.075   G 0.925
 
  A 0.967   G 0.033
 
  C 0.883   T  0.117
 
  C 0.633   T  0.367
 
  A 0.05 G 0.95
 
  C 0.742   G 0.258
 
  G 0.875   T  0.125
 
  C 0.833   T  0.167
 
  A 0.208   C 0.792
 
  Please some help with the relevant command names or a technique to code
 this
  task.
 
  Thank you in advance
 
  Regards Hannes
 
[[alternative HTML version deleted]]
 
  __
  R-help@stat.math.ethz.ch mailing list
  https://stat.ethz.ch/mailman/listinfo/r-help
  PLEASE do read the posting guide!
 http://www.R-project.org/posting-guide.html
 



[[alternative HTML version deleted]]

__
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html


[R] data managment

2006-06-14 Thread yohannes alazar
First I would really like to thank the mailing list for help I got in the
past, as a new to R I am really needing some support on hoe to code the
following problem.



I am trying to sort some data I have in a big file. The file has 4 columns
and 19000 rows. An example of it looks like this:-



G 0.892   A 0.108

G 0.883   T  0.117

T  0.5   C 0.5

A 0.617   G 0.383

G 0.925   A 0.075

A 0.967   G 0.033

C 0.883   T  0.117

C 0.633   T  0.367

G 0.95 A 0.05

C 0.742   G 0.258

G 0.875   T  0.125

T  0.167   C 0.833

C 0.792   A 0.208



Columns one and three are alphabets while three and four are their
corresponding values.

I wanted to sort this data so that my first and third columns are in
alphabetic order. For example in the first row the order is G then A.
This is not in alphabetic order therefore we swap them along with their
values and it becomes:

 A0.108   G 0.892

Row two looks fine but row three needs the same rearrangement as row one.
And the final out put looks like:

A 0.108   G 0.892

G 0.883   T  0.117

C 0.5   T  0.5

A 0.617   G 0.383

A 0.075   G 0.925

A 0.967   G 0.033

C 0.883   T  0.117

C 0.633   T  0.367

A 0.05 G 0.95

C 0.742   G 0.258

G 0.875   T  0.125

C 0.833   T  0.167

A 0.208   C 0.792

Please some help with the relevant command names or a technique to code this
task.

Thank you in advance

Regards Hannes

[[alternative HTML version deleted]]

__
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html


Re: [R] data managment

2006-06-14 Thread jim holtman
This 'applies' a function to each row to reverse columns if necessary:



 x
   V1V2 V3V4
1   G 0.892  A 0.108
2   G 0.883  T 0.117
3   T 0.500  C 0.500
4   A 0.617  G 0.383
5   G 0.925  A 0.075
6   A 0.967  G 0.033
7   C 0.883  T 0.117
8   C 0.633  T 0.367
9   G 0.950  A 0.050
10  C 0.742  G 0.258
11  G 0.875  T 0.125
12  T 0.167  C 0.833
13  C 0.792  A 0.208
 str(x)
`data.frame':   13 obs. of  4 variables:
 $ V1: chr  G G T A ...
 $ V2: num  0.892 0.883 0.5 0.617 0.925 0.967 0.883 0.633 0.95 0.742 ...
 $ V3: chr  A T C G ...
 $ V4: num  0.108 0.117 0.5 0.383 0.075 0.033 0.117 0.367 0.05 0.258 ...

 t(apply(x, 1, function(z){
+ if (z[1]  z[3]) return(z)
+ else return(z[c(3,4,1,2)])
+ }))
   [,1] [,2][,3] [,4]
1  A  0.108 G  0.892
2  G  0.883 T  0.117
3  C  0.500 T  0.500
4  A  0.617 G  0.383
5  A  0.075 G  0.925
6  A  0.967 G  0.033
7  C  0.883 T  0.117
8  C  0.633 T  0.367
9  A  0.050 G  0.950
10 C  0.742 G  0.258
11 G  0.875 T  0.125
12 C  0.833 T  0.167
13 A  0.208 C  0.792





On 6/14/06, yohannes alazar [EMAIL PROTECTED] wrote:

 First I would really like to thank the mailing list for help I got in the
 past, as a new to R I am really needing some support on hoe to code the
 following problem.



 I am trying to sort some data I have in a big file. The file has 4 columns
 and 19000 rows. An example of it looks like this:-



 G 0.892   A 0.108

 G 0.883   T  0.117

 T  0.5   C 0.5

 A 0.617   G 0.383

 G 0.925   A 0.075

 A 0.967   G 0.033

 C 0.883   T  0.117

 C 0.633   T  0.367

 G 0.95 A 0.05

 C 0.742   G 0.258

 G 0.875   T  0.125

 T  0.167   C 0.833

 C 0.792   A 0.208



 Columns one and three are alphabets while three and four are their
 corresponding values.

 I wanted to sort this data so that my first and third columns are in
 alphabetic order. For example in the first row the order is G then A.
 This is not in alphabetic order therefore we swap them along with their
 values and it becomes:

 A0.108   G 0.892

 Row two looks fine but row three needs the same rearrangement as row one.
 And the final out put looks like:

 A 0.108   G 0.892

 G 0.883   T  0.117

 C 0.5   T  0.5

 A 0.617   G 0.383

 A 0.075   G 0.925

 A 0.967   G 0.033

 C 0.883   T  0.117

 C 0.633   T  0.367

 A 0.05 G 0.95

 C 0.742   G 0.258

 G 0.875   T  0.125

 C 0.833   T  0.167

 A 0.208   C 0.792

 Please some help with the relevant command names or a technique to code
 this
 task.

 Thank you in advance

 Regards Hannes

[[alternative HTML version deleted]]

 __
 R-help@stat.math.ethz.ch mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide!
 http://www.R-project.org/posting-guide.html




-- 
Jim Holtman
Cincinnati, OH
+1 513 646 9390 (Cell)
+1 513 247 0281 (Home)

What is the problem you are trying to solve?

[[alternative HTML version deleted]]

__
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html