What is the easiest way to calculate a percent rank “by” an index key?


Foe example, I have a dataset with 3 fields:


Year,    State,   Income ,


I wish to calculate the rank, by year, by state.

I also wish to calculate the “percent rank”, where I define percent rank as 


(n is the number of numeric data points within each date-state grouping.)



This is what I am currently doing:


1.  I create a “group by” field by using the paste function to combine date and 
state into a field called date_state.   I then use the rank function to 
calculate the rank by date, by state. 


2. I then add a field called “one” that I set to 1 if the value in income is 
numeric and to 0 if it is not.


3. I then take an aggregate sum of “one”.  This gives me a count (n) for each 
date-state grouping.



4. I next use merge to add this count to the table.


5. Finally, I calculate the percent rank.




The merge takes quite a bit of time to process. 


Is there an easier/more efficient way to calculate the percent rank?




        [[alternative HTML version deleted]]

R-help@stat.math.ethz.ch mailing list
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html

Reply via email to