My data frame looks like:
SightingID PA1 PA2 PlotID InOverlap Area1 2001 1 -99 392
Y 0.22 2002 1 -99 388 Y 0.253 2008 1
NA 104 N 0.344 2010 1 NA 71 N
0.185 2012 1 NA 61 N 0.166 2013 1 NA
61 N 0.227 2014 1 NA 62 N 0.258
2015 1 NA 63 N 0.199 2016 1 NA 63
N 0.310 2017 1 NA 63 N 0.2511
2018 1 NA 63 N 0.2612 2019 1 NA 63 N
0.2613 2020 1 NA 64 N 0.33 14 2021
1 NA 64 N 0.4215 2022 1 NA 85 N
0.0816 2023 0 1 95 Y 0.11 17 2024 1
NA 93 N 0.2318 2025 1 NA 106 N
0.419 2026 1 NA 134 !
N 0.28
The only unique values in the data frame are the SightingID. I would like to
obtain a new data frame with unique PlotID values based on several conditions:-
return the row if there is only one SightingID for the PlotID- if there is
several SightingID for the same PlotID value: -select first the SightingID
for which PA1=0, if there is several SightingID with PA1=0 for the same
PlotID select the one with the highest value in Area, if there is several
SightingID with PA1=0 for the same PlotID with the highest value for Area
select one SightingID at random - select the SightingID for which PA1 is
not equal to 0 based on the highest value in Area (and at random if there are
several with the highest value in Area)
I have no idea how to do that, can someone help me please ? Sarah
[[alternative HTML version deleted]]
______________________________________________
[email protected] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.