Re: [R] any and all

Dénes Tóth Fri, 12 Apr 2024 15:43:34 -0700

Hi Avi,

As Duncan already mentioned, a reproducible example would be helpful toassist you better. Having said that, I think you misunderstand how`dplyr::filter` works: it performs row-wise filtering, so the filteringexpression shall return a logical vector of the same length as thedata.frame, or must be a single boolean value meaning "keep all" (TRUE)or "drop all" (FALSE). If you use `any()` or `all()`, they return asingle boolean value, so you have an all-or-nothing filter in the end,which is probably not what you want.

Note also that you do not need to use `mutate` to use `filter` (read?dpylr::filter carefully):

```
filter(
  .data = mydata,
  !is.na(first.a) | !is.na(first.b),
  !is.na(second.a) | !is.na(second.b),
  !is.na(third.a) | !is.na(third.b)
)
```

Or you can use `base::subset()`:
```
subset(
  mydata,
  (!is.na(first.a) | !is.na(first.b))
  & (!is.na(second.a) | !is.na(second.b))
  & (!is.na(third.a) | !is.na(third.b))
)
```

Regards,
Denes

On 4/12/24 23:59, Duncan Murdoch wrote:

On 12/04/2024 3:52 p.m., avi.e.gr...@gmail.com wrote:

Base R has generic functions called any() and all() that I am havingtrouble

using.
It works fine when I play with it in a base R context as in:

all(any(TRUE, TRUE), any(TRUE, FALSE))

[1] TRUE

all(any(TRUE, TRUE), any(FALSE, FALSE))

[1] FALSE
But in a tidyverse/dplyr environment, it returns wrong answers.
Consider this example. I have data I have joined together with pairs of

columns representing a first generation and several other pairsrepresenting

additional generations. I want to consider any pair where at least one of

the pair is not NA as a success. But in order to keep the entire row,I want

all three pairs to have some valid data. This seems like a fairly common
reasonable thing often needed when evaluating data.
So to make it very general, I chose to do something a bit like this:

We can't really help you without a reproducible example. It's notenough to show us something that doesn't run but is a bit like the realcode.


Duncan Murdoch

result <- filter(mydata,
                  all(
                    any(!is.na(first.a), !is.na(first.b)),
                    any(!is.na(second.a), !is.na(second.b)),
                    any(!is.na(third.a), !is.na(third.b))))
I apologize if the formatting is not seen properly. The above logically
should work. And it should be extendable to scenarios where you want at

least one of M columns to contain data as a group with N such groupsof any

size.

But since it did not work, I tried a plan that did work and feelssilly. I

used mutate() to make new columns such as:
result <-
   mydata |>
   mutate(
     usable.1 = (!is.na(first.a) | !is.na(first.b)),
     usable.2 = (!is.na(second.a) | !is.na(second.b)),
     usable.3 = (!is.na(third.a) | !is.na(third.b)),
     usable = (usable.1 & usable.2 & usable.3)
   ) |>
   filter(usable == TRUE)
The above wastes time and effort making new columns so I can check the
calculations then uses the combined columns to make a Boolean that can be
used to filter the result.

I know this is not the place to discuss dplyr. I want to check firstif I amdoing anything wrong in how I use any/all. One guess is that thegeneric is

messed with by dplyr or other packages I libraried.

And, of course, some aspects of delayed evaluation can interfere insubtle

ways.
I note I have had other problems with these base R functions before and

generally solved them by not using them, as shown above. I would muchrather

use them, or something similar.
Avi

    [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help

PLEASE do read the posting guidehttp://www.R-project.org/posting-guide.html

and provide commented, minimal, self-contained, reproducible code.


______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help

PLEASE do read the posting guidehttp://www.R-project.org/posting-guide.html

and provide commented, minimal, self-contained, reproducible code.


______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] any and all

Reply via email to