我想用group by在一个数据框上用summarise
做几次计算。输入数据:
dat <- data.frame (ID = c(1:10),
var1 = as.factor(c("A","B","A","A","B","B","B","C","A","B")),
Var2 = as.factor(c("low","medium","low","low","medium","high","high","high","high","high")))
现在我想在var1上做一个group by,计算ID,并计算var2 = high的比例。我的输出应该如下所示:
var1 total prop_high
1 A 4 0.25
2 B 5 0.60
3 C 1 1.00
到目前为止,我得到了以下代码,但我被比例计算卡住了
dat2 <- dat %>%
group_by(var1) %>%
summarise(total = n(),
prop_high = )
发布于 2020-11-11 10:00:15
您可以取逻辑值的mean
来获得比例。
library(dplyr)
dat %>%
group_by(var1) %>%
summarise(total = n(),
prop_high = mean(Var2 == 'high'))
#Same as
#prop_high = sum(Var2 == 'high')/n())
# var1 total prop_high
# <fct> <int> <dbl>
#1 A 4 0.25
#2 B 5 0.6
#3 C 1 1
https://stackoverflow.com/questions/64784082
复制相似问题