我正在寻找一种用百分比标记堆积条形图的方法,而y轴显示原始计数(使用ggplot)。这是不带标签的情节的MWE:
library(ggplot2)
df <- as.data.frame(matrix(nrow = 7, ncol= 3,
data = c("ID1", "ID2", "ID3", "ID4", "ID5", "ID6", "ID7",
"north", "north", "north", "north", "south", "south", "south",
"A", "B", "B", "C", "A", "A", "C"),
byrow = FALSE))
colnames(df) <- c("ID", "region", "species")
p <- ggplot(df, aes(x = region, fill = species))
p + geom_bar()
我的桌子要大得多,R很好地统计了每个地区的不同物种。现在,我想同时显示原始计数值(最好在y轴上)和百分比(作为标签),以比较区域之间物种的比例。
我尝试了许多方法,geom_text()
但是我认为与其他问题(例如,这个问题)的主要区别在于
任何帮助深表感谢!!
如@Gregor所述,分别汇总数据,然后将数据摘要输入ggplot。在下面的代码中,我们用于动态dplyr
创建摘要:
library(dplyr)
ggplot(df %>% count(region, species) %>% # Group by region and species, then count number in each group
mutate(pct=n/sum(n), # Calculate percent within each region
ypos = cumsum(n) - 0.5*n), # Calculate label positions
aes(region, n, fill=species)) +
geom_bar(stat="identity") +
geom_text(aes(label=paste0(sprintf("%1.1f", pct*100),"%"), y=ypos))
更新:使用dplyr
0.5及更高版本时,您不再需要提供y值以使文本在每个小节内居中。相反,您可以使用position_stack(vjust=0.5)
:
ggplot(df %>% count(region, species) %>% # Group by region and species, then count number in each group
mutate(pct=n/sum(n)), # Calculate percent within each region
aes(region, n, fill=species)) +
geom_bar(stat="identity") +
geom_text(aes(label=paste0(sprintf("%1.1f", pct*100),"%")),
position=position_stack(vjust=0.5))
本文收集自互联网,转载请注明来源。
如有侵权,请联系 [email protected] 删除。
我来说两句