R: two scatterplots on single graph using ggplot
Please note I am beginner with R. I have merged two data frames with one common column with merge() method. I have obtained data frame like:
x y1 y2
1 3 5
2 2 4
1 2 2
3 5 5
...
etc. i would like to plot such data frame with ggplot. What I have created (using documention of geom_point is
ggplot(data = dat_c, aes(games, variance.x)) +
ge开发者_如何转开发om_point(aes(x = games, y = variance.x), legend= TRUE, xlab="X", ylab="Y", colour=alpha('red', 0.05)) +
geom_point(aes(x = games, y = variance.y), legend = TRUE, colour=alpha('blue', 0.05) )
It works, NaNs do not disturb me because I get warning that they are ignored, which is fine. However I have two problems and I am not sure how to fix them:
- my actual plot is located at the bottom-left corner, I would like to set max values for X and Y axis (in a dynamic manner, for example with highest value from data + 100 or something like this)
- the legend is not displayed
- the axis are not described
Here is how it looks hlike:
See also:
- Plot multiple variables on y-axis using ggplot
- R - creating legend for three data sets on same graph using ggplot
- How to manually add a legend to a ggplot object
- ggplot and R: Two variables over time
(these are the results of searching [r] ggplot melt
, although you might also have gotten there via [r] ggplot legend
...)
If you can, get a copy of the ggplot
book and read it from the beginning -- unfortunately the PDF of the draft is no longer available online, but the book is worth the investment.
You actually have some points with
x
andy
values near the extremes of your plot. It's just hard to see them because they're nearly transparent (it will be a little easier to see them on a white background, i.e. try adding+theme_bw()
to yourggplot
call). You can usexlim
andylim
if you want to restrict the range of the plot. (Trysummary
on your data and check out the Max values ...)the best way to get the axes drawn is to follow the
ggplot
idiom of "melting" your data into a long-format data set with one column for the category (y1
vsy2
) and another for the value, as follows:
d <- data.frame(x=c(1,2,1,3),
y1=c(3,2,2,5),
y2=c(5,4,2,5))
library(ggplot2)
library(reshape2) ## for melt()
dm <- melt(d,id.var=1)
ggplot(data=dm,aes(x,value,colour=variable))+
geom_point(alpha=0.2)+
scale_colour_manual(values=c("red","blue"))+
labs(x="games",y="variance")
(sorry for the slightly odd formatting)
I set the alpha
value a little higher because otherwise it would have been hard to see the points in the figure. I think the default colours (reddish and blue-ish) are OK, but I used scale_colour_manual
to get them the way you specified.
- I'm not sure what you mean.
You should melt
your data into long format and then map the colour aesthetic to the variable column from the melted data.frame. Something like this:
dat <- data.frame(x = c(1,2,1,3), y1 = c(3,2,2,5), y2 = c(5,4,2,5))
dat.m <- melt(dat, id.vars = "x")
ggplot(dat.m, aes(x, value, colour = variable)) +
geom_point() +
scale_colour_manual(values = c("red", "blue"))
You can manually set the limits with xlim()
and ylim()
respectively. It's not clear what you're doing with alpha, so I'll leave that one up to you.
精彩评论