Tidyverse
Материал из Letopisi.Ru — «Время вернуться домой»
Пакет языка R
- Hadley Wickham and Garrett Grolemund - R for Data Science
Import, Tidy, Transform, Visualize, and Model Data
- install.packages("tidyverse")
- library(tidyverse)
Содержание |
Import
Tidy
Transform
dplyr Basics
- Pick observations by their values
- (filter()). - filter(flights, month == 1, day == 1)
- Reorder the rows
- (arrange()) - arrange(flights, desc(arr_delay))
- Pickvariables by their names
- (select()). select(flights, year, month, day) select(flights, -(year:day))
- Create new variables with functions of existing variables
- (mutate()). - mutate() always adds new columns at the end of your dataset
- Collapse many values down to a single summary
- (summarize()).
Filter
Arrange
Select
select(flights, -(year:day))
Mutate
Summarize
sum(!is.na(x))
not_cancelled %>% group_by(dest) %>% summarize(carriers = n_distinct(carrier)) %>% arrange(desc(carriers))
not_cancelled %>% count(dest)
Visualize
ggplot(data = mpg) + geom_point(mapping = aes(x = displ, y = hwy))
ggplot(data = mpg) + geom_point(mapping = aes(x = displ, y = hwy, color = class))
ggplot(data = mpg) + geom_point(mapping = aes(x = displ, y = hwy, size = class))
Facets
ggplot(data = mpg) + geom_point(mapping = aes(x = displ, y = hwy)) + facet_wrap(~ class, nrow = 2)
Statistical Transformations
ggplot(data = diamonds) + geom_bar(mapping = aes(x = cut))
ggplot(data = diamonds) + stat_count(mapping = aes(x = cut))
Coordinate Systems
- coord_flip() switches the x- and y-axes.
ggplot(data = mpg, mapping = aes(x = class, y = hwy)) + geom_boxplot() + coord_flip()