Data Visualization

Code for Quiz 9.

  1. Load the R packages we will use.
  1. Quiz Questions

Question: e_charts-1

Create a bar chart that shows the average hours Americans spend on five activities by year. Use the ‘timeline’ argument to create an animation that will animate through the years.

spend_time  <- read_csv("https://estanny.com/static/week8/spend_time.csv")

e_charts-1

Start with spend_time

spend_time  %>%
  group_by(year)  %>%
  e_charts(x = activity, timeline = TRUE)  %>%
  e_timeline_opts(autoPlay = TRUE) %>%
  e_bar(serie = avg_hours) %>%
  e_title(text = 'Average hours Americans spend per day on activity') %>%
  e_legend(show = FALSE)

Question: echarts-2

Create a line chart for the activities that American spend time on.

Start with spend_time

spend_time %>%
  mutate(year = paste(year, "12", "31", sep = "-")) %>%
  mutate(year = lubridate::ymd(year)) %>%
  group_by(activity) %>%
  e_charts(x = year) %>%
  e_line(serie = avg_hours) %>%
  e_tooltip() %>%
  e_title(text = 'Average hours Americans spend per day on each activity') %>%
  e_legend(top = 40)

Question: Modify slide 82

ggplot(spend_time, aes(x = year, y = avg_hours, color - activity,)) +
geom_point() +
geom_mark_ellipse(aes(filter = activity == "leisure/sports",
description = "Americans spend the most time on leisure/sport"))

Question: Tidyquant

Retrieve stock price for Microsoft, ticker: MSFT, using tq_get - From 2019-08-01 to 2020-07-28 - Assign output to df

df  <- tq_get("MSFT", get = "stock.prices",
              from = "2019-08-01", to = "2020-07-28")

Create a plot with the df data

ggplot(df, aes(x = date, y = close)) +
  geom_line() +
  geom_mark_ellipse(aes(filter = date == "2020-02-03", description = "Woman in Santa Clara County confirmed to be ninth case in the U.S."), fill = "yellow",) +
  geom_mark_ellipse(aes(filter = date == "2020-11-10", description = "U.S. passed over 10 million COVID-19 cases"), color = "red", ) +
  labs(
    title = "Microsoft",
    x = NULL,
    y = "Closing price per share",
    caption = "Source: https://en.wikipedia.org/wiki/Timeline_of_the_COVID-19_pandemic_in_the_United_States"
  )

Save the previous plot to preview.png and add to the yaml chunk at the top

ggsave(filename = "preview.png",
       path = here::here("_posts", "2022-04-02-data-visualization"))