G18 Data Workshop

Workshop: “Doing the most with your data: Processing, products, metadata and web applications"

Organized by Corinna Gries & Hilary Dugan

Objectives

  • Use R to download data from the LTER data repository, the CUAHSI data center, and the GLEON DataONE member node.

  • Learn the basics of Metadata

  • Use the R package EML do develop metadata formatted in the Ecological Metadata Language

  • Upload data and metadata to the GLEON DataONE member node (demonstration only, due to current log-in changes in DataONE)

  • Aggregate data the easy way, using dplyr and tidyr

  • Become familiar with R package Lake Metabolizer. Contains a variety of methods for calculating lake metabolism from buoy data.

  • Build a Shiny App. Shiny is a web application framework for R, which you can use to turn your R code an interactive application.

Tasks to be completed prior to arriving at G18

1. Download and install R.

This is achieved using a CRAN mirror, such as: http://cran.stat.sfu.ca/

-Or-

Update R, if you are using an R version < 3.2.4

 

2. Download and install RStudio

We will be using RStudio for all examples/demonstrations. Please have the RStudio application on your laptop.

https://www.rstudio.com/products/rstudio/download/

3. Install R packages

The following packages should be installed on your computer. To install packages in R, use the following code:

install.packages(‘shiny’)

Please install:

  • LakeMetabolizer

  • rLakeAnalyzer

  • shiny

  • dataone

  • devtools

  • rmarkdown (this will install several other packages)

  • tidyr

  • dplyr

  • lubridate

  • readr

  • WaterML

Some packages are not yet available on CRAN. You can install packages directly from github using the devtools package. We will be using the EML package (this is installed from github, which means that you’ll need to install devtools first and then issue the command:

devtools::install_github("ropensci/EML")

Instructions are at: https://github.com/ropensci/EML

4. Download Data for the Workshop

Sparkling Lake, Wisconsin, 1-minute buoy data. We will be working with these during the workshop. 

Sparkling Lake, Wisconsin, hourly buoy data. These are for reference. We will be making these during the workshop.

Shiny template files. Please download. 

5. Workshop material - Coding vignettes, presentation

  • Vignette: Aggregating buoy data from minute to hourly (pdf)
  • Vignette: R code for metabolism estimates (pdf)
  • Vignette: R code for creating EML metadata (pdf)
  • Presentation: Metadata presentation (pdf)
  • Material: Create EML (zip)
  • Material: Data Download example scripts (zip)
  • Shiny code available on github

    Additional Resources

     

    Schedule, July 4th, 2016

    7:00-8:30

    Breakfast

    8:30-9:00

    Introduction to GSA (led by GSA co-chairs)

    9:00-11:00

    Download data from a repository

    Prepare data for analysis

    Run Analysis using Lake Metabolizer

    11:00-11:30

    Break

    11:30-13:00

    Prepare EML metadata for harmonized data product

    (upload data back to repository - demonstration only due to log in credentials needed)

    13:00-14:00

    Lunch!

    14:00-15:30

    How to build a shiny app to view buoy data.

    (Recommended for intermediate and advanced R users)

    Example: http://poseidon.limnology.wisc.edu:3838/metviewer/

    15:30-16:00

    Wrap-up, evaluations

     16:30-17:30

    Sport activities with students (soccer, frisbee, walking/hiking)

     18:00-19:30

    GLEON/NETLAKE Welcome and mixer

    (Welcome Paul/Kathie/Eleanor/Bas) (CCC to introduce an ice breaker Liz, Ian and Steven).

      19:30

    GLEON/NETLAKE Dinner

     

    Theme by Danetsoft and Danang Probo Sayekti inspired by Maksimer