Elements of Data Science

GBCC DATA210G ⚫️ Spring 2016

Introduction

This course is the foundation for introducing students to key topics in data science, including data acquisition/preparation and exploratory data analysis. Major topics include:

  • an introduction to the R programming language and RStudio integrated development environment
  • working with modern data formats such as XML, CSV, JSON, XLS & XHTML
  • data import/export from/to files, APIs, web sites & databases
  • finding data to augment analyses, and
  • exploratory data analysis & visualization.

Instructor Contact

Course Topics:

  • Programming in the R Programming Language and using the RStudio IDE
  • Organizing data analysis projects
  • Understanding the data analysis pipeline
  • Cleaning data sets and working with “tidy” data
  • Working with discrete & continuous data
  • Working with modern data formats (e.g. CSV, XML, etc)
  • Accessing data via modern APIs
  • Accessing data in databases
  • Performing exploratory data analysis
  • Visualizing data
  • Communicating analysis results

DATA210G Cloud-based R/RStudio

Regardless of whether you have R & RStudio installed locally, you can use the cloud instance provided by Great Bay. This will be the environment used in class.