Elements of Data Science
GBCC DATA210G ⚫️ Spring 2016
Introduction
This course is the foundation for introducing students to key topics in data science, including data acquisition/preparation and exploratory data analysis. Major topics include:
- an introduction to the R programming language and RStudio integrated development environment
- working with modern data formats such as XML, CSV, JSON, XLS & XHTML
- data import/export from/to files, APIs, web sites & databases
- finding data to augment analyses, and
- exploratory data analysis & visualization.
Instructor Contact
Course Topics:
- Programming in the R Programming Language and using the RStudio IDE
- Organizing data analysis projects
- Understanding the data analysis pipeline
- Cleaning data sets and working with “tidy” data
- Working with discrete & continuous data
- Working with modern data formats (e.g. CSV, XML, etc)
- Accessing data via modern APIs
- Accessing data in databases
- Performing exploratory data analysis
- Visualizing data
- Communicating analysis results
DATA210G Cloud-based R/RStudio
Regardless of whether you have R & RStudio installed locally, you can use the cloud instance provided by Great Bay. This will be the environment used in class.