Skip to Main Content
It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.
Florida Tech Evans Library Logo

Data Cleaning

An introductory guide to data cleaning concepts, tools, and methods.

Learn Data Cleaning with R

There are many options available online for learning data cleaning with R. Here are a few of our favorites. 

Learning the Basics:

An introduction to Cleaning Data in R - Data Camp Blog Post

  • This blog post features videos and transcripts which cover a basic introduction to dirty data and the main concepts of data cleaning. 

An Introduction to Data Cleaning With R - Discussion Paper

  • A more comprehensive learning resource, this discussion paper offers a detailed background on data cleaning and how it fits into the research data mining and analysis process. The paper also contains coding examples as well as exercises for learners to test their skills. 

 

Reformatting Data: 

Tidy Data 

  • This guide offers in-depth examples of the common data cleaning issues outlined in Hadley Wickham's "Tidy Data" article, including the code used to resolve those issues. This article is code-heavy and will best serve those who are already familiar with R. 

 

Merging Similar Strings: 

Intro to Refinr

  • A walkthrough offering coding examples of the functions that make up the Refinr package. 

OpenRefine Tutorial:

OpenRefine for Ecologists 

  • A guided tutorial offering a breakdown of various key tools and functions found in OpenRefine.