How to Access Datasets in R

By Datazar

(This article was first published on R Language in Datazar Blog on Medium, and kindly contributed to R-bloggers)

Have you spent hours, pulling your hair out trying to figure out how to access datasets in R? Once imported to a variable, columns from a dataset (eg: CSV) can be very tricky to access. Sometimes columns contain spaces, funky characters or other incosistencies. Here are some examples on how to access the data from CSV and JSON datasets.


We’ll use this CSV dataset since it has *very* long column names.
  1. Read File: csvDataset

2. Access Columns: csvDataset$Total.carbon.emissions.from.fossil.fuel.consumption.and.cement.production..million.metric.tons.of.C.

As you can see here, the blank spaces AND parantheses got replaced by a period . .This is actually pretty useful because you can just use the period for other special characters, helping you get to variables faster.

You can always head(csvDataset) to look at the column names (and first few rows) to get the actual column names used to call them.


We’ll use this simple JSON dataset from NASA showing meteorite impacts.

For JSON, we’re going to load an external library.

  1. Load rjson library: library(rjson)
  2. Read File: jsonDataset
  3. Access an Object: jsonDataset[1] (gives you the first object)

Link to datasets and example R notebook:

How to Access Datasets in R was originally published in Datazar Blog on Medium, where people are continuing the conversation by highlighting and responding to this story.

To leave a comment for the author, please follow the link and comment on their blog: R Language in Datazar Blog on Medium. offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more…

Source:: R News