Details for fitbit-load-in-R.ipynb

Published by gedankenstuecke


This notebooks demonstrates how to use the Fitbit data in R and how to make simple plots with the data.


Tags & Data Sources

activity tracking Fitbit Connection


Please log in to comment.

Last updated 1 year, 10 months ago

Analyzing Fitbit-data in R

This notebook requires you to use the Fitbit connection to get your activity data into Open Humans.

Given that the imported Fitbit data has a rather particular JSON format this notebook just gives an idea of how to extract individual data from the larger data set.

For a start let's load our required packages. To load the JSON we are using rjson, which needs to be installed with install.packages in the first step:

In [3]:
Updating HTML index of packages in '.Library'
Making 'packages.html' ... done

Attaching package: ‘rjson’

The following objects are masked from ‘package:jsonlite’:

    fromJSON, toJSON

With that out of the way we can find the Fitbit data from our Open Humans account:

In [4]:
access_token <- Sys.getenv("OH_ACCESS_TOKEN")
url <- paste("",access_token,sep="")
resp <- GET(url)
user <- content(resp, "parsed")

for (data_source in user$data){
    if (data_source$source == "direct-sharing-102"){
         fitbit_url <- data_source$download_url

temp <- tempfile()
json_data <- fromJSON(file=temp)

Now that we have our fitbit data stored in json_data we can start to extract individual data types from it.

Let's focus on steps for now. We create two empty vectors and from there we first iterate over the yearly_entry in json_data$'tracker-steps'. Each yearly_entry has one main list, which is activities-tracker-steps. Which is why we now can iterate over yearly_entry$'activities-tracker-steps' and get the dateTime and value for each of these entry.

This is not how you should do things in R, but I couldn't find out a better way to extract that data. If you do: Please be in touch at!

In [5]:
dates <- c()
steps <- c()

for (yearly_entry in json_data$'tracker-steps'){
    for (entry in yearly_entry$'activities-tracker-steps'){
    dates <- append(dates,entry$dateTime)
    steps <- append(steps,entry$value)

Once that's done we can merge the dates and steps into a single data frame and convert the data types:

In [6]:
df <- data.frame('date' = dates, 'steps' = steps)
df$steps <- as.numeric(as.character(df$steps))
df$date <- as.Date(as.character(df$date))
df$year <- as.character(format(df$date,"%Y"))

Now we can start our first plot: All of our daily steps

In [7]:
ggplot(subset(df,df$steps != 0),aes(date,steps,color=year)) + geom_point() + theme_minimal()

Let's now also compare the daily step counts between the years. For this we generate fake_dates, which contain only the real day and month but are all normalized onto the same year. With that we can now plot everything onto a single x-axis by using the fake_date for it and still keep the year, using the variable of the same name:

In [8]:
df$date_wo_year <- as.character(format(as.Date(df$date),"%m-%d"))
df$fake_date <- as.Date(paste('2001',df$date_wo_year,sep='-'))

ggplot(subset(df,df$steps != 0),aes(fake_date,steps,color=year)) + geom_line() + stat_smooth() + facet_grid(year ~ .) + theme_minimal()
`geom_smooth()` using method = 'loess'

This should give you an idea of how to read the Fitbit data. To see all the different data types that are in the fitbit data you can use the following:

In [9]:
  1. 'activities-overview'
  2. 'tracker-minutes-very-active'
  3. 'minutes-to-sleep'
  4. 'awake-minutes'
  5. 'sleep-efficiency'
  6. 'sleep-awakenings'
  7. 'heart'
  8. 'tracker-minutes-fairly-active'
  9. 'sleep-minutes-after-wakeup'
  10. 'tracker-minutes-lightly-active'
  11. 'sleep-minutes'
  12. 'tracker-steps'
  13. 'weight'
  14. 'tracker-activity-calories'
  15. 'tracker-elevation'
  16. 'tracker-calories'
  17. 'tracker-minutes-sedentary'
  18. 'tracker-distance'
  19. 'time-in-bed'
  20. 'weight-log'
  21. 'tracker-floors'
  22. 'profile'
  23. 'sleep-start-time'

And with that happy exploring!

In [ ]: