Data resources : Places where you can download data

UN : http://data.un.org/
US : http://www.data.gov/
http://www.gapminder.org/data/
http://www.asdfree.com/ – with R codes
Kaggle : https://www.kaggle.com/datasets
rOpenSci : https://ropensci.org/

Credit to : https://www.coursera.org/learn/data-cleaning/home/info

AND:

http://data.princeton.edu/wws509/datasets

Removing duplicates in R using ‘dplyr’ and ‘data.table’

In this post, I will show how to remove duplicates of observations in a data frame.

 

Package ‘tibble’ in R

What is ‘tibble’ package?

According to Hadley Wickham “Tibbles are a modern reimagining of the data.frame, keeping what time has proven to be effective, and throwing out what is not.

The name comes from dplyr: originally you created these objects with tbl_df(), which was most easily pronounced as “tibble diff”. “

Find its similarities and dissimilarities with data.frame. More info here : tibble

mapply in R – an example

mapply() looks like an interesting function in R. here an example of what you can do with mapply() function

The results are :