Very rarely in a data science project is data easily available as part of a package. It's more typical for the data to be in a file, a database, or extracted from a document such as web pages, tweets, or PDFs. In these cases, the first step is to import the data into R and tidy the data, using the tidyverse package. This usually involves several, often complicated, steps to convert data from its raw form to the tidy form that greatly facilitates the rest of the analysis. We refer to this process as data wrangling.
In this Data Science - Wrangling Certificate, which is part of Data Science Professional Certificate from EdX in partnership with Harvard University - HarvardX, we will cover several common steps of the data wrangling process including importing data into R from files, tidying data, string processing, html parsing, working with dates and times, and text mining. Rarely are all these wrangling steps necessary in a single analysis, but data scientist will likely face them all at some point.
Always verify the dates on the programme website programme website .
Length: 4 weeks Effort: 2 to 4 hours per week
This programme requires students to demonstrate proficiency in English.
Check the programme website for information about funding options.
StudyPortals Tip: Students can search online for independent or external scholarships that can help fund their studies. Check the scholarships to see whether you are eligible to apply. Many scholarships are either merit-based or needs-based.
Together with the ISIC Association and British Council IELTS, StudyPortals offers you the chance to receive up to £10000 to expand your horizon and study abroad. We want to ultimately encourage you to study abroad in order to experience and explore new countries, cultures and languages.