Data Science with R
This top selling popular course will deeply introduce you to the modern R for Data Science. Technologies such as RStudio and the Tidyverse will be the core tools to run data manipulation, visualization, modelling and to export results in beautiful and clear reports. A practical course, rich with contents, an ideal basis to start you professional career in Data Science.
Topics include
- An overview of the Data Science toolbox
- Introduction to R and RStudio
- Why Data Science with R
- Advantages of an interpreted language
- Data Objects for Continuous and Categorical Variables
- Data Objects for Scalar, Vector and Matrices
- Tables, Data Frames and Tibbles
- Functions to wrap complex calculations
- Data formats
- Data Import
- Missing data handling
- Data Manipulation with Window functions
- Relational Data Manipulation
- Tidy datasets
- Reproducible analysis with pipelines
- Data Discovery
- Data Visualization: box plots, density plots, histograms, bar charts, scatter and line plots
- Introduction to the Grammar of Graphics
- Introduction to Statistical Models
- Iterative investigation method
- Presenting results with RMarkdown Reports
What you will be able to do
- Use data from several data sources
- Tidy your dataset
- Discover relations among your data
- Create visualizations
- Fit a model for your data
- Deliver insights and results with a clear report or presentation
Duration
2 days.
Pre requisites
None.
Audience
This course is a fundamental for every business area. Different example datasets can be used according to industry type, for a better understanding and faster use of the concepts.