Data Analysis Deep Dive
- Course Code GK821567
- Duration 2 days
Course Delivery
Jump to:
Course Delivery
This course is available in the following formats:
-
Company Event
Event at company
-
Public Classroom
Traditional Classroom Learning
-
Virtual Learning
Learning that is virtual
Request this course in a different delivery format.
Course Overview
TopData Analysis is a process of applying statistical and mathematical techniques systematically to understand, explore, and analyze data to find patterns, and draw inferences that help businesses make data-driven decisions. This typically involves multiple activities such as data collection, exploration, cleaning, pre-processing, and organizing data. Many times, data analysis is an iterative ongoing process where the data is continuously collected and analyzed simultaneously. There are two primary methods for data analysis.
Qualitative techniques and quantitative techniques. Quantitative data analysis techniques involve working with quantitative/numerical data including statistics, percentages, and calculations. These techniques also include working with algorithms, mathematical analysis tools, and software to manipulate data and uncover hidden business value. For example, quantitative data analysis used to assess market data helps a company decide a price for its new product.
Qualitative data analysis involves, working with non-numerical data i.e categorical variables. Qualitative data analysis is also used in many business processes, such as identifying themes and patterns, answering research questions, etc to improve a product.
This course provides an overview of data concepts and what data analysis is and then deep dives into the fundamentals of Data Analysis such as statistics and probability. This course also focuses on widely used data analysis methods such as regression along with detailed steps to perform the same.
*Must have Microsoft Excel in order to complete class activities.
Virtual Learning
This interactive training can be taken from any location, your office or home and is delivered by a trainer. This training does not have any delegates in the class with the instructor, since all delegates are virtually connected. Virtual delegates do not travel to this course, Global Knowledge will send you all the information needed before the start of the course and you can test the logins.
Course Schedule
TopTarget Audience
TopCourse Objectives
Top- Data Analysis process, benefits and use cases
- Basic of Probability and Statistics
- Measure of data spread and distributions
- Inferential Statistics and Hypothesis Testing
- Applications of Statistics and Probability theory
- Forecast trends using linear regression analysis
Course Content
Top- 1. All about Data
Data in the real world - A brief on various formats and sources of data
- 7 V's of Data
- Structured vs Unstructured vs Semi-Structured data
- Data processing types
Introduction to Data Analysis
- Need for Data Analysis
- Applications and Use Cases of Data Analysis
- Data Analysis Methodology
- Types of variables
- Numerical vs Categorial Variables
Descriptive Statistics
- Measures of Central Tendency
- Measures of Dispersion
- Data Skewness and Kurtosis
- Understanding Outliers
- Understanding missing values
- Role of Descriptive Statistics in Data Analysis
Inferential Statistics
- Population and Sample
- Statistics vs Parameters
Introduction to Probability
- Basics of Probability
- Axioms of Probability
- Conditional Probability and Bayes theorem
- 2. Applications of Conditional Probability
Understanding Probability Distributions - Discrete Probability Distributions
- Continuous Probability Distributions
- Performing Distributions in Excel
- Why understanding Data Distributions is important for Data Analysis
Data Analysis Process
- Understanding Covariance and Correlation
- Understanding univariate vs Bivariate vs Multi variate data analysis
- Understanding Regression
- Simple Linear Regression
- Multiple Linear Regression
- Exercise
Introduction to Predictive Analytics
- Exercise
Course Prerequisites
Top- Some familiarity with data terminologies
- MS Excel for Exercises