Data Mining and Analysis Training Course
Objective:
Delegates be able to analyse big data sets, extract patterns, choose the right variable impacting the results so that a new model is forecasted with predictive results.
Course Outline
-
Data preprocessing
- Data Cleaning
- Data integration and transformation
- Data reduction
- Discretization and concept hierarchy generation
-
Statistical inference
- Probability distributions, Random variables, Central limit theorem
- Sampling
- Confidence intervals
- Statistical Inference
- Hypothesis testing
-
Multivariate linear regression
- Specification
- Subset selection
- Estimation
- Validation
- Prediction
-
Classification methods
- Logistic regression
- Linear discriminant analysis
- K-nearest neighbours
- Naive Bayes
- Comparison of Classification methods
-
Neural Networks
- Fitting neural networks
- Training neural networks issues
-
Decision trees
- Regression trees
- Classification trees
- Trees Versus Linear Models
-
Bagging, Random Forests, Boosting
- Bagging
- Random Forests
- Boosting
-
Support Vector Machines and Flexible disct
- Maximal Margin classifier
- Support vector classifiers
- Support vector machines
- 2 and more classes SVM’s
- Relationship to logistic regression
-
Principal Components Analysis
-
Clustering
- K-means clustering
- K-medoids clustering
- Hierarchical clustering
- Density based clustering
-
Model Assesment and Selection
- Bias, Variance and Model complexity
- In-sample prediction error
- The Bayesian approach
- Cross-validation
- Bootstrap methods
Open Training Courses require 5+ participants.
Data Mining and Analysis Training Course - Booking
Data Mining and Analysis Training Course - Enquiry
Data Mining and Analysis - Consultancy Enquiry
Consultancy Enquiry
Testimonials (5)
I was benefit from the guidance and sharing life examples + answering all questions.
Marta Melloch - Amazon Development Center Poland Sp. z o.o.
Course - Data Mining and Analysis
I really enjoyed the all the best.
Halil polat - Amazon Development Center Poland Sp. z o.o.
Course - Data Mining and Analysis
The information given was interesting and the best part was towards the end when we were provided with Data from Durex and worked on Data we are familiar with and perform operations to get results.
Jessica Chaar
Course - Data Mining and Analysis
The hands-on exercise and the trainer capacity to explain complex topics in simple terms.
youssef chamoun
Course - Data Mining and Analysis
I like the exercises done.
Nour Assaf
Course - Data Mining and Analysis
Upcoming Courses
Related Courses
Algorithmic Trading with Python and R
14 HoursThis instructor-led, live training in Norway (online or onsite) is aimed at business analysts who wish to automate trade with algorithmic trading, Python, and R.
By the end of this training, participants will be able to:
- Employ algorithms to buy and sell securities at specialized increments rapidly.
- Reduce costs associated with trade using algorithmic trading.
- Automatically monitor stock prices and place trades.
Programming with Big Data in R
21 HoursBig Data is a term that refers to solutions destined for storing and processing large data sets. Developed by Google initially, these Big Data solutions have evolved and inspired other similar projects, many of which are available as open-source. R is a popular programming language in the financial industry.
Introductory R (Basic to Intermediate)
14 HoursThis instructor-led, live training in Norway (online or onsite) is aimed at beginner-level data analysts who wish to use R programming to manipulate data, perform basic data analysis, and create compelling visualizations for insights.
By the end of this training, participants will be able to:
- Understand the basics of R Programming.
- Apply fundamental data science processes.
- Create visual representations of data.
R Fundamentals
21 HoursR is an open-source free programming language for statistical computing, data analysis, and graphics. R is used by a growing number of managers and data analysts inside corporations and academia. R has also found followers among statisticians, engineers and scientists without computer programming skills who find it easy to use. Its popularity is due to the increasing use of data mining for various goals such as set ad prices, find new drugs more quickly or fine-tune financial models. R has a wide variety of packages for data mining.
Cluster Analysis with R and SAS
14 HoursThis instructor-led, live training in Norway (online or onsite) is aimed at data analysts who wish to program with R in SAS for cluster analysis.
By the end of this training, participants will be able to:
- Use cluster analysis for data mining
- Master R syntax for clustering solutions.
- Implement hierarchical and non-hierarchical clustering.
- Make data-driven decisions to help to improve business operations.
Data and Analytics - from the ground up
42 HoursData analytics is a crucial tool in business today. We will focus throughout on developing skills for practical hands on data analysis. The aim is to help delegates to give evidence-based answers to questions:
What has happened?
- processing and analyzing data
- producing informative data visualizations
What will happen?
- forecasting future performance
- evaluating forecasts
What should happen?
- turning data into evidence-based business decisions
- optimizing processes
The course itself can be delivered either as a 6 day classroom course or remotely over a period of weeks if preferred. We can work with you to deliver the course to best suit your needs.
Data Analysis with Python, R, Power Query, and Power BI
21 HoursThis instructor-led, live training in Norway (online or onsite) is aimed at beginner-level professionals who wish to clean and analyze data, make statistical projections, and create insightful visualizations using these tools.
By the end of this training, participants will be able to:
- Understand the basics of Python, R, Power Query, and Power BI for data analysis.
- Clean and organize datasets using Python and Power Query.
- Perform statistical analysis and projections with R.
- Create professional dashboards and reports with Power BI.
- Integrate and analyze data from multiple sources effectively.
Data Analytics With R
21 HoursR is a very popular, open source environment for statistical computing, data analytics and graphics. This course introduces R programming language to students. It covers language fundamentals, libraries and advanced concepts. Advanced data analytics and graphing with real world data.
Audience
Developers / data analytics
Duration
3 days
Format
Lectures and Hands-on
Data Mining with R
14 HoursR is an open-source free programming language for statistical computing, data analysis, and graphics. R is used by a growing number of managers and data analysts inside corporations and academia. R has a wide variety of packages for data mining.
Econometrics: Eviews and Risk Simulator
21 HoursThis instructor-led, live training in Norway (online or onsite) is aimed at anyone who wishes to learn and master the fundamentals of econometric analysis and modeling.
By the end of this training, participants will be able to:
- Learn and understand the fundamentals of econometrics.
- Utilize Eviews and risk simulators.
HR Analytics for Public Organisations
14 HoursThis instructor-led, live training (online or onsite) is aimed at HR professionals who wish to use analytical methods improve organisational performance. This course covers qualitative as well as quantitative, empirical and statistical approaches.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Statistical Analysis using SPSS
21 HoursThis instructor-led, live training in Norway (online or onsite) is aimed at beginner-level to intermediate-level professionals who wish to perform statistical analysis using SPSS to interpret data accurately, run complex statistical tests, and generate meaningful insights.
By the end of this training, participants will be able to:
- Navigate the SPSS interface and manage datasets efficiently.
- Perform descriptive and inferential statistical analyses.
- Conduct t-tests, ANOVA, MANOVA, regression, and correlation analyses.
- Apply non-parametric tests, principal component analysis, and factor analysis for advanced data interpretation.
Talent Acquisition Analytics
14 HoursThis instructor-led, live training (online or onsite) is aimed at HR professionals and recruitment specialists who wish to use analytical methods improve organisational performance. This course covers qualitative as well as quantitative, empirical and statistical approaches.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Introduction to Data Visualization with Tidyverse and R
7 HoursThe Tidyverse is a collection of versatile R packages for cleaning, processing, modeling, and visualizing data. Some of the packages included are: ggplot2, dplyr, tidyr, readr, purrr, and tibble.
In this instructor-led, live training, participants will learn how to manipulate and visualize data using the tools included in the Tidyverse.
By the end of this training, participants will be able to:
- Perform data analysis and create appealing visualizations
- Draw useful conclusions from various datasets of sample data
- Filter, sort and summarize data to answer exploratory questions
- Turn processed data into informative line plots, bar plots, histograms
- Import and filter data from diverse data sources, including Excel, CSV, and SPSS files
Audience
- Beginners to the R language
- Beginners to data analysis and data visualization
Format of the course
- Part lecture, part discussion, exercises and heavy hands-on practice