Instructor Led Online Classes

Jun 14 Fri

Data Science in Practice - Online Bootcamp (Batch 2: Week 1) (Weekends)

Jun 14 to Jun 15

  • 14 Fri

    Session 1

    09:00 AM to 04:30 PM (EDT)

    08:00 -  09:00   (Optional) Technical Support on Installation of Python and Related Toolsets
    09:00 -  09:30   Module 1.1 - Data Science Landscape
    09:30 -  12:15     Module 2.1 - Python Programming for Data Science
    13:00  -  15:00    Module 2.2 - Exploratory Data Analysis with Python
    15:00  -  16:00    Module 2.3 - Data Visualization
    16:00  -  16:30    EDA Exercises and Project Introduction

  • 15 Sat

    Session 1

    09:00 AM to 04:30 PM (EDT)

    09:00 -  10:00    Module 2.4 – Data Analysis Project Review
    10:00  -  12:00    Module 3.1  - Basics of Machine Learning 
    13:00  -  15:30    Module 3.2 - Practical Machine Learning with Python
    15:30  -  16:30    Module 3.3 - ML Project: Mobility Prediction in Metropolis (P2)

Jun 21 Fri

Data Science in Practice - Online Bootcamp (Batch 2 : Week 2) (Weekends)

Jun 21 to Jun 22

  • 21 Fri

    Session 1

    09:00 AM to 04:30 PM (EDT)

    09:00 -  09:30   Module 3.3 - ML Project Solution Review
    09:30 -  12:00    Module 3.4 - Advance ML with ScikitLearn 
    12:45  -  13:30    Module 1.2 - Data Science from Business Perspective
    13:30  -  16:30    Module 3.5 - ML Project: Customer Churn Prediction (P3) - Mini-Hackathon and Hints

  • 22 Sat

    Session 1

    09:00 AM to 04:30 PM (EDT)

    09:00 -  11:00    Module 4.1 - Machine Learning for Large Scale Applications
    11:00   -  13:00    Module 4.2 - Building Decentralized Applications: Blockchain Network
    14:00  -  16:00    Module 4.3 - Deep Learning
    16:00  -  16:30    Module 1.3 - Data Science Roles and Methods


Industry recognized certification enables you to add this credential to your resume upon completion of all courses

Need Custom Training for Your Team?
Get Quote
Call Us

Toll Free (844) 397-3739

Inquire About This Course
Dr. Mohammed Kamran Malik, Instructor - Data Science in Practice - An Online Bootcamp

Dr. Mohammed Kamran Malik

Is a Professor of Machine Learning at University of the Punjab, Lahore. He has more than 15 years of teaching and development experience. He holds a Ph.D. in Computer Science, holds several patents and is author of over 25 journal and conference papers. His focus areas in Data Science include Natural Language Processing and Machine Learning. He is a renowned pedagogist and has won several awards due to his teaching methodology. Prof. Malik has provided consultancy to many multinational companies on Natural Language Processing, Machine Learning and Data Science projects. In particular, he has successfully transformed over 10 teams at corporates and software houses to Data Science landscape.
Dr. Zubair Nawaz, Instructor - Data Science in Practice - An Online Bootcamp

Dr. Zubair Nawaz

Dr. Zubair Nawaz is working as an Assistant Professor at Punjab University College of Information Technology (PUCIT), University of the Punjab, Lahore. He has done two postdocs, one from Delft University of Technology, The Netherlands and other from Synchrotron-Light for Experimental Science and Applications in the Middle East (SESAME), Jordan. He has a B.S. in Mechanical Engineering from Ghulam Ishaq Khan Institute of Engineering Sciences and Technology (GIKI), Topi, MS CS from Lahore University of Management Sciences (LUMS), Lahore, and a Ph.D. in Computer Engineering from Delft University of Technology (TU Delft). He has a diverse background, working in Theoretical Computer Science, Cryptography, Compilers, Reconfigurable Computing, High-Performance Computing, Computer Architecture, Bio-informatics. For the last 3-4 years, he is working extensively on Data Science, Machine Learning and Deep learning with applications in Finance, Computer Vision, Medical Imaging and Natural Language Processing.
Dr. Chan Naseeb, Instructor - Data Science in Practice - An Online Bootcamp

Dr. Chan Naseeb

Is a Lead Data Scientist in the IBM Global Elite team for Data Science & AI. Prior to this, he was a Manager (Principal Lead Data Scientist) at KPMG data & analytics team. Where he took leadership for Data Science engagements & amp client delivery in different sectors such as: public sector, chemical industry, transport, logistics, and FinTech. He was also a technical lead for the development of an open-source Big Data platform KAVE (KPMG Analytics and Visualization Environment). He lead both onshore and offshore teams for Data Science engagements. His focus spanned on end to end delivery of Data Science solutions ranging from strategy, transformation and execution to enable organizations to become data driven. He was also R & D lead for Data Science research & prototyping solutions for Industry relevant problems. He has been doing analysis, design and verification of systems for many years. He has been working in Data Science related areas for over 13+ years. Other dimensions of his role include collaboration with the academia, performance development and capacity building of teams. He is a Data Science, and agile evangelist. His expertise include AI, Machine Learning, Deep Learning, Data Science, and Blockchain. He has been a reviewer for a number of journals, and conferences while being also the speaker and moderator at some of those. He holds a Professional Doctorate in Engineering from Eindhoven University of Technology.
Dr. Muhammad Shahzad Cheema, Instructor - Data Science in Practice - An Online Bootcamp

Dr. Muhammad Shahzad Cheema

Is Lead Data Scientist at IBM Watson IoT HQ Munich. He has 17-years of hands-on experience in implementing AI systems. He holds a Ph.D. in Large Scale Machine Learning from University of Bonn, Germany and holds 3 M.Sc.’s in Mathematics, Computer Science and Robotics. He has led a number of AI and Big Data projects across industries. Some of his projects include: a first of a kind AI based smart home with capabilities such as face recognition, behavior learning, edge analytics and life-long learning; a Reinforcement Learning based autonomous driving solution that enables a vehicle learn driving faster and smoother in real world scenarios; optimization of world’s largest logistic network; a real-time bidding engine handling nearly a billion request per day, a cognitive cooking system that is able to learn recipes and physics of the devices. Over his career, Dr. Cheema has worked in startup, corporate, technology, research and academia. As an AI Evangelist, he is an advocate of business- and solution-oriented data science. He has developed and mentored several Data Science teams at large tech and corporate organizations. He is also leading Data Science Initiative that aims at AI resource development by enabling Data Science enthusiasts through focused trainings.

Instructors: Dr. Mohammed Kamran Malik,  Dr. Zubair Nawaz,  Dr. Chan Naseeb,  Dr. Muhammad Shahzad Cheema

A hands-on and end-to-end Data Science Program

  • Perform effective data analysis & implement end-to-end machine learning pipelines.
  • Investigate and present your work in effective visual form.
  • Instructors performed this workshop all over the world with combined 40+ years of experience.

Course Description

The bootcamp will cover different aspects of Data Science with hands-on exercises and industrial use-cases. Major modules include Python for Data Science, Data Analysis, Practical Machine Learning, Large Scale Machine Learning, Deep Learning, and Business Perspectives of Data Science. This well-structured offering is by Data Science Initiative (DSI) - a team of industry and academia experts - who have executed 10 hands-on workshops in the past with an excellent rating of 4.5/5.0.

What am I going to get from this course?

  • Perform effective data analysis i.e. 50% to 80% of Data Science work
  • Investigate and present your work in effective visual form
  • Understand most popular machine learning algorithms and their uses
  • Implement end-to-end machine learning pipe-lines for given usecases
  • Implement large-scale machine learning algorithms on cloud and through APIs
  • Employ deep learning for unstructured data (Images and Text)
  • Build a small block-chain
  • Differentiate real data science from the fuzz around it
  • Understand DS-business echo-system and can identify right use-cases

Prerequisites and Target Audience

What will students need to know or do before starting this course?

Participants may brush some basic concepts of probability, matrices, and programming (any language). 

Who should take this course? Who should not?

You should join this Bootcamp if you are among:
  • Business or IT Professionals who wish to transform to a data scientist role
  • Early stage data scientists and data analysts who wish to learn end-to-end data science from a team of industry experts.
  • Students and fresh graduates who want to pursue a career in Data Science, Machine Learning and Artificial Intelligence Area
  • Researchers from any field who works with data and would like to employee Machine Learning in their research
  • Non-python Data Scientists who would like to go through a quick hands-on transformation to Python.


Module 1: Data Science Landscape

Lecture 1 Data Science Landscape - Overview

This sub-module will enable participants to differentiate real data science from the fuzz around it and develop a solution-oriented mindset. It is scheduled in the beginning of the workshop. • The four building blocks of Data Science • Common mistakes and best practices for data scientists • Relation of Data Science with AI, Machine Learning, Deep Learning, and other fields • Technology stack and choosing the best technology • An overview of industrial applications and their requirements

Lecture 2 Data Science from Business Perspective

Scheduled in the middle of the workshop, this module will introduce participants with practical requirements of business use-cases and best practices during CRISP-DM or garage approach. • Identifying and prioritizing data science use-cases within an organization • Translating business problems into ML problems • Analytics-roadmap for organizations • Mapping of Design Thinking to Data Science • Introduction and requirements for ML project P2, that will follow this session

Lecture 3 Data Science Roles and Methods

At the end of the workshop when the participants have gone through the end-to-end data science journey, this module will help participants understand which role fit best to them, the best practices, and how to enter and excel in that role. • Data Science value perspective • Data Science teams and organizations • Data Scince roles (e.g. Data Engineer, Analyst, Data Scientist, ML Engineer, etc) • Methods such as CRISP-DM, garage, scrum • Communication with non-data scientists • The non-technical skills needed for excellence in Data Science

Module 2: Exploratory Data Analysis with Python

Lecture 4 Python Programming for Data Science

Python is an easy to understand scripting language – yet its compact programming style and vast amount of libraries makes it a challenge for learners to focus on what matters the most. With a carefully designed contents trajectory, this hands-on module will provide participants a good basis in Python for the rest of the workshop – which then includes most relevant libraries and methods. • Why Python is the most popular language for Data Scientists • Introduction to Python as a language • Python native data structures including Lists, Set, Dictionary, Tuple • Numpy and Scipy • Control Structures • Functions and Classes • Hands-on Exercises in Python

Lecture 5 Exploratory Data Analysis with Python

EDA is what consume most time of a Data Scientist (50% to 80%). This module will equip participants with best techniques to do efficient and effective EDA. • Introduction to EDA, common mistakes, best practices • Introduction to Pandas library • DataFrame and Series data structures in pandas • Reading data from different sources (csv, web, excel, json, SQL, txt, etc) • DataFrame operations e.g. filtering, filling, merging, conditioning, aggregation • Summarization, outlier detection, and bird-eye-view reporting • Map-reduce for efficient operations on DataFrames • EDA hands-on excercises

Lecture 6 Data Visualization

Data Visualization is not only helpful to communicate results and findings with others but is equally important for data scientist itself in order to understanding the data. This module will enable participants to develop quick and pretty visualizations using Python libraries. • Good and bad types of visualizations • Practical working with Matplotlib – making any visualization • Practical working with Seaborn – making interactive and pretty visualizations • Practical working with Plotly – making and deploying interactive and pretty visualizations

Lecture 7 Data Analysis Project

This module will allow participants test and improve their skills developed so far i.e. in EDA with pandas, and visualization libraries in Python. A business use case will be discussed as a reference case for exercises. • Use-case and problem statements • Data loading and merging • Data analysis • Data cleaning • Data exploration • Data visualization

Module 3: Practical Machine Learning

Lecture 8 Basics of Machine Learning

This module will provide theoretical understanding of machine learning algorithms, their working, their advantages and limitations, and hence demystifying it for participants – eventually hoping they will be able to decide their own design of ML if needed. • Types of Machine Learning algorithms and application scenarios • Classification algorithms (e.g. Naïve Bayes, Decision Trees, KNN, ANN, Support Vector Machines) • Regression algorithms (e.g. cousins of classification algorithms, Linear/Ridge/Lasso Regression and all other cousins of classification algorithms) • Ensemble methods (e.g. Random Forests, Gradient Boosted Trees) • Outlier detection (e.g. One-Class SVM, auto-encoders) • Clustering algorithms (e.g. K-Means, DBSCAN, Hierarchical clustering) • Feature Selection and Dimensionality reduction (e.g. PCA, LDA, RFE and other techniques)

Lecture 9 Practical Machine Learning with Python

By using Scikit-learn as the main library, this tool will enable participants to apply machine learning process through model selection, model building, parameter optimization and evaluation. • Introduction of Scikit-learn • Exploration of built-in datasets • Building an ML model using Scikit-learn • Splitting data into training, validation, and testing • Cross-validation techniques • Hyperparameter search using GridSearchCV • ML Excercises and Assignment

Lecture 10 ML Project – Mobility Prediction in Metropolis

This is the first of two projects in Machine Learning. The objective of this projects is to enable participants apply their so far gained knowledge of end-to-end data science process (EDA+ML) on a real-world scenario. • Quick introduction of the use-case • Loading data from different sources • EDA and Data cleaning • Building machine learning models • Validation and testing of models • Debugging machine learning model w.r.t overfitting and underfitting

Lecture 11 Advance ML with Scikit-Learn

Primary objective of this module is to enable participants to use Pipelines as end-to-end machine learning construct. The module will also cover more details on feature selection, and dimensionality reduction with exercises. • Curse of Dimensionality • Feature selection methods (Univariate and multivariate methods) • Dimensionality reduction techniques exercises (PCA, LDA, RFE, etc) • Concept of transformations and operations in Python • Introduction to Machine Learning Pipelines in Scikit-learn • Building 2-steps, 3-steps, k-steps ML Pipelines (e.g. Feature Selection + Feature Engineering + Classification) • Pipelines and GridSearchCV • Exercise of all of above • The untold truth of Machine Learning

Lecture 12 ML Project – Customer Churn Prediction

This is second ML project with and objective to exercise a full CRISP-DM cycle i.e. domain-understanding + data understanding + data analysis + model building + model optimization + model deployment (as a pipeline). • Domain understanding from a problem statement • Exploratory Data Analysis • Feature Engineering and Feature Selection • Building machine learning models • Validation and testing of models • Doing it all within a Machine Learning Pipeline • Applying CRISP-DM

Module 4: Data Science at Scale

Lecture 13 Machine Learning for Large Scale Applications

This is second ML project with and objective to exercise a full CRISP-DM cycle i.e. domain-understanding + data understanding + data analysis + model building + model optimization + model deployment (as a pipeline). • Building ML Apps with REST APIs using Flask • Introduction to Apache Spark • Machine Learning on Spark – A Hands-on session • Introduction to Cloud-based Artificial Intelligence • Architecture of large-scale AI Applications

Lecture 14 Building Decentralized Applications: Blockchain Network

This module will enable participants understand and experiments on the concepts of decentralized applications. • Understanding Blockchain • Distributed Ledgers • Cryptocurrencies • Blockchain Potential: Use cases • Building a Blockchain application with Python (hands-on)

Lecture 15 Deep Learning

Deep Learning is probably the most talked-about area in Data Science. This module will equip participants with the understanding and practical experience of building a deep neural network using Keras/Tensorflow. • Types and applications of Neural Networks • Multi-layer Backpropagation Networks • Activation function (Sigmoid, Tanh, Relu, etc) • Introduction to Convolutional Neural Network • Introduction to Keras • Development of Image classification using Deep Neural Network • Development of Image classification using Convolutional Neural Network