Improve your skills from anywhere, anytime on your schedule

(+254) 721 331 808    training@upskilldevelopment.com

Data Management and Statistical Data Analysis using Python Course

NOTE: To view the training dates and registration button clearly put your mobile phone, tablet on landscape layout. Thank you

Online/ On-site (Nairobi, Kenya) Training Dates

Course Date Onsite fee: Live Online fee: Register for Physical/Online Training
13/05/2024 To 17/05/2024 1,500 USD 900 USD Register for On-site Register for Online
10/06/2024 To 14/06/2024 1,500 USD 900 USD Register for On-site Register for Online
08/07/2024 To 12/07/2024 1,500 USD 900 USD Register for On-site Register for Online
12/08/2024 To 16/08/2024 1,500 USD 900 USD Register for On-site Register for Online

Introduction

This comprehensive course will be your guide to learning how to use the power of Python to analyze big data, create beautiful visualizations, and use powerful machine learning algorithms. This course is designed for both beginners with basic programming experience or experienced developers looking to make the jump to Data Science and big data Analysis.

Duration

5 Days

Course Objective

  o   Research Design

o   Python for Data Science and Machine

o   Spark for Big Data Analysis

o   Implement Machine Learning Algorithms

o   Numbly for Numerical Data

o   Pandas for Data Analysis

o   Matplotlib for Python Plotting

o   Seaborn for statistical plots

o   Interactive dynamic visualizations

o   SciKit-Learn for Machine Learning Tasks

o   K-Means Clustering, Logistic Regression and Linear Regression

o   Random Forest and Decision Trees

o   Natural Language Processing and Spam Filters

o   Neural Networks

o   Support Vector Machines

o   Research report writing

Who Should Attend?

This is a general course targeting participants with elementary knowledge of Statistics from Agriculture, Economics, Food Security and Livelihoods, Nutrition, Education, Medical or public health professionals among others who already have some statistical knowledge, but wish to be conversant with the concepts and applications of statistical modeling using Python

Course content

Module1: Basic statistical terms and concepts

  o   Introduction to statistical concepts

o   Descriptive Statistics

o   Inferential statistics

Module 2: Research Design

  o   The role and purpose of research design

o   Types of research designs

o   The research process

o   Which method to choose?

o   Exercise: Identify a project of choice and developing a research design

Module 3: Survey Planning, Implementation and Completion

  o   Types of surveys

o   The survey process

o   Survey design

o   Methods of survey sampling

o   Determining the Sample size

o   Planning a survey

o   Conducting the survey

o   After the survey

o   Exercise: Planning for a survey based on the research design selected

Module 4: Introduction to Phython

  o   Course Intro

o   Setup

o   Installation Setup and Overview

o   IDEs and Course Resources

o   iPython/Jupyter Notebook Overview

Module 5: Learning Numpy

  o   Intro to numpy

o   Creating arrays

o   Using arrays and scalars

o   Indexing Arrays

o   Array Transposition

o   Universal Array Function

o   Array Processing

o   Array Input and Output

Module 6: Intro to Pandas

  o   DataFrames

o   Index objects

o   Reindex

o   Drop Entry

o   Selecting Entries

o   Data Alignment

o   Rank and Sort

o   Summary Statistics

o   Missing Data

o   Index Hierarchy

Module 7: Working with Data

  o   Reading and Writing Text Files

o   JSON with Python

o   HTML with Python

o   Microsoft Excel files with Python

o   Merge and Merge on Index

o   Concatenate and Combining DataFrames

o   Reshaping, Pivoting and Duplicates in Data Frames

o   Mapping,Replace,Rename Index,Binning,Outliers and Permutation

o   GroupBy on DataFrames

o   GroupBy on Dict and Series

o   Splitting Applying and Combining

o   Cross Tabulation

Module 8: Big Data and Spark with Python

  o   Welcome to the Big Data Section!

o   Big Data Overview

o   Spark Overview

o   Local Spark Set-Up

o   AWS Account Set-Up

o   Quick Note on AWS Security

o   EC2 Instance Set-Up

o   SSH with Mac or Linux

o   PySpark Setup

o   Lambda Expressions Review

o   Introduction to Spark and Python

o   RDD Transformations and Actions

Module 9: Data Visualization

  o   Installing Seaborn

o   Histograms

o   Kernel Density Estimate Plots

o   Combining Plot Styles

o   Box and Violin Plots

o   Regression Plots

o   Heatmaps and Clustered Matrices

Module 10: Data Analysis

  o   Linear Regression

o   Support Vector

o   Decision Trees and Random Forests

o   Natural Language Processing

o   Discrete Uniform Distribution

o   Continuous Uniform Distribution

o   Binomial Distribution

o   Poisson Distribution

o   Normal Distribution

o   Sampling Techniques

o   T-Distribution

o   Hypothesis Testing and Confidence Intervals

o   Chi Square Test and Distribution

Module 11: Report writing for surveys, data dissemination, demand and use

  o   Writing a report from survey data

o   Communication and dissemination strategy

o   Context of Decision Making

o   Improving data use in decision making

o   Culture Change and Change Management

o   Preparing a report for the survey, a communication and dissemination plan and a demand and use strategy.

o   Presentations and joint action planning

Training Approach

This course will be delivered by our skilled trainers who have vast knowledge and experience as expert professionals in the fields. The course is taught in English and through a mix of theory, practical activities, group discussion and case studies. Course manuals and additional training materials will be provided to the participants upon completion of the training.

Tailor-Made Course

This course can also be tailor-made to meet organization requirement. For further inquiries, please contact us on: Email: training@upskilldevelopment.com Tel: +254 721 331 808

Training Venue 

The training will be held at our Upskill Training Centre. We also offer training for a group at requested location all over the world. The course fee covers the course tuition, training materials, two break refreshments, and buffet lunch.

Visa application, travel expenses, airport transfers, dinners, accommodation, insurance, and other personal expenses are catered by the participant

Certification

Participants will be issued with Upskill certificate upon completion of this course.

Airport Pickup and Accommodation

Airport pickup and accommodation is arranged upon request. For booking contact our Training Coordinator through Email: training@upskilldevelopment.com, +254 721 331 808 

Terms of Payment: Unless otherwise agreed between the two parties payment of the course fee should be done 3 working days before commencement of the training so as to enable us to prepare better.

Online/ On-site (Nairobi, Kenya) Training Schedule

Course Date Onsite fee Live Online fee Click to Register, Physical/Online Training
13/05/2024 to 17/05/2024 1,500 USD 900 USD Register for Onsite Register for Online
10/06/2024 to 14/06/2024 1,500 USD 900 USD Register for Onsite Register for Online
08/07/2024 to 12/07/2024 1,500 USD 900 USD Register for Onsite Register for Online
12/08/2024 to 16/08/2024 1,500 USD 900 USD Register for Onsite Register for Online

Some of Our Recent Clients

Training that focuses on providing skills for work?

We support the development of a skilled and confident workforce to meet the changing demands of growing sectors by offering the best possible training to enable them to fulfil learning goals.

Make a Mark in You Day to Day work
Click to chat with us