s

Python for Data Science

Python is a powerful, flexible, open source and widely-used programming language with many applications. I had an opportunity to facilitate a workshop on python4Data science at Arusha Technical College (ATC), Arusha -Tanzania. The workshop aimed to explores Python’s place in the scientific ecosystem, and how the language, with several readily-available open-source libraries, can serve as a powerful tool for data science.

The workshop covered the following aspects:

  1. A thorough introduction to python programming.
  2. Introduction to scientific computing with numpy.
  3. Data Analysis and Data Visulisation with pandas, matplotlib and seaborn.
  4. Introduction to Machine Learning.

To demonstrate the power of python for data-science we used the following datasets from Tanzania opendata portal: Primary School Enrolment by Sex and Age and University Students Enrollment Degree & Non-Degree

All resources for this workshop can be accessed here.

We conducted this program together with Anthony Faustine, Dr. Dina Machuve and Eliyah Masesa.