A free online introduction to reproducible data analysis using Python and open data
Created by Michel Wermelinger
last edited May 03 2017 by Michel Wermelinger
“Learn to Code for Data Analysis” is a free online course by The Open University, continuously available here.
It is an introduction on how to obtain, clean, process, analyse, and visualise open data, and publicly share the results according to a reproducible research approach. The course uses real health, weather, development and economic data from the World Health Organisation, the Weather Underground, the World Bank, and the United Nations Comtrade databases.
The course assumes no knowledge of programming and statistics and does not require any software installation: it uses an online Python-based environment. The course uses pandas (a state of the art data analysis library for Python), and Jupyter notebooks as the programming and documentation environment. All these are used by professional scientists.
The course aims to promote data literacy and may be of interest to teachers and A-level students not just in Computing but also in maths, natural sciences, engineering, sociology, human geography and other disciplines where data analysis can be used to investigate a topic more in depth. I’d appreciate if you can share this in your school.
The pedagogy of this course is outlined here.
Duration: 20-30h (depending on how many of the exercises and projects are tackled)