Exercise 5

Note

Please complete this exercise by the start of the next lesson.

Start your assignment

You can start working on your copy of Exercise 5 by accepting the GitHub Classroom assignment.

You can also take a look at the template repository for Exercise 5 on GitHub (does not require logging in). Note that you should not try to make changes to this copy of the exercise, but rather only to the copy available via GitHub Classroom.

Pair programming (optional!)

Students attending the course in Helsinki, if you wish, you can continue working in pairs. See more information in Slack, and in week 2: Why are we working in pairs?. Those students who want to submit their own solutions, please contact the course assistant that is grading your assignments (see list in Slack).

Cloud computing environments

https://img.shields.io/badge/launch-binder-red.svg https://img.shields.io/badge/launch-CSC%20notebook-blue.svg

Exercise 5 hints

Below are some tips for working on Exercise 5.

Selecting date ranges

In the Problem 4 part 2, the aim is to select rows that belong to certain month. The key here is to understand that the data values in YR--MODAHRMN column are integer numbers using a format YYYYMMDDHHmm where YYYY is the year of the observation, MM is the month, DD is the day, HH is the hour, and mm is the minute.

Using these values it is possible to make simple mathematical queries such as finding the values starting from August:

august_values = data.loc[data['YR--MODAHRMN'] >= 201708010000]

Here, the value 201708010000 corresponds to the first day of August at 00:00 hour.