Please complete this exercise by the start of the next lesson.
Start your assignment
You can start working on your copy of Exercise 5 by accepting the GitHub Classroom assignment.
You can also take a look at the template repository for Exercise 5 on GitHub (does not require logging in). Note that you should not try to make changes to this copy of the exercise, but rather only to the copy available via GitHub Classroom.
Pair programming (optional!)
Students attending the course in Helsinki, if you wish, you can continue working in pairs. See more information in Slack, and in week 2: Why are we working in pairs?. Those students who want to submit their own solutions, please contact the course assistant that is grading your assignments (see list in Slack).
Cloud computing environments#
Exercise 5 hints#
Below are some tips for working on Exercise 5.
Selecting date ranges#
In the Part 3 of Problem 3, the aim is to select rows that belong to certain month. The key here is to understand that
the data values in the
YR--MODAHRMN column are integer numbers using a format
YYYY is the
year of the observation,
MM is the month,
DD is the day,
HH is the hour, and
mm is the minute.
Using these values it is possible to make simple mathematical queries such as finding the values starting from the beginning of August, 2017:
august_values = data.loc[data['YR--MODAHRMN'] >= 201708010000]
Here, the value
201708010000 corresponds to the first day of August at the hour 00:00.