In the course so far you have learnt the building blocks of Python. With this you can write a surprising amount of useful code! However our efforts are rewarded many times over if we can share the code we have written. Sharing your code:

clearly demonstrates your programming skills to future employers and/or supervisors
enables other researchers to use your code, progressing science forwards
establishes you an active participant in a particular domain or community of practice (people know who you are)

However it is unfortunately not enough to stick our Python scripts / modules online and hope that other people will use them. We need to use tests to demonstrate that our software is reliable, and we need to write documentation so that others understand how best to use the code. These two topics - testing and documentation - are the focus of this lesson.

Please note that the following text has been closeley adapted, with permission, from the Code Refinery software testing lesson and the Code Refinery documentation lesson.

Untested software can be compared to uncalibrated detectors

“Before relying on a new experimental device, an experimental scientist always establishes its accuracy. A new detector is calibrated when the scientist observes > its responses to known input signals. The results of this calibration are compared against the expected response.”

[From Testing and Continuous Integration with Python, created by K. Huff]

Simulations and analysis using software should be held to the same standards as experimental measurement devices!

Further motivation for testing:

Testing helps to detect errors before they cause problems

In software tests, expected results are compared with observed results in order to establish accuracy:

As projects grow, it becomes easier to break things without noticing immediately
Software defects can be caused by both human errors and non-controllable events (i.e. environmental conditions)
Testing is essential for research software because we care about reproducibility of scientific results

Testing also encourages others to use your code:

It provides a way for users to see if the code is installed correctly
It allows users to better judge the quality of the code

Finally, testing encourages other developers to contribute to your code as it is easier for external developers to contribute to the project without breaking your code (or at least it is clear when they have broken the code!)

However bear in mind that tested code does not mean the code is perfect; “Program testing can be used to show the presence of bugs, but never to show their absence!” (Edsger W. Dijkstra)

Discussion question: Can you think of examples where it is not necessary to share your code?

Show answer

Some examples where you may choose not to test your code:

A Jupyter notebook which produces a plot and you know by looking at the plot whether it worked
A short, ‘obviously correct’ Python script which you never intend to reuse

Use assertions for things you believe will/should never happen.

def kelvin_to_celsius(temp_k):
    """
    Converts temperature in Kelvin
    to Celsius.
    """
    assert temp_k >= 0.0, "ERROR: negative T_K"
    temp_c = temp_k - 273.15
    return temp_c

Units testing is for small components (units) of a code

def fahrenheit_to_celsius(temp_f):
    """Converts temperature in Fahrenheit
    to Celsius.
    """
    temp_c = (temp_f - 32.0) * (5.0/9.0)
    return temp_c

# This is the test function: `assert` raises an error if something
# is wrong.
def test_fahrenheit_to_celsius():
    temp_c = fahrenheit_to_celsius(temp_f=100.0)
    expected_result = 37.777777
    assert abs(temp_c - expected_result) < 1.0e-6

Unit tests are functions
Unit testing is used to test one unit: for example, a single function

Question: In the example above we want to check if the calculated temp_c is equal to the expected_result. Why do we use the assert statement abs(temp_c - expected_result) < 1.0e-6 instead of the simpler assert statement temp_c == expected_result?

Show answer

Due to the inherent limitations to computational accuracy, we should not test for the exact equality between two floats. Instead we check that they are the same to within a numerical tolerance (in this case 1.0e-6). For more details please see the lesson Evaluating numerical errors and accuracy.

Another commonly used type of testing is called end-to-end. End-to-end tests check if the software is working as a whole, from beginning to end. For example, you input example data at the start of a simulation and then check that the end results of the simulation are correct.

Pytest can be used to implement a Python test suite

Rather than running unit tests one-by-one we can use pytest to automatically find and run all the tests within a project; pytest collects and runs all test functions starting with test_.

In the following steps we will make a simple Python function and use pytest to test it.

Create a new directory and change into it:

mkdir pytest-example
cd pytest-example

Then create a file called example.py and copy-paste the following code into it:

def add(a, b):
    return a + b


def test_add():
    assert add(2, 3) == 5
    assert add('space', 'ship') == 'spaceship'

This code contains one genuine function and a test function. pytest finds any functions beginning with test_ and treats them as tests.

Let us try to test it with pytest:

pytest -v example.py

============================================================ test session starts =================================
platform linux -- Python 3.7.2, pytest-4.3.1, py-1.8.0, pluggy-0.9.0 -- /home/user/pytest-example/venv/bin/python3
cachedir: .pytest_cache
rootdir: /home/user/pytest-example, inifile:
collected 1 item

example.py::test_add PASSED

========================================================= 1 passed in 0.01 seconds ===============================

Yay! The test passed!

Let us break the test! Introduce a code change which breaks the code and check whether pytest detects the change:

pytest -v example.py

============================================================ test session starts =================================
platform linux -- Python 3.7.2, pytest-4.3.1, py-1.8.0, pluggy-0.9.0 -- /home/user/pytest-example/venv/bin/python3
cachedir: .pytest_cache
rootdir: /home/user/pytest-example, inifile:
collected 1 item

example.py::test_add FAILED

================================================================= FAILURES =======================================
_________________________________________________________________ test_add _______________________________________

    def test_add():
>       assert add(2, 3) == 5
E       assert -1 == 5
E         --1
E         +5

example.py:6: AssertionError
========================================================= 1 failed in 0.05 seconds ==============

Notice how pytest is smart and includes context: lines that failed, values of the relevant variables.

Question: In the example above we have compared integers. In this optional exercise we want to learn how to compare floating point numbers since they are more tricky.

The following test will fail and this might be surprising. Try it out:

def add(a, b):
    return a + b


def test_add():
    assert add(0.1, 0.2) == 0.3

Your goal: find a more robust way to test this addition.

Show answer

One solution is to use pytest.approx:

from pytest import approx

def add(a, b):
    return a + b

def test_add():
    assert add(0.1, 0.2) == approx(0.3)

But maybe you didn’t know about pytest.approx: and did this instead:

def test_add():
    result = add(0.1, 0.2)
    assert abs(result - 0.3) < 1.0e-7

This is OK but the 1.0e-7 can be a bit arbitrary.

Documentation comes in different forms

There is nothing in the programming field more despicable than an undocumented program. — Ed Yourdon, Software Engineering pioneer

Tutorials: learning-oriented, allows the newcomer to get started
How-to guides: goal-oriented, shows how to solve a specific problem
Explanation: understanding-oriented, explains a concept
Reference: information-oriented, describes the machinery

These are distinct. For an excellent discussion, please see What nobody tells you about documentation.

There is no one size fits all: however a good starting point is almost always to include a README file with your code. Your code may, depending on the number of users apart from yourself, also include:

Purpose
Authors
License
Recommended citation
Copy-paste-able example to get started
Dependencies and their versions or version ranges
Installation instructions
Tutorials covering key functionality
Reference documentation (e.g. API) covering all functionality
How do you want to be asked questions (mailing list or forum or chat or issue tracker)
FAQ section
Contribution guide

Question: Visit the Github repository for effmass and explore the repository and documentation site.

Which different types (tutorials/how-to/explanation/reference) of documentation can you find?
Are all the items in the bullet point list included?

Show answer

- There is a README file with basic installation how-to instructions and an overview of the functionality.
- There are notebook tutorials for allowing newcomers to get started.
- There is API reference information here which describes how each function works.
- There is a paper which explains some of the concepts behind the code (for example the different definitions of effective mass).
All of the bullet points on the list are included except i) there is no FAQ ii) the example is a Jupyter notebook and is not quickly copy-pastable (you would end up with the code and the surrounding text, which would then need to be removed).

The README file is important for users

The README file is important as it is usually the first thing people look at when visiting a code repository.
Use it to communitcate important information about your project.
Write your README using a markup language such as Markdown. Github will automatically format your markdown file when you push it to your Github repository.
The README should be in the top-level of your repository
To make your README extra friendly you can use emojis.
A README file might contains: A summary of the software purpose, author list, a link to the license file, a recommended citation, installation instructions and a list of code dependencies (e.g. numpy, matplotlib), a link to the issues tracker, and any other contact details.

In-code documentation is important for code developers

In-code documentation are very useful for people wanting to contribute to your code.
They can also be used to auto-generate online documentation for functions/classes (for more information see the extension task below).
The most commonly used Python in-code documentation are comments starting with # and Python docstrings

Question: Which of the comments below is the best and why?

# Now we check if temperature is larger then -50:
if temperature > -50:
    print('do something')

# We regard temperatures below -50 degrees as measurement errors
if temperature > -50:
    print('do something')

Show answer

The first comment describes what happens in this piece of code, whereas the second describes why this piece of code is there, i.e. its purpose. Comments like the second comment are much more useful.

A docstring is a structured comment associated to a segment of code (i.e. a function or class).

We have already written basic docstrings. Here we will see how to write Google style docstrings. Let’s look at the following function:

def mean_temperature(temperatures):
    return sum(temperatures)/len(temperatures)

We can make it clearer what this function does and how to use it using a docstring. A good docstring describes:

What the function does
What goes in (including the type of the input variables)
What goes out (including the return type)

def mean_temperature(temperatures):
    """
      Get the mean temperature

      Args:
          data (list): A list with air temperature measurements.

      Returns:
          The mean air temperature (float)
    """
    return sum(temperatures)/len(temperatures)

Two key advantages of docstrings are:

Python parses docstrings, for example calling the help function will display the docstring: try help(mean_temperature)
You can generate your documentation as you are generating the code. In fact, thinking carefully about what the input/output/behaviour should be may encourage you to write better code!

TASKS

Write and run a unit test for the Lorenz() function in the Lorenz.py script you developed in a previous lesson.
Write a docstring for the Lorenz() function in the Lorenz.py script. Follow the Google docstring format as in the example above.
Upload your new file to the remote Github repository you created in the previous lesson.

EXTENSION TASKS: Automate everything!

Automated testing using Continuous Integration allows us to automatically run tests when there is a commit to a Github repository. Following the tutorial from The Code Refinery, implement continuous integration for a Github repository holding Lorenz.py (this could be the Github repository you created in the previous lesson). Note that you will have to adapt the workflow file so that pip installs the python packages imported in your script (for Lorenz.py the packages are numpy and matplotlib).
API (Application Programming Interface) documentation can be automatically generated using a tool such as pdoc3. These tools read in all of the docstrings within a Python package and generate a webpage accordingly. For example, see the API documentation for the effmass project. Automatic API documentation is a real gamechanger; as long as we write update the docstrings alongside our code, we can quickly generate new documentation. Following the documentation on pdoc3 generate a html page with the API documentation for Lorenz.py. In the next lesson you will learn how to host this page using Github Pages.