Web Server

Overview

tl;dr: Runserver v. Gunicorn

The runserver management command, producing the URL http://127.0.0.1:8000/, is useful only in local development environments. Gunicorn, for synchronous operations, is more suited for production environments.

What is a web server? Testdriven.io makes a colorful introduction to the concept:

Imagine for a moment that you are a web server, like Gunicorn. Your job consists of the following parts:

You sit around and wait patiently for a request from some kind of a client.

When a client comes to you with a request, you receive it.

Then, you take this request to someone called PythonApp and say to him, "Hey dude, wake up! Here's a request from a very important client. Please, do something about it."

You get a response from this PythonApp.

You then deliver this response back to your client.

This is the only thing you do. You just serve your clients. You know nothing about the content or anything else. That's why you are so good at it. You can even scale up and down processing depending on the demand from the clients. You are focused on this single task.

More specifically, gunicorn is a WSGI - a web server gateway interface; a web server would be something like nginx. And this gateway interface is what python apps like Django/Flask interface with to reach the actual web server:

pyproject.toml include's gunicorn
[tool.poetry.dependencies]
python = "^3.11.3"
django = {version = "^4.2.1", allow-prereleases = true}
gunicorn = "^20.1"
...

How I presently understand the relationship:

flowchart LR
    subgraph server-side
        subgraph python
            gunicorn---django
        end
        server(((web server)))<--->python
    end
    subgraph client-side
        browser<--http request-response--->server
    end

Gunicorn receives requests and processes it through workers. Based on its docs:

Gunicorn is based on the pre-fork worker model. This means that there is a central master process that manages a set of worker processes. The master never knows anything about individual clients. All requests and responses are handled completely by worker processes.

Concretizing this description, I create this visual, mental model:

flowchart LR
    subgraph gunicorn-master-process
        subgraph django-app-worker-process-1
            d1req(request: get endpoint '/about/')
            d1res(response: render html template)
            d1req--route request url to response view-->d1res
        end
        subgraph django-app-worker-process-2
            d2req(request: search 'hello world')
            d2res(response: query db, show results)
            d2req--route request url to response view-->d2res
        end
        subgraph django-app-worker-process-3
            d3req(request: user signup)
            d3res(response: send email, request confirmation)
            d3req--route request url to response view-->d3res
        end
    end
    nginx(((web server)))<--request-response managed by gunicorn--->django-app-worker-process-1
    nginx(((web server)))<--request-response managed by gunicorn--->django-app-worker-process-2
    nginx(((web server)))<--request-response managed by gunicorn--->django-app-worker-process-3

Runserver using config.wsgi

How is runserver related to config.wsgi?

According to docs:

runserver: ... This server uses the WSGI application object specified by the WSGI_APPLICATION setting.

And in our settings, we see that

config/settings/_settings.py

WSGI_APPLICATION = "config.wsgi.application"

Gunicorn using config.wsgi

Async

Lately, Django's interest has veered towards ASGI - asynchronous server gateway interface - as well. See Mariusz Felisiak's initial take on this in Running Tasks Concurrently in Django Asynchronous Views. This boilerplate implementation is limited to the synchronous processes.

gunicorn replaces the built-in python manage.py runserver since the latter should NOT run in production. Django's warning is explicit:

runserver: ... DO NOT USE THIS SERVER IN A PRODUCTION SETTING. It has not gone through security audits or performance tests. (And that’s how it’s gonna stay. We’re in the business of making web frameworks, not web servers, so improving this server to be able to handle a production environment is outside the scope of Django.)

The gunicorn docs contains a specific section on how it integrates with Django:

Gunicorn will look for a WSGI callable named application if not specified. So for a typical Django project, invoking Gunicorn would look like:

$ gunicorn myproject.wsgi

scripts/web.sh
gunicorn config.wsgi:application \
    --bind 0.0.0.0:"$PORT" \  # (1)
    --workers=2 \ # (2)
    --worker-tmp-dir /dev/shm \ # (3)
    --capture-output \
    --enable-stdio-inheritance

Instead of running the server in address 127.0.0.1 with port 8000; we use --bind 0.0.0.0 to an environment variable PORT. Why 0.0.0.0? Itamar Turner-Trauring explains this in relation to Docker here and concludes with 0.0.0.0 means "listen on all interfaces".
The most relevant setting affects the app's scalability are the type and number of workers.

Re: type of worker, Gunicorn docs state:

The most basic and the default worker type is a synchronous worker class that handles a single request at a time. This model is the simplest to reason about as any errors will affect at most a single request. Though as we describe below only processing a single request at a time requires some assumptions about how applications are programmed.

sync worker does not support persistent connections - each connection is closed after response has been sent (even if you manually add Keep-Alive or Connection: keep-alive header in your application).

Re: number of workers, Gunicorn docs warn:

DO NOT scale the number of workers to the number of clients you expect to have. Gunicorn should only need 4-12 worker processes to handle hundreds or thousands of requests per second.
Helps avoid blocking requests. See gunicorn docs and Itamar Turner-Trauring's notes.

Re: config.wsgi:application.

We know, by now, that config refers to the project folder. The config folder contains a wsgi.py file. This was originally created when running django-admin startproject config. This wsgi.py file refers to an application:

/config/wsgi.py
import os
from django.core.wsgi import get_wsgi_application

os.environ.setdefault("DJANGO_SETTINGS_MODULE", "config.settings")
application = get_wsgi_application()

This "application callable", as described, points to the Python path that the Django application will use.

WSGI_APPLICATION: The full Python path of the WSGI application object that Django’s built-in servers (e.g. runserver) will use. The django-admin startproject management command will create a standard wsgi.py file with an application callable in it, and point this setting to that application.