Schelling Model of Segregation

Background

A recent article in the 1 November 2024, Daily Skimm discussed an article in the New York Times which analysed the movement of voters in the United States. The article found that voters are increasingly moving to neighbourhoods where the majority of residents share their political views.

This change leading to increased polarisation (not to mention that some do not think being an adjudicated rapist is a barrier to being elected) are in the United States.

There are a number of models to look at how segregation may appear with only mild preferences in the individuals in the system. One of these is Schelling Model which is an agent-based model where:

The world contains a discrete spatial grid.
Agents are placed randomly on the grid, with at most one in each grid location.
Agents come in two colors: red and blue.
At each step, each agent is happy if a certain number of their possible neighbours have the color, and unhappy otherwise.
Unhappy agents will pick a random empty cell to move to each step.
The model keeps running until there are no unhappy agents.

By default, the number of similar neighbours the agents need to be happy is set to 3 and they have up to 8 possible neighbours (Moore neighbourhood). That means the agents would be perfectly happy with a majority of their neighbours being of a different color (e.g. a Blue agent would be happy with five Red neighbours and three Blue ones). Despite this, the model consistently leads to a high degree of segregation, with most agents ending up with no neighbours of a different color.

Before we start implementing the model, you should seriously have a look at the blog entry created by Vi Hart and Nicky Case. Vi Hart produces amazing content and well worth following

Finally, if you are interested in reading the original paper on this see Schelling, Thomas C. Dynamic Models of Segregation. Journal of Mathematical Sociology. 1971, Vol. 1, pp 143-186.

Implementation

The Schelling model traditionally talked about agents having different colours. I switched to state so my implementation is more similar to the Game of Life and Fire model. The Agent class has a state attribute that is either JETS or SHARKS. (I did now want to use RED and BLUE.)

There are two differences between this model and the previous models:

The first is that the agents are not updated simultaneously. Instead, we update the agents in a random order. This is because if we update the agents simultaneously, the model will never reach an equilibrium.
The second difference is that the agents move. We will look at two versions of the model:
- The first version is the traditional Schelling model where the agents are moved to a random empty cell.
- The second version is where the agents are moved to a random cell that has at least one neighbour of the same state. This is because if we move the agents to a random empty cell, the model will be slower to reach an equilibrium.

To move an agent to a random empty location we can use the following code:

    self.model.grid.move_to_empty(self)

to move an agent to a random location with at least one neighbour of the same state we could do something like the following:

loop over, say 5, attempts to place the agent
    move agent to a random empty location
    if has at least one neighbour of the same color
        break

Outline of core classes `Agent` and `Model`

`Agent` class

store its state which is JETS, or SHARKS. This corresponds to the colour of the agent in the traditional Schelling model.
has function, get_neighbors to return a list of all neighbours.
has function, is_happy to check if the agent is happy.
has function, step to compute its behaviour based on its state and the state of its neighbours.

class Agent(mesa.Agent):

    def __init__(self, model):
        super().__init__(model)

    def get_neighbors(self): ...

    def is_happy(self): ...

    def step(self): ...

The `Model` class

The Model class stores the model parameters, the model geometry (space/grid), and the agents in the model.

The basic structure of this class is

class Model(mesa.Model):

    def __init__(self, seed=None):
        super().__init__(seed=seed)

        # store model parameters

        # create the model space/grid

        # create agents in the model

        # setup data collector

        # init simulation

    def step(self):
        self.agents.shuffle_do("step")

Implementing the Schelling Model

`Agent` class

The get_neighbors method:

is the same as that used in the Game of Life or in the Fire model.

The is_happy method:

gets its list of neighbours
counts the number of neighbours that have the same state as the agent
checks if the agent is happy based on the proportion of neighbours that have the same state as the agent is at least as large as the model threshold parameter, model.pr_same.

The step method:

checks if the agent is happy
if the agent is not happy, it moves to a random empty location, using self.model.grid.move_to_empty method.

`Model` class

Store model parameters

The Model class has parameters width, height, density, and pr_same. The density parameter is the proportion of cells that are occupied by agents. The pr_minority is the size of the minority class, JETS, and the pr_same parameter is the proportion of neighbours that need to have the same state as the agent for the agent to be happy.

Create the model space/grid

The model space is a SingleGrid with toroidal boundary conditions. This the same as that used in the Game of Life.

Create agents in the model

Creating the agents is done using a loop similar to the fire model by iterating over the grid locations and create an agent (with probability density). When creating agent we set its stateto JETS based on pr_minority.

Init simulation

We will add stuff here later, when we add data collection facilities.

Visualisation

The visualisation and animation is practically the same as in the Fire Model, the only addition is the letter X for agents that are not happy.

Data Collector

The mesa library has a DataCollector class that can be used to collect data during the simulation. We will add a data collector to collect the number of happy agents at each step. This will allow us to plot the number of happy agents over time.

Adding `running` flag to `Model` class

In Model.__init__, in "# start simulation" section we set running flag to True, to indicate that the model is still running. This flag will be set to False when there are no unhappy agents.
In Model.step we first check to see if the model is still running, if not we return immediately. Otherwise we perform the model step. Then check if there are any unhappy agents, if not we set the running flag to False.

    def step(self):
        if not self.running: return 

        self.agents.shuffle_do("step")

        self.running = sum([not a.is_happy() for a in self.agents])>0

Now, with running flag implemented we can run a simulation as follows

    model = Model(width=20, height=30, density=0.5, seed=667)
    while model.running:
        model.step()

Adding `move` counter variable to `Model` class

Instead of (or in addition to) counting the number of unhappy agents at each step, we can add a counter variable to the Model class that is incremented each time an agent is moves. This counter is reset at start of each Model.step call. So:

In Model.__init__, in "# start simulation" section we set moved counter to zero.
In Model.step we first reset the moved counter to zero.
Then we increment the counter each time an agent is moved, inside the Agent.step method.
We set the running flag to False, when moved is zero at end of the step. So simulation stops when no agents are moved instead of when no agents are unhappy. So we use

    self.running = (self.moved!=0)

Setting up the data collector

To use the DataCollector class, we create an instance of it in the Model.__init__ method. Where we list the information we want to collect. We can use the DataCollector to collect the number of steps, the number of agents that have moved, and the number of unhappy agents at each step.

    self.datacollector = mesa.DataCollector(
        model_reporters = {
            "steps": "steps",
            "moved": "moved",
            "unhappy": lambda m: sum(1 for a in m.agents if not a.is_happy()),
        },
    )

Then in Model.__init__ we add the following line to collect the data:

        self.datacollector.collect(self)

Finally, in the Model.step method we add the following line to collect the data at each step:

        self.datacollector.collect(self)

Getting data from the data collector

The DataCollector class collects the data in a pandas DataFrame. We can access the data using the get_model_vars_dataframe method. This method returns a DataFrame with the data collected.

    df = model.datacollector.get_model_vars_dataframe()
    df.head()

	steps	moved	unhappy
0	0	0	48
1	1	43	31
2	2	31	23
3	3	22	14
4	4	17	13

Plotting the data

We can plot the data using the matplotlib library. For example, we can plot the number of unhappy agents and the number moved over time using the following code:

plt.plot(df.steps, df.unhappy, label='unhappy')
plt.plot(df.steps, df.moved, label='moved')
plt.legend()
plt.title(f"Agents behaviour step (pop={len(model.agents)})")
plt.show()

This will produce a plot similar to the following:

Batch Running

The mesa library also has a BatchRunner class that can be used to run multiple simulations with different parameters. For example, we can use the BatchRunner to run the Schelling model with different values of the density parameter. So we could write

params = { "width": 10, "height": 10, "density": np.linspace(0.1,0.6,11) }

and then run the simulations using the BatchRunner class using:

results = mesa.batchrunner.batch_run(
        Model,
        parameters=params,
        iterations=50,
        max_steps=1000,
        display_progress=True,
    )
df = pd.DataFrame(results)

This code creates a DataFrame with the results of the simulations. The DataFrame has a row for each simulation and a column for each parameter and model variable that was collected.

df.head()

	RunId	Step	width	height	density	steps	moved
0	0	1	10	10	0.1	1	3
1	1	1	10	10	0.15	1	1
2	2	3	10	10	0.2	3	1
3	3	6	10	10	0.25	6	1
4	4	4	10	10	0.3	4	2

We could generate a boxplot of the number of steps in the simulation by density using the following code:

df.boxplot(column='steps', by='density')

positions = [1+p for p in range(df.density.nunique())]
labels = [f"{v:.2}" for v in  df.density.unique()]
plt.suptitle("Number of steps in simulation by density")
plt.xticks(positions, labels)
plt.show()

This will produce a plot similar to the following:

As the population density increases the number of steps in the simulation needed before agents are all happy increases.

Schelling Model of Segregation

Background

Implementation

Outline of core classes Agent and Model

Agent class

The Model class

Implementing the Schelling Model

Agent class

Model class

Store model parameters

Create the model space/grid

Create agents in the model

Init simulation

Visualisation

Data Collector

Adding running flag to Model class

Adding move counter variable to Model class

Setting up the data collector

Getting data from the data collector

Plotting the data

Batch Running

Outline of core classes `Agent` and `Model`

`Agent` class

The `Model` class

`Agent` class

`Model` class

Adding `running` flag to `Model` class

Adding `move` counter variable to `Model` class