# Reinforcement learning in optimizing forest management

Pekka Malo, Olli Tahvonen, Antti Suominen, Philipp Back, and Lauri Viitasaari (Finland)

Canadian Journal of Forest Research, March 2021

Presented by Zachary Sunberg, August 30th, 2023

## Motivation

- Forests have are important sources for building materials (historically for fuel)
- Biodiversity, carbon capture, are becoming more important factors
- Managing involves reasoning about costs, rewards, growth dynamics, fires, etc.
- Previous state-of-the-art-optimization was not scalable

## Contributions

- Bringing Reinforcement Learning to Forest management!
- Biggest stochastic forestry optimization problem ever solved
- Drastically reduced computation time
- No need to simplify models

- Demonstrated that RL converges to previously-known optimal strategy
- Discovered new strategies when the uncertainty of disasters is included
- Explained why some results were expected and some unexpected with a classic principle

## Forestry 101

- Four tree species:
- Norway Spruce
- Silver Birch
- Scots Pine
- European Aspen

- CCF = Continuous Cover Management
- RF = Rotation Forestry
- Central Finland
- Disasters

## Context in Literature

- \(j\) = species
- \(s\) = size class
- \(\tilde{x}_{j,s,t}\) = number of trees
- \(\alpha_{j,s}(\tilde{x}_t)\) = fraction to new size class
- \(\mu_{j,s}(\tilde{x}_t)\) = death fraction
- \(\phi_j(\tilde{x}_t) \geq 0\) = ingrowth rate
- \(\Delta\) = period length (5 years)
- \(r\) = interest rate (discount \(b = 1/(1+r)\))
- \(h\) = number of harvested trees
- \(\pi\) = profit
- \(w\) = "cost of artificial regeneration"

Important Previous Work:

- Faustmann, 1849: Basic optimal rotation problem
- Reed, 1984: Disasters
- Parkatti and Tahvonen, 2020: Nonlinear Programming

## Problem Setting

- State Space: 44-dimensional continuous
- Action Space: 44 continuous dimensions, 2 binary choices

## Problem Setting

## Approach

- Hybrid PPO with GAE

"The forest state variables x for all species j and size classes s are passed (appropriately normalized) to the input layer."

- Relu activations
- Shared layer of 500
- Actor networks each have a layer of 200
- Value network has a layer of 300

## Computational Resources

## Numerical Results: Deterministic Model

BLV = 2336

Close to optimized BLV: 2335

## Numerical Results: Initial Clearcut?

## Numerical Results:

Initial Clearcut?

## Numerical Results:

Stochastic Outcomes with No Disasters

## Numerical Results:

Stochastic Outcomes with No Disasters

Mean: 2357 Euros

Mean: 2355 Euros

Deterministic Mean:

2336 Euros

## Numerical Results:

Stochastic Outcomes with Disasters

0% Fire

1% Fire

\(r\) = 1%

2% Fire

Disaster Probability = 1%

Disaster Probability = 2%

Policy optimized with disasters

Policy optimized with no disasters

Mean 15,151

Mean 14,753

Mean 11,101

Mean 10,060

## Numerical Result Summary

- "Higher interest rate favors CCF"
- Birch is sometimes dominant
- With no disasters, similar value to deterministic
- With disasters, RL significantly better

**Confirmed Previous****Previously Unknown****Confirmed Previous****New Result!**

RL Performance = Deterministic Optimization Performance

RL Performance > Deterministic Optimization Performance

No disasters

Disasters

**Certainty Equivalence!**

## Critique

**Positive**

- Explains RL well to a new field

- Enough details to reproduce (especially problem parameters!)

- Not overly complex

- Well written

**Negative**

- Some minor details missing

- Did not release code?

- Too many results distract from story

## Impact and Legacy

18 citations since 2021 = Pretty Good!

## Contributions (Recap)

- Brought Reinforcement Learning to Forest management!
- Biggest stochastic forestry optimization problem ever solved
- Drastically reduced computation time (170 h -> 6 min)
- No need to simplify models (Could include more stochasticity)

- Demonstrated that RL converges to previously-known optimal strategy
- Discovered new strategies when the uncertainty of disasters is included
- Explained why some results were expected and some unexpected with a classic principle:
**Certainty Equivalence**

#### RL-For-Forest-Management

By Zachary Sunberg

# RL-For-Forest-Management

- 109