Exploration Bias

Exploration Bias

Follow

Follow

home newsletter

Batch MCTS, Part 2

Batch MCTS, Part 2

The previous post described a batch-inference MCTS implementation for self-play. Batches consisted of one state from each of N concurrent episodes. Unfortunately that approach doesn’t work for playing a competitive episode since in that case there’s ...

Virgil King

Batch MCTS, Part 1

Batch MCTS, Part 1

Virgil King

AlphaZero modifications

AlphaZero modifications

Virgil King

Introduction

Introduction