How does AirRAG use Monte Carlo Tree Search (MCTS)?

Question

Answers ( 1 )

    0
    2025-03-28T02:29:31+00:00

    AirRAG uses Monte Carlo Tree Search (MCTS) to generate, expand, and backtrack the reasoning process. It employs the Upper Confidence Bound applied to trees (UCT) for node selection, with the formula: `UCT(s, p) = Q(s, a) / N(s) + w * sqrt(log N(p) / N(s))`, where `Q(s, a)` is the reward, `N(s)` is the number of visits, and `w` balances exploration and exploitation.

Leave an answer