Tree of Thought (ToT) Technique

Tree of Thoughts is a language model prompting method proposed by Yao and Long in their 2023 paper, <Tree of Thoughts (ToT): A Framework for Advanced Problem Solving>. It's an approach that's particularly well-suited for complex problems that require strategic thinking and exploration. ToT builds on the Chain-of-Thought (CoT) prompting strategy by expanding it to use tree-like structures of reasoning, combined with a search algorithm for systematic problem solving.

Large Language Model Guided Tree-of-Thought.pdf388.93KB

Tree of Thoughts- Deliberate Problem Solving with Large Language Models.pdf748.36KB

How ToT works

Simply put, the Tree of Thoughts (ToT) approach maps out the process of exploring several possibilities and landing on the best solution in a tree-shaped structure. This lets the language model consider different pathways like a person would—and if needed, go back to a previous step to try a different approach.

Why ToT is getting attention

•

Tree structure: ToT explores the problem-solving process through multiple branches, like the limbs of a tree. Each 'branch' stands for a single idea or step in solving the problem. (Just picture the folder structure you often see in Explorer.)

•

Idea generation and evaluation: Like a person coming up with different ideas to solve a problem and judging which is best, the language model proposes various solutions and picks the optimal one.

•

Exploration and backtracking: The model explores several routes to the answer, and if needed, can go back a step to rethink from a different direction.

Practical applications

The issue with CoT is that you can't revert or go back once you've started—you have to finish, check the result, and then chain your thoughts. What makes ToT especially appealing is that you can go back midway and fix things as you go. As mentioned earlier, let's say we're tackling a math problem that's tricky for LLMs. For example, imagine the quiz below.

"4x4 스도쿠 퍼즐의 빈 칸을 채워 넣으시오."

General approach

•

Process: In the usual method, you find the missing number in each row, column, and 2x2 grid one by one and fill in the blank spots.

•

Result: Fill in the blanks in order and complete the puzzle.

The 4x4 Sudoku puzzle I created is completely blank, so the completed solution board for this puzzle would look like this:

1 2 3 4
3 4 1 2
2 1 4 3
4 3 2 1

ToT method

Extra prompt for the ToT process

•

Step 1: The language model suggests a number for the first blank.

•

Step 2: Consider what number should go into the next blank.

•

Backtracking: If the model thinks the puzzle can't be solved at a certain step, it goes back to a previous step and tries a different number.

•

Final Output: Completely fill in all the blanks correctly to solve the puzzle.

I worked through a 4x4 Sudoku puzzle step-by-step, filling every blank correctly over 14 stages. Here is the final completed puzzle:

1 2 3 4
3 4 2 1
4 3 1 2
2 1 4 3

Honestly, it’s hard to notice the difference just by looking at it this way. But if you look at how this was solved in code, you can understand the distinction.

Regular prompt method

ToT approach

If you run this on GPT-3.5, you’ll be able to check it like this:

Differences between regular prompt results and ToT prompt results

As you can see from the results, these prompt-based techniques let us reproduce in GPT-3.5 or LLaMA2 much of what’s possible with GPT-4. (If you use GPT-4, it just solves it directly with code.)

You may use this for commercial purposes if you credit the source and have the copyright holder’s permission.

Made with Slashpage