Argument

Inference using language models is one of the most important and challenging areas in today's AI field. Inference involves analyzing information and drawing logical conclusions from it. This is a more complex task than simple information retrieval or text generation.

Prompt : 57*32의 답은 무엇입니까?

The result of multiplying 57 by 32 is 1824.

These simple problems can now be solved quite well. Let's try something a bit more challenging.

Prompt : 242,343,232,12,1,423,3,2,433,42,21,51,26중 홀수인 숫자들을 더해보시오.

Let's try adding the odd numbers among the given numbers:

Odd numbers: 343, 1, 423, 3, 433, 21, 51

Sum: 343 + 1 + 423 + 3 + 433 + 21 + 51 = 1675

So, if you add up the odd numbers among the given values, you get 1675.

Is this the correct answer? No, it's not. The correct answer is "1275." Why does this kind of thing happen?

Language models, especially Large Language Models (LLMs), are capable of processing various types of data and extracting meaning from complex contexts. However, such models still inevitably have limitations when it comes to performing inference tasks.

•

Text-based learning: Language models are primarily trained on text data like sentences, paragraphs, and dialogues. This data mostly reflects the meaning, context, and grammar of language, placing more emphasis on linguistic meaning rather than numerical calculations or exact mathematical processing.

•

Statistical inference: Language models use statistical methods to predict the most likely words or sentences. This approach is effective at recognizing and imitating language patterns, but has limitations when it comes to performing complex mathematical calculations or logical reasoning.

•

Data diversity and limitations: If the training data does not sufficiently include examples related to numerical calculations or mathematical problem solving, the model may have trouble handling these types of questions.

•

Language-oriented design: Language models are mainly designed to focus on language comprehension and generation, so mathematical capability is not a main goal or feature of these models.

Of course, AI researchers don't just leave things as they are. In GPT-4 and similar models, they're solving this kind of problem through coding, using libraries specialized for traditional inference/calculation tasks. In fact, when this is run on GPT-4, the problem is solved like this.

# 정의된 숫자들의 리스트
numbers = [242, 343, 232, 12, 1, 423, 3, 2, 433, 42, 21, 51, 26]

# 홀수인 숫자들만 필터링
odd_numbers = [num for num in numbers if num % 2 != 0]

# 홀수인 숫자들의 합계 계산
sum_of_odd_numbers = sum(odd_numbers)
sum_of_odd_numbers
---
output: 1275

To overcome these limitations, it is necessary to use even more advanced prompt engineering techniques. Prompt engineering is a way of giving the model clearer and more specific instructions, so that it can provide the desired information or answers more accurately. For instance, it can mean laying out all the variables and conditions clearly so the model can understand them during complex problem solving or decision-making processes. In the example above, it can be used like this.

Before change

Prompt : 242,343,232,12,1,423,3,2,433,42,21,51,26중 홀수인 숫자들을 더해보시오.

After change

Prompt : 242,343,232,12,1,423,3,2,433,42,21,51,26 를 홀수와 짝수로 구별한 다음
홀수 인 숫자를 모두 더하시오. 이를 순서대로 진행해 주세요.

Let’s separate the given numbers into odd and even, then add the odd numbers in order:

Odd: 343, 1, 423, 3, 433, 21, 51
Odd sum: 343 + 1 + 423 + 3 + 433 + 21 + 51 = 1275

Therefore, if you add up all the odd numbers among the given values, you get 1275.

With just a change in the prompt, we can see that it produced the right answer. It's similar to teaching someone step by step—understanding is faster, and the quality of work goes up.

Inference using language models is still a developing field, and to do it effectively, the ability to issue clear instructions is key. These approaches help make the most of a model's inference skills and provide support in solving complex problems.

It may be used for commercial purposes with permission from the copyright holder, provided the source is cited.

Made with Slashpage