2023-02-21
- I was asked the question verbally a few days ago and answered it verbally, but I drew the diagram again because I couldnât convey the image in my brain in words.
- (I later realized that I had unexpectedly done âWhat the language model cannot doâ).
- For Thumbnail
Write down your thoughts on the question, âCan creating a language model specifically for the Japanese language give it an advantage over existing language models due to circumstances unique to the Japanese language?
-
Q: Isnât the Japanese language more expressive than English?
-
A: I donât think so.
- Natural language has developed for the purpose of communication between homo sapiens
- So âgetting the message acrossâ is most important.
- The average listenerâs ability to understand the language does not increase due to bottlenecks in the ability to express the language.
- If the âaverage listenerâs ability to understandâ does not differ significantly by ethnicity or race, then the natural language used by that ethnic group will also have the same ability to express itself.
-
- (In science fiction terms, people who have lived in a harsh colony for generations might have a higher level of comprehension than the average homo sapiens, but, well, itâs not much different on Earth today.)
- For example, in mathematics, artificial vocabulary and syntax are used to distinguish âconcepts that are used vaguely and indistinguishably in natural language.
- This artificial language need not be understood by the average homo sapiens.
- So it can have higher expressive power than natural language.
- However, the ability to express yourself in areas you are not interested in is sometimes discarded.
- Both English and Japanese are natural languages, which caps the ability to express oneself.
- Easy compared to the ability to understand âthe type of language that can only be understood by trained experts,â such as technical discussions among mathematicians or the behavior of complex algorithms
- Side-by-side view
- Natural language has developed for the purpose of communication between homo sapiens
-
There is a difference or there isnât.
-
For example, in Japanese, the subject is naturally omitted, making it easy to discard the subject.
-
This, in turn, means that it is easy to overlook the subject distinction.
-
This means that there is a cultural difference in what is chopped up into small pieces and what is chopped up into large pieces in articulating the world.
-
It is not that one language is superior to the other.
-
- A person of one culture looks at the language of another culture and thinks, âIâm an idiot for calling B without distinguishing between B and C,â but the other culture thinks likewise, âIâm an idiot for calling D without distinguishing between C and D.â
- There wonât be much difference in the performance of the human brain, so there wonât be much difference in the number of pieces inscribed in the world as a whole.
- concrete example
-
- I told the AI that it was an A. He said no, itâs a B.
-
-
-
Using multiple languages, with the most finely divided language for each domain, provides a better understanding of the world than any single language.
-
- Related: Cognitive Resolution.
- To use a high school math analogy, if you know both vectors and complex numbers, when you want to rotate a vector, you can convert it to a complex number, rotate it, and then rotate it back.Vector and complex number metaphor
- By mapping the system to the one that is easiest to operate, operating it there, and then returning to it, it is easy to do things that were difficult to do in the original system.
- Better to have several different systems.
- Ancient India, which developed before European ideas flowed in and was not given back to Europe much due to high translation costs.
- Muslim community that had an incentive not to share knowledge due to religious opposition to Christianity.
- It would be interesting to see better language models and low-cost references to literature from these cultures.
- It is also worth noting that these areas are still likely to see population growth
- A Japanese person asks a question in Japanese, and the AI thinks once in Arabic, then thinks in Pali based on the result, and finally answers by translating the conclusion into Japanese, saying, âThis guy only understands Japanese, so itâs no use.
-
Q: Is there value in a better language model for Japanese?
- A: Yes.
- This does not mean that âJapanese-only models have value.â
- The importance of a thicker pipeline to the âAI that thinks across languagesâ that will grow more and more in the future.
- âAI that thinks across languagesâ is like a newly discovered oil field, value bubbling up
- Users of languages with narrower pipes do not enjoy much of the value that springs from this.
relevance - Work to open up the forest of what has yet to be written. - In the process of trying to summarize the contents of this page (Worldview after LLM), the language
This page is auto-translated from /nishio/æ„æŹèȘèšèȘăąăă«ă«ă€ăăŠèăăăăš using DeepL. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. Iâm very happy to spread my thought to non-Japanese readers.