pContinuousTranslation image

nishio Iā€™m considering whether to improve the current DeepL-based automatic translation system for Scrapbox or switch to a GPT-based system.

nishio DeepL based method Pros 惻There is a glossary function, so that you can control the way certain words go to certain translations. 惻Implemented a mechanism to replace existing mistranslations without using the API by checking the occurrence points 惻If you work hard, you can have it translated while maintaining its structure (via XML) Cons 惻Hard work is troublesome

nishio GPT-based method Pros 惻Seems great (bluff) 惻 Actually, 3.5 would be cheap. 惻Translation can be adjusted simply by indicating in natural language Cons

nishio The theory is that if you have to work hard either way, you might as well work hard on GPT.

The theory is that the people in DeepL are doing a great job, so we might as well get on with it.

nishio Discussion on whether translation is necessary in the first place.

Instead of translating the Japanese text as it is, it would be better for the LLM to write it in English with that as a reference.

Currently, lines containing Japanese characters are considered Japanese and translated, but English sentences referring to Japanese words, Chinese sentences containing Chinese characters, machine translation of English sentences, etc. are also translated.

nishio If we can create a pipeline of ā€œGPT reads my recent Scrapbox pagesā€, it could be derived from there, like ā€œsend my thoughts and questions about what I read to Slackā€. This canā€™t be done by a translation engine.

nishio.iconLet ā€œDeepL-based methodā€ be X and ā€œGPT-based methodā€ be Y, and abstract them. GPT-4.icon

  • Both X (DeepL-based methods) and Y (GPT-based methods) are means of text translation, but each has its own unique advantages and disadvantages.
  • X is already implemented and has specific control features (control of translation of terms and replacement of mistranslations). This is an effective means of solving existing problems and ensuring consistent quality. However, improving and managing X can be laborious and it requires an investment of time and resources.
  • Y, on the other hand, is an advanced natural language processing tool that allows for flexible and natural translation. It has great evolutionary potential, but requires specific control and understanding to take advantage of its potential.
  • The choice between the two depends on your specific needs, use case, and available resources. Whether you choose X, which requires a higher degree of control and effort, or Y, which offers greater flexibility and evolutionary potential, will depend largely on your goals. nishio.iconMap X and Y by analogy to a region with the same structure. Prohibited Concepts: Translation, AI GPT-4.iconLetā€™s mirror each translation method to another domain: driving a car.
  • X (DeepL-based methodology) is similar to self-driving cars. Self-driving cars follow predefined routes and rules on the road. Its routes and rules can be freely customized to ensure a certain level of safety and predictability. However, new data and rules must be added to accommodate new routes and situations. In addition, each new route requires time and effort.
  • Y (GPT-based method), on the other hand, resembles an experienced human driver. This driver has the ability to adapt to different road conditions and unexpected situations. However, in order to behave optimally on a particular route or situation, that driver must learn from past experience.
  • Viewed using this analogy, which method you choose depends on how much control you want and how much flexibility and adaptability you need. Whether you choose an automated vehicle that follows certain routes and rules (X) or a human driver that learns from experience and adapts to new situations (Y) depends largely on your needs and goals.

I saw this parable and thought GPT-based would be more fun.

  • Not trying to drive the right way on a set path, but on a path we donā€™t yet understand well.

This page is auto-translated from /nishio/pContinuousTranslation2023-06-01 using DeepL. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. Iā€™m very happy to spread my thought to non-Japanese readers.