-
It is impossible to prepare a clean corpus in advance
-
Need a mechanism to put it in the right place and clean it up to posterior.
-
Need a mechanism to refine Incomplete corpus instead of being given a corpus and learning from it
-
Making Human Components
-
It works without humans.
-
Elicit information by triggering human action rather than verbalizing in advance.
This page is auto-translated from /nishio/コーパス精錬 using DeepL. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. I’m very happy to spread my thought to non-Japanese readers.