from Talk to the City Try Polis is Opinion Clustering, Talk to the City is Topic Clustering
TTTC is not âclustering of opinionsâ but âclustering of topicsâ - I thought this might be the case when I saw Talk to the City clustering, but was captured by the concrete facts. - BERTopic, TF-IDF, and the actual implementation uses CountVectorizer.
- What this means is that âban Xâ and âdo not hinder Xâ fall into the same cluster.
-
- The Copyright Act, Article 30-4. restricts copyright with respect to (Use not for the purpose of enjoying the ideas or sentiments expressed in the work).
- The former says this provision is bad because it undermines the welfare of creators.
- The latter is saying that if right holders such as the media interpret the proviso of this provision [âHowever, this shall not apply in cases where the interests of the copyright holder would be unreasonably impaired in light of the type and use of the work and the manner of such useâ] in a broad and strong manner, it would prevent âlarge-scale linguistic data collection for AI developmentâ (and is therefore âbadâ). (and therefore bad).
- So heâs saying âlarge scale linguistic data collection for AI developmentâ is Good.
- These two are plotted close together
- There is no mechanism in place to analyze this type of good/bad sentiment (sentiment).
This page is auto-translated from /nishio/PolisăŻæèŠăźăŻă©ăčăżăȘăłă°ăTTTCăŻăăăăŻăźăŻă©ăčăżăȘăłă° using DeepL. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. Iâm very happy to spread my thought to non-Japanese readers.