2024-06-14 Cybozu Labs Study Session Lecture Presentation by Yasukazu Nishio

  • I’d like to add my verbal questions, but I’m busy and behind.

Shift to the past

current topics

  • 2024-06-06 Takahiro Yasuno to run for Governor of Tokyo

    • In the past, the election period has been a time to “broadcast the candidate’s views,” that is, to tell voters what the candidate thinks one way or the other. I believe that this time around we can use technology to make it a period where people’s opinions are heard. In the world of digital democracy, we call this “[broad listening”. Through this broad listening, I would like to make the election period a time for everyone to think about ideal policies.

    • My number card to verify your identity and achieve one-person, one-vote voting.
      • Direct voting results to capture the will of the people with a high resolution.

    • and
    • Audrey Tang also commented
      • “I like the way Mr. Anno combines broad listening, the my number system, and the open source spirit.”

      • “Mr. Anno is not protesting. I like that very much. He is saying, ‘We can change for the better’.”

  • impressions

    • golden_lucky Plurality, I didn’t expect it to be a realistic option so soon in Japan.

    • nishio Really that()

    • When I first learned about the concept of broad listening, I thought this was a technique that would allow me to “hear the multitude of opinions” at a level that was impossible for a live person.
    • So I thought it was a kind of human enhancement or increased intellectual productivity.
    • With the future development of LLM, the cost of realization will come down and it will become an “easy-to-use technology” in a few years, and in 5 or 10 years, it will be a technology that cannot be ignored by groupware companies like Cybozu.
    • 〜I had been thinking that the time span of the world would be about 1.5 to 2 years, but to come this far in one year, the world is changing at a rapid pace! That’s how I feel.
    • Talk to the City for Governor
    • takahiroanno Public tweets aboutTOKYOAI will be crawled via API and visualized using LLM, Talk to the city, etc. We will try to visualize them using LLM, Talk to the city, etc.

    • I’m helping!
  • Review of [broad listening

  • image

    • Figure from “[Not subjective or objective, but from the subjectivity of one to the subjectivity of many.
  • explanation

    • The development of technology to duplicate and distribute information has made it easier for one person to communicate ideas to a large number of people.
    • However, when a large number of people transmit information, the volume of information received by one person becomes enormous, and the person drowns in the flood of information.
    • In a society and company where people actively communicate, technology that supports listening to the opinions of many will be important.
  • Technical

    • This is a kind of “summary” in the broadest sense
    • But in LLM training, the “summary task” is trained with training data, such as putting in the text of the paper and taking out the abstract portion.
    • This assumes “a well-formed sentence written by one person” as input.
    • Summarizing techniques are needed for “a set of sentences written by many people, each with his or her own writing style.”
    • Trial and error on how to do that is taking place.
  • digression

  • Haruyuki Seki of Code for Japan gave a keynote presentation titled “Updating Society and Democracy through Civic Tech” at the Organized Session “AI and Democracy” of the 2024 National Conference on Artificial Intelligence on 2024-05-31. The presentation was titled “Updating Society and Democracy through Civic Tech”.

    • image
    • image
    • My Hand drawn diagram suddenly appears on a beautiful slide.
    • (I wonder if someone could make a cleaned up version of this in CC0 or something…)
    • (I have given permission to Mr. Seki to use the images in Scrapbox as he likes.)

Review of [Polis

  • image
  • Opinion vectors are “agree (+1)”, “disagree (-1)”, and “don’t know/neutral (0)” to the text.
  • This vector is clustered, and the sentences and other information that caused the clusters to separate are displayed.
  • What we actually did - Polis Experience Report: Should Same-Sex Marriage Be Legalized?
    • image
    • Found a higher percentage of sentences in favor of legalization (73%) than in favor of legalization (73%).
    • Example: Marriage as social proof and marriage as taxation/social security have their own issues to be considered in different layers. : 74% in favor, 11% opposed, 13% neutral
    • 86% in Cluster A, where most people are against same-sex marriage, and 69% in Cluster B, where all are in favor of same-sex marriage, both majorities.
    • In other words, in order to resolve this “pro-same-sex marriage” conflict, it would be better to first proceed along the lines of “let’s separate social proof from financial social security,” which both sides have agreed to do.
    • Incidentally, Same-sex marriages in Taiwan are not kinship.
      • It is clear that many opponents of same-sex marriage are not opposed to same-sex marriage per se, but rather feel, for example, that “if my son marries a man and my son is happy, that is acceptable,” “but I am not sure about this ‘male spouse of my son,’ so I am not comfortable making him a member of my family. (p. 3)
  • Visualization and analysis allow us to observe the conflict composition from a different angle.
  • Polis is used in vTaiwan in 2015.
  • It’s from 2015, so no LLM or other AI-like features.

What is Talk to the City?

  • Audrey Tang (summary): In 2014 it was impossible to gather the opinions of so many people and retain all the nuances, but now, thanks to the “Talk to the City” tool, it is very easy and inexpensive to do so.

  • Talk to the City is an open source analysis tool developed by the AI Objectives Institute (AOI)
    • Purpose of the tool: Improve collective discussion/[collective decision-making
      • Designed to facilitate dialogue with communities, especially cities
    • Collect and analyze Twitter tweets, Polis results, etc.
    • Visualize clusters of opinions and arguments and generate commentary for each cluster
    • This helps people get a fuller picture of the comment distribution and promotes constructive dialogue
  • What is AI Objectives Institute?

How Talk to the City works technically

  • Unlike Polis, input is text data
    • So if you have text data, you can use it for anything.
    • Utilize a variety of data sources, not only Polis, but also Twitter, Google Forms, comment sections of news articles, groupware, etc.
    • As an outlandish experiment, I included each paragraph of the book:. - Visualize the contents of the Plurality book with Talk to the City
      • Useful or not, technically possible.
  • Use LLM to extract concise, readable messages for input text
    • Obtained in JSON list format, so multiple messages may be extracted for a single input.
    • The default is GPT-3.5-turbo with Few-shots as an example.
  • Cluster this set of messages with [BERTopic
    • (2022) BERTopic: Neural topic modeling with a class-based TF-IDF
      • First, embed documents into high-dimensional vectors in the language model
      • UMAP (Uniform Manifold Approximation and Projection) projection to low-dimensional space
      • Hierarchical clustering with HDBSCAN(Hierarchical Density-Based Spatial Clustering of Applications with Noise)
  • Visualization by dropping into 2D space with UMAP
  • TF-IDF, which considers a cluster as a document, and identifies keywords that characterize the cluster.
  • LLM generates cluster commentary

Let’s see it in action

  • 2024-06-12 The first part of the report was published!
    • takahiroanno We extracted comments on the runoff announcement and created an AI-based analysis report.

    • I now have a more concrete picture of where your interests and concerns about me lie. We will also use data and AI to face the thoughts and feelings of the people of Tokyo!

    • takahiroanno Listening to people’s opinions through the use of technology is called “broad listening” in the world of digital democracy. As a concrete example, we have created this analysis report. We will continue to update this report and listen to your opinions!

  • image

Polis 2.0

  • Idea proposed by Audrey Tang
    • It’s like a development code name that has never been officially announced.
    • (Confirmed by mashbean, MODA, Ministry of Digital Development, Taiwan)
  • Create clusters in TTTC from Polis data
  • Allow chat with virtual opinion leaders generated from that cluster
  • image
  • This is what Audrey was talking about in [Plurality Seoul
    • image
    • For many participants, this is the first time interacting with AI in the context of careful deliberation.

    • In other words, instead of talking to the AI alone, we are discussing it as a group for the benefit of all.

    • With this augmented deliberation, everyone can contribute to and benefit from digital democracy.

    • Increased deliberation increases the resilience (ability to overcome difficulties) of a society.

    • Because it deepens understanding and empathy for different cultures and integrates information in a consistent manner.

    • This discussion platform is designed as an open source (software, etc.) tool.

  • This “augmented deliberation” is very interesting from the perspective of my interest in increasing intellectual productivity!
    • but not very exciting.
    • The reason for this is probably
      • People are not used to a situation where the person they are talking to is a “cluster” with no characterization.
      • I’m unfamiliar with this form of communication, so I want to see what others are doing first.
      • Even if you can see other people’s interactions, you can’t meet other people unless you get a certain number of people together.
  • It would be interesting to see Yasuno’s measures from that perspective.
  • In terms of candidates for governor of Tokyo.
    • A flesh and blood person can only use a portion of the 24 hours in a day.
      • Communicate with voters who could physically come there by standing in front of the station or
      • The content of communication is clouded.
    • A digital person (virtual personality) can remain active 24 hours a day.
      • You can stay on a specific YouTube channel all the time and access and communicate from anywhere in the world.
      • The content of the communication is preserved.
        • This means that comments made on YouTube Live can be included in TTTC’s analysis.
  • I’m thinking it’s a good idea.

interactive mass communication.

  • In one-on-one communication, I say things like, “Don’t just talk one way, listen to what the other person is saying,” and “Since we have one mouth and two ears, we should be willing to listen twice as much.

  • However, in conventional mass communication via TV, newspapers, etc., a large number of viewers only receive what is transmitted one way.

  • The reason why this happened is because the technology to duplicate and deliver information to a large number of people developed faster than the technology to summarize information.

  • image

  • It is difficult to generalize, since various technologies spread at different rates, but one example is Germany’s national radio (esp. an NHK radio station), which reached 70% of the population in 1939.

  • Over the last 100 years, viewers accustomed to one-way mass communication have become less confident that it’s okay to speak up and that there are people who will listen.

  • First, we launch “Attitude of listening” and collect comments, visualize them and show “this is what happened as a result of listening to your voices”, and then people who see them update their perception and write better comments to … By repeating this cycle, better “interactive mass communication” will be realized.

  • image

    • from Polis 2.0: forging ahead with a critical review by mashbean
      • gpt.iconFormation of a feedback loop allows automatic generation of seed opinions and expansion of the Polis database

    • There are various ways of expression, but the direction of “making mass communication bidirectional by repeating the cycle” is being sought in many parts of the world.
    • As Audrey Tang says, “Digital natives don’t think that once-every-four-years upload bandwidth is enough.”
      • It is important to increase upload bandwidth.
  • The image of a snowball that rolls and gets bigger and bigger.

summary

  • In this workshop, broad listening, Polis, Talk to the City, and Polis 2.0 were explained and the future direction of interactive mass communication was explained.
    • digital democracy, Takahiro Yasuno’s campaign for governor of Tokyo utilizes the concept of broad listening to listen to the voices of a large number of voters.
    • Interactive communication strategies utilizing YouTube have the potential to transform traditional one-way mass communication.
    • This campaign will raise the profile of broad listening in Japan, and if it is shown that many voters consider it important, other politicians will embrace broad listening.
  • Polis and Talk to the City are both copylefted, open source (AGPL) tools that become more sophisticated the more people use them.
    • Technological developments make it possible to efficiently aggregate the opinions of large numbers of people and promote constructive dialogue.
    • Through these efforts, more people will be able to participate in the democratic process and contribute to social decision-making.
  • The interactive nature of mass communication is gradually approaching realization through repeated feedback loops.
    • Just as a snowball grows as it rolls, we hope that these efforts will accumulate and lead to the realization of a better society.

Aside: I’ll write when I have room.

  • Analogy with KJ method
  • image
  • In his mind, he has a dogmatic principle of grouping, such as, “In my opinion, it is correct to divide this paper material into three major categories: market research, quality control, and labor management. They are simply applying this dogmatic classification scheme and sifting and fitting the paper materials into a ready-made framework. This is a mere sifting and fitting of paper materials into the ready-made wakku, which completely defeats the conceptual significance of the KJ method.

    • Jiro Kawakita criticized human beings for sieving, fitting into [* dogmatic classification wakugumi
  • As a way to deal with this problem, I proposed the KJ method, which is to look at each piece one by one and attach it to the related pieces.
  • This is Agglomerative Hierarchical Clustering in data science terms.
  • TTTC avoids “dogmatic human fitting” by having machines, rather than humans, perform the clustering.
  • I think it’s an easy way to get some of the benefits of the KJ method.
  • Replace the KJ method in its entirety?
    • Skin feeling that it is not.
    • For the moment, “proximity” is “cosine distance in the embedded vector space”.
    • I suspect this is where the human KJ method differs from the human KJ method, but it’s not really different, and humans may be processing it in a similar way.
    • I think it would be more linguistic to compare a human doing KJ method and TTTC for the same data, future research.
    • Analogy with [U-theory
    • image
      • 1: Stuck inside the frame
      • 2: Look outside the box
      • 3: The frame you originally had fades away with outside perspectives.
      • 4: Gathering perspectives from outside the company creates “understanding from a number of perspectives.
    • TTTC seems to be a system that encourages people to move to level 3~4

This page is auto-translated from [/nishio/Talk to the City勉強会](https://scrapbox.io/nishio/Talk to the City勉強会) using DeepL. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. I’m very happy to spread my thought to non-Japanese readers.