2023-06-23ă Cybozu Labs Study Session
-
In the last Plurality and Polis Study Group, we mainly explained the context of Plurality, and only lightly touched on Polis.
-
thereafter
- Call for a paper describing the algorithm Polis: Scaling Deliberation by Mapping High Dimensional Opinion Spaces or
- Polis in EC2 and running instances of
- [Export data from Polis DB
- Public pol.is instances are easy and convenient to use, but they do not provide access to raw data, so we can experiment with raw data by setting up our own instances.
- Call for a paper describing the algorithm Polis: Scaling Deliberation by Mapping High Dimensional Opinion Spaces or
-
Polis is a useful tool for better deliberation
- So itâs used in many applications here and there.
- But itâs not a magic wand like âput Polis in and itâs a deliberative processâ.
- In observing the various applications, I could see specific areas of mismatch with the situation and areas I wanted to change.
- To adjust a tool to a situation, we need to know how it works.
- So, in this issue, we delve into the inner workings of Polis.
-
Before I do that, Iâll add a little context about democratic process.
-
About [democratic process
-
What I thought after talking to various people and seeing their reactions.
- Sometimes, âdemocratic processâ and âmajority rule by votingâ are equated.
- As a result, âdigital democracyâ is sometimes equated with âdigital voting.â
- Polis are sometimes misunderstood as a tool for decision-making by voting.
-
Definition by OpenAI
- What is a âdemocratic process?â
- (A) a broadly representative group of people exchanging views and careful deliberation; and (B) a broadly representative group of people exchanging views and careful deliberation,
- (b) Through transparent decision-making process.
- (c) It means the process that ultimately determines the outcome.
- What is a âdemocratic process?â
-
Majority vote by ballot satisfies (B) and (C), but (A) is doubtful.
-
We need to support more deliberations.
- The word in focus here is âdeliberation democracy.â
- Digital technology has enabled new forms of communication, so an important aspect of digital democracy is to support deliberation through digital technology.
- Communication before digital technology meant, for example, politicians making speeches in front of train stations or going to petition politicians to express their opinions.
- Communication was inefficient.
- Polis is âtechnology that scales deliberations.â
- Polis: Scaling Deliberation by Mapping High Dimensional Opinion Spaces
- Designing a form of communication that will allow deliberations to be more scalable.
- Non-traditional formats and communication mechanisms more suited to deliberative discussions are being explored
- This year, advances in LLM technology have led to the state of âCouldnât we create a better deliberation mechanism by combining LLMs?â This is the state of the art.
- On 5/25, OpenAI announced [Democratic Inputs to AI
- The definition of âdemocratic processâ by OpenAI mentioned earlier is also written in this
- On June 20, Anthropic and Polisâ The Computational Democracy Project teamed up to publish a paper entitled âOpportunities and Risks of LLMs for Scalable Deliberation with Polis. The Computational Democracy Project] and Polis
- Anthropic is the organization that makes Claude, which has a wider context range than OpenAI.
- On 5/25, OpenAI announced [Democratic Inputs to AI
-
Additional information on âDigital Voting
- Taiwan and Estonia are active in digital voting due to geopolitical risks.
- In 2014-, Hong Kong experienced âelection nonsenseâ and demonstrations against it (campaign to prevent a public construction work by acquiring a building or landmark (e.g. housing development, etc. .)) took place.
- Taiwan fears a similar âelectoral nonsenseâ will take place.
- Military power could change âelections in the physical world todayâ into âbad onesâ.
- I would like to move âelectionsâ to the online world to counter this possibility.
- Estonia once thought the same thing and started electronic voting.
-
Estonia still has a strong sense of urgency that its land may one day be controlled againâŠ
-
⊠Even if a country is invaded and physically loses its âterritory,â as long as it has the âdataâ of its citizens, it can regenerate itself⊠Estonia is working on an idea of âdata embassiesâ where all the countryâs data will be stored in embassies outside the country, with the first location to be opened in Luxembourg in 2018.
-
Even if the government ceases to function due to physical occupation of the territory, if there is a government as software on the Internet, citizens with ID cards can access it and function as a country called Estonia. This is the very idea of âGovernment as a Service. The former CIO of the Estonian government also said that this idea is the ultimate goal of national security that the country should aim for.
-
- 2023, Majority of votes cast electronically in Estonia.
- Estonia once thought the same thing and started electronic voting.
- Is the online world easier to defend?
- Online here is supposed to be moved onto Ethereum, a world computer distributed all over the planet.
- That would make it harder to destroy them.
- Thatâs why Taiwanâs Ministry of Digital Development Participates in Distributed ID Standardization.
- Because it is more suitable for their purposes to create an internationally standardized standard than to create a Taiwanese-only standard.
- Online here is supposed to be moved onto Ethereum, a world computer distributed all over the planet.
- This is the background behind these countriesâ promotion of digital voting, and to be frank, Japan does not have the threat that Taiwan feels today, so it is not surprising that most Japanese do not share this awareness of the issue.
- Therefore, Nishioâs personal opinion is that âin promoting Plurality in Japan, it is difficult to be understood when talking about digital voting, so it is better to explain it by putting more weight on the aspect of âit is beneficial to visualize the distribution of everyoneâs opinionsââ.
- In other words, it becomes âPolis interesting, letâs try it.
-
Japanâs problem is silver democracy, but I digress.
- When we try to implement âa voting system that is not one vote per personâ because we want to correct the inequality that minors are not entitled to vote (domain voting system) or because it is unequal to have the same voting power regardless of whether a person has a long or short life expectancy (life expectancy voting system), people say âit is difficult to implement paper and box voting, so digital voting would be better. and box voting] is difficult to implement, so digital voting would be better,â but I have a feeling that these voting systems have not been widely discussed yet.
- For the time being, I am not interested in changing the electoral system, but in the form of unsuccessful campaigns such as Operation Coconut, or technological use of force.
-
Regarding ârepresentativeness,â can digital technology reduce ârepresentativesâ to ultimately âindividualsâ?
-
Offline representative democracy had to represent the entire population by a few hundred people at most, so minority opinions could not be represented
- Digital technology allows individuals to express their opinions.
- However, the current social networking sites are not able to unite them, resulting in a flood of information, and being small alone is not an ideal state of affairs.
- So a good balancing and clustering (=representation) mechanism is needed.
What we would like to change about Polis
-
Not visible until there are at least 7 voters.
- Not visualizing when there are fewer people has the advantage of making individual opinions less visible.
- Inconvenient for situations where you want to visualize internal discussions with a small number of people
-
Opinion submissions from general users are considered âagreeâ.
- In other words, âWhoâs more in favor of it, those who are for it or against it?â I canât post with the feeling that
- But when I see situations where polls are made on Twitter and elsewhere, I wonder, âWhat do people think?â Thereâs a lot of âWhat do you think?
- The current Polis makes it difficult to know how others voted on the opinions you posted.
- View individual opinion polls in the detailed report.
- If youâre lucky enough to be chosen as the main opinion of the cluster, youâll see it in the visualization.
- There should be permalinks to individual opinion polls.
- When you vote, you can see the results of everyoneâs vote.
- If you have already voted, you can see recent results.
- Share this URL so others can vote too!
- Create incentives for opinion contributors to post on social networking sites
-
Unable to update vote.
- When you see it in the image of âelectionâ and âvoting,â you think you shouldnât change your vote after the fact.
- But to see the output of others and change oneâs opinion is what âdeliberationâ is all about.
- If it doesnât change, itâs just a statistical survey.
- Currently, there is no view to look back on what I voted for and how I voted.
- Iâd like to be able to change my vote from there.
- Maybe this is because Polis focuses on collecting the âemotionsâ of âpeople who donât thinkâ.
- The README expresses SENTIMENT GATHERING PLATFORM in the README.
- [Sentiment / emotion collection platform
- When Nishio did Polis, many of the people who participated and returned feedback were âthinkersâ.
- I, personally, would rather observe â10 thinking peopleâs thoughtsâ than â1000 unthinking peopleâs feelingsâ.
- As a democratic process, âa few people thought about it and decidedâ is not convincing to those who could not participate.
- Need to create a subjective feeling of âunderstandingâ
- The README expresses SENTIMENT GATHERING PLATFORM in the README.
- The current design cannot respond to the âthinking personâsâ need to know.
- Some say, âIt would be nice if some material was presented to help us make a decision before we answer Yes/No.â
- If we actually do that, the majority who donât want to read the material will disengage.
- People have individual differences. to no one way can meet everyoneâs needs.
- Polis needs an approach to broaden its base, a mechanism to reach out to a wide range of people and then scoop up and gather those who are interested and feel a desire to become deeply involved.
- Some say, âIt would be nice if some material was presented to help us make a decision before we answer Yes/No.â
-
comments make me want to write a reply.
- Polisâ deliberate design is not to apply opinions directly to opinions
- Avoid framing (wrestling)
- (Internet term) Framing (Internet term) - Wikipedia
- Often, however, usersâ âdesire to write their thoughtsâ increases
- This state has a heightened incentive to verbalize.
- Wouldnât it be more beneficial to have more data if we let them write honestly?
- I just have a problem with feeding it directly back to âthe person who wrote the original opinionâ.
- Writing a reply is not in itself a problem.
- It would be nice if the LLM could take the replies and convert them into useful feedback.
- Polisâ deliberate design is not to apply opinions directly to opinions
-
Moderation issues.
- In Nishioâs Polis operation, it is usually always set to âno moderation, user posts are instantly visible to other usersâ.
- âNo comments shown without moderator approvalâ
- If this setting is turned off, it will not appear until approved by a human moderator (still off by default).
- I have a problem with this moderation.
- The human burden is heavy.
- Opportunity loss due to hidden period of time waiting for moderation
- Concern that moderator bias may be introduced.
- In fact, the bias comes in.
- Because the determination of âopinion not relevant to the topicâ is influenced by the individualâs opinion on the topic
- Nishio thinks this way because of his background in the KJ method.
- What may appear to be unrelated may later be found to be related.
- At least one person thought it was relevant and posted it, so donât be too quick to throw it away. - ape
- âIf it were, you wouldnât have to moderate it because you wouldnât throw it away anyway.
- In reality, there will be a lot of âemotional or unclear documentsâ posted, which will cause problems for voters to be upset.
- The LLM will do something about it.
- In Nishioâs Polis operation, it is usually always set to âno moderation, user posts are instantly visible to other usersâ.
-
Tends to become less interesting when there are two clusters.
- If you take a vote on a subject that is divided into two sides, and you are given a visualization of the two sides, you will say, âOf courseâŠ
- When you look at the details, there is some interesting data, but most people are not aware of it.
- Youâre splitting it in two.
- Iâm in the majority, thank God.
- I am in the minority, this system is not a good one.
- In encouraging deeper thinking, maybe it would be better to have more than three clusters?
- Are we aiming for âfunâ or âproblem solvingâ?
- That is an important perspective. I think it is important to be interesting first in order to encourage the participation of many people.
- Skewing statistical results for the sake of fun would be a hindrance to solving the problem.
- Thatâs true, it seems like a good idea to add another view instead of distorting it
-
I want to take over the voting results from a past dialogue.
- Using one CONVERSATION for a long time is not a good experience.
- The number of comments is only increasing.
- I want to re-partition periodically.
- But every time itâs reset to zero, thatâs not so subtle.
- What do we take over?
- Comments that were agreed upon need not be carried over.
- Comments with many âI donât knowâ would be redone with additional information or rewritten to make them easier to understand.
- Iâd like to recycle the ones that are split, if itâs possible to re-vote.
- However, I donât want several comments with nearly the same voting trends.
- Using one CONVERSATION for a long time is not a good experience.
How Polis Works
- Papers:.
- Create [voting matrix
- Matrix with +1/0/-1/NaN for each user and opinion
- What you havenât voted for yet is represented by NaN.
- Fill in missing values with user averages per opinion
- For an agenda item for which everyone who voted for it is in favor of it, even those who did not vote for it are considered to be in favor of it.
- This will create a matrix with â-1 to +1â values.
- Basically just do âhigh-dimensional data visualizationâ from here.
- There are many and varied ways to do this.
- Various things are being tried in Polis.
- Basically principal component analysis (PCA)
- Average Care
- The missing values are filled with the mean, which inevitably leads to a large number of samples near the mean.
- Clustering this as it is would create a cluster of âpeople who havenât voted much yetâ around the mean, which isnât interesting.
- So weâll use the number of opinions voted on to make adjustments.
- Vector from the mean by a factor of 1 for those voting for all, and by a factor of sqrt(2) for those voting for only half.
- He writes that this was used at least in the early days, but that the need for this adjustment has since been reduced by âshowing users the pros and cons of the opinions firstâ.
- It wasnât clear from reading the code whether they were making adjustments in the current implementation or not.
- Clustering with an undetermined number of clusters
- There have always been diverse ways to do this, too.
- Hierarchical clustering is typical
- However, the number of clusters does not have to be determined, but the threshold must be determined
- Polis uses K-means method and [silhouette coefficient
- What is silhouette coefficient?
- Try K=2~5 and choose the one with the best score
- There have always been diverse ways to do this, too.
-
- I was able to get a similar result with the Python implementation of the calculation, but I wonder why the shape is slightly different.
- Opinions on behalf of the cluster
- Representativeness measure .
- I think Pv is the mode of beta distribution.
- Beta distribution is conjugate prior distribution of Bernoulli distribution
-
The Bernoulli distribution is a discrete probability distribution in mathematics that takes 1 with probability p and 0 with probability q = 1 - p.
- That is, Iâm making a Bayesian estimation of this p, thinking that whether a person in a given group g votes v for a given comment c follows a Bernoulli distribution with probability p.
- I think Pv is the mode of beta distribution.
- The paper further states that Fisherâs exact probability test.
- Fisherâs exact probability test - Wikipedia
-
Fisherâs exact probability test ⊠It is used to examine whether there is a statistically significant association between two variables in a 2 x 2 contingency table (when dealing with data where two populations are classified into two categories, with one degree of freedom)⊠When the sample size is large, the chi-square test is used because the sample distribution of the statistic is approximately equal to the chi-square distribution⊠Fisher showed that the probability p of obtaining such a combination of numbers is represented by the following hypergeometric distribution
- Source code-wise, theyâre using hypergeometric functions.
- Hypergeometric distribution - Wikipedia
-
The hypergeometric distribution is a type of discrete probability distribution that gives the probability of how many success states there are in a non-restored extraction from a population with success states. It is applied to non-repeat extraction from a finite population that can be divided into two exclusive attributes, such as male/female, pass/fail, and so on.
- So there are two exclusive characteristics of voting for comment c, âv or otherwise,â and theyâre collected from two populations, âpeople in group gâ and âpeople outside group g.â
- âDoes it make a difference to the group?â I would like to know
- So, assuming that âthere is no difference between groups,â find the probability that the results are more radical than observed in that case
- So much for the accuracy test.
- I donât know why Iâm multiplying these probabilities by Rvgc, but why am I doing this?
- The paper states that it âreflects both the estimated effect size and the statistical confidence associated with the effect.â
- The mathematical interpretation of the resulting values is not quite clear.
- Finally, sorting by this value and choosing from the smallest, comments that characterize the group are selected
- Itâs a mystery, multiplying a smaller better p-value by an odds ratio that looks like a larger better p-value.
- Why not just use the p-value?
- The paper states, âFirst select the significant ones from the affirmative comments, and if none, from the negative comments.â
- Human beings, because the negative form is hard to understand.
- Degree of consensus per comment is calculated by [$ \prod_g P_v(g,c)
- This means ignoring the number of people in the group and multiplying the probability that each group agrees
- If one of them is zero, itâs zero.
- It means that the decision is not made by the majority group alone, but must be supported by all groups, including the minority group.
- Itâs like the veto of the Permanent Council.
- Resemblance.
- The good thing is that âcloser to consensusâ can now be expressed rather than a binary Yes or No.
- It would be nice to be able to visualize this and make suggestions like âwe need to convince this group of people to get closer to a consensusâ.
- Cases where a minority group of experts is opposed, persuade the experts?
- If we have a majority vote, the opinions of a minority of experts will be crushed, but this method sounds good because it generates communication for persuasion.
- The expert side will come up with evidence and other counter evidence.
- I tend to implicitly assume the direction of âexperts persuade the masses,â but the masses can âclose their ears because itâs too hard and they donât understand it well,â and it would be interesting to do the opposite.
- LLM support seems necessary.
- The current Polis does not involve a place for âcommunication for persuasionâ, so something else is needed.
- Representativeness measure .
- So, with some trepidation, I was able to reproduce the statistical processing done by the Polis backend using Python from the raw CSV of the polling data.
Reexamination of proposed improvements
- Not visible until there are at least 7 voters.
- Aside from appearance, visualization (digest report) can be done from the second person voting stage.
- So we can share the URL after the people who prepared it have voted.
- Of course, in this case, we know what the first person voted for, so this is for use cases where this is not a problem.
- I donât think thereâs anything in particular that you want to keep secret if itâs a situation where youâre trying to organize an internal verbal discussion thatâs getting out of hand, so I think this is fine.
- Especially in small cases of about 10 people, it may be useful in some cases to summarize in a way that does not involve dimensionality reduction by PCA.
- Specifically, LLMs can give feedback on a controversial opinion, such as, âThis opinion is divided, but letâs delve into it from different perspectives.
- Opinion submissions from general users are considered âagreeâ.
- Just a UI issue, just stop writing one to the DB.
- Unable to update vote.
- Just a UI issue, just update the DB.
- comments make me want to write a reply.
- How to use this comment in a situation where participants could fight would require a layer of ingenuity that has nothing to do with statistics.
- If there is trust between members, such as for internal use, you could allow them to reply to each other.
- Do you want a Cotonoha-like screen?
- Moderation issues.
- This is another LLM layer story.
- If thereâs trust among the members, it might be better to go through bare bones.
- Tends to become less interesting when there are two clusters.
- Can force clusters to three or more
- If you fix it to three, it looks like MAGI System.
- In this situation, the âfirst principal component axisâ is the âobvious axis of conflictâ.
- So it would be interesting to discard that and cluster them in other dimensions?
- When there is confrontation, sometimes it works better to seek another axis rather than push and pull on that axis of conflict.
- A case study of the debate over whether to change Taiwanâs time zone: 6489dff1aff09e00000cfcf1.
- So it would be interesting to discard that and cluster them in other dimensions?
- Can force clusters to three or more
- I want to take over the voting results from a past dialogue.
- be made
-
I donât need several comments with nearly the same voting trends.
- This is, in essence, just picking out the comments with the largest variance along each principal component axis of the vote distribution.
- summary
- I have a good idea of how to solve some of these problems with the statistical layer.
- Some of the rest are UI layers, some are LLM layers
- Polisâ visualization of updates per second is interesting to watch
- Whether it is more than interesting to be updated every second is tenuous.
- Maybe we could explore more alternative visualization methods?
This page is auto-translated from /nishio/PolisććŒ·äŒ using DeepL. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. Iâm very happy to spread my thought to non-Japanese readers.