2021-05-04 An experiment to pick up keywords from the most recent one-day update in Scrapbox’s Project A and present pages containing those keywords from Project B.

  • In this case, A is this public project and B is a private project of about 400 pages

If it seems to have a good effect, run it regularly once a day.

context - Diary of 2021-05-04 - About Scrapbox private to public transfer

What we did.

  • Get ✅public’s most recently updated pages
  • ✅Extract keywords from its content
    • Just the one that is explicitly bracketed for now.
    • You can get it from the API.
  • ✅Find and present ✅from different sources that contain those keywords.
    • There are two ways to search on your own after exporting to JSON or by hitting the search API.
    • This time, the search API
  • How to present
    • Create a page in private
      • Because the search results of a private project cannot be made public
    • There is an option to send a DM in Slack.
    • ✅ this time I printed it and pasted it into Scrapbox by hand

result

  • (of) good appearance
  • I thought it was a private memo repository with not much information, but there were some surprisingly interesting pages.
    • I got some hits from the experiment, like the one where I transcribed a recording of a video conference.
  • 105 keywords extracted from 24 hours of activity
  • The number of hits found in the search is 11.
    • The search API yields the hit rows.
  • There are some not-so-good keywords.
    • Research, Organization, Scrapbox, Unexplored Jr.
    • Too many hits.
    • It is better to display only the information “many hits” if the number of hits exceeds a certain level.
    • In the original project, it is only connected if both pages are bracketed, which is not a large number.
    • Three of the four cases were bracketed.
      • So, if you have a lot of search results, you can only show the number of results instead of the number of results, and bracket that search term instead.
      • Always do search keyword bracketing.
      • Rather, “Is there already a link?” and if so, is it better not to search?

impressions

  • relatively good
  • There are a number of other projects that seem to have a lot of interesting content dead in the water.
    • I want to have more than one target project.

additional experiment

Updated to account for the case where the output project and the project to be searched are different.

impressions

  • It’s nice to see other people’s presentations on technical keywords that I’m interested in and have written down.
  • We all think we are using the same Japanese, but there are differences in the choice of words we use to express our thoughts.
    • In my project, I use the word “exclusion” to refer to the removal of a person from a group of people (this is not my word either, but something I wrote down after seeing other people’s expressions).
    • On the other hand, shokai is used to refer to “removing harmful features or functions from software.
    • By comparison Notice the difference.
  • If you look at the keywords displayed and tweak the page in various ways, of course it will be an “updated page” tomorrow, and the same thing will be displayed.
    • This must be boring.
    • Remember what you put out once and don’t put it out for a week to a month, and so on.

source https://github.com/nishio/scbot/blob/0.2/scbio.py#L113


This page is auto-translated from /nishio/è€‡æ•°ăźăƒ—ăƒ­ă‚žă‚§ă‚Żăƒˆé–“ăźăƒȘăƒłă‚Żă‚’èŠ‹ă€ă‘ă‚‹ using DeepL. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. I’m very happy to spread my thought to non-Japanese readers.