- Related: keyphrase extraction.
- This word is used when we are very conscious of not stopping at one word.
- Iâd like to see â100 Personnel Systems for 100 Peopleâ or âEvaluating by Harmony, Generalists are Chosen.â
- Even the worldâs âkey phrase extractionâ is often subject to restrictions such as ânoun sequenceâ or â
adjective*noun+
formâ - noun phrase approach- Itâs not possible to extract key phrases like the above with such a constrained method.
- Often there are times when you want to use a string of characters that do not appear in a sentence as a key phrase.
- I want them to be connected by an âinformation sharingâ link when the phrases âinformation sharingâ and âsharing informationâ are used.
technique
- An approach that does not use linguistic knowledge
- Simple word frequency.
- Need for [stopword
- Throwing away information on word order.
- The âGeneral Managerâs Associationâ issue where the idiom is split.
- Synonyms are considered different
- cooccurrence
- co-location
- N-grams, etc.
- intra-window co-occurrence
- intra-document cooccurrence - concentration (of oneâs attention)
- Throwing away information on word order.
- tf-idf
- Approach to map real-valued scores whereas the stop word was 0/1.
- âThe less frequently it appears in other texts, the more appropriate it is to characterize this text.â
- IDF part
- Frequent occurrence as a word, but sometimes an important key phrase in the form of an idiom
- RAKE
- Simple word frequency.
- Need for [stopword
- graph based (e.g. graph)
- Graph word adjacencies and choose the one with the highest rank.
- Use PageRank
This page is auto-translated from /nishio/ăăŒăŻăŒăæœćș using DeepL. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. Iâm very happy to spread my thought to non-Japanese readers.