我正在寻找一个简单的使用R包,将允许我:
我正在寻找的东西,允许一个开始的R级用户使用。谢谢
发布于 2022-01-02 19:29:50
潮汐文本允许您这样做。
还有一本关于tidytext在https://www.tidytextmining.com/index.html的免费书籍。
library(tidytext)
library(dplyr)
somewords <- "the lorem ipsum text with some common words.. very common words can be found quite often. these are often commonly used words enmeshed with lorem ipsum."
tibble(somewords) %>%
unnest_tokens(word, somewords) %>%
group_by(word) %>%
count() %>%
arrange(desc(n))tibble(somewords) %>%
unnest_tokens(word, somewords) %>%
filter(!word %in% stop_words$word)bigrams <- tibble(somewords) %>%
unnest_tokens(bigram, somewords, token = "ngrams", n = 2)
bigrams <- bigrams %>%
count(bigram, sort = TRUE)https://softwarerecs.stackexchange.com/questions/81725
复制相似问题