docs = docs.apply(lambda s: clean_email_text(s))
然后我们呢把里面的email提取出来:
doclist=docs.values
接下来,我们使用gensim...库来进行LDA模型的构建,gensim可用指令pip install -U gensim安装。...above', 'a', 'at', 'your', 'theirs', 'below', 'other', 'not', 're', 'him', 'during', 'which']
然后我们将输入转换成gensim...import corpora, models, similarities
import gensim
dictionary = corpora.Dictionary(texts)
?...最后,就可以开始构建我们的模型了:
lda = gensim.models.ldamodel.LdaModel(corpus=corpus, id2word=dictionary, num_topics