Skip to content

myarist/Topic-Clustering-for-Covid-19

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Topic Modelling for Covid-19

Here, I using LDA (Latent Dirichlet Allocation)

LDA

LDA

There are two panels displayed by the pyLDAvis graph. The left panel shows a global view of the model, such as how common topics are and how topics relate to each other. The circle on the left panel shows the topic, the distance between circles shows the distance between topics, the prevalence of each topic is indicated by the circle area. Although topic prevalence can indicate which is the most dominant or important topic, prevalence has a disadvantage when comparing topics between topics because it is difficult to compare almost the same circle size. The distance of the circle also has the same weakness, which is difficult to measure the similarity of topics in a topic cluster.

The right panel contains bar charts representing each word that is useful in interpreting the topic. The red diagram shows the frequency of words from related topics (red circle) and the blue diagram shows the wide frequency of the corpus. Above the bar chart is a tool to regulate relevance.

Releases

No releases published

Packages

No packages published

Languages