The Data Mining Forum
This forum is about data mining
, data science
and big data
: algorithms, source code, datasets, implementations, optimizations, etc. You are welcome to post call for papers, data mining job ads, link to source code of data mining algorithms or anything else related to data mining. The forum is hosted by P. Fournier-Viger
. No registration is required to use this forum!
Re: Using itemset mining in text mining
Date: October 17, 2018 09:40PM
Welcome to the forum.
There are a lot of ways that pattern mining or itemset mining can be used to analyze text.
For example, itemset mining has been used in information retrieval to search for documents. In that case, we can extract itemsets representing frequent words common to a group of similar documents.
We can also use pattern mining to discover hidden features in text such as the writing style of the author. This can then be used to discover who is the author of an anonymous text (a problem called authorship attribution). I have done some work in that direction by using frequent sequential pattern mining. If you are interested you can send me an email at philfv8 AT yahoo.com
I think there are many other possibilities. Itemsets could also be used as features to build a classifier for classifying texts into categories such as news, sport, entertainment.
I think you can find a lot of applications.