The Data Mining Forum
This forum is about data mining
, data science
and big data
: algorithms, source code, datasets, implementations, optimizations, etc. You are welcome to post call for papers, data mining job ads, link to source code of data mining algorithms or anything else related to data mining. The forum is hosted by P. Fournier-Viger
. No registration is required to use this forum!
Standard known solution for Utility transaction dataset
Date: May 01, 2021 06:47AM
I am doing research on HUIM.
For result discussion, I planned to measure the performance of algorithm by comparing the no HUIs mined with standard known HUIs for particular dataset sucha as Chess, Mushroom, Connect.
Where I can found the standard known HUIs and its utility for any benchmark transaction dataset.?
Re: Standard known solution for Utility transaction dataset
Date: May 02, 2021 06:22PM
If you want to know the number of HUIs in a dataset, you can run the exact algorithms like EFIM, FHM, and HUI-Miner. All of these algorithms are complete, which means that they always find ALL the high utility itemsets. So if you want to know how many HUIs for some minutil value, you can just use one of those algorithms and you will know.
In SPMF, there are also some approximate algorithms like HUIM-GA, HUIM-BPSO etc. Those algorithms may not find all the HUIs because they are not complete algorithms. They use evolutionary or swarm intelligence techniques to try to find an approximate solution more quickly.
Besides that you can also find details about experiments and number of HUIs in experimental evaluation of HUIM papers.