The Data Mining Forum
This forum is about data mining
, data science
and big data
: algorithms, source code, datasets, implementations, optimizations, etc. You are welcome to post call for papers, data mining job ads, link to source code of data mining algorithms or anything else related to data mining. The forum is hosted by P. Fournier-Viger
. No registration is required to use this forum!
CM-Spade/PrefixSpan do not identify frequent sequence with major support
Date: July 05, 2017 09:16AM
I have the following sequence database: https://paste.ee/p/DLVu3
It is clear that
9002910008 -1 9003360008 -1 9009590008 -1 9008310008 -1 9008090008 -1 9003400008 -1 9008230008 -1 9006600008 -1 9001510008 -1
is a frequent sequence, with high support.
When I try to run CM-Spade or PrefixSpan, either with 0.1 or 0.5 minsup, I get no output.
Re: CM-Spade/PrefixSpan do not identify frequent sequence with major support
Date: July 05, 2017 07:22PM
Yes, items are encoded as integers. Normally in Java an integer is signed and on 32 bits, which means no values greater than 2,147,483,647 are accepted.