<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/">
    <channel>
        <title>Forum</title>
        <description>This is a forum for anyone interested in Artificial Intelligence and Data Mining!</description>
        <link>http://forum.ai-directory.com/index.php</link>
        <lastBuildDate>Thu, 17 May 2012 05:24:12 -0400</lastBuildDate>
        <generator>Phorum 5.2.10</generator>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,572,574#msg-574</guid>
            <title>Re: Data format for Sequential Patterns with time-series</title>
            <link>http://forum.ai-directory.com/read.php?5,572,574#msg-574</link>
            <description><![CDATA[ By the way, there is some algorithms that are specialised for a single sequence like WINEPI or MINEPI for what they call &quot;episode mining&quot;.<br />
<br />
But this is a different problem than sequential pattern mining]]></description>
            <dc:creator>webmasterphilfv</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Wed, 16 May 2012 23:18:02 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,572,573#msg-573</guid>
            <title>Re: Data format for Sequential Patterns with time-series</title>
            <link>http://forum.ai-directory.com/read.php?5,572,573#msg-573</link>
            <description><![CDATA[ Hello Yogi,<br />
<br />
Welcome to the forum.<br />
<br />
Sequential pattern mining like GSP  take as input : (1) a sequence database and (2) a parameter called &quot;minsup&quot;.<br />
<br />
A sequence database is a set of sequences.<br />
<br />
The goal of sequential pattern mining is to find subsequences that are common to several sequences.<br />
<br />
In your case, I think that you have only a single sequence.<br />
<br />
<br />
I will give you an example  of a sequence database.<br />
<br />
For example, here are four sequences named s1, s2 and s3:<br />
<br />
s1: (a), (b, c), (d), (e)<br />
s2: (b), (d), (e), (f)<br />
s3: (b), (d), (e), (f)<br />
<br />
The first sequence means that item &quot;a&quot; occured, and was followed by &quot;b&quot; and &quot;c&quot; at the same time, then followed by &quot;d&quot; and then by &quot;e&quot;<br />
<br />
The second sequence means that &quot;b&quot; was followed by &quot;d&quot;, followed by &quot;e&quot; followed by &quot;f.<br />
<br />
The third...<br />
<br />
So if you have a sequence database, you can find some subsequences that are common to several sequences with algorithms like GSP and PrefixSpan.<br />
<br />
For example, those algorithms could find that the subsequence (b), (e) appears in s1, s2 and s3.<br />
<br />
Another example is that (e), (f) appears in s1 and s2.<br />
<br />
So I just write this quickly to give you an idea of what those algorithms do.<br />
<br />
If you have only a single sequence these algorithm may not be appropriate unless you can divide your sequence in several sequences and that it make sense for your application to do that.<br />
<br />
Best,<br />
<br />
Philippe]]></description>
            <dc:creator>webmasterphilfv</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Wed, 16 May 2012 23:15:31 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,572,572#msg-572</guid>
            <title>Data format for Sequential Patterns with time-series</title>
            <link>http://forum.ai-directory.com/read.php?5,572,572#msg-572</link>
            <description><![CDATA[ Hello all,<br />
<br />
I am newbie in the sequential patterns mining. Right now, i am doing my internship and i will implement this method.<br />
<br />
I need help for the datasets that will fit for the sequential pattern with time-series.<br />
<br />
I have prepared an example of datasets like below : <br />
<br />
Time	           M_T_ambiante	DeltaT	M_T_Ext	M_Rayonnement<br />
20/06/2011 00:00	Medium	Normal	Medium	Low<br />
20/06/2011 00:05	Medium	Normal	Medium	Low<br />
20/06/2011 00:10	Medium	Normal	Medium	Low<br />
20/06/2011 00:15	Medium	Normal	Medium	Low<br />
20/06/2011 00:20	Medium	Normal	Medium	Low<br />
20/06/2011 00:25	Medium	Normal	Medium	Low<br />
20/06/2011 00:30	Medium	Normal	Medium	Low<br />
<br />
Is this datasets already can be used with sequential patterns like GSP or else?<br />
<br />
Thanks before your help.<br />
<br />
Regards,<br />
Yogi]]></description>
            <dc:creator>YougyZ</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Wed, 16 May 2012 05:36:54 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,322,571#msg-571</guid>
            <title>Re: 2012 data mining conference list</title>
            <link>http://forum.ai-directory.com/read.php?5,322,571#msg-571</link>
            <description><![CDATA[ The 15th International Conference on Discovery Science (DS-2012),<br />
Lyon, France, on 29-31 October 2012.<br />
Deadline: 25th May 2012<br />
Notification:  	8th July, 2012<br />
Format: (up to) 15 pages in Springer LNCS format]]></description>
            <dc:creator>webmasterphilfv</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Mon, 14 May 2012 08:59:14 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,33,570#msg-570</guid>
            <title>Re: Sequential pattern mining datasets</title>
            <link>http://forum.ai-directory.com/read.php?5,33,570#msg-570</link>
            <description><![CDATA[ You have to pass in a correct parameters to generator.<br />
<br />
Try run with these for help:<br />
<br />
ibm_gen_file.exe seq -help]]></description>
            <dc:creator>tisonet</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Sat, 12 May 2012 05:14:30 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,33,569#msg-569</guid>
            <title>Re: Sequential pattern mining datasets</title>
            <link>http://forum.ai-directory.com/read.php?5,33,569#msg-569</link>
            <description><![CDATA[ The IBM generator should work.  I have tried it yesterday and generated some datasets.  <br />
<br />
What kind of error did you get?<br />
<br />
Philippe]]></description>
            <dc:creator>Philippe</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Fri, 11 May 2012 10:21:47 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,33,568#msg-568</guid>
            <title>Re: Sequential pattern mining datasets</title>
            <link>http://forum.ai-directory.com/read.php?5,33,568#msg-568</link>
            <description><![CDATA[ but The IBM Quest Synthetic Data Generator cannot work at all.neither at windows xp or windows7. is there anything wrong?]]></description>
            <dc:creator>doris</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Fri, 11 May 2012 10:05:50 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,322,567#msg-567</guid>
            <title>Re: 2012 data mining conference list</title>
            <link>http://forum.ai-directory.com/read.php?5,322,567#msg-567</link>
            <description><![CDATA[ KDIR 2012 ( 4th International Conference on Knowledge Discovery and Information Retrieval), <br />
[<a href="http://www.kdir.ic3k.org" rel="nofollow" >www.kdir.ic3k.org</a>]<br />
October 4 - 7, 2012, Barcelona, Spain<br />
Format: see webpage]]></description>
            <dc:creator>webmasterphilfv</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Wed, 09 May 2012 18:36:51 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,520,566#msg-566</guid>
            <title>Re: [JAVA] how to calculate the maximum memory usage of a data mining algorithm</title>
            <link>http://forum.ai-directory.com/read.php?5,520,566#msg-566</link>
            <description><![CDATA[ Thanks you both<br />
<br />
i got so many things now and clear with it.<br />
<br />
yes philippe sir i also feel that algorithm run slow when memory reach at the highest point. like you mention 1 GB]]></description>
            <dc:creator>Dvijesh88</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Wed, 09 May 2012 07:55:01 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,546,565#msg-565</guid>
            <title>Re: Ph.D project in data mining</title>
            <link>http://forum.ai-directory.com/read.php?5,546,565#msg-565</link>
            <description><![CDATA[ ^^  <br />
<br />
These are some good topics!<br />
<br />
Graph mining are a very good topic in my opinion. It may need a little bit more mathematics than some other topics. But it should be interesting. In particular, social network mining is very popular now.<br />
<br />
Multimedia is also very promising. For example, mining video stream from camera to detect abnormal behavior, clustering similar music, discovering patterns in satellite pictures....<br />
<br />
You could also work on medical data.<br />
<br />
Also, temporal data is interesting to discover trends, repetitive patterns, etc.  It could be applied for example to stock market data, etc.<br />
<br />
Good luck!<br />
<br />
Phil]]></description>
            <dc:creator>webmasterphilfv</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Tue, 08 May 2012 21:45:47 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,520,564#msg-564</guid>
            <title>Re: [JAVA] how to calculate the maximum memory usage of a data mining algorithm</title>
            <link>http://forum.ai-directory.com/read.php?5,520,564#msg-564</link>
            <description><![CDATA[ Hello Dvijesh,<br />
<br />
&gt;&gt;&gt;&gt; this memory show that how much RAM used by JVM?<br />
<br />
The method <i>checkMemory</i>() is for checking how much memory is used by the program running inside the JVM <u>at a particular moment</u>.<br />
<br />
&gt;&gt;&gt; And if it is RAM than How it show that max memory is used by program because if program execute all the steps then RAM will be realesed.....<br />
<br />
To know what is the max memory, the trick is, as tisonet said, to check the memory several times during the execution.  So in my code, it means to call <i>checkMemory</i>() several time during the execution.<br />
<br />
For example, consider the PrefixSpan algorithm.  You could call the method checkMemory() after each database projection, for example.  Then, when the algorithm terminates the variable <i>maxMemory</i> will contain the maximum memory used. It may not be 100 % exact. But it will be very close to the real value.<br />
<br />
A better way may be to do as tisonet said and to check the memory before the algorithm starts check regularly an<br />
<br />
Beside that, I have one last thing to say about the memory. You probably know that Java use a garbage collector to free some memory when all the memory is used. The garbage collector is &quot;lazy&quot;. That means that it will only free memory when all the memory has been used. For example, if you have 1 GB of memory, the garbage collector will start to clean when the 1 GB is full. Why i'm talking about this? It is that if your program reach the point where the garbage collector need to free some memory, it will start to slow down your algorithm and if you are measuring the speed of your algorithm, it will give you some wrong measurements. So, I just want to say to be careful if you reach the limit of memory available on your computer because the garbage collector may interfere with your measurements.<br />
<br />
Philippe]]></description>
            <dc:creator>webmasterphilfv</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Tue, 08 May 2012 21:33:21 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,520,563#msg-563</guid>
            <title>Re: [JAVA] how to calculate the maximum memory usage of a data mining algorithm</title>
            <link>http://forum.ai-directory.com/read.php?5,520,563#msg-563</link>
            <description><![CDATA[ My implemenation in C#:<br />
<br />
 public class MemoryCrawler<br />
  {<br />
    /////////////////////////////////////////////////////////////////////////////////////////////<br />
    /////////////////////////////////////////////////////////////////////////////////////////////<br />
<br />
    private static long _maxMemoryInBytes = 0;<br />
    private static long _memoryBeforeRunInBytes = 0;<br />
    private static readonly object _mutex = new object();<br />
    private readonly Timer _memoryCrawler = new Timer(500);<br />
<br />
    /////////////////////////////////////////////////////////////////////////////////////////////<br />
    /////////////////////////////////////////////////////////////////////////////////////////////<br />
<br />
    public MemoryCrawler()<br />
    {<br />
      _memoryCrawler.AutoReset = true;<br />
      _memoryCrawler.Elapsed += MemoryCrawlerOnElapsed;<br />
    }<br />
  <br />
    /////////////////////////////////////////////////////////////////////////////////////////////<br />
    /////////////////////////////////////////////////////////////////////////////////////////////<br />
  <br />
    private static void MemoryCrawlerOnElapsed(object sender, ElapsedEventArgs elapsedEventArgs)<br />
    {<br />
      lock (_mutex)<br />
      {<br />
        long actualMemory = GC.GetTotalMemory(false);<br />
        _maxMemoryInBytes = actualMemory &gt; _maxMemoryInBytes ? actualMemory : _maxMemoryInBytes;<br />
      }<br />
    }<br />
<br />
    /////////////////////////////////////////////////////////////////////////////////////////////<br />
    /////////////////////////////////////////////////////////////////////////////////////////////<br />
  <br />
    public void Start()<br />
    {<br />
      _maxMemoryInBytes = 0;<br />
      _memoryBeforeRunInBytes = GC.GetTotalMemory(true);<br />
<br />
      _memoryCrawler.Start();<br />
    }<br />
<br />
    public void Stop()<br />
    {<br />
      _memoryCrawler.Stop();<br />
    }<br />
<br />
    public long GetMaxMemoryInMBytes()<br />
    {<br />
      return (_maxMemoryInBytes - _memoryBeforeRunInBytes) / 1048576;<br />
    }<br />
<br />
    /////////////////////////////////////////////////////////////////////////////////////////////<br />
    /////////////////////////////////////////////////////////////////////////////////////////////<br />
  <br />
  }]]></description>
            <dc:creator>tisonet</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Tue, 08 May 2012 10:37:55 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,520,562#msg-562</guid>
            <title>Re: [JAVA] how to calculate the maximum memory usage of a data mining algorithm</title>
            <link>http://forum.ai-directory.com/read.php?5,520,562#msg-562</link>
            <description><![CDATA[ Thats true.<br />
<br />
I measure max. memory little bit different. <br />
Before starting mining I take memory snapshot. So I got variable memoryBeforeRun.<br />
Then Iam getting snapshot regularly during mining every x ms. So i got max. memory usage. <br />
<br />
Finally max. memory is equal to: memoryBeforeRun - maxMemory.]]></description>
            <dc:creator>tisonet</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Tue, 08 May 2012 10:33:14 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,520,561#msg-561</guid>
            <title>Re: [JAVA] how to calculate the maximum memory usage of a data mining algorithm</title>
            <link>http://forum.ai-directory.com/read.php?5,520,561#msg-561</link>
            <description><![CDATA[ one question i have in my mind sir.<br />
please let me clear<br />
<br />
this memory show that how much RAM used by JVM?<br />
<br />
And if it is RAM than How it show that max memory is used by program because if program execute all the steps then RAM will be realesed.....]]></description>
            <dc:creator>Dvijesh88</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Tue, 08 May 2012 07:22:24 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,546,560#msg-560</guid>
            <title>Re: Ph.D project in data mining</title>
            <link>http://forum.ai-directory.com/read.php?5,546,560#msg-560</link>
            <description><![CDATA[ I wanna do something but i dont know what.(:P) Thats funny.<br />
<br />
Do you wanna interesting data mining topic?<br />
What about data mining in astronomic data, graphs, multimedia or spatio-temporal data? Thats are good enough topic for get Ph. D. at my univerzity.]]></description>
            <dc:creator>tisonet</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Tue, 08 May 2012 03:47:42 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,546,559#msg-559</guid>
            <title>Re: Ph.D project in data mining</title>
            <link>http://forum.ai-directory.com/read.php?5,546,559#msg-559</link>
            <description><![CDATA[ Please also tell me good data mining projects.<br />
<br />
I would like to make Ph.D. in data mining. maybe clustering research. <br />
<br />
It is good topic?<br />
<br />
Dawn]]></description>
            <dc:creator>Dawn</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Mon, 07 May 2012 23:37:22 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,322,558#msg-558</guid>
            <title>Re: 2012 data mining conference list</title>
            <link>http://forum.ai-directory.com/read.php?5,322,558#msg-558</link>
            <description><![CDATA[ IEEE ICDM 2012 : <br />
[<a href="http://icdm2012.ua.ac.be" rel="nofollow" >icdm2012.ua.ac.be</a>]<br />
Brussels, Belgium<br />
Conference full paper submissions: June 18, 2012<br />
Paper notifications: September 18, 2012<br />
Format : 6 - 10 pages, IEEE format]]></description>
            <dc:creator>webmasterphilfv</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Sun, 06 May 2012 08:37:03 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,549,557#msg-557</guid>
            <title>Re: SPAM vs PrefixSpan</title>
            <link>http://forum.ai-directory.com/read.php?5,549,557#msg-557</link>
            <description><![CDATA[ yes sir<br />
you can ask <br />
give me mail ID, i will sent it to u.]]></description>
            <dc:creator>Dvijesh88</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Wed, 02 May 2012 03:16:23 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,549,556#msg-556</guid>
            <title>Re: SPAM vs PrefixSpan</title>
            <link>http://forum.ai-directory.com/read.php?5,549,556#msg-556</link>
            <description><![CDATA[ Dvijesh88 Wrote:<br />
-------------------------------------------------------<br />
... i sent you that paper.<br />
<br />
Can I ask for that paper?<br />
<br />
I think that solution for avoiding database scan is use ITEM_IS_EXIST_TABLE which was defined in LAPIN algorithm.]]></description>
            <dc:creator>tisonet</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Tue, 01 May 2012 15:55:31 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,322,555#msg-555</guid>
            <title>Re: 2012 data mining conference list</title>
            <link>http://forum.ai-directory.com/read.php?5,322,555#msg-555</link>
            <description><![CDATA[ MLG-2012: Tenth workshop on Mining and Learning with Graphs (MLG-2012).<br />
[<a href="http://dtai.cs.kuleuven.be/events/mlg2012/" rel="nofollow" >dtai.cs.kuleuven.be</a>]<br />
Edinburg, July 1st, 2012, Colocated with ICML 2012<br />
Deadline: May 7th<br />
Format: 8 pages (ICML format) or 3 pages (ICML format)]]></description>
            <dc:creator>webmasterphilfv</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Tue, 01 May 2012 13:18:16 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,549,554#msg-554</guid>
            <title>Re: SPAM vs PrefixSpan</title>
            <link>http://forum.ai-directory.com/read.php?5,549,554#msg-554</link>
            <description><![CDATA[ Hi Dvijesh,<br />
<br />
In my opinion, it would not make sense to use only a vertical database with prefixspan because PrefixSpan need to scan the sequences, which cannot be made efficiently with a vertical database.  Vertical databases are better for candidate generation like in ECLAT, SPAM or APRIORITID.<br />
<br />
But perhaps that it would be possible to do something like FPGrowth.  FPGrowth uses a horizontal database stored into a FPTree. But it also has a header table that allows to find all the sequences that contain an item. This header table could be considered as kind of vertical database. <br />
<br />
Perhaps that something like that could be made or has been already made for PrefixSpan. But i'm not sure how this information would be useful... or if it could be used to make the algorithm faster.  I did not think a  lot about this. :P<br />
<br />
Best,<br />
<br />
Phlippe]]></description>
            <dc:creator>webmasterphilfv</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Tue, 01 May 2012 13:15:20 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,23,553#msg-553</guid>
            <title>Re: Apriori algorithm in C# or Java</title>
            <link>http://forum.ai-directory.com/read.php?5,23,553#msg-553</link>
            <description><![CDATA[ Hi Priyanka,<br />
<br />
Please check <a href="http://www.philippe-fournier-viger.com/spmf/" rel="nofollow" >this page</a> for Java source code for Apriori.<br />
<br />
Philippe]]></description>
            <dc:creator>webmasterphilfv</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Tue, 01 May 2012 13:05:35 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,23,552#msg-552</guid>
            <title>Re: Apriori algorithm in C# or Java</title>
            <link>http://forum.ai-directory.com/read.php?5,23,552#msg-552</link>
            <description><![CDATA[ can u please send me apriori algorithm in c++ or java]]></description>
            <dc:creator>priyanka</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Tue, 01 May 2012 09:21:32 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,549,551#msg-551</guid>
            <title>Re: SPAM vs PrefixSpan</title>
            <link>http://forum.ai-directory.com/read.php?5,549,551#msg-551</link>
            <description><![CDATA[ hello sir,<br />
you are right about PrefixSpan as far i know.<br />
 <br />
yes that both problem has been solved by one algorithm atleast it claim that they over come both the problem which mention here. i sent you that paper.<br />
<br />
so sir cant we represent the dataset in vertical format and apply prefixspan? yeah we have to do some changes in algorithm....<br />
this is just question. and idea i can be wrong.]]></description>
            <dc:creator>Dvijesh88</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Sun, 29 Apr 2012 10:27:34 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,549,550#msg-550</guid>
            <title>Re: SPAM vs PrefixSpan</title>
            <link>http://forum.ai-directory.com/read.php?5,549,550#msg-550</link>
            <description><![CDATA[ Hello Dvijesh,<br />
<br />
Here are my quick observations about SPAM and PrefixSpan. Maybe I forgot some elements.<br />
<br />
What is good about SPAM:<br />
- it uses a  bitmap representation which is memory efficient. There is several optimizations that are posible with bitmaps.<br />
- it uses a vertical representation of the database so that the database only need to be scanned once to create the vertical representation.<br />
- it is very fast to calculate the intersection of two sids sets (sets of sequence ids) by doing a kind of logical AND with two bitmaps.<br />
-. ...<br />
<br />
The weak points of SPAM: <br />
- if the sequences are very long, the memory usage will go up because each bitmap take more memory space.  If there is a lot of frequent items, more bitmaps will need to be stored into memory.<br />
- SPAM generate candidates and then compute the sids set of the candidate to calculate its support. It may generate many candidates that are not frequent, therefore wasting time. It can also generate candidates that do not appear in the database.<br />
- it is difficult to extend SPAM with additional constraints compared to PrefixSpan<br />
- the database has to be stored in memory<br />
-. ...<br />
<br />
What is good about PrefixSpan<br />
- a pattern growth approach. It will not generate candidates that do not appear in the database.<br />
- it includes some optimizations like pseudo-projection, etc.<br />
- it is easy to extend PrefixSpan with additional constraints, for mining closed patterns, etc.<br />
-. ...<br />
<br />
What is weak about PrefixSpan<br />
- unlike SPAM, it does not use a vertical representation. So it may need to scan the database several times (even if it uses pseudo-projection).<br />
- the database has to be stored in memory (could be stored on disk, but would be slower)<br />
- ...]]></description>
            <dc:creator>webmasterphilfv</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Sat, 28 Apr 2012 18:55:49 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,549,549#msg-549</guid>
            <title>SPAM vs PrefixSpan</title>
            <link>http://forum.ai-directory.com/read.php?5,549,549#msg-549</link>
            <description><![CDATA[ I just want to know which algorithm better? please tell me with all points.<br />
where to use these algorithms?]]></description>
            <dc:creator>Dvijesh88</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Sat, 28 Apr 2012 14:08:02 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,546,548#msg-548</guid>
            <title>Re: Ph.D project in data mining</title>
            <link>http://forum.ai-directory.com/read.php?5,546,548#msg-548</link>
            <description><![CDATA[ Tenjiku Wrote:<br />
-------------------------------------------------------<br />
&gt; Hi. I'm a student. I want to do a Ph.D. in data<br />
&gt; mining but i don't know which topic is<br />
&gt; interesting. Could you recommend me research<br />
&gt; topics about data mining?  I have heard of doing<br />
&gt; decision tree but i think that it is too simple.<br />
&gt; Could you give me a good research topics for doing<br />
&gt; my phd in data mining????]]></description>
            <dc:creator>Azrs</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Sat, 28 Apr 2012 02:45:18 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,322,547#msg-547</guid>
            <title>Re: 2012 data mining conference list</title>
            <link>http://forum.ai-directory.com/read.php?5,322,547#msg-547</link>
            <description><![CDATA[ A french conference:<br />
<br />
BDA 2012 (Bases de Données Avancées 2012)<br />
[<a href="http://bda2012.isima.fr/" rel="nofollow" >bda2012.isima.fr</a>]<br />
23 au 26 oct 2012, Clermont-Ferrand, France<br />
Format: 20 pages A4<br />
Deadline: 7 may 2012<br />
Notification: 12 july 2012]]></description>
            <dc:creator>webmasterphilfv</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Fri, 27 Apr 2012 20:53:31 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,546,546#msg-546</guid>
            <title>Ph.D project in data mining</title>
            <link>http://forum.ai-directory.com/read.php?5,546,546#msg-546</link>
            <description><![CDATA[ Hi. I'm a student. I want to do a Ph.D. in data mining but i don't know which topic is interesting. Could you recommend me research topics about data mining?  I have heard of doing decision tree but i think that it is too simple. Could you give me a good research topics for doing my phd in data mining????]]></description>
            <dc:creator>Tenjiku</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Thu, 26 Apr 2012 00:05:20 -0400</pubDate>
        </item>
        <item>
            <guid>http://forum.ai-directory.com/read.php?5,322,545#msg-545</guid>
            <title>Re: 2012 data mining conference list</title>
            <link>http://forum.ai-directory.com/read.php?5,322,545#msg-545</link>
            <description><![CDATA[ MMDS 2012. Workshop on Algorithms for Modern Massive Data Sets<br />
Stanford, USA<br />
Date: July 10–13, 2012<br />
Deadline: unknown<br />
Format: unknown<br />
[<a href="http://www.stanford.edu/group/mmds/" rel="nofollow" >www.stanford.edu</a>]]]></description>
            <dc:creator>webmasterphilfv</dc:creator>
            <category>The Data Mining Forum</category>
            <pubDate>Tue, 24 Apr 2012 19:31:16 -0400</pubDate>
        </item>
    </channel>
</rss>

