Tomorrow

webmasterphilfv

1. CFP [Deadline extended] - CFP: International Conference on Big Data and Cybersecurity Intelligence

*** We apologize for the unavoidable cross-posting ***
We are pleased to invite contributions for BDSCIntell’2018 taking place in Beirut, Lebanon organized by Lebanese University, in collaboration with Université Versailles St-Quentin (UVSQ France and Cognitus (a French R&D accelerator). See the detail of the conference below.
Conference Website: http://www.cognitus.fr/BDCSIntell201

Today

webmasterphilfv

Hi all,
Just to let everyone know that I have posted a video presentation of the MinFHM algorithm:
http://www.philippe-fournier-viger.com/spmf/videos/MinFHM.php
It gives an overview of the paper but does not discuss all the details.
Best,
Philippe

3 days ago

webmasterphilfv

IEA/AIE-2019 CALL FOR PAPERS
Sponsored by : International Society of Applied Intelligence (ISAI)
In Cooperation with: Association for the Advancement of Artificial Intelligence (AAAI), Association for Computing Machinery (ACM/SIGART), Austrian Association for Artificial Intelligence (OEGAI), Catalan Association for Artificial Intelligence (ACIA), Graz University of Technology, Austria, Italian

4 days ago

webmasterphilfv

In pattern mining, there is a lot of possible topics:
- new applications of algorithms
- designing faster algorithms for an existing pattern mining problem （you can choose a problem and find a way of improving the performance)
- designing a new problem for pattern mining (you need to think about some limitations of the current pattern mining problems and think about something new that you can

4 days ago

webmasterphilfv

You can check the new paper of my collaborator in TKDE:
Truong, T., Duong, H., Le, B., Fournier-Viger, P. (2018). Efficient Vertical Mining of High Average-Utility Itemsets based on Novel Upper-Bounds . IEEE Transactions on Knowledge and Data Engineering (TKDE), to appear.
He presents a new average utility mining algorithm which is faster than the most recent algorithms for this problem usin

4 days ago

webmasterphilfv

Welcome to the forum.
There are a lot of ways that pattern mining or itemset mining can be used to analyze text.
For example, itemset mining has been used in information retrieval to search for documents. In that case, we can extract itemsets representing frequent words common to a group of similar documents.
We can also use pattern mining to discover hidden features in text such as the w

6 days ago

webmasterphilfv

Hi Nima,
I see.
I remember that a few years ago I saw a paper about some algorithms called FPGraph or something like that. Some person contacted with me by e-mail to ask me to implement this algorithm. But it was not published in a good journal if I remember well, and maybe even that it was an incorrect algorithm because the paper lacked details from what I remember.
After that I did not

6 days ago

webmasterphilfv

Hello,
Yes, there are many algorithms to discover patterns in graph.
For example, there are several algorithms to discover frequent subgraphs in a graph or multiple graphs. Such algorithms are GSpan, FSG, etc. For an overview of frequent subgraph mining, here is an introduction to frequent subgraph mining. Besides that people will also discover other types of patterns such as cliques in grap

8 days ago

webmasterphilfv

5. The 2nd International Workshop on Big Data Analytics for Cyber Intelligence and Defense (BDA4CID 2018)
6. 6th International Workshop on Distributed Storage and Blockchain Technologies for Big Data
7. The 2nd International Workshop on Big Data Analytic for Cybercrime Investigation and Prevention
8. 4th IEEE Workshop on Big Data Analytics in Supply Chains and Transportation

8 days ago

webmasterphilfv

I have spent 1 hour to update xthe list with a few more conferences. If you think I miss some other important conferences, please post below.

10 days ago

webmasterphilfv

SIAM International Conference on Data Mining (SDM19)
Sponsored by the SIAM Activity Group on Data Mining and Analytics.
This conference is held in cooperation with the American Statistical Association.
Location:
Hyatt Regency Calgary
Calgary, Alberta, Canada
Dates:
May 2-4, 2019
Conference General Co-chairs:
Tanya Berger-Wolf, University of Illinois, U.S.
Nitesh Chawla, Universi

10 days ago

webmasterphilfv

CALL FOR PAPERS
The Second IEEE International Workshop on Benchmarking, Performance Tuning and Optimization for Big Data Applications (BPOD 2018)
Collocated with IEEE BigData 2018
One day during December 10-13, 2018, Seattle, WA, USA
Website: https://userpages.umbc.edu/~jianwu/BPOD/
=============================================
Users of big data are often not computer scientists. On t

14 days ago

webmasterphilfv

Have you tried to contact the corresponding author of that paper?
Best,
Philippe

20 days ago

webmasterphilfv

Hi all,
This is to let you know that a new version of SPMF has been released.
The main changes are as follows:
3 new algorithm implementations by Siddharth Dawar, Vikram Goyal et al.:
- HMDS algorithms for mining the top-k high utility itemsets in a data stream
- HMiner for high utility itemset mining
- UP-Hist for high utility itemset mining
1 algorithm implementation by Cheng W

20 days ago

webmasterphilfv

Dear Hung,
Just to let you know that I have updated SPMF a few minutes ago. I have included your bug fix for MinFHM. Besides that I have added 5 new algorithms!
Also, as promised, you are now in the list of contributors. ;-)
Thanks for your help!
Best,
Philippe

21 days ago

webmasterphilfv

Dear Yao,
Thanks for your interest in our paper. Yes, these algorithms are indeed very fast, as shown in the paper
If it was me, I would put the code in SPMF, but although I am a co-author, I am not the main author of this paper. Please contact Tin Truong Chi or Bac Le to request for the source code.
Best regards,
Philippe

24 days ago

webmasterphilfv

Hi,
Thanks for using SPMF. There is some datasets on the SPMF website but you are right that they do not provide the name of the webpages.
The original FIFA dataset contained this information: http://ita.ee.lbl.gov/html/contrib/WorldCup.html
But it is not in SPMF format. So you could convert it to SPMF format again and keep the page label. That would be a solution to obtain click streams

27 days ago

webmasterphilfv

AIAI 2019 @ Crete, Greece – 1st Call for Papers
Dear colleagues
We would like to invite you to submit your work at the 15th International Conference on Artificial Intelligence Applications and Innovations - AIAI 2019 http://www.aiai2019.eu.
15th AIAI 2019 is supported by the IFIP (IInternational Federation for Information Processing, WG12.5)
In 2019 the 15th ΑΙΑΙ Conference will

4 weeks ago

webmasterphilfv

Dear all,
Just to let you know that the PDFs of articles from the UDM 2018 workshop on utility driven mining are online at:
http://philippe-fournier-viger.com/utility_mining_workshop_2018/program.php
Best regards,
Philippe

4 weeks ago

webmasterphilfv

Hi,
Yes, I think you are right. There seems to be a bug in the implementation. Thanks for reporting it. I should release a new version of SPMF in about 1 week and half because I will have a week of holiday. I will then fix the bug, and also add several new algorithms related to high utility itemset mining that some people have sent to me recently.
By the way, I will also add your name to th

5 weeks ago

webmasterphilfv

Thanks Dang,
I see. An XML format. I don't like too much XML-based format personally. It wastes a lot of space with all these tags. Already the text-based format of SPMF takes a lot of space because it is a text file. This format would maybe make the output files 10 times or more larger. Just my opinion. But I understand that it can be useful for interoperability with other software.
Is it w

5 weeks ago

webmasterphilfv

Looks like an interesting concept. Wish you good luck with your product.
Philippe

5 weeks ago

webmasterphilfv

IEEE Big Data 2018 Call for Workshop Papers & Posters
2018 IEEE International Conference on Big Data (BigData 2018)
http://cci.drexel.edu/bigdata/bigdata2018/index.html
Dec 10-13 2018, Seattle, WA, USA
The IEEE Big Data 2018 has received more than 600 full papers in the main conference and industry and government program. If you miss the submission deadline, there are still chances f

5 weeks ago

webmasterphilfv

Hi,
Thanks for using SPMF. I do not know what is the PMML format. But if you are comfortable with Java, you could modify the code for writing the rules to the file. This should not be hard. In SPMF, each algorithm is in a separated package. So you could first find the code of the algorithm that you want to modify and then change the code.
But what is PMML? Can you give me a link to a websit

6 weeks ago

webmasterphilfv

Hi,
There is a lot of possible topics. You can choose to work on something more fundamental like algorithm design or something more applied such as how to best solve a given applied problem.
A good way of choosing a research problem is to look at some recent papers and find something that you are interested in. Personally, I am quite interested in pattern mining problems and algorithm desi

7 weeks ago

webmasterphilfv

Hi all,
It is my pleasure to announce that my data mining blog is now also available in Chinese:
The data mining blog (Chinese).
About every week some articles will be translated to Chinese and put on this Chinese version of the blog. I will not translate all the content of my English blog but the most important posts will be translated. Besides, some guest authors may also write blog p

7 weeks ago

webmasterphilfv

You may export the data from the database to a text file in the proper format and they apply Apriori to the text file to obtain the result.
But it depends on your implementation of Apriori. If you are using the implementation from the SPMF software, then you should read the documentation to see which format is required as input.
Best

8 weeks ago

webmasterphilfv

If your data has time information then yes.

2 months ago

webmasterphilfv

Hello,
Thanks for reading our papers ;-) It is a little bit late, so I will answer the easy questions first, and answer other questions maybe tomorrow.
Philippe
> after I read the article of EFIM and I'm lost at
> certain page.
> correct me if my understanding is not correct.
> The FHM(you created) is able to
> accelerate(improve) the performance of MUI-MINER,
>

