The Data Mining Forum                             open-source data mining software open-source data mining software data science journal data mining conferences
This forum is about data mining, data science and big data: algorithms, source code, datasets, implementations, optimizations, etc. You are welcome to post call for papers, data mining job ads, link to source code of data mining algorithms or anything else related to data mining. The forum is hosted by P. Fournier-Viger. No registration is required to use this forum!.  

Current Page: 1 of 1
Results 1 - 16 of 16
3 months ago
tassieTom
Hi Philippe, I thought I would share some interesting data/results. Some German casinos put their nightly roulette results on line, spin by spin in time order. You can get years of results if you want. Here is one casino: https://www.spielbank-wiesbaden.de/index.php?id=105&view=archiv&table=TB2AS&date=02.01.2019# This is table by table with full statistics. Now if roulette
Forum: The Data Mining / Big Data Forum
3 months ago
tassieTom
Hi Philippe, Thanks for taking the time to answer this. Your response has been very helpful. By consistent I meant the same output with the same input. It looks a bit like a monte carlo run where there is variation in output, but you explained it very well. I will post what I am using it for a little later today, then you will see what I mean. Regards, Tom Berger
Forum: The Data Mining / Big Data Forum
3 months ago
tassieTom
Arthur, Leighton Vaughan Williams is based in the UK and used to have a blog.I think he called himself the Betfair Professor. He had access to some studies. Another tip is also to get Google Scholar and type in various search terms such as tennis market efficiency, favourite longshot bias, and market bias. Tennis has a few great papers on the favourite longshot bias. Good luck. Tom Berger
Forum: The Data Mining / Big Data Forum
3 months ago
tassieTom
Hi Philippe, I did work on the horses and greyhounds in the 80's and 90's with fairly primitive software and computers compared with what is available today. The problem is that the market is SEMI efficient, meaning that not all the information is in the prices, but MOST is. This makes the public fairly accurate in predicting winners. So to be more accurate than the public, a LOT of data is
Forum: The Data Mining / Big Data Forum
3 months ago
tassieTom
I had an additional question in this post, but solved it--ignore my post here please. Tom Berger
Forum: The Data Mining / Big Data Forum
3 months ago
tassieTom
Hi Philippe, I tested your new web site and it loads fine on my end...very quick. It has a simple, more modern look. Fonts etc are all good and present well over various computers. Regards, Tom Berger
Forum: The Data Mining / Big Data Forum
3 months ago
tassieTom
Information Efficiency in Financial and Betting Markets Hardcover – 23 Nov 2005 by Leighton Vaughan Williams. This consists of various papers and has terrific information on tennis and horse racing, market efficiency, arbitrage etc.About 380 pages, recommended. If you want to know anything about the horse racing markets from around the world, various papers have been compiled into a whopp
Forum: The Data Mining / Big Data Forum
3 months ago
tassieTom
Philippe, thank you very much for your help, I appreciate it. This is fantastic, it lets you see what is going on by adding a visual aspect. It's all working well! I am testing some interesting things, will post results in the near future if it turns out worthwhile. Many thanks again! Regards, Tom
Forum: The Data Mining / Big Data Forum
3 months ago
tassieTom
Philippe, sorry to be a pain, but the java doesn't compile, I get the following error: C:\Users\Administrator\Desktop>javac MainTestConvertToDotFormat.java MainTestConvertToDotFormat.java:75: error: incompatible types: Object cannot be converted to String for(String otherVertex : edge.getValue()) { ^ Note: MainTestConv
Forum: The Data Mining / Big Data Forum
3 months ago
tassieTom
This is brilliant, thank you Philippe. Keep up your wonderful work. Many thanks, Tom
Forum: The Data Mining / Big Data Forum
3 months ago
tassieTom
Hi Philippe, Thank you for writing an amazingly useful article on visualising the output of SPMF using graphviz. http://data-mining.philippe-fournier-viger.com/discovering-visualizing-sequential-patterns-web-log-data-using-spmf/ I was able to replicate your FIFA results using your Java script. However, what I dont understand is what it is doing. It is "condensing" or shrinking t
Forum: The Data Mining / Big Data Forum
8 months ago
tassieTom
Thank you very much Philippe. Regards, Tom
Forum: The Data Mining / Big Data Forum
8 months ago
tassieTom
Hi Phillipe I have read your papers on authorship attribution. I am interested in the output of TKS -- how you rank the most likely candidate. It seems you are using a script or customised software to apply the output of TKS to create percentages, discard most common sequences and rank the most likely candidates. Is this script/program available or is it propriety? I have output TKS from 8
Forum: The Data Mining / Big Data Forum
8 months ago
tassieTom
Hi Philippe, Here is the basic version of a BAT file that will process multiple .text files with TKS in a folder. For simplicity there is no directory checking, so SMPF and the .text files need to be in the same folder. for %%f in (*.text) do ( echo %%~nf java -jar spmf.jar run TKS "%%~nf.text" "%%~nfOutput.txt" 50 1 4 "" 1 0 ) That's it! Hope it
Forum: The Data Mining / Big Data Forum
8 months ago
tassieTom
Thanks for your response, Philippe. You gave me a clue by mentioning a Bat file for windows. I did a bit of research and managed to get one running which will process all the files in a folder. It works like a charm! Thanks again.
Forum: The Data Mining / Big Data Forum
8 months ago
tassieTom
Good morning, I am experimenting with skipgrams and TKS. The results are very impressive in text files that I am using. I am wondering if it is possible to batch process a whole folder of txt files instead of entering the input files one at at time in SPMF. Many thanks, Tom
Forum: The Data Mining / Big Data Forum
Current Page: 1 of 1

This forum is powered by Phorum and provided by P. Fournier-Viger (© 2012).
Terms of use.