The Data Mining Forum                             open-source data mining software data science journal data mining conferences high utility mining workshop
This forum is about data mining, data science and big data: algorithms, source code, datasets, implementations, optimizations, etc. You are welcome to post call for papers, data mining job ads, link to source code of data mining algorithms or anything else related to data mining. The forum is hosted by P. Fournier-Viger. No registration is required to use this forum!.  
contentious in Decision Tree
Posted by: Nail
Date: July 30, 2019 11:52AM

In Decision Tree how we get to know the range of contentious attribute .
As in the image below it is written >80 nand <80


Options: ReplyQuote
Re: contentious in Decision Tree
Date: July 31, 2019 05:11AM

Let say that you want to build a decision tree using a purity measure such as GINI.

If you want to split using some ranges of values for some continuous attribute such as age > 80 in a decision tree, a simple way is to calculate the GINI measures for different split:
age > 80 / age <= 80
age > 75 / age <= 75
age > 70 / age <= 70
...

Then, you choose the best split. For example if age > 70 / age <= 70 allows to obtain the best GINI value, then you choose this.

HOpe this helps

Options: ReplyQuote


This forum is powered by Phorum and provided by P. Fournier-Viger (© 2012).
Terms of use.