数据挖掘导论英文chap4-basic-classification.ppt

上传人:99****p 文档编号:1420385 上传时间:2019-02-25 格式:PPT 页数:101 大小:1.74MB
下载 相关 举报
数据挖掘导论英文chap4-basic-classification.ppt_第1页
第1页 / 共101页
数据挖掘导论英文chap4-basic-classification.ppt_第2页
第2页 / 共101页
数据挖掘导论英文chap4-basic-classification.ppt_第3页
第3页 / 共101页
数据挖掘导论英文chap4-basic-classification.ppt_第4页
第4页 / 共101页
数据挖掘导论英文chap4-basic-classification.ppt_第5页
第5页 / 共101页
点击查看更多>>
资源描述

1、Data Mining Classification: Basic Concepts, Decision Trees, and Model EvaluationLecture Notes for Chapter 4Introduction to Data MiningbyTan, Steinbach, Kumar Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 1 Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 2 Classification: Defi

2、nitionlGiven a collection of records (training set ) Each record contains a set of attributes, one of the attributes is the class.lFind a model for class attribute as a function of the values of other attributes.lGoal: previously unseen records should be assigned a class as accurately as possible. A

3、 test set is used to determine the accuracy of the model. Usually, the given data set is divided into training and test sets, with training set used to build the model and test set used to validate it. Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 3 Illustrating Classification Task Tan,

4、Steinbach, Kumar Introduction to Data Mining 4/18/2004 4 Examples of Classification TasklPredicting tumor cells as benign or malignantlClassifying credit card transactions as legitimate or fraudulentlClassifying secondary structures of protein as alpha-helix, beta-sheet, or random coillCategorizing

5、news stories as finance, weather, entertainment, sports, etc Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 5 Classification TechniqueslDecision Tree based MethodslRule-based MethodslMemory based reasoninglNeural NetworkslNave Bayes and Bayesian Belief NetworkslSupport Vector Machines Ta

6、n,Steinbach, Kumar Introduction to Data Mining 4/18/2004 6 Example of a Decision TreecategoricalcategoricalcontinuousclassRefundMarStTaxIncYESNONONOYes NoMarried Single, Divorced80KSplitting AttributesTraining Data Model: Decision Tree Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 7 Ano

7、ther Example of Decision Treecategoricalcategoricalcontinuousclass MarStRefundTaxIncYESNONONOYes NoMarried Single, Divorced80KThere could be more than one tree that fits the same data! Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 8 Decision Tree Classification TaskDecision Tree Tan,Ste

8、inbach, Kumar Introduction to Data Mining 4/18/2004 9 Apply Model to Test DataRefundMarStTaxIncYESNONONOYes NoMarried Single, Divorced80KTest DataStart from the root of tree. Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 10 Apply Model to Test DataRefundMarStTaxIncYESNONONOYes NoMarried Single, Divorced80KTest Data

展开阅读全文
相关资源
相关搜索

当前位置:首页 > 教育教学资料库 > 课件讲义

Copyright © 2018-2021 Wenke99.com All rights reserved

工信部备案号浙ICP备20026746号-2  

公安局备案号:浙公网安备33038302330469号

本站为C2C交文档易平台,即用户上传的文档直接卖给下载用户,本站只是网络服务中间平台,所有原创文档下载所得归上传人所有,若您发现上传作品侵犯了您的权利,请立刻联系网站客服并提供证据,平台将在3个工作日内予以改正。