Sponsors

Sponsors


Home

Classification problems aim to identify the characteristics that indicate the group to which each case belongs. This pattern can be used both to understand the existing data and to predict how new instances will behave. For example, you may want to predict whether individuals can be classified as likely to respond to a direct mail solicitation, vulnerable to switching over to a competing long distance phone service, or a good candidate for a surgical procedure.

Poll: Which of the following would you recommend as the best introductory book on data mining?
Data Mining: Concepts and Techniques - Han & Kamber
Data Preparation for Data Mining - Pyle
Introduction to Data Mining - Tan, Steinbach & Kumar
Principles of Data Mining - Hand, Mannila & Smyth
Machine Learning - Mitchell
The Elements of Statistical Learning - Hastie, Tibshirani & Friedman
Introduction to Business Data Mining - Olson & Shi
Predictive Data Mining: a practical guide - Weiss & Indurkhya
Other
Books are way too structured and expensive for me!
[View results]

Check out more information about these books here!
-->


Data mining creates classification models by examining already classified data (cases) and inductively finding a predictive pattern. These existing cases may come from an historical database, such as people who have already undergone a particular medical treatment or moved to a new long distance service. They may come from an experiment in which a sample of the entire database is tested in the real world and the results used to create a classifier. For example, a sample of a mailing list would be sent an offer, and the results of the mailing used to develop a classification model to be applied to the entire database. Sometimes an expert classifies a sample of the database, and this classification is then used to create the model which will be applied to the entire database.

Classification, perhaps the most commonly applied data mining technique, employs a set of pre-classified examples to develop a model that can classify the population of records at large. Fraud detection and credit-risk applications are particularly well suited to this type of analysis. This approach frequently employs decision tree or neural network-based classification algorithms. The use of classification algorithms begins with a training set of pre-classified example transactions. For a fraud detection application, this would include complete records of both fraudulent and valid activities, determined on a record-by-record basis. The classifier training algorithm uses these pre-classified examples to determine the set of parameters required for proper discrimination. The algorithm then encodes these parameters into a model called a classifier.


The approach affects the explanation capability of the system. Once an effective classifier is developed, it is used in a predictive mode to classify new records into these same predefined classes. For example, a classifier capable of identifying risky loans could be used to aid in the decision of whether to grant a loan to an individual.   
Disclaimer
The content on this site is provided as information only and does not constitute an endorsement by the webmaster. It is your responsibility to check out suppliers thoroughly. Trademarks and Service Marks are the property of their respective companies. Note: If you think that a reference to  your work/site/tool should be added to this site or if you have any suggestions related to improvement of this site, please send an email to: admin@eruditionhome.com
This website is about data mining, data mining tutorial, data en language mining, data mining software, data mining tool, crm data mining, business data intelligence mining, data mining technique, application data mining, data mining web, data mining solution, data mining technology, data mining process, data mining warehouse, data definition mining, data mining science technology, data mining privacy, course data mining, data mining reason, data discovery knowledge mining, data data mining warehousing, data job mining, data introduction mining, data mining sas, data mining research, data mining news, concept data mining, data data mining warehouse, data mining text, data mining training, case data engineering in mining software study, consulting data mining, data decision mining thesis tree, data mining server tool, data knowledge management mining, data mining multimedia, data dmo mining sql, care data health mining, code data mining project, data mining olap, data define mining, article data mining, comparison data detection intrusion mining, data mining oracle, data mining pdf, data mining warehousing, data mining program, data mining services, application data mining statistical, association data mining, case data mining study, content data management mining, chennai data mining, data example mining, data it loc mining, data mining seminar, data government mining, audit data mining, classification data mining project report, data information mining, data mining technologies, company data mining, data mining resource, data disadvantage mining, data discovery journal knowledge mining, data marketing mining, data mining visual, data free mining software, career data mining, conference data mining, data mining model, article data data mining warehouse, benefit data mining, data faq mining, data library mining, data mining product, anova data mining, application data digital library mining, data data mining quality, data data mining reduction, data journal mining, analytic data kurt mining technologies.