DATA MINING TOOLS FOR THE SIX SIGMA PRACTITIONER




 Training materials:  XL Miner software (trial version);  Data Mining Tools student participant guide.  Recommended (but not required) texts:   The Elements of Statistical Learning by T. Hastie, R. Tibshirani,  J. H. Friedman;  Principles of Data Mining (Adaptive Computation and Machine Learning) by David J. Hand, Heikki Mannila, Padhraic Smyth.

Course agenda

            I. OVERVIEW OF DATA MINING
                    • Definitions and tasks 

                    • Steps in the data mining process 

                    • Facts versus myths 

                    • Data mining applications

            II. PREPATORY STEPS
                    • Data preparation (missing data, outlier analysis, data types) 

                    • Data visualization 

                    • Data dictionaries

            III. BACKGROUND ON MODELING
                    • The curse of dimensionality     

                    • Notation and terms 

                    • Bias-variance tradeoff 

                    • Control of the bias-variance tradeoff 

                    • Error functions

            IV. TRADITIONAL MODELS
                    • Linear regression procedures 

                    • Logistic regression 

                    • Discriminant analysis 

                    • Nearest neighbors 

                    • Clustering algorithms

            V. MODERN MODELS
                    • Classification and Regression Trees 

                    • Neural networks 

                    • Bump hunting 

                    • Association rules 

                    • Evaluating and combining models (Bagging, boosting, MART) 

                    • Survey of recent developments (as time permits)