Machine Learning –I

Paper Code: 
MBB 323
Credits: 
4
Contact Hours: 
90.00
Max. Marks: 
100.00
Objective: 

Course outcomes

Learning and teaching

Strategies

Assessment

Strategies

On completion of this course, the students will be able to;

CO 1. Formulate a problem for business analytics.

CO 2. Install python and orange tool for machine learning implementation on business problem.

CO 3. Prepare the dataset for computation after collected it from the business domain based data source.

CO 4. Select suitable machine learning technique for designing a model.

CO 5. Develop a machine learning model for business problems.

CO 6. Evaluate and compare the performance

of machine learning models.

Approach in teaching: Interactive Lectures, GroupDiscussion, Tutorials, Case Study

Learning activities for thestudents: Self-learning assignments, presentation

Class test, Semester end examinations, Quiz, Assignments, Presentation

 

18.00
Unit I: 

Introduction to Data Mining and machine learning: Basic Data Mining Tasks, Data Mining versus Knowledge Discovery in Databases, Applications of Machine Learning, Machine Learning vs AI , Types of Machine Learning, Metrics, Accuracy Measures: Precision, recall, F-measure, confusion matrix, cross-validation, bootstrap, Probability and likelihood, probability distribution. Data Mining tool Orange.

 

18.00
Unit II: 

Understand the Problem by Understanding the Data, unbalanced data, Unsupervised Learning: Association rules, Apriori algorithm, FP tree algorithm, and their implementation in python and Orange tool, Market Basket Analysis and Association Analysis.

 

18.00
Unit III: 

Clustering: k-means and implementation of k-means using python and Orange tool, Concept of other clustering algorithms: Expectation Maximization (M) algorithm, Hierarchical clustering, and DBSCAN.

 

18.00
Unit IV: 

Classification & Prediction: model Construction, performance, attribute selection Issues: under, Over-fitting, cross validation, tree pruning methods, missing values, Information Gain, Gain Ratio, Gini Index, continuous classes. Classification and Regression Trees (CART) and C 5.0 .Implementation of decision tree in python and Orange tool.

 

18.00
Unit V: 

Classification & Prediction: Linear Regression, Multiple Linear Regression, Logistic Regression, Naïve Bayes and Support Vector Machines(SVM), Implementation of Linear Regression, Logistic Regression, Naïve Bayes and SVM in python and Orange tool.

*Case studies related to entire topics are to be taught.

 

Essential Readings: 

Essential readings

  • Jiawei Han & Micheline Kamber, “Data Mining: Concepts & Techniques”, Morgan Kaufmann Publishers, Third Edition.
  • Sebastian Raschka & Vahid Mirjalili,” Python Machine Learning”, Second Edition,Packt>.
  • McKinney ,Python for Data Analysis. O’ Reilly Publication,2017.
  • Curtis Miller, ”Hands-On Data Analysis with NumPy and Pandas"
  • (Latest editions of the above books are to be referred)
Suggested readings
  • Curtis Miller,” Hands-On Data Analysis with NumPy and Pandas"
  • (Latest editions of the above books are to be referred)

 

 

References: 
E resources

 

Journals

 

Academic Year: