Machine Learning with Mahout Certification Training
This Machine Learning with Mahout Certification Training course covers the fundamentals of machine learning techniques ranging from various algorithms of Support Vector Machines, k-means clustering, Random Forests, Collaborative filtering to recommendation system, Mahout on Hadoop and Amazon EMR, etc.
In the modern information age of exponential data growth, the success of companies and enterprises depends on how quickly and efficiently they turn vast amounts of data into actionable information. Whether it’s for processing hundreds or thousands of personal e-mail messages a day or driving user intent from petabytes of weblogs, the need for tools that can organize and enhance data has never been greater. Therein lies the premise and the promise of the field of machine learning and Apache Mahout.
- Gain an insight into the Machine Learning techniques.
- Understand the algorithms of SVM, Naive Bayes, Random Forests,etc.
- Implement these using 'Apache Mahout'
- Understand the recommendation system
- Learn Collaborative filtering, Clustering and Categorization
- Analyse Big Data using Hadoop and Mahout
- Implementing a recommender using MapReduce
- Introduction to tools like Weka, Octave, Matlab, SAS
1. Analytics Professionals
2. Data Scientists looking to hone their machine learning skills
3. Software Developers and Architects
4. Business Analysts wanting to learn Mahout for ML implementation
5. Professionals working with R, Matlab, Python, etc.
6. Statisticians looking to learn machine learning techniques
7. Graduates aspiring to take a leap in analytics domain
Module: 1 - Introduction to Machine Learning and Apache Mahout
This module will give you an insight about what ‘Machine Learning’ is and How Apache Mahout algorithms are used in building intelligent applications.
Topics – Machine Learning Fundamentals, Apache Mahout Basics, History of Mahout, Supervised and Unsupervised Learning techniques, Mahout and Hadoop, Introduction to Clustering, Classification.
Module: 2 - Mahout and Hadoop
In this module you will learn how to set up Mahout on Apache Hadoop. You will also get an understanding of Myrrix Machine Learning Platform.
Topics – Mahout on Apache Hadoop setup, Mahout and Myrrix.
Module: 3 - Recommendation Engine
In this module you will get an understanding of the recommendation system in Mahout and different filtering methods.
Topics – Recommendations using Mahout, Introduction to Recommendation systems, Content Based (Collaborative filtering, User based, Nearest N Users, Threshold, Item based), Mahout Optimizations.
Module: 4 - Implementing a recommender and recommendation platform
In this module you will learn about the Recommendation platforms and implement a Recommender using MapReduce.
Topics – User based recommendation, User Neighbourhood, Item based Recommendation, Implementing a Recommender using MapReduce, Platforms: Similarity Measures, Manhattan Distance, Euclidean Distance, Cosine Similarity, Pearson’s Correlation Similarity, Loglikihood Similarity, Tanimoto, Evaluating Recommendation Engines (Online and Offline), Recommendors in Production.
Module: 5 - Clustering
This module will help you in understanding ‘Clustering’ in Mahout and also give an overview of common Clustering Algorithms.
Topics – Clustering, Common Clustering Algorithms, K-means, Canopy Clustering, Fuzzy K-means and Mean Shift etc., Representing Data, Feature Selection, Vectorization, Representing Vectors, Clustering documents through example, TF-IDF, Implementing clustering in Hadoop, Classification.
Module: 6 - Classification
In this module you will get a clear understanding of Classifier and the common Classifier Algorithms.
Topics – Examples, Basics, Predictor variables and Target variables, Common Algorithms, SGD, SVM, Navie Bayes, Random Forests, Training and evaluating a Classifier, Developing a Classifier.
Module: 7 - Mahout and Amazon EMR
At the end of this module, you will get an understanding of how Mahout can be used on Amazon EMR Hadoop distribution.
Topics – Mahout on Amazon EMR, Mahout Vs R, Introduction to tools like Weka, Octave, Matlab, SAS.
Module: 8 - Project
In this module you will develop an intelligent application using Mahout on Hadoop.
Topics – A complete recommendation engine built on application logs and transactions.