Advanced Data Mining with Weka

Learn how to use popular packages that extend Weka's functionality and areas of application. Use them to mine your own data!

Advanced Data Mining with Weka

Course Description

Extend your repertoire of data mining scenarios and techniques This course will bring you to the wizard level of skill in data mining, following on from Data Mining with Weka and More Data Mining with Weka, by showing how to use popular packages that extend Weka’s functionality. You’ll learn about forecasting time series and mining data streams. You’ll connect up the popular R statistical package and learn how to use its extensive visualisation and preprocessing functions from Weka. You’ll script Weka in Python – all from within the friendly Weka interface. And you’ll learn how to distribute data mining jobs over several computers using Apache SPARK. This course is aimed at anyone who deals in data. You should have completed Data Mining with Weka and More Data Mining with Weka – or be an experienced Weka user. Altho... Read More »

Extend your repertoire of data mining scenarios and techniques

This course will bring you to the wizard level of skill in data mining, following on from Data Mining with Weka and More Data Mining with Weka, by showing how to use popular packages that extend Weka’s functionality. You’ll learn about forecasting time series and mining data streams. You’ll connect up the popular R statistical package and learn how to use its extensive visualisation and preprocessing functions from Weka. You’ll script Weka in Python – all from within the friendly Weka interface. And you’ll learn how to distribute data mining jobs over several computers using Apache SPARK.

This course is aimed at anyone who deals in data. You should have completed Data Mining with Weka and More Data Mining with Weka – or be an experienced Weka user. Although the course includes some scripting with Python, you need no prior knowledge of the language. You will have to install and configure some software components; we provide full instructions.

Before the course starts, download the free Weka software. It runs on any computer, under Windows, Linux, or Mac. It has been downloaded millions of times and is being used all around the world.

(Note: Depending on your computer and system version, you may need admin access to install Weka.)

Read Less
Course Outcomes:
  • Discuss the use of lagged variables in time series forecasting
  • Explore the use of overlay data in time series forecasting
  • Identify several different applications of data mining with Weka
  • Compare incremental and non-incremental implementations of classifiers
  • Evaluate the performance of classifiers under conditions of concept drift
  • Classify tweets using various techniques
  • Calculate optimal parameter values for non-linear support vector machines
  • Demonstrate the use of R classifiers in Weka
  • Develop R commands and R scripts from Weka
  • Explain how distributed Weka runs Weka on a cluster of machines
  • Experiment with distributed implementations of Weka classifiers and clusterers
  • Explain how “map” and “reduce” tasks are used to distribute Weka
  • Design Python and Groovy scripts for Weka operations
  • Apply Python libraries to produce sophisticated visualizations of Weka output
  • Describe how Weka can be invoked from within a Python environment

DON'T HAVE TIME?

We can send you everything you need to know about this course through email.
We respect your privacy. Your information is safe and will never be shared.