A hand holding a compass to symbolize data exploration

Exploratory Data Analysis

In this domain the data is analysed so it can be understood and cleaned up. It comprises 24% of the exam marks. Domain 2 is Exploratory Data Analysis, there are three subdomains:

Analysing and visualising the data (subdomain 2.3) overlaps with the other two sub-domains which use these techniques. The techniques include graphs, charts and matrices. Before data can be sanitized and prepared (subdomain 2.1) it has to be understood. This is done using statistics that focus on specific aspects of the data and graphs and charts that allow relationships and distributions to be seen. The data can then be cleaned using techniques to remove distortions and fill in gaps. Feature Engineering (subdomain 2.2) is about creating new features from existing ones to make the ML algorithms more powerful. Techniques are used to reduce the number of features and categorise the data.

When the data is understood and has been cleaned it is ready for the next stage, modeling.


AWS Certified Machine Learning Study Guide: Specialty (MLS-C01) Exam

This study guide provides the domain-by-domain specific knowledge you need to build, train, tune, and deploy machine learning models with the AWS Cloud. The online resources that accompany this Study Guide include practice exams and assessments, electronic flashcards, and supplementary online resources. It is available in both paper and kindle version for immediate access. (Vist Amazon books)


Sample Exploratory Data Analysis questions

This test is five questions randomly taken from the questions in the tests of the three subdomains.

4

2 Exploratory Data Analysis

Five questions from a test bank of 30 questions about domain 2, Exploratory Data Analysis.

1 / 5

<–?–> can represent values as colors?

2 / 5

What is Data Augmentation?

3 / 5

The number of variables displayed in a bar chart is <–number–>

4 / 5

What is Amazon SageMaker Ground Truth?

5 / 5

What attribute statistics allow numeric fields to be described?

Study guides for exploratory data analysis

static image of cv library ad showing a blue owl and the text looking for you next job? Register cv
Reviews

CV Library

If you want to land your dream AWS job you have to do more than just dream about it you need a CV. Agents may call, email or text and job ads pop up on every site you visit but the first thing they will ask for is a copy of your CV. A CV…

Whizlabs AWS certified machine learning course with a robot hand
Reviews

Whizlabs review – AWS Certified Machine Learning Specialty

Need more practice with the exams? Check out Whizlab’s free test with 15 questions. They also have three practice tests (65 questions each) and five section tests (10-15 questions each). Money off promo codes are below. For the AWS Certified Machine Learning Specialty Whizlabs provides a practice tests, a video course and hands-on labs. These…

Amazon Study Guide for the AWS Machine Learning Speciality exam
Reviews

Amazon Study Guide review – AWS Certified Machine Learning Specialty

This Amazon Study Guide review is a review of the official Amazon study guide to accompany the exam. The study guide provides the domain-by-domain specific knowledge you need to build, train, tune, and deploy machine learning models with the AWS Cloud. The online resources that accompany this Study Guide include practice exams and assessments, electronic…

Two gloved hands holding a antibacterial hand sanitizer gel dispenser symbolising data cleansing
Exploratory Data Analysis (Domain 2)

Data cleansing and preparation for modeling

Understanding data, cleansing data and dataset generation are important first steps in exploratory data analysis. Every other phase in the Machine Learning process relies on the data being cleaned and prepared. This Study Guide starts with statistical techniques used to help understand the data. Once data is understood it has to be cleaned up so…

Pluralsight AWS Certified Machine Learning web page screen shot
Reviews

Pluralsight review – AWS Certified Machine Learning Specialty

Contains affiliate links. If you go to Pluralsight’s website and make a purchase I may receive a small payment. The purchase price to you will be unchanged. Thank you for your support. The AWS Certified Machine Learning Specialty learning path from Pluralsight has six high quality video courses taught by expert instructors. Two are introductory…

Credits: Photo by Jamie Street on Unsplash