Cleaning Data for Effective Data Science: Data Ingestion, Anomaly Detection, Value Imputation, and Feature Engineering
With Pearson and David Mertz
Liked by 1 user
Duration: 4h 49m
Skill level: Intermediate
Released: 7/11/2025
Course details
The course introduces the tools and techniques needed for data ingestion, anomaly detection, value imputation, and feature engineering. Numerous ingested formats are addressed, including JSON, CSV, SQL RDBMS, HDF5, NoSQL databases, and binary serialized data structures. Instructor David Mertz outlines why some problems are peculiar to data representation, while others link to the data in itself. To address untidiness in data, learn how and when to impute missing values, detect unreliable data and statistical anomalies, and generate synthetic features that are necessary for successful data analysis and visualization goals. By the end of this course, you’ll be equipped with highly marketable and in-demand skills in data analysis, machine learning, and data integrity troubleshooting.
Skills you’ll gain
Earn a sharable certificate
Share what you’ve learned, and be a standout professional in your desired industry with a certificate showcasing your knowledge gained from the course.
LinkedIn Learning
Certificate of Completion
-
Showcase on your LinkedIn profile under “Licenses and Certificate” section
-
Download or print out as PDF to share with others
-
Share as image online to demonstrate your skill
Meet the instructors
Contents
What’s included
- Learn on the go Access on tablet and phone