From the course: Automating Data Quality in Dev Environments
Unlock the full course today
Join today to access over 24,600 courses taught by industry experts.
Confirm your data's source system(s)
From the course: Automating Data Quality in Dev Environments
Confirm your data's source system(s)
- [Instructor] Every algorithm has a source of truth. No matter how large your data pipeline is, it all originates from somewhere. As you start building out the roadmap for your high priority project, your job is to figure out what that single source of data truth is. It could be an Excel file that lives on a colleague's laptop. It could be an on-premise database with no consistent upkeep. Whatever it is, you have to find it and account for it in your data product planning. You can use the latest tools and frameworks to build an algorithm that takes your company to new heights. But if your source system's data isn't accounted for, it's all for nothing. You'll bring that bad quality data throughout your architecture. Finding your source system involves knowing who to speak with. If you can confirm who's in charge of that system, you'll gain more insight into how it operates, which system it connects to, how it's maintained, who has access to it and more. Once you find that person, set…
Contents
-
-
-
-
(Locked)
Write data requirements for your roadmap1m 51s
-
(Locked)
Confirm your data's source system(s)2m 17s
-
(Locked)
Establish the right data system integrations2m 34s
-
(Locked)
Define your source data's minimum acceptance criteria (MAC)1m 52s
-
(Locked)
Set up data lineage tracking3m 2s
-
(Locked)
Define levels of access per user2m 36s
-
(Locked)
Draft a to-be process map2m 33s
-
(Locked)
Define areas of data transformation2m 35s
-
(Locked)
Choose some super users to validate your product1m 44s
-
(Locked)
Give your data team room to fail2m 56s
-
(Locked)
-
-