From the course: Automating Data Quality in Dev Environments

Unlock the full course today

Join today to access over 24,600 courses taught by industry experts.

Confirm your data's source system(s)

Confirm your data's source system(s)

- [Instructor] Every algorithm has a source of truth. No matter how large your data pipeline is, it all originates from somewhere. As you start building out the roadmap for your high priority project, your job is to figure out what that single source of data truth is. It could be an Excel file that lives on a colleague's laptop. It could be an on-premise database with no consistent upkeep. Whatever it is, you have to find it and account for it in your data product planning. You can use the latest tools and frameworks to build an algorithm that takes your company to new heights. But if your source system's data isn't accounted for, it's all for nothing. You'll bring that bad quality data throughout your architecture. Finding your source system involves knowing who to speak with. If you can confirm who's in charge of that system, you'll gain more insight into how it operates, which system it connects to, how it's maintained, who has access to it and more. Once you find that person, set…

Contents