From the course: Microsoft Fabric Data Flows and Data Storage
Unlock the full course today
Join today to access over 24,600 courses taught by industry experts.
Data organization - Microsoft Fabric Tutorial
From the course: Microsoft Fabric Data Flows and Data Storage
Data organization
- [Instructor] It is important to have a consistent, well-applied structure to your data. As otherwise, it's very hard to locate different elements or apply consistent security rules. In big data solutions, this is sometimes referred to as a cake in the lake problem. As you know, the good stuff is in the lake, you just don't know how to access it. And this problem can be mitigated with good data lake organization. To make data organization easier, the data should be classified as to the quality and the source. All the raw data should go in the raw area. Raw data should be a copy of the data from the source. The data should not be modified at all, just written in red. Underneath the high level raw folder, the folders beneath it should describe the folders for the data source. The next area is staging, where you would store any data that had to be modified. I highly recommend creating a reference area for information…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.