From the course: Fundamentals of Apache Iceberg

Unlock this course with a free trial

Join today to access over 24,600 courses taught by industry experts.

Partitioning in Apache Iceberg

Partitioning in Apache Iceberg - Apache Tutorial

From the course: Fundamentals of Apache Iceberg

Partitioning in Apache Iceberg

- Now first off, what is partitioning? Because basically today, we're going to talk about what Apache Iceberg brings to the table when it comes to partitioning, which is actually pretty cool. But first we got to understand what it is and how it used to be done to really sort of appreciate what Apache Iceberg brings to the table. So, partitioning is just a practice of physically breaking up the table into different pieces. So, this is different than clustering or sorting the table, where you take the data files and you might sort it, let's say, by date. So that way, all the files kind of have all the records sorted by a particular column. In this case, what you're doing is you're saying, "Hey, there's a particular field." So in this case, this example we're saying the age field and any files with... Any records with a unique value should belong in the same files. So every, let's say, user who has the age of 19 are going to be physically separate or physically separated from everybody…

Contents