From the course: Manage and Optimize Big Data with Apache Iceberg
Unlock the full course today
Join today to access over 24,600 courses taught by industry experts.
Setting up your Iceberg environment on Databricks - SQL Tutorial
From the course: Manage and Optimize Big Data with Apache Iceberg
Setting up your Iceberg environment on Databricks
- [Instructor] Let's log into our Databricks account. I've already logged in. To use the Apache Iceberg, we need to install the Apache Iceberg JAR in our Databricks cluster. By default, Databricks uses the Delta Lake, so we need to install the Iceberg jar. Now to get to know which JAR you need, let's go to the Google and search Iceberg Spark compatibility. You will find a page where there are information given about which Spark version support which specific Iceberg version. So this is the compatibility metrics. Here you can check, based on your Spark version, you can download the specific Iceberg runtime JAR. Let's go back to my Databricks, go to my cluster, and let's see my cluster version. My cluster version is Apache Spark 3.3. Let's go down here and see for Spark 3.3, this is the specific Iceberg JAR. I can click on this JAR to download. Now this is started download. Once your JAR get download, under your cluster, click on the Libraries, and click on the Install New. Now here we…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
(Locked)
Apache Iceberg introduction2m 42s
-
(Locked)
Role of Iceberg in modern data architecture2m 28s
-
(Locked)
Key features and advantages3m 23s
-
(Locked)
Setting up your Iceberg environment on Databricks4m 28s
-
(Locked)
Creating your first Iceberg table: Practical8m 3s
-
(Locked)
Challenge: Create an Iceberg table34s
-
(Locked)
Solution: Create an Iceberg table1m 47s
-
(Locked)
-
-
-
-