Start Here
Search & Browse the Databricks Knowledge Base.Help Topics
Whether it's a very specific question, or a vague one, we have it covered in our company help center.Popular Articles
SSO server redirects to original URL, not to vanity Databricks URL
Problem When you log into Databricks using a vanity URL (such as mycompany.cloud....
How to improve performance of Delta Lake MERGE INTO queries using partition pruning
This article explains how to trigger partition pruning in Delta Lake MERGE INTO (...
How to improve performance with bucketing
Bucketing is an optimization technique in Apache Spark SQL. Data is allocated amo...
Job failure due to Azure Data Lake Storage (ADLS) CREATE limits
Problem When you run a job that involves creating files in Azure Data Lake Storag...
Spark job fails with Driver is temporarily unavailable
Problem When running notebooks or jobs on a cluster, they run successfully multip...
ABFS client hangs if incorrect client ID or wrong path used
Problem You are using Azure Data Lake Storage (ADLS) Gen2. When you try to access...
Create a DataFrame from a JSON string or Python dictionary
This article explains how you can create an Apache Spark DataFrame from a variabl...
Distinguish active and dead jobs
Problem On clusters where there are too many concurrent jobs, you often see some ...
Cluster-named and cluster-scoped init script migration notebook
On Dec 1, 2023, Databricks will disable cluster-named init scripts for all worksp...
Install and compile Cython
This document explains how to run Spark code with compiled Cython code. The steps...