CloudQuery + Databricks Integration: Complete Setup Guide

JoeKarlsson · July 29, 2025, 9:13pm

Hey everyone,

We’ve been working on running CloudQuery syncs directly within Databricks, and after hitting a few walls, we’ve got a solid setup that’s working well in production. Figured I’d share the complete walkthrough since several folks have asked about this integration.

Our team needed centralized cloud asset inventory across AWS accounts, but didn’t want to spin up separate infrastructure just for CloudQuery. Since we’re already heavy Databricks users, running everything as Jobs made sense.

The setup breakdown:

Secrets management (this part’s crucial): We’re using Databricks CLI to create a secrets scope. You’ll need these keys:

AWS_ACCESS_KEY_ID / AWS_SECRET_ACCESS_KEY / AWS_SESSION_TOKEN
CLOUDQUERY_API_KEY  
DATABRICKS_CATALOG / DATABRICKS_SCHEMA
DATABRICKS_ACCESS_TOKEN / DATABRICKS_HOSTNAME / DATABRICKS_HTTP_PATH
DATABRICKS_STAGING_PATH

You can checkout the complete walk though here: How to Work with CloudQuery Syncs within Databricks | CloudQuery Blog

Additional Resources:

Main sync job: https://github.com/cloudquery/databricks-simple-cli-job
Transformation code: https://github.com/cloudquery/databricks-sync-transformations (private repo, ping us if you need access)
Visualization app: https://github.com/cloudquery/databricks-sample-app

Topic		Replies	Views
Cloudquery python sync for aws environments not finding proper method CloudQuery SDK	4	31	June 5, 2024
Help with importing DynamoDB records in CloudQuery for Grafana integration CloudQuery Plugins	4	62	October 16, 2023
CloudQuery Powered by documentation and access inquiry CloudQuery Plugins	9	53	March 21, 2024
Gather multiple AWS orgs data into same Postgres DB with CloudQuery CloudQuery Plugins	8	58	January 5, 2024
CloudQuery deployment model and hardware requirements for AWS multi-account sync CloudQuery Plugins	2	29	November 9, 2023

CloudQuery + Databricks Integration: Complete Setup Guide

Related topics