Configure dynamic GCS destination based on postgres schema and table

helping-wahoo · November 1, 2023, 1:16pm

Hey all, I’m looking at setting up a PostgreSQL → GCS extract. I have both pieces working, but I was wondering if there’s a way to configure the GCS destination based on the table/schema of the PostgreSQL?

I would like the path to look something like:

path/to/file/<postgres_schema>/<postgres_table>/*.csv

for that table. It looks like I could do this if I made a source/destination for each table, but that seems rather cumbersome.

yevgenyp · November 1, 2023, 1:18pm

Hi! I believe that right now the pg source only works by syncing one schema only, so you would need to set up a source and destination per schema, and then you can have the above structure.

helping-wahoo · November 1, 2023, 1:20pm

I’m currently only syncing the public schema. What’s the configuration to change the path for each table?

yevgenyp · November 1, 2023, 1:21pm

Check out this. The path can be postgre_schema/, and then it will create each table in its own file if this is what you are looking for.

helping-wahoo · November 1, 2023, 1:24pm

I don’t think so, I’m looking to have each table be in its own directory in GCS. As in: if I have two tables: ‘actions’ and ‘orders’. The path in GCP for actions would be:

gs://bucket-name/path/to/file/actions/actions.csv

and for orders:

gs://bucket-name/path/to/file/orders/orders.csv

Is that possible?

yevgenyp · November 1, 2023, 1:25pm

I see. I believe this is not possible right now. Can you open an issue please? What is the reason that gs://bucket-name/path/to/file/actions.csv wouldn’t work?

helping-wahoo · November 1, 2023, 1:36pm

Sure! I use the directories as a basis for organization currently within our data lake. I suppose it’s mostly convenience and organization. But in general, some mild templating for the destination could be helpful, especially if it includes a timestamp of when the sync runs.

And thanks for your help!
GitHub Issue #15097

Topic		Replies	Views
Syncing specific schema tables in PostgreSQL with CloudQuery CloudQuery Plugins	4	57	October 18, 2023
Cloudquery duplicate destination configuration clarification needed CloudQuery Plugins	5	35	December 15, 2023
Issue with CloudQuery affecting multiple AWS RDS parameter tables CloudQuery Plugins	28	194	June 13, 2024
CloudQuery table name prefixing and lowercase formatting options CloudQuery Plugins	14	97	August 22, 2024
CloudQuery AWS config troubleshooting for PostgreSQL connection issues CloudQuery Plugins	1	38	December 15, 2023

Configure dynamic GCS destination based on postgres schema and table

Related topics