Issue with double creation of _cq_sync_time when using PG and S3 plugins

comic-pup · April 10, 2024, 6:40pm

When running CloudQuery with PostgreSQL to S3 using the PostgreSQL and S3 plugin, I noticed that it double creates _cq_sync_time and _cq_sync_time as the PostgreSQL database is populated with CloudQuery data as well.

ben · April 10, 2024, 7:31pm

I am not clear about the use case. You are syncing data from a Postgres database to an S3 bucket and are seeing duplicated columns, specifically _cq_sync_time. How does the data get into Postgres initially?

comic-pup · April 11, 2024, 6:33am

The data is being synced via CQ into Postgres. I then export to S3 with the S3 plugin, and I’m seeing double keys of _cq_source_name since the S3 plugin also creates the same column names that already exist in Postgres.

For example, Plugin 1, 2, and 3 sync to Postgres. Then the Postgres Source plugin is used to export with the S3 destination plugin. This second process then doubles the columns, which causes a HIVE_CANNOT_OPEN_SPLIT.

ben · April 11, 2024, 12:35pm

What version of the CLI and plugins are you using?

comic-pup · April 11, 2024, 3:25pm

CLI 4.4 and Source Postgres 3.0.1 Destination S3 4.10.0

ben · April 11, 2024, 3:35pm

Your issue has already been fixed in the latest version of the CLI (v5.15.0). Also, I would suggest upgrading your other plugins as they are pretty old, and we have improved performance and fixed a lot of issues since they were released.

Topic		Replies	Views
Sync runs duplicating data temporarily in CloudQuery need a solution CloudQuery Plugins	1	4	March 21, 2024
Error executing CloudQuery sync due to duplicate key constraint violation CloudQuery Plugins	3	12	March 13, 2024
CloudQuery sync time for AWS data in PG database on EKS CloudQuery Plugins	24	41	October 23, 2023
Multi-client sync configuration guidance for CloudQuery with PostgreSQL CloudQuery Plugins	5	28	January 5, 2024
CloudQuery sync fails on one server but works on another with same config CloudQuery Plugins	6	13	November 24, 2023

Issue with double creation of _cq_sync_time when using PG and S3 plugins

Related topics