CloudQuery GCP plugin sync issue with detached disk data inconsistency

Hi, I am testing the CQ GCP plugin and I ran into an issue with data :frowning_with_open_mouth:

I am using the latest version of the GCP plugin, CloudQuery, and Neo4j for the destination. I am using the neo4j:5.22.0 Docker image. I tried to sync the gcp_compute_disks table. In my GCP project, I have 2 instances: the first one has 2 disks and the second has 3 disks.

The sync of the gcp_compute_disks table completed without any error. Then, I detached the third disk from the second instance and I ran the sync again. The sync completed without any error, but when I looked at the data in Neo4j, I discovered an issue: the detached disk is shown as attached to the instance (checked the column users), and the boot disk of the second instance is shown as detached (the column users is empty).

Another issue is that the ā€œdescriptionā€ column value of the detached disk is assigned to the boot disk of the first instance. I donā€™t know what is going wrong. Maybe I missed something? (I used just the basic config as in the documentation).

Any help, please? Are you facing the same issue or able to reproduce it on your side? Thank you!

Hi @alert-wildcat,

This is the API we use to get the disks: Google Cloud Compute Disks API.

One way to verify the data is to run the following command:

gcloud compute disks list

and see what that gives you.

Hi @erez,

Thanks for your reply. The API is returning correct data. Even when I did a sync using the plugin file as a destination, the data is correct. This behavior occurs only with Neo4j. I am wondering if the Neo4j plugin has an issue when some data is nil (as the column ā€œusersā€ was empty during testing)?

Got it, thanks for the added context. That helps.
Can you open a bug report for the Neo4j plugin via this link?
Please share a full reproduction if possible, including the spec file, plugins versions, and CLI version. That will help us solve the issue faster.