Skip to content
This repository has been archived by the owner on May 17, 2022. It is now read-only.

Plugin request: Snowflake #13

Closed
timgl opened this issue Feb 26, 2021 · 8 comments
Closed

Plugin request: Snowflake #13

timgl opened this issue Feb 26, 2021 · 8 comments

Comments

@timgl
Copy link
Contributor

timgl commented Feb 26, 2021

I want to dump all my events and person information in Snowflake.

I've actually had a look at this. it seems the only way to do this is to create .csv's of events, upload them to s3 and then make a call to snowflake to ingest that. I'm trying to get a direct contact there to ask if there's an easier way, and to get added to their PartnerConnect program.

I think the easiest way to do this is to use their node connector, then do something like this to insert it one at a time.

@mariusandra
Copy link
Contributor

Depends on #12 then

@jamesefhawkins
Copy link
Contributor

jamesefhawkins commented Mar 4, 2021

I've had no response from the contact I got referred to, I've a third person I've been told I can get connected to through an investor (support = world networking championships, clearly), have asked him to intro! I've sent this question directly.

@mariusandra
Copy link
Contributor

This is built: https://github.com/PostHog/snowflake-export-plugin

image

image

Some points:

  • The following needs to be merged for it to work: Snowflake and crypto plugin-server#291
  • Snowflake supports a few different methods of authentication. Currently only "password" is implemented. Adding the "private key" option shouldn't be that much extra work.
  • Finally, we have the same case as with the bigquery export: there's no retry support with this plugin. In case snowflake or your network is down at some point, the events are lost. Good enough for now, but we need to find a universal solution to it (see Kafka last mile batching & retrying processEvent plugin-server#273)

@timgl
Copy link
Contributor Author

timgl commented Mar 29, 2021

I think we can safely ignore $snapshot and $feature_flag_called events for now (and maybe even autocapture). Other than that great work!

@mariusandra
Copy link
Contributor

I can add some option in the plugin config to select what types of events to send. Who knows, someone might actually want to send their $snapshots somewhere for archival. The BigQuery plugin theoretically suffers from the same situation as it also just sends everything.

@mariusandra
Copy link
Contributor

I added this (default values in screenshot):

image

@timgl
Copy link
Contributor Author

timgl commented Mar 29, 2021

yeah i think that's fine for now. would be nice to standardise that somehow but not urgent

@Twixes
Copy link
Member

Twixes commented Apr 15, 2021

Snowflake contact woes aside, I would say this can be considered done #18

@Twixes Twixes closed this as completed Apr 15, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants