Writings.

  1. Connect Snowplow to Snowflake

    I have already mentioned that we are using Snowplow as our backbone system to collect our events. It was not mentioned that the last time setup Snowplow in pair Redshift was a long and awful ride. Mostly because of quite a bad Snowplow documentation and some mess with existing Snowplow…

    Snowplow AWS Snowflake Iglu

  2. Extracting data from Bigquery into Redsfhit

    Another morning, another challenge! :) This time I have to integrate data collected into Google Ads into our data warehouse based on Redshift. I started with Google and find several tricky ways to get data into Redshift. Some paid ETL services. Self-hosted script to capture data from Google Ads directly via…

    redshift bigquery ETL AWS GCS

  3. How do we build analytics system in Scentbird

    Couple of months ago Scentbird has celebrated 2 years of building internal analytics system. I considered it is a time to start sharing our experience with the community and it was a topic of my HighLoad 2019 (the biggest conference in Russia regarding high load systems) speech. Here it is…

    Analytics Looker AWS Glue redshift DBT architecture

  4. Jenkins: How to start Jenkins build using commit id of last successful build on test enviroment?

    Last week I faced with a quite not trivial case in CI practice. There is production build running once a day by schedule and it is using master branch as a source. The main challenge here that the development team is not using branches in everyday practice (why? it is…

    jenkins CI groovy continuous-integration

  5. Generating series in Redshift

    Redshift in 99% of cases means data warehouse, where generating series (especially date series) is quite a common case. You will be disappointed to find out no information about series in official documentation (the only thing that you will be able to find - "generate_series function is not…

    redshift AWS Analytics sql