Hightouch (YC S19) – Sync data from data warehouses to SaaS tools

Hey HN! Kashish, Tejas, and Josh here. We’re building Hightouch (https://www.hightouch.io/), a reverse ETL platform— that is, software that gets your data back out of your data warehouse and into the SaaS tools that people at your company are familiar with (like Salesforce). We enable you to Bring Your Own (BYO) database so that all your SaaS tools run off of the same dataset. You specify what data you want and where, and we take care of the rest.

We were exposed to the data integration space as early engineers at Segment. Segment and other CDPs (Customer Data Platforms) were built on an older model that hits a wall once you reach a certain level of complexity. You don’t have access to your own data, there isn’t a great way to express business logic, and you don’t have flexibility to transform data to your needs.

Cloud-based data warehouses like Snowflake and tools like dbt solved part of this problem. Where Segment/CDPs require you to store data in their format, warehouses let you store your data in any format. They let you store it in your own cloud for privacy and security. And where Segment/CDPs only have event data, warehouses have all your data—things like a full replica of Salesforce data and a full replica of Postgres data.

The problem is, all this data tends to get stuck in the warehouse and only get used for reports and dashboards. In our experience, business teams don’t want another BI dashboard. They want their data in their primary tools—the SaaS applications where they spend their days—so they can use it to actually operate their business.

Because of this mismatch, a lot of engineers are doing busywork writing scripts to get data from warehouses into CRMs like Salesforce, Hubspot, Customer.io, and so on. Such scripts are brittle—they need changing when users request more columns, when an API changes, etc. And that’s only if the business people are lucky enough to get engineers’ time in the first place. There are also a lot of teams downloading CSVs and manually uploading them to various platforms because they can’t get their work in front of engineers who are busy with a hundred other priorities.

We decided to build something that would appeal to both sides: business people who know what they want from their data and just need access to it, and data engineers who want to help but can’t build and maintain every integration as the marketing team buys an endless number of SaaS tools. That’s how we came up with Hightouch.

Hightouch is a platform that makes it easy to take models and views from your warehouse and sync them into your SaaS apps, using only SQL to express your logic. Mapping between columns and application fields is done through a declarative UI. You write a SQL query to pull the data you need, map columns from that query to fields in your SaaS tool, and set how often you want data to sync. We handle the rest. See a demo here: https://www.youtube.com/watch?v=kDhHWG9hwj0.

No more hard-coding database columns to a Salesforce field in Python or Javascript, only to have a sales team ask for a ‘quick change’. We handle all the annoying complexities of moving data around: type-casting, error handling, authentication, retries, debugging, observability, notifications/alerts, and changing APIs—freeing up your engineering time to work on problems specific to your business rather than syncing data into a CRM.

We’ve also built integrations we think data engineers will love. We integrate directly with dbt and dbt Cloud, we offer git sync for version control of your models and syncs, we have an Airflow operator, as well as a public API, and we’d love your ideas on what you think is missing.

Hightouch doesn’t store anything. We connect directly to your existing warehouse/database and SaaS tools. As data changes in the warehouse, it changes in the SaaS tool. You get full control and your data is always owned by you.

Our customers use Hightouch to do things like: sending a feed of new leads and customers to Slack; syncing product usage data into CRMs like Hubspot and Salesforce; and syncing user cohorts into marketing systems, such as all users who abandoned their shopping cart, or users with a high “churn risk” score.

We’ve grown from 4 people to almost 30 now, and work with amazing customers like CircleCI, Plaid, Retool, Ramp, Lucid Chart, Nando’s, Grafana, Kong, Autotrader, Blend, and Imperfect Foods. We’re also hiring—we have over 15 positions open at https://hightouch.io/careers/, and we would love to meet you and have you join the team.

We’d love to hear your thoughts, feedback and experiences on data warehouses, building integrations, ETL, workflow orchestration, and anything data related!



Get Top 5 Posts of the Week



best of all time best of today best of yesterday best of this week best of this month best of last month best of this year best of 2023 best of 2022 yc w24 yc s23 yc w23 yc s22 yc w22 yc s21 yc w21 yc s20 yc w20 yc s19 yc w19 yc s18 yc w18 yc all-time 3d algorithms animation android [ai] artificial-intelligence api augmented-reality big data bitcoin blockchain book bootstrap bot css c chart chess chrome extension cli command line compiler crypto covid-19 cryptography data deep learning elexir ether excel framework game git go html ios iphone java js javascript jobs kubernetes learn linux lisp mac machine-learning most successful neural net nft node optimisation parser performance privacy python raspberry pi react retro review my ruby rust saas scraper security sql tensor flow terminal travel virtual reality visualisation vue windows web3 young talents


andrey azimov by Andrey Azimov