Safary Logo Safary
⏩ Safary Logo

Data Engineer - Infrastructure

πŸ“… 06/10/2023

Apply

Data Engineer

πŸ’° $150,000 - $175 🌍 San Francisco πŸ“… 05/26/2025

Apply

Job Description

As employee #2 of our data engineering team, you will lay the groundwork for
the crypto industry's first Customer Data Platform. You'll manage data and
solve problems that have never been done before. Here are the missions you'll
be working on:

* **AI patterns recognition:** You'll be developing multiple AI features, from user categorization to autofilled descriptions, market and conversations sentiment analysis and AI insights to help crypto marketers.
* **SDK, User Graph improvement and reliability:** We're constantly improving our proprietary user graph by matching wallets to social profiles. Your mission will be supporting new crypto wallets and networks, automating a system to match more users to their identities, and applying data checks to ensure reliability.
* **Integrations** : As a customer data platform, we're expanding the number of data sources our customers can import from Web2 and Web3. You'll manage multiple API endpoints and integrate new third-party tools like Mixpanel, Amplitude, Segment, Dune Analytics, and DeFi Llama. This involves not only integration work but also data modeling and architectural design.
* **Social Data analysis:** You'll work with social data APIs like Twitter to analyze Key Opinion Leaders' performance and trends.

You will mainly work with [Eliott](https://www.linkedin.com/in/eliott-
mogenet-405984105), cofounder & CPO of Safary, and Ricardo, our founding data
engineer who holds a PhD in privacy and has laid the groundwork for our entire
data infrastructure.

### Requirements πŸ—“οΈ

* Proven track record as a Data Engineer delivering complex data solutions
* Advanced **SQL** skills and expertise with complex queries
* Mastery in **Python** development and strong experience with **PySpark**
* Extensive proficiency managing cloud services, including **AWS Redshift, RDS Postgres** , S3, Lambda, Kinesis, SQS, ECS, EC2.
* Strong competency implementing and supporting various data models, such as highly normalized, star schema and **Data Vault**
* Practical experience with orchestration tools like **Airflow** , Dagster or Prefect
* Demonstrated proficiency consuming and automating interactions with **APIs**
* Hands-on experience creating data pipelines using **dbt** for various platforms (ideally AWS Redshift and Postgres)
* Experience in SaaS analytics, marketing, or crypto companies is a plus