Opportunities at Craft portfolio companies


Senior Data Engineer - Postgres, SQL, R



Data Science
Posted on Thursday, May 23, 2024
About Hone

Hone offers a platform for live online instructor-led training classes, Hone allows organizations of all sizes to source, manage and deliver leadership, management and people skills training, and measure its impact on their organization. The business is located in Poway, California.

About The Role

We are seeking a seasoned Senior Data Engineer to join our dynamic team. This role involves designing, building, and maintaining sophisticated ETL pipelines using tools like Fivetran and dbt, and managing a robust data warehouse built on Postgres. The ideal candidate will bring a deep understanding of database administration, including monitoring, tuning, and troubleshooting, to ensure high performance, reliability, and scalability of our data infrastructure. In this position, you will collaborate closely with stakeholders to understand and meet data requirements, integrating source systems smoothly and supporting downstream data consumers effectively. Your expertise will be crucial in documenting ETL processes and data models, and sharing knowledge across teams to foster a data-driven culture within the organization.

What You’ll Do

  • ETL Pipeline Development: Design, build, and maintain robust ETL pipelines using Fivetran and dbt to extract and load data from various source systems into our data warehouse.
  • Data Warehouse Management: Manage and optimize our data warehouse built on Postgres, ensuring high performance, reliability, and scalability. Implement best practices for data modeling and schema design.
  • Database Administration: Perform database administration tasks including monitoring, tuning, backup, and recovery. Troubleshoot and resolve database-related issues in a timely manner.
  • Stakeholder Collaboration: Work closely with stakeholders to understand data requirements, gather feedback, and ensure alignment between data engineering efforts and business objectives.
  • Source System Integration: Collaborate with owners of source systems to ensure smooth data integration, troubleshoot data quality issues, and implement changes as needed.
  • Downstream Data Consumer Support: Provide support to downstream data consumers, including destination systems for reverse ETL processes. Ensure timely delivery of reliable and accurate data.
  • Documentation and Knowledge Sharing: Document ETL processes, data models, and system configurations. Share knowledge and best practices with the engineering team and other stakeholders.

What You’ll Bring

  • Proficiency with ETL tools (i.e. Fivetran, SSIS, Talend, etc.) for data integration.
  • Expert SQL skills, particularly with Postgres.
  • Experience with dbt (data build tool) for data transformation and modeling.
  • Proficient in git usage such as branching strategies, triage, best practices.
  • Familiarity with data visualization tools such as Tableau.
  • Knowledge of database administration tasks including monitoring, tuning, and backup.
  • Problem-Solving Skills: Ability to troubleshoot complex data issues, identify root causes, and implement effective solutions. Bonus Skills
  • Familiarity with segment.io or other robust Customer Data Platforms (CDPs) and Salesforce or other enterprise CRMs
  • Statistical Analysis: Knowledge of distributions, statistical testing, and regression analysis.
  • Machine Learning: Familiarity with machine learning algorithms, including supervised and unsupervised learning, and model deployment.
  • Programming Skills: Expertise in Python or R, beyond SQL and database scripting.
  • Advanced Analytics: Skills in time series analysis, NLP, and complex event processing.
  • Experimentation and A/B Testing: Ability to design and interpret A/B testing for data-driven decisions.