[Remote] Founding Data Engineer, AI Platform

Remote Full-time Hiring now

Note: The job is a remote job and is open to candidates in USA. GC AI is the fastest-growing legal AI platform for in-house legal teams, seeking a Founding Data Engineer to own the entire data stack. This role involves consolidating data from multiple sources into a single warehouse, building internal data tools, and defining data engineering practices as the first dedicated data hire.

Responsibilities

Take ownership of the data warehouse in BigQuery: modeling, pipeline development, data quality, and performance
Build pipelines that consolidate product usage data, CRM data, billing, customer contract data, and user analytics into a single source of truth
Design and build internal data tools using applied AI, including natural-language query interfaces and automated reporting, so the rest of the company can self-serve without waiting on an analyst
Set up the warehouse so business teams can run their own queries and pull their own numbers without filing a ticket
Build toward a data lake architecture that supports personalization and model fine-tuning for the GC AI product
Keep the stack lean. Use what's available in BigQuery and the broader GCP ecosystem and make smart decisions to reduce complexity and cost without introducing tool sprawl
Define data engineering practices, tooling, and standards as the first hire on what will become a team

Skills

5+ years of experience in data engineering, with hands-on experience building and maintaining data warehouses and pipelines
Strong SQL skills and deep experience with BigQuery or comparable analytical databases
Proficiency in Python for pipeline development, scripting, and tooling
Experience building ETL/ELT pipelines that consolidate data from multiple source systems (SaaS APIs, event streams, databases)
Experience working within GCP or a comparable cloud ecosystem
Ability to design data models that are clean, performant, and usable by non-engineers
Experience building internal data tools or agents using LLMs (text-to-SQL, natural language interfaces, automated reporting). This is a strong differentiator
Experience as the first or early data hire at a startup, where you owned the full stack
Familiarity with legaltech, legal operations, or SaaS product analytics
Experience setting up self-serve analytics layers (semantic layers, BI tool configuration, data documentation)
Experience with data infrastructure that supports ML workflows (feature stores, training data pipelines, data lakes)
Experience with infrastructure as code, especially Terraform, for managing GCP data infrastructure

Company Overview

GC AI is the legal AI platform built for in-house teams that solves the workflows in-house lawyers and legal professionals face every day. It was founded in 2023, and is headquartered in San Francisco, California, USA, with a workforce of 51-200 employees. Its website is https://gc.ai.

Apply To This Job

Apply

[Remote] Founding Data Engineer, AI Platform

Related roles

[Remote] Senior QA Analyst – WMOS (Warehouse Automation)

[Remote] Forward Deployed Engineer, Networking

[Remote] Procurement Project Manager

[Remote] Lead, Program Procurement Management

[Remote] Software Development Engineer, iOS

[Remote] Financial Analyst

[Remote] Analyst III Epic EHR Application (Remote)

[Remote] Senior Software Development Engineer, Android

[Remote] Majors Account Executive

[Remote] Business Development - Engineering

Director College of Law, JD Student Success

Software Engineer II /BUFFALO, NY (Remote)- 6 months Contract

Experienced Data Entry Specialist – Remote Research Panelist Opportunity at blithequark

HR Generalist, Employee Service Center

Employee Support Officer (Resourcing)

Experienced Remote Data Entry Clerk - Flexible Part-Time Opportunity with Comprehensive Training and Benefits at Blithequark

Customer Service Representative (Remote)

Frontend JavaScript Developer

Postal Clerk - No Experience Required - Great P...

Part-Time Online English Teacher for Kids (Remote Position)