Question 1

Lakehouse or warehouse — what do you recommend?

Accepted Answer

Both, often together. Lakehouse for the raw and transformed analytical plane, a warehouse where SQL-first reporting needs the indexing and concurrency. We pick on workload shape, not on vendor preference.

Question 2

How do you handle data contracts between teams?

Accepted Answer

We write them as code in the producer's repo, validate them in CI, and break the build when a schema change would silently break a consumer. Producers own the contract, consumers depend on it, breakage is loud.

Question 3

Do you do governance and lineage, or just the pipes?

Accepted Answer

Both. Lineage is wired from the first domain on day one — not bolted on later — and the governance policy (PII handling, retention, masking) ships as runnable policy, not a PDF.

Question 4

What does a feature store actually buy us?

Accepted Answer

Point-in-time correctness for training, offline-to-online parity for inference, and a reusable layer so the same feature does not get reimplemented by three teams. We ship one when ML use cases need it, not as a vanity install.

Question 5

Who owns the platform when you leave?

Accepted Answer

Your data engineers. We pair on the first domain, take a back seat by the third, and transfer the platform with runbooks, governance, and a written upgrade path. Default is full handoff.

Make your data ready for the models you want to ship.

Three concrete deliverables.

Lakehouse and ingest plane

Feature store and model inputs

Observability and lineage

From kickoff to production.

Source and use-case audit

Platform and contract design

First domain in production

Domain expansion and ownership transfer

The stack we build on.

One we shipped.

Questions buyers ask first.

AI engineering

RAG and knowledge systems

Cloud modernization

Ready to scope this?