The Role
Your job is to build and maintain the data backbone that powers everything our agents do from campaign strategy to creative generation. You’ll design systems that make our agents smarter every week. Think pipelines, embeddings, stream sync, monitoring, all the stuff that makes the rest of the stack actually work.
If you’ve spent time in the weeds of ML infra or shipped real data products that scale, this is your chance to go deep on AI-powered marketing, but with real data, real clients, and real feedback loops.
You Will:
• Design and deploy ETL and ELT pipelines for both structured and unstructured data sources.
• Manage databases, data lakes, and vector stores so our agents can retrieve what they need instantly.
• Build event-driven streaming systems to keep everything in real-time sync across teams and services.
• Work directly with our AI teams to improve embedding generation, caching layers, and prompt workflows.
• Own monitoring and performance tuning across our entire AWS stack tracking cost, latency, and reliability from end to end.
You Bring:
• 5+ years of hands-on experience in data engineering or ML Ops, ideally involving AI agent systems or model integration.
• Expert Python and SQL — plus working knowledge of NoSQL tools like MongoDB or DynamoDB.
• Production-level experience with RAG architectures and semantic search systems.
• A proven track record building scalable, fault-tolerant pipelines that hold up in production.
• Experience working with AWS services like Lambda, Step Functions, Redshift, or Snowflake.
• Familiarity with observability tooling like OpenTelemetry or Prometheus, and you know how to keep things visible when they go sideways.