Job Description
<h3>📋 Description</h3> • Develops self-service infrastructure components (e.g., Terraform modules, GitLab CI templates, deployment scaffolds) to eliminate manual bottlenecks.
• Designs, implements, and maintains CI/CD infrastructure to support the ML models development lifecycle.
• Implements robust monitoring and logging solutions to ensure the reliability of ML systems.
• Represents ML requirements in infrastructure and security decisions.
• Stays up-to-date with the latest trends and technologies in ML and DevOps. <h3>🎯 Requirements</h3> • Proven experience in building infrastructure to support ML development and deployment workflows
• Proficiency with cloud services (AWS), infrastructure-as-code (Terraform), and workflow orchestration (Airflow).
• Strong understanding of CI/CD tools such as GitLab CI
• Familiarity with ML platforms and tools (e.g., Databricks, MLflow) and data platforms like Snowflake
• Excellent problem-solving skills and attention to detail
• Strong communication and collaboration skills
• Prior work in cross-functional platform roles or on enablement teams (preferred)
• Experience with monitoring tools like Prometheus, Grafana, DataDog, and Splunk (preferred) <h3>🏖️ Benefits</h3> • Medical/Prescription drug insurance
• Dental
• Vision
• Health Care/Dependent Care Flexible Spending Account
• Health Savings Account
• Pre-Tax and Roth 401(k)
• Short and Long-Term Disability Insurance
• Life/AD&D Insurance
• Commuter Benefits
• Student Loan Repayment Program
• Educational Assistance
• generous paid time off