Job Description
<h3>π Description</h3> β’ We are seeking an experienced and pragmatic Machine Learning Engineer to join our growing Cloud Operations team.
β’ This role will focus on building and maintaining the pipelines and infrastructure that make machine learning and analytics possible in a security-focused, production-grade environment.
β’ As a core contributor, you will work closely with data scientists, data engineers, and platform engineers to deploy and monitor ML models, build reusable pipelines, manage feature stores, and ensure robust, scalable, and secure data operations for our cybersecurity platform.
β’ Youβll play a key role in operationalizing machine learning in a way that directly enhances our AI-powered pentesting and analytics capabilities.
β’ Design, build, and maintain reliable MLOps pipelines that support versioned, testable, and reproducible model training and deployment.
β’ Develop CI/CD pipelines for model promotion, validation, canary testing, and rollback.
β’ Automate model performance monitoring, logging, and alerting to maintain model health in production.
β’ Collaborate with Data Engineering and Data Science teams to build and maintain data pipelines, feature stores, and high-quality training datasets.
β’ Support the creation of ML-friendly data assets that meet latency, freshness, and accuracy requirements.
β’ Integrate robust data validation, lineage tracking, and quality checks throughout the pipeline.
β’ Define and manage scalable infrastructure for model training and inference using container orchestration platforms (e.g., Kubernetes).
β’ Apply infrastructure-as-code (IaC) principles to build reproducible environments for experimentation and production.
β’ Ensure compliance with security and privacy best practices in model and data handling.
β’ Work side-by-side with data scientists to enable fast experimentation while maintaining production-grade standards.
β’ Facilitate efficient use of GPU/TPU resources, experiment tracking tools, and model registries.
β’ Participate in planning, postmortems, and optimization of our ML platform to improve velocity and reliability. <h3>π― Requirements</h3> β’ 3+ years of experience in software engineering or DevOps roles, with 1+ years focused on MLOps or ML infrastructure.
β’ Strong background in deploying machine learning models to production, including model versioning, rollback, and performance tracking.
β’ Advanced proficiency in Python, including common ML libraries (e.g., scikit-learn, MLflow, PyTorch, TensorFlow).
β’ Strong skills in building and maintaining data pipelines using tools like Apache Airflow, dbt, or similar.
β’ Experience working with cloud platforms (preferably GCP) and infrastructure tools like Docker, Kubernetes, Terraform, or Pulumi.
β’ Solid understanding of data engineering concepts such as batch and streaming ETL, data partitioning, and schema evolution.
β’ Familiarity with cybersecurity, penetration testing workflows, or secure data handling practices is a plus.
β’ Comfort working in an agile, fast-paced, and mission-driven startup environment <h3>ποΈ Benefits</h3> β’ Earn competitive compensation and an attractive equity plan
β’ Save for the future with a 401(k) program (US)
β’ Benefit from medical, dental, vision and life insurance (US)
β’ Leverage stipends for:
β’ Wellness
β’ Work-from-home equipment & wifi
β’ Learning & development
β’ Make the most of our flexible, generous paid time off and paid parental leave