Job Description
<h3>π Description</h3> β’ Join phData, a dynamic and innovative leader in the modern data stack.
β’ We partner with major cloud data platforms like Snowflake, AWS, Azure, GCP, Fivetran, Pinecone, Glean and dbt to deliver cutting-edge services and solutions.
β’ phData is a remote-first global company with employees based in the United States, Latin America and India.
β’ We celebrate the culture of each of our team members and foster a community of technological curiosity, ownership and trust.
β’ Machine Learning Engineers are the Swiss army knives of machine learning.
β’ They own the infrastructure and deployment planβfrom making sure data science models can actually be built using customer data to deploying them into a production environment.
β’ They provide thought leadership by recommending the right technologies and solutions for a given use case, from the application layer to infrastructure.
β’ Machine Learning Engineers have the team leadership and coding skills (e.g. Python, Java, and Scala) to get their solutions into production β and to help ensure performance, security, scalability, and robust data integration.
β’ Create operational testing strategies, validate and test the model in QA, and implementation, testing, and deployment. <h3>π― Requirements</h3> β’ At least 4 years experience as a Machine Learning Engineer, Software Engineer, or Data Engineer
β’ 4-year Bachelor's degree in Computer Engineering or a related field
β’ Experience deploying data science models in a production setting.
β’ Expertise in Python, Scala, Java, or another modern programming language
β’ The ability to build and operate robust data pipelines using a variety of data sources, programming languages, and toolsets
β’ Strong working knowledge of SQL and the ability to write, debug, and optimize distributed SQL queries
β’ Experience working with Data Science/Machine Learning software and libraries such as h2o, TensorFlow, Keras, scikit-learn, etc.
β’ Experience with Docker, Kubernetes, or some other containerization technology
β’ Familiarity with multiple data source systems (e.g. JMS, Kafka, RDBMS, DWH, MySQL, Oracle, SAP)
β’ Systems-level knowledge in network/cloud architecture, operating systems (e.g., Linux), storage systems (e.g., AWS, Databricks, Cloudera)
β’ Production experience in core data technologies (e.g. Spark, Pandas)
β’ Development of APIs and web server applications (e.g. Flask, Django, Spring)
β’ Complete software development lifecycle experience including design, documentation, ong analytical abilities; ability to translate business requirements and use cases into a solution, including ingestion of many data sources, ETL processing, data access, and consumption, as well as custom analytics
β’ Excellent communication and presentation skills; previous experience working with internal or external customers <h3>ποΈ Benefits</h3> β’ Remote-First Work Environment
β’ Casual, award-winning small-business work environment
β’ Collaborative culture that prizes autonomy, creativity, and transparency
β’ Competitive comp, excellent benefits, generous PTO plus 10 Holidays (and other cool perks)
β’ Accelerated learning and professional development through advanced training and certifications