Job Description
<h3>π Description</h3> β’ Weβre partnering with three fast-growing AI startups β all backed by top-tier investors and based in the Bay Area.
β’ Weβre looking for Senior AI Engineers to join their technical founding teams and play a pivotal role in shaping and scaling their AI systems.
β’ If you're passionate about deploying LLMs in production, working on Retrieval-Augmented Generation (RAG) systems, and building scalable, real-time AI solutions, this is for you.
β’ Build and optimize RAG agents using modern LLM frameworks
β’ Design scalable APIs and services using Python and FastAPI
β’ Work with vector databases (e.g., Pinecone, Weaviate, FAISS) and embedding models to enable semantic search and knowledge retrieval
β’ Query and manipulate structured data using SQL
β’ Integrate and fine-tune Large Language Models (LLMs) for product use cases
β’ Collaborate with product and design teams to ship AI-powered features in rapid development cycles
β’ Contribute to MLOps pipelines and best practices for deploying and monitoring models in production <h3>π― Requirements</h3> β’ 3+ years of experience in backend engineering or applied ML, preferably at a startup or fast-moving environment
β’ Strong experience with Python, FastAPI, and RESTful APIs
β’ Hands-on experience with RAG architectures, vector databases, and LLM APIs (OpenAI, Anthropic, Mistral, etc.)
β’ Solid understanding of embedding models, prompt engineering, and semantic search
β’ Comfort working with SQL and structured datasets
β’ Strong product intuition and the ability to work autonomously on 0β1 projects
β’ Bonus: Experience with LangChain, Haystack, or similar frameworks