<h3>π Description</h3> β’ What βGreatβ Looks Like
β’ Prompt & RAG Engineering
β’ Design, tune, and A/B-test prompt chains that lift instruction-following accuracy, session-completion detection, and overall Buddy engagement.
β’ Implement RAG pipelines to power both support bots and in-product answers.
β’ Multi-language Scale-out
β’ Modularise Buddy prompt chains so every language pair uses a single, well-structured template with language-specific slots.
β’ Add capabilities to compare model performance for different languages.
β’ Recommendations Engine
β’ Prototype and productionise a Kotlin- or Python-based service that recommends the next best learning feature, measuring success via lift in feature completions and balanced usage.
β’ Evaluation & Observability
β’ Build an end-to-end prompt-quality harness (unit + offline + in-prod metrics) and integrate it with our custom model-serving layer.
β’ Collaboration & Ownership
β’ Partner daily with Sonia (Head of Content), Luis (Dir. Product Strategy & Innovation) and Elizaveta (Engineering Manager), proactively taking ownership of high-leverage deliverables that move the KPIs. <h3>π― Requirements</h3> β’ Must-Have
β’ Prompt-engineering best practices
β’ Proactiveness & strong ownership mindset
β’ Retrieval-Augmented Generation (RAG)
β’ Nice-to-Have
β’ Recommender-systems foundations
β’ A/B testing at scale
β’ Python (LLM tooling) & Kotlin fundamentals
β’ ML-ops tooling (Airflow, Feature Store)
β’ Basic statistics / experimentation design