Back to jobsJob overview

About the role

Research Scientist Intern, Comms and Language (PhD) Computer Vision Research at Meta

Required Skills

pythonc++natural language processingcomputer visionmachine learningdeep learningmultimodal airesearch

About the Role

Meta is seeking PhD Research Interns to join their Fundamental AI Research (FAIR) Multimodal Foundations teams. The role involves advancing AI through research in NLP, speech, computer vision, and multimodal learning. Interns will develop novel solutions, conduct experiments, and contribute to Meta's product development.

Key Responsibilities

  • Perform research to learn semantics of multimodal data (text, audio, images, video)
  • Brainstorm with mentors and review literature on challenging research problems
  • Develop novel solutions, implement prototypes, and perform extensive experiments
  • Present research outcomes to internal and/or external audiences
  • Contribute research applicable to Meta product development

Required Skills & Qualifications

Must Have:

  • Currently pursuing or holding a PhD in NLP, Speech Processing, Computer Vision, ML, AI, or equivalent
  • Research/work experience in NLP, Speech Processing, Computer Vision, ML, or Deep Learning
  • Experience in Python, C++, or other related programming languages
  • Must obtain work authorization in country of employment and maintain it during employment

Nice to Have:

  • Proven track record with publications at leading conferences (ACL, EMNLP, CVPR, NeurIPS, etc.)
  • Experience advancing AI techniques in NLP or Computer Vision/ML
  • Experience manipulating and analyzing complex, large-scale, high-dimensional data
  • Experience utilizing theoretical and empirical research to solve problems
  • Experience working and communicating cross-functionally in teams
  • Intent to return to degree program after internship completion