Research Intern, Machine Learning for Biology

Internship
London
Salary: £39K
No remote work
Apply

InstaDeep
InstaDeep

Interested in this job?

Apply
Questions and answers about the job

The position

Job description

InstaDeep, founded in 2014, is a pioneering AI company at the forefront of innovation. With strategic offices in major cities worldwide, including London, Paris, Berlin, Tunis, Kigali, Cape Town, Boston, and San Francisco, InstaDeep collaborates with giants like Google DeepMind and prestigious educational institutions like MIT, Stanford, Oxford, UCL, and Imperial College London. We are a Google Cloud Partner and a select NVIDIA Elite Service Delivery Partner. We have been listed among notable players in AI, fast-growing companies, and Europe's 1000 fastest-growing companies in 2022 by Statista and the Financial Times. Our recent acquisition by BioNTech has further solidified our commitment to leading the industry.

Join us to be a part of the AI revolution!

We are seeking Research Interns to join our London-based Research Team, working on developing machine learning applications for the life sciences.  Generative foundation models like ESM-3 [1], AlphaFold [2], RFDiffusion [3], ProteinMPNN [4] are revolutionising computational biology and in silico drug design; and InstaDeep is leading the way in developing the next generation of technologies and applications (see as examples, Nucleotide Transformer [6] and ProtBFN [7]).  This requires pairing fundamental ML innovations with domain expertise in computational biology who are able to identify, develop and validate models to specific applications. 

In this role you will work closely with our Research Scientists and Research Engineers to help deliver on our ambitious goals.  You will be able to understand fundamental ML concepts and be excited to support the development and high-quality validation of novel generative models across key biological modalities.  You will be equally comfortable discussing new applications at the whiteboard as you are implementing the technical aspects of necessary data, modelling and validation to see these realised.

Role Responsibilities:

  • Support the efforts of the Science team through the development of novel methods and applications under the guidance of our Research Scientists and Engineers.
  • Design and implement experiments for proof of concept and benchmarking.
  • Contribute to team research and publications.
  • Report and present experimental results and research findings, both internally and externally, verbally and in writing.
  • Upon request, collaborate with other groups’ activities, including but not limited to presenting the company to new prospective clients, participating in calls and meetings, and representing InstaDeep in conferences/events.
  • Requirements

  • Currently enrolled in a PhD programme (or recent graduate) in a related STEM discipline.
  • Theoretical and practical knowledge in machine learning and deep learning, including experience with a deep learning framework such as Jax, PyTorch or Tensorflow.
  • Excellent communication skills and collaborative spirit.
  • Relevant experience in the application of deep learning to life science application. 
  • Proven ability to contribute to research communities and/or efforts, as evidenced by publishing scientific papers in leading journals or conferences (JMLR, ICLR, NeurIPS, ICML, etc.).
  • Work permit for the UK for the duration of the internship.
  • References

    [1] Simulating 500 million years of evolution with a language model

    [2] Accurate structure prediction of biomolecular interactions with AlphaFold 3

    [3] De novo design of protein structure and function with RFdiffusion

    [4] Robust deep learning–based protein sequence design using ProteinMPNN

    [5] Bayesian Flow Networks

    Selected InstaDeep publications and releases

    [6] The Nucleotide Transformer: Building and Evaluating Robust Foundation Models for Human Genomics

    [7] Protein Sequence Modelling with Bayesian Flow Networks

    [8] A foundational large language model for edible plant genomes

    [9] Kyber is InstaDeep’s in-house cluster and ranks among the world’s top 100 most powerful clusters and is among the top 20 H100 GPU clusters.

    Our commitment to our people

    We empower individuals to celebrate their uniqueness here at InstaDeep. Our team comes from all walks of life, and we’re proud to continue encouraging and supporting applicants from underrepresented groups across the globe. Our commitment to creating an authentic environment comes from our ability to learn and grow from our diversity, and how better to experience this than by joining our team? We operate on a hybrid work model with guidance to work at the office 3 days per week to encourage close collaboration and innovation. We are continuing to review the situation with the well-being of InstaDeepers at the forefront of our minds.

    Right to work: Please note that you will require the legal right to work in the location you are applying for.

    Want to know more?

    These job openings might interest you!

    These companies are also recruiting for the position of “Data / Business Intelligence”.

    Apply