Kaggle and OpenVaccine Competition
Kaggle is an online community of Data Scientists, ML Engineers and MLOps champions who come together to explore creating models and technical solutions to popular real-world problems. Kaggle competitions focus on finding solutions to these popular problems to advance the collective community’s knowledge and capabilities. The notebook you will work with is based on a Kaggle project which leveraged data science to develop models and design rules for RNA degradation. The model will predict likely degradation rates at each base of an RNA molecule, trained on a subset of an Eterna dataset comprising over 3000 RNA molecules (which span a panoply of sequences and structures) and their degradation rates at each position. This course is a self-service exploration of this problem solved using a Jupyter notebook and Kubeflow Pipelines.
- More details on this project can be found here.