Module 3 [1 ECTS] - Bioinformatics and Metagenomics

Start date: Week 37 (10th September 2024)

Note: This module contains two lectures held on two different dates: September 10th Tuesday at 9:00-10:00 CEST and September 11th Wednesday at 10:00-11:00 CEST.

Speaker 1: Nima Rafati (Data Scientist at NBIS, SciLifeLab, Uppsala University; Tuesday 10th September 9:00-10:00 CEST)

Nima Rafati is a passionate bioinformatician focused on uncovering patterns in biological data through the analysis of sequencing data across various living systems.

Title: Decoding Cell-Level Biology with scRNA-Seq Data

The lecture will cover the scRNA-Seq workflow, including sample preparation, sequencing, data preprocessing, and result interpretation. Key topics such as quality control and clustering will be discussed. An example of scRNA-Seq data analysis will be demonstrated to illustrate these concepts.

Speaker 2: Laura Carroll (DDLS Fellow; Pre-recorded)

Dr. Laura Carroll is an Assistant Professor and DDLS Fellow in the Department of Clinical Microbiology at Umeå University. As a microbial bioinformatician, her research focuses on developing bioinformatic approaches to leverage whole-genome sequencing (WGS) data for improving pathogen surveillance, source tracking, outbreak detection, and risk evaluation. Learn more about her work on her institutional page or visit her research group website. Find her publications on Google Scholar.

Title: (Meta)genome mining approaches for microbial secondary metabolite discovery

Microbes can produce a wide range of secondary metabolites, which allow them to interact with each other, as well as with their environment more broadly. Beyond their ecological roles, some microbial secondary metabolites have found uses in medicine or industry (e.g., as first-in-class drugs), while others can have negative impacts on human health (e.g., as toxins or carcinogens). To discover novel microbial secondary metabolites, (meta)genome mining approaches make use of the fact that the enzymatic machinery responsible for secondary metabolite production is encoded by physically clustered groups of genes in microbial genomes, called biosynthetic gene clusters (BGCs). Here, we will discuss state-of-the-art (meta)genome mining approaches for BGC discovery, with a focus on machine learning (ML) approaches. We’ll walk through how to train and evaluate a sequence segmentation model for BGC detection tasks, step-by-step, and we’ll discuss common ML pitfalls. Finally, we’ll discuss how these methods can be applied to (meta)genomic data to gain insight into microbe-microbe and microbe-host interactions.

Computer Lab: The lab will cover tools in bioinformatics and metagenomics data analysis with practical exercises, providing hands-on experience in applying these techniques to real-world datasets.