📊 HT2024: Data-driven Life Sciences

Table of Contents

Welcome to the Data-driven Life Sciences 2024 course, where you will explore the intersection of data science, artificial intelligence, and life sciences to drive innovation and discovery. This fully online course culminates in an in-person hackathon, fostering a vibrant community that gathers the DDLS and SciLifeLab members.

The 6 modules aim to introduce learners to computer-driven life sciences, covering application areas in data-driven life sciences. Guest lecturers (DDLS Fellows, SciLifeLab fellows, and SciLifeLab facility training providers) will teach topics including technologies and analysis of data sets from proteomics, transcriptomics, biomolecular structure, molecular dynamics simulations, and various imaging techniques. These modules present, analyze, and discuss models of biological phenomena and related scientific breakthroughs based on such data analysis.

The course is open to all the master’s, PhD students and anyone who interested in Sweden,

The registration is closed!

You can either register for all the modules or individual modules.

Specifically students at KTH, you should register for the course at the KTH course selection system:

Intended Learning Outcomes

  • Describe the field of data-driven life sciences

  • Present an overview of various application areas

  • Provide examples of applications and their associated analysis methods

  • Apply statistical and machine learning analysis to biological data sets

  • Formulate models of biological phenomena

  • Present and review scientific literature in computer-driven life sciences

  • Reflect on the ethical consequences of data-driven life sciences

  • Practice good data management, including collection, handling, sharing, and analysis

Course Format and Credits

The course consists of 6 modules: a basic module and 5 elective modules. Each module is a stand-alone short-form course with 3 sessions per module. Completing all 6 modules earns a total of 7.5 ECTS.

Modules and Certification

Each course module contains 3 main activities: a lecture on Tuesday, giving background information about the topic(s) of the week; a computer lab, usually on Wednesday, where you will practically explore data exploration using Python in Jupyter notebooks; and a journal club on Friday where you will collectively explain and discuss an assigned research paper.

Both the computer lab and the journal club are mandatory and graded activities!

  • For the labs, you be given a Jupyter notebook with tutorials and coding exercises. You will be running and practicing coding by running the Jupyter notebook in Google Colab. During the lab session, you can discuss your answers with the lab teachers. You will submit your final notebook with answers to the questions to be graded.

  • For the journal club, you will have a list of questions (the same every week) which we will answer during the journal club. Your participation and answer to these questions will be graded during the journal club.

In the final project week/hackathon, course participants will choose a project to be conducted in pairs (Master’s students) or individually (PhD students), followed by an oral presentation. The project and presentation will be graded by peers and the examiner. Please note that the oral presentation is only required for Master’s students; PhD students are exempt from this requirement.

You can sign up partially or for the whole course. Each module is worth a certain number of ECTS credits, and you can choose to participate in the modules that interest you most. However, to obtain a certification for participation, you must complete all modules.

Module 1 (basic module) is a prerequisite for obtaining certification for attendance in any of the subsequent elective modules. To receive a certificate, you must:

  • Apply for and participate in Module 1.

  • Actively participate in all registered modules by attending and engaging in all sessions.

You will receive 0.5 ECTS for the basic module and 1 ECTS for each elective module. Completing a final project earns an additional 2 ECTS. For example, if you are a PhD student interested in Module 2, you must complete both Module 1 and Module 2 to receive 0.5 + 1 ECTS. Master’s students from KTH or other universities must complete the entire course, including the basic and elective modules, plus the final project.

The first module covers course basics and prerequisites. The subsequent modules each run for a week and cover various topics in data-driven life sciences (DDLS). Each week includes three main activities:

  • One or two 2-hour lectures on Tuesday providing background information about the week’s topic(s)

  • A computer lab on Wednesday for practical data exploration using Python in Jupyter notebooks (Google Colab)

  • A journal club on Friday for collective explanation and discussion of an assigned research paper

You can either register for the full course or selected modules.

Self-directed Learning

One of the principal objectives of this course is to instill in you a data-driven mindset essential for life science research. More importantly, we aim to teach you the art of “learning how to learn.” This involves honing your ability to independently acquire new knowledge and skills. ChatGPT serves as an excellent tool to facilitate this self-directed learning process. Instead of focusing solely on the course content, we encourage you to embrace this broader, skill-centric approach to education. We will use this self-directed learning approach extensively in the computer lab and journal club sessions.

Course Modules

Note: The exact content for each module may be subject to change. Please refer to the updated schedule for the most current information.

Module 1

  • Lecture by Wei Ouyang: Introduction to Data-Driven Life Science and AI
  • Computer Lab and Journal Club will be announced one week before the module starts

Module 2

  • Lecture by Patrick Bryant: Protein structure prediction and design
  • Lecture by Johan Bengtsson-Palme: Large-scale predictions of antibiotic resistance in microbial communities
  • Computer Lab and Journal Club will be announced one week before the module starts

Module 3

  • Lecture by Nima Rafati: Decoding Cell-Level Biology with scRNA-Seq Data
  • Lecture by Laura Carroll: Metagenomics and Microbiome Analysis
  • Computer Lab and Journal Club will be announced one week before the module starts

Module 4

  • Lecture by Estibaliz Gómez de Mariscal: Deep learning enabled image-driven scientific discovery
  • Lecture by Wei Ouyang: Advances in Generative AI for Conditional Microscopy Cell Images
  • Computer Lab and Journal Club will be announced one week before the module starts

Module 5

  • Lecture by Daniel Fürth: From Single Molecules to Tissue Atlases
  • Computer Lab and Journal Club will be announced one week before the module starts

Module 6

  • Lecture by Cemal Erdem: System Biology and computational modeling
  • Lecture by Gabriel Reder: Accelerating Scientific Discovery with Large Language Models
  • Computer Lab and Journal Club will be announced one week before the module starts

Final Project

Grades

The final grade on the course is determined as follows:

  • The computer labs are graded P/F. To pass, you need to attend all the labs and answer the questions to a satisfactory degree.

  • Participation in the journal club is graded as P/F. To pass, you need to read the assigned papers, attend all the journal club and participate in discussions.

  • The project is graded P/F. To pass, you need to carry out the project, and get a passing grade on the project report.

  • The oral exam (project presentation) at the end of course will be graded A-F scale.

  • For master students, the grade on the oral exam determines the final grade on the course provided the other three activities have received a passing grade.

  • For PhD students, the oral exam is not mandatory, but you are welcome to join the oral exam if you want to practice your presentation skills.

The submission of the computer lab, the project report and the oral exam will be first reviewed and graded by your peers, and then the final grade and feedbacks will be given by the teachers and examiner.

Communication and groups

You can find all announcements and posts at here.

For questions, please email the course responsible, see contact here.

Meet your instructor

In this course, you will meet:

  • Wei Ouyang, assistant professor in biophysics who will be holding the lectures, journal club and grading them (weio@kth.se). Wei is also the course responsible.
  • Songtao Cheng, PhD student in biophysics, who will be teaching assistant for the course, chairing lectures, holding the computer labs and journal clubs songtao.cheng@scilifelab.se)
  • Nils Mechtel, PhD student in biophysics, who will be teaching assistant for the course, chairing lectures, holding the computer labs and journal clubs(nils.mechtel@scilifelab.se)

Course schedule

See the course schedule for the detailed schedule.

FAQs

Are there prerequisites?

The students are expected to have basic knowledge of biology and programming in Python. If you are not familiar with Python, it will be helpful if you can go through the prerequisites.

How often do the courses run?

We run the course once a year, in Period 1 from August to October.

How do I register the course?

Note: The registration is closed!

  • For KTH master students, please register at the KTH course selection system.
  • For PhD students or anyone from other universities, please sign up at here.
  • You can also sign up for individual module(s) here.

How do I get access to the course material?

The course material will be made available on each module’s page. You can find the module pages under the course modules section. We won’t be using KTH Canvas.

Can I attend the course remotely?

Yes, the entire course is conducted online. All lectures and sessions will be held via Zoom. You can join the lectures, computer labs and journal clubs using this Zoom link.

The hackathon will take place in person at SciLifeLab, Tomtebodavägen 23, 171 65 Solna. While physical attendance at the hackathon is optional and not required for participating in the final project, you will need to arrange your own accommodation if you choose to attend in person. Please note that accommodation costs are not covered by the course. Further details about the hackathon and its schedule will be communicated via email and updated on the course page.

What if I cannot attend the computer labs or journal club?

For all selected modules, it is mandatory to attend all the associated computer labs and journal club in order to receive credit. If you anticipate missing a computer lab or journal club, please notify the course responsible as soon as possible. Exceptions will be considered on a case-by-case basis.

If an exception is granted, you will still be required to submit the notebook for the computer lab or write a report to address the journal club question sheet to ensure you meet the module requirements.

Will the lecture be recorded?

The recording of lectures may vary depending on the lecturer, and it cannot be guaranteed. However, the slides will be made available after each lecture.

Can I use ChatGPT / Claude or other generative AI tools in the computer labs, journal club, assignments or the final project?

Yes, you are encouraged to use these AI tools to facilitate your learning, and perform tasks. However, you need to make sure that you are using them in a responsible way. Importantly, if you used it for submitting any graded activities, please make sure to also attach the exported conversation history with the AI tool.