[DL Edition] T036: Uncertainty estimation #286

mbackenkoehler · 2022-12-06T11:45:59Z

Details

Talktorial ID: 036
Title: Uncertainty estimation
Original authors: Michael Backenköhler
Reviewer(s): TBD
Date of review: TBD

Content

One line summary: Illustrate basic uncertainty estimation in ML on small molecules
Potential labels or categories (e.g. machine learning, small molecules, online APIs): machine learning, small molecules, ensemble methods
Time it took to execute (approx.): TBD
I have used the talktorial template and followed the content and formatting suggestions there
Packages must be open-sourced and should be installable from conda-forge. If you are adding new packages to the TeachOpenCADD environment, please check if already installed packages can perform the same functionality and if not leave a sentence explaining why the new addition is needed. If the new package is not on conda-forge, please list them and their intended usage here.
- package1: Already in TeachOpenCADD
- package2 (conda-forge): I use it for XXX
- package3 (pip only): I use it for XXX
Data must be publicly available, preferably accessible via a webserver or downloadable via a URL. Please list the data resources that you use and how to access them:
- EGFR binding affinities: Talktorial 022

Content style

Talktorial includes cross-references to other talktorials if applicable
The table of contents reflects the talktorial story-line; order of #, ##, ### headers is correct
URLs are linked with meaningful words, instead of pasting the URL directly or linking words like here.
I have spell-checked the notebook
Images have enough resolution to be rendered with quality, without being too heavy.
All figures have a description
Markdown cell content is still in-line with code cell output (whenever results are discussed)
I have checked that cell outputs are not incredibly long (this applies also to DataFrames)
Formatting looks correctly on the Sphinx render (bold, italics, figure placing)

Code style

Website

We present our talktorials on our TeachOpenCADD website (https://projects.volkamerlab.org/teachopencadd/), so we have to check as well if the Jupyter notebook renders nicely there.

If this PR adds a new talktorial, please follow these steps:
- Add your talktorial to the complete list of talktorials here (at the end).
- Add your talktorial to one or multiple of the collections here. Or propose a new collection section in your PR.
- Add your talktorial's nblink file by running python generate_nblinks.py from within the directory teachopencadd/docs/talktorials.
- Please complile the website following the instructions here.
Check the rendering of the talktorial of this PR.
Is your talktorial listed in the talktorial list?
Is your talktorial listed in the talktorial collections?
- Add a picture for your talktorial in the collection view by following these instructions.

review-notebook-app · 2022-12-06T12:04:39Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

gerritgr · 2023-02-07T11:50:45Z

Intro (or discussion): maybe be more critical of the claims of UE in general/clarify the scope of UE.
Calibration: maybe start with an example or, at least, clarify the setting like: "Assume we have a ML model that incorporates uncertainty, how do we evaluate and improve the predicted uncertainty" or even put this after the methods part.
"varying the model's architecture explicitely or via a Bayesian network with probabilistic dropout." -> explain a little bit more, explicitely -> explicitly
"we can compute confidence intervals based on the standard deviations, we get out of our model ensemble. According to the definition of the confidence interval..." Can we get an actual confidence interval here? Maybe ref to a rigorous definition. Why use the STD and not the samples in a non-parametric way?

Start branch

0824751

mbackenkoehler changed the title ~~[DL Edition] Uncertainty estimation~~ [DL Edition] T0036: Uncertainty estimation Dec 6, 2022

mbackenkoehler changed the title ~~[DL Edition] T0036: Uncertainty estimation~~ [DL Edition] T036: Uncertainty estimation Dec 6, 2022

Start talktorial using the T000 template

42aeed4

mbackenkoehler added 3 commits December 6, 2022 16:46

First work on uncertainty est. talktorial

bb293b7

add bagging (resampling training data)

841652b

some more structure

004c1d6

AndreaVolkamer added new talktorial New talktorial labels Dec 8, 2022

mbackenkoehler added 3 commits December 15, 2022 14:07

continue writing and structuring

620844f

ensemble image

76dfeb8

text on ensemble methods

5e9edc6

dominiquesydow mentioned this pull request Dec 27, 2022

[2023.05.2-base] DL edition #285

Merged

9 tasks

mbackenkoehler and others added 8 commits December 28, 2022 13:42

start transition to pytorch

3954a1e

re-structuring; initial ensemble code

ce33469

more code cleanups and writing

f8c7d48

some more writing

7fe3558

some re-ordering

678819c

readme; clean up redundant parts

db56055

clean up for CI

b08d91a

Review of Uncertainty Estimation talktorial T036

e91e548

mbackenkoehler added 4 commits February 7, 2023 14:30

quiz questions

616dffc

incorporate Romans feedback

829b7a3

readme typo

2d046b7

incorporate some feedback

092f9cf

gerritgr merged commit 1fb5b6e into DL_edition Apr 11, 2023

mbackenkoehler deleted the mb-036-uncertainty-estimation branch January 29, 2024 10:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DL Edition] T036: Uncertainty estimation #286

[DL Edition] T036: Uncertainty estimation #286

mbackenkoehler commented Dec 6, 2022 •

edited

Loading

review-notebook-app bot commented Dec 6, 2022

gerritgr commented Feb 7, 2023

[DL Edition] T036: Uncertainty estimation #286

[DL Edition] T036: Uncertainty estimation #286

Conversation

mbackenkoehler commented Dec 6, 2022 • edited Loading

Details

Content

Content style

Code style

Website

review-notebook-app bot commented Dec 6, 2022

gerritgr commented Feb 7, 2023

mbackenkoehler commented Dec 6, 2022 •

edited

Loading