Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DL Edition] T035: SMILES based property prediction #320

Merged
merged 7 commits into from
Apr 11, 2023
Merged

Conversation

azmtag
Copy link
Collaborator

@azmtag azmtag commented Jan 18, 2023

Talktorial review

Details

  • Talktorial ID: T035
  • Title: [DL Edition] T035: SMILES based property prediction
  • Original authors: Azat Tagirdzhanov
  • Reviewer(s): XXX
  • Date of review: DD-MM-YYYY

Content

  • One line summary: SMILES based property prediction using recurrent neural networks
  • Potential labels or categories (e.g. machine learning, small molecules, online APIs): XXX
  • Time it took to execute (approx.):
  • I have used the talktorial template and followed the content and formatting suggestions there
  • Packages must be open-sourced and should be installable from conda-forge. If you are adding new packages to the TeachOpenCADD environment, please check if already installed packages can perform the same functionality and if not leave a sentence explaining why the new addition is needed. If the new package is not on conda-forge, please list them and their intended usage here.
    • numpy, pandas, matplotlib: Already in TeachOpenCADD
    • pytorch 1.12.1 (conda-forge): I use it for implementing neural networks
  • Data must be publicly available, preferably accessible via a webserver or downloadable via a URL. Please list the data resources that you use and how to access them:
    • QM9 dataset: Added to the repository (12M)

Content style

  • Talktorial includes cross-references to other talktorials if applicable
  • The table of contents reflects the talktorial story-line; order of #, ##, ### headers is correct
  • URLs are linked with meaningful words, instead of pasting the URL directly or linking words like here.
  • I have spell-checked the notebook
  • Images have enough resolution to be rendered with quality, without being too heavy.
  • All figures have a description
  • Markdown cell content is still in-line with code cell output (whenever results are discussed)
  • I have checked that cell outputs are not incredibly long (this applies also to DataFrames)
  • Formatting looks correctly on the Sphinx render (bold, italics, figure placing)

Code style

  • Variable and function names follow snake case rules (e.g. a_variable_name vs aVariableName)
  • Spacing follows PEP8 (run Black on the code cells if needed)
  • Code line are under 99 characters each (run black-nb -l 99)
  • Comments are useful and well placed
  • There are no unpythonic idioms like for i in range(len(list)) (see slides)
  • All 3rd party dependencies are listed at the top of the notebook
  • I have marked all code cell with output referenced in markdown cells with the label # NBVAL_CHECK_OUTPUT
  • I have identified potential candidates for a code refactor / useful functions
  • All import ... lines are at the top (practice part) cell, ordered by standard library / 3rd party packages / our own (teachopencadd.*)
  • I have used absolute paths instead of relative paths
    HERE = Path(_dh[-1])
    DATA = HERE / "data"

Website

We present our talktorials on our TeachOpenCADD website (https://projects.volkamerlab.org/teachopencadd/), so we have to check as well if the Jupyter notebook renders nicely there.

  • If this PR adds a new talktorial, please follow these steps:
    • Add your talktorial to the complete list of talktorials here (at the end).
    • Add your talktorial to one or multiple of the collections here. Or propose a new collection section in your PR.
    • Add your talktorial's nblink file by running python generate_nblinks.py from within the directory teachopencadd/docs/talktorials.
    • Please complile the website following the instructions here.
  • Check the rendering of the talktorial of this PR.
  • Is your talktorial listed in the talktorial list?
  • Is your talktorial listed in the talktorial collections?
    • Add a picture for your talktorial in the collection view by following these instructions.

@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@AndreaVolkamer AndreaVolkamer mentioned this pull request Feb 8, 2023
9 tasks
@Old-Shatterhand
Copy link
Collaborator

@azmtag Did you implement Paula's feedback? I could review it then.

@@ -0,0 +1,1144 @@
{
Copy link
Collaborator

@Old-Shatterhand Old-Shatterhand Mar 15, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Put a link to the Talktorial introducing one-hot encoding.


Reply via ReviewNB

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One thing, I forgot: There's a review section on the top of the notebook, when finalizing, I'd remove/resolve that one as well.
But overall, a very nice and good notebook!

@gerritgr gerritgr merged commit cf5f76c into DL_edition Apr 11, 2023
@mbackenkoehler mbackenkoehler deleted the at-035-rnns branch January 29, 2024 10:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants