2020-IA-bonnes-pratiques-notebook

The goal is to show how we can improve a very simple, notebook-based, project into a very readable, reusable and changeable one.

Steps go as follows :

0 : Initial project. Base code from janakiev made a lot worse on purpose.
1 : Improve the notebook itself : Add markdown, better and more pythonic code, etc.
2 : Add some extra files : README.md and requirements.txt and use an isolated environment.
3 : Separate notebooks as a DAG.
4 : Externalise some of the code for better readability.
5 : Unit test most of the externalised code.
6 : Using Papermill for parametrized execution.

The overall final project architecture is a free interpretation of the Cookie Cutter Data Science project.

Extra interesting documentation on this matter :

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.vscode		.vscode
0-Initial-Project		0-Initial-Project
1-Better-Notebook		1-Better-Notebook
2-Extra-Files		2-Extra-Files
3-Separate-Notebooks		3-Separate-Notebooks
4-Separate-Code		4-Separate-Code
5-Unit-Tests		5-Unit-Tests
6-Papermill		6-Papermill
Extra		Extra
.gitignore		.gitignore
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback