About

I first created this repository to tackle the Kaggle Competition called Jane Street Market Prediction.

However, as I was solving it, I realised I am building a much broader framework for experimentation and ML development in PyTorch.

Hence, I decided to publish this framework together with my solution so that others can benefit from my efforts.

Installation

git clone https://github.com/TomasDavidYe/kaggle-jane-street-market-prediction;
cd kaggle-jane-street-market-prediction;
pip3 install -r requirements.txt;

How to use this package?

There are 2 ways this package can be used.

To get inspired by my solution to the Jane Street Market Prediction competition.
To reuse some of my utility functions for your own ML development in PyTorch

Jane Street Kaggle Competition Usage

First, pull data from the original Kaggle Data Source.

Try to run simple_mlp_experiment.py script to make sure the workspace is setup correctly. If that works, you can use ExperimentContext framework the to either improve the SimpleMLPModel originally presented, or you can build your own model which implements the AbstractModel interface.

Example

from entities.ExperimentContext import ExperimentContext

context = ExperimentContext(
    model=model, # Your own model
    train_data=train_data,
    test_data=test_data
)

ML Framework Usage

You can also use this package to conduct your own ML experiments with PyTorch.

For example, checkout my EllipseClassificationExperiment where I use PyTorch to define a logistics regression model and use it to classify points generated by a 2 dimensional Gaussian distribution.

For me, these experiments helped me a lot to gain intuition about how different models work.

from lab.EllipseClassificationExperiment import EllipseClassificationExperiment
EllipseClassificationExperiment(model=model,  # any class implementing the AbstractModel interface
                                feature_transform=feature_transform,  # any function transforming input vectors to different vectors, make sure the output size is pluggable into your model
                                num_of_points=300,
                                buffer_coefficient=3,
                                ellipse_a=2,
                                ellipse_b=1).run()

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
data_generation		data_generation
deep_learning		deep_learning
entities		entities
lab		lab
models		models
pytorch_tutorials		pytorch_tutorials
utils		utils
visualisation		visualisation
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
logistics_regression_experiment.py		logistics_regression_experiment.py
requirements.txt		requirements.txt
simple_mlp_experiment.py		simple_mlp_experiment.py
utility_function_experiment.py		utility_function_experiment.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Installation

How to use this package?

Jane Street Kaggle Competition Usage

Example

ML Framework Usage

About

Releases

Packages

Languages

TomasDavidYe/kaggle-jane-street-market-prediction

Folders and files

Latest commit

History

Repository files navigation

About

Installation

How to use this package?

Jane Street Kaggle Competition Usage

Example

ML Framework Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages