Skip to content

Project made in Jupyter Notebook with Kaggle Titanic dataset, which aims at detailed data analysis and prediction of which passengers survived the sinking of the Titanic.

Notifications You must be signed in to change notification settings

golecalicja/titanic-survivor-prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Titanic Survivor Prediction

Table of contents

Introduction

Project made in Jupyter Notebook with Kaggle Titanic dataset, which aims at detailed data analysis and prediction of which passengers survived the sinking of the Titanic.

Data description

image

pclass: A proxy for socio-economic status (SES)
1st = Upper
2nd = Middle
3rd = Lower

age: Age is fractional if less than 1. If the age is estimated, is it in the form of xx.5

sibsp: The dataset defines family relations in this way...
Sibling = brother, sister, stepbrother, stepsister
Spouse = husband, wife (mistresses and fiancés were ignored)

parch: The dataset defines family relations in this way...
Parent = mother, father
Child = daughter, son, stepdaughter, stepson
Some children travelled only with a nanny, therefore parch=0 for them.

Methods used

  • Cleaning Data
  • Statistical Inference
  • Exploratory Data Analysis (EDA)
  • Data Visualization
  • Supervised Machine Learning Algorithms: Logistic Regression, Random Forest, Naive Bayes, K-nearest Neighbors, SVC

Technologies used

  • Python 3.8.8
  • Pandas 1.2.4
  • Matplotlib 3.3.4
  • Seaborn 0.11.1
  • Sklearn 0.24.1

About

Project made in Jupyter Notebook with Kaggle Titanic dataset, which aims at detailed data analysis and prediction of which passengers survived the sinking of the Titanic.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published