JennyYu2017

Jenny YU JennyYu2017

Extensive experience as a business analyst complemented with a Master Degree in Applied Data Science

0 followers · 1 following

https://www.linkedin.com/in/jenny-y-aaa69b166/

Popular repositories Loading

ML-Regression-House-Price-Prediction-with-Random-Forest-Regressor ML-Regression-House-Price-Prediction-with-Random-Forest-Regressor Public

In this project, I use the Random forest algorithm to build the house price prediction model on a dataset with 16 features and 4600 samples from Kaggle. Random Forest Regressor will be an optimal a…

Jupyter Notebook 1
ML-Classification-Breast-Cancer-Diagnosis-via-PCA-Linear-Regression-and-Logistic-Regression ML-Classification-Breast-Cancer-Diagnosis-via-PCA-Linear-Regression-and-Logistic-Regression Public

The goal of this project is to compare the performance of Principle Component Analysis, Linear Regression, and Logistic Regression models on the automatic diagnosis of breast cancer.

MATLAB
Big-Data-and-NLP-Inverted-Index-Database-for-19-000-Reuters-News-Articles Big-Data-and-NLP-Inverted-Index-Database-for-19-000-Reuters-News-Articles Public

This project is about implementing an inverted index using Apache Spark（PySpark）to build a relational database (SQLite) for 19,000 Reuters News Articles.Storing the index in a database offers the b…

Jupyter Notebook
NLP-and-Visualization-Sentiment-Analysis-with-300MM-Russsian-Troll-Tweets NLP-and-Visualization-Sentiment-Analysis-with-300MM-Russsian-Troll-Tweets Public

This report presents text mining data analyses in Python on 3 million tweets associated with a Russian Internet Research Agency. Python libraries(nltk, bs4 and re) is applied to clean the text cont…

Jupyter Notebook
Hypothesis-Test-and-Correlation-Analysis-Rental-Subsidy-Rate-across-PHC-in-BC Hypothesis-Test-and-Correlation-Analysis-Rental-Subsidy-Rate-across-PHC-in-BC Public

Applied KS test and T-test to check whether rental subsidy rate’s distributions are different across different PHAs and implemented Pearson-correlation analysis to explore the linear correlation be…

R
ML-Clustering--Hierarchical-Clustering-and-K-means-Clustering ML-Clustering--Hierarchical-Clustering-and-K-means-Clustering Public

Implemented two types of clustering methods, Lloyd’s algorithm (“k-means”) and hierarchical agglomerative clustering, on two datasets to compare their performance based on their clustering results.

Jupyter Notebook 1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jenny YU JennyYu2017

Block or report JennyYu2017

Popular repositories Loading