Popular repositories Loading
-
ML-Regression-House-Price-Prediction-with-Random-Forest-Regressor
ML-Regression-House-Price-Prediction-with-Random-Forest-Regressor PublicIn this project, I use the Random forest algorithm to build the house price prediction model on a dataset with 16 features and 4600 samples from Kaggle. Random Forest Regressor will be an optimal a…
Jupyter Notebook 1
-
ML-Classification-Breast-Cancer-Diagnosis-via-PCA-Linear-Regression-and-Logistic-Regression
ML-Classification-Breast-Cancer-Diagnosis-via-PCA-Linear-Regression-and-Logistic-Regression PublicThe goal of this project is to compare the performance of Principle Component Analysis, Linear Regression, and Logistic Regression models on the automatic diagnosis of breast cancer.
MATLAB
-
Big-Data-and-NLP-Inverted-Index-Database-for-19-000-Reuters-News-Articles
Big-Data-and-NLP-Inverted-Index-Database-for-19-000-Reuters-News-Articles PublicThis project is about implementing an inverted index using Apache Spark(PySpark)to build a relational database (SQLite) for 19,000 Reuters News Articles.Storing the index in a database offers the b…
Jupyter Notebook
-
NLP-and-Visualization-Sentiment-Analysis-with-300MM-Russsian-Troll-Tweets
NLP-and-Visualization-Sentiment-Analysis-with-300MM-Russsian-Troll-Tweets PublicThis report presents text mining data analyses in Python on 3 million tweets associated with a Russian Internet Research Agency. Python libraries(nltk, bs4 and re) is applied to clean the text cont…
Jupyter Notebook
-
Hypothesis-Test-and-Correlation-Analysis-Rental-Subsidy-Rate-across-PHC-in-BC
Hypothesis-Test-and-Correlation-Analysis-Rental-Subsidy-Rate-across-PHC-in-BC PublicApplied KS test and T-test to check whether rental subsidy rate’s distributions are different across different PHAs and implemented Pearson-correlation analysis to explore the linear correlation be…
R
-
ML-Clustering--Hierarchical-Clustering-and-K-means-Clustering
ML-Clustering--Hierarchical-Clustering-and-K-means-Clustering PublicImplemented two types of clustering methods, Lloyd’s algorithm (“k-means”) and hierarchical agglomerative clustering, on two datasets to compare their performance based on their clustering results.
Jupyter Notebook 1
If the problem persists, check the GitHub status page or contact support.