There's a lot of hype in the news about cryptocurrency, so we want to take stock of the latest news descriptions regarding Bitcoin and Ethereum to get a better feel for the current public sentiment around each coin.
In this assignment, I have applied natural language processing to understand the sentiment in the latest news articles featuring Bitcoin and Ethereum. I have also applied fundamental NLP techniques to better understand the other factors involved with the coin prices such as common words and phrases and organizations and entities mentioned in the articles.
This project requires a newsapi key and uses the nltk and spaCy NLP libraries.
Overall, sentiment towards Bitcoin and Ethereum are mixed. Bitcoin's mean compound score is -0.3 indicating a mixed negative sentiment. Ethereum's mean compound score is 0.1 indicating a mixed positive sentiment. As can be seen in the top 10 n-grams frequency analysis below, it is evident why Bitcoin sentiment is worse than Ethereum given Bitcoin's bigrams listed from 7-10. Ethereum doesn't have such negative associations in its top 10 bigrams list.
- bitcoin, mining, 7
- international, monetary, 6
- el, salvador, 6
- legal, tender, 4
- mining, system, 4
- jack, dorsey, 4
- heather, morgan, 4
- six, fears, 4
- fears, russian, 4
- russian, attack, 4
- nft, marketplace, 6
- york, city, 4
- ethereum, blockchain, 4
- sequoia, capital , 4
- capital, india, 4
- nonfungible, token, 4
- million, ethereum, 3
- bitcoin, ethereum, 3
- led, sequoia, 3
- blockchain, bridge, 3
Wordclouds help present the most common words throughout the texts analyzed. When working through the analysis, it is also a helpful tool for identifying frequent stopwords that are not included in NLTKs standard stopwords module.