ChangeMyView-GPT3.5

Based on Chenhao Tan's 2016 paper Winning Arguments: Interaction Dynamics and Persuasion Strategies in Good-faith Online Discussions, this project leverages the semantic analysis capabilities of GPT-3.5 to identify tonal attributes that influence the persuasiveness of textual content.

After reading the original paper, I was very fascinated by the overarching goal: finding subtle linguistic trends that hold predictive power for persuasiveness. Given that the features extracted in the original paper were of a lexical nature, I was interested to use GPT-3.5 to perform some semantic classification and see how they relate to both GPT-3.5’s predicted persuasiveness and their actual delta vote. Due to a lack of processing power, I used around 500 pairs of data from the training set, totaling to 1086 data points.

With some prompt engineering I asked GPT-3.5 to label the formality, subjectivity, optimistic vs. cynical tone, extremity, and lexical density from -1.0 to 1.0. The choice of these tonal labels and the wording of these criteria is from my personal knowledge of textual features. Based on these tonal observations, I asked it to make a judgement from 0.0 to 1.0 on the overall persuasiveness.

Plotting a violin graph gave a more intuitive visualization of the data distribution. Here the darker contour represents the positive data and the lighter contour the negative. In accordance with reddit most posts were informal, subjective with a heavy dose of cynicism. More positive posts were optimistic and high in lexical density. Notably, the posts getting the delta votes seemed to be more polarized on the extremity scale.

While I was confident that GPT-3.5 could do a pretty good job extracting semantic data, I was curious to see how good the persuasiveness predictions are since they involved higher level reasoning. Hence, I employed a logistic regression model to predict the likelihood of an argument being classified as 'positive,' which, in this context, corresponds to receiving a delta vote on a forum. This roughly approximates to the persuasiveness score. Below is a 2D kernel density estimation (KDE) graph overlaid with a regression trend line. There is a weakly positive correlation between the persuasiveness scores generated by GPT 3.5 and the predicted probability generated by the logistic regression model, suggesting that the concept of persuasiveness demands a richer interpretation and more comprehensive extraction of textual data.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.DS_Store		.DS_Store
.gitignore		.gitignore
Prompt_Template.txt		Prompt_Template.txt
README.md		README.md
attribute_violin.png		attribute_violin.png
correlation_matrix.png		correlation_matrix.png
extract_to_json.py		extract_to_json.py
heldout_op_data.json		heldout_op_data.json
heldout_pair_data.jsonlist		heldout_pair_data.jsonlist
logistic_regression.py		logistic_regression.py
logreg_KDE_scatter.png		logreg_KDE_scatter.png
main.py		main.py
output.json		output.json
plotting.py		plotting.py
processing.log		processing.log
semantic_classification.py		semantic_classification.py
update_json.py		update_json.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChangeMyView-GPT3.5

About

Releases

Packages

Languages

JoNeedsSleep/ChangeMyView-GPT3.5

Folders and files

Latest commit

History

Repository files navigation

ChangeMyView-GPT3.5

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages