`tweet-sentiment-analysis`

Sentiment Analysis of Tweets from Kaggle Twitter Dataset

Statement: Given Tweet Content and an Entity, the task is to judge sentiment of Tweet Content about entity. There are 3 classes in this dataset: Positive, Negative and Neutral (messages not relevant to the entity, i.e. Irrelevant) classified as Neutral.

Steps involved in Preprocessing of Raw Data (w/ `regex`)

Delete nans
Lower Text
Remove urls
Remove punctuation
Remove contractions (why'd -> why would)
Remove mentions (@user hey -> hey)
Remove hashtags (#sometrend -> sometrend)
Remove double spaces
Decode emojis
Remove stopwords (how is the weather -> weather)
Remove numbers (my id 882244 -> my id)
Delete nans
Lemmatize (the boy's cars are different colors -> the boy car be differ color)
Texts vectorized with TF-IDF vectorizer
Categorical features one-hot-encoded

Neural-Net Architecture

NNSentimentClassifier(
	(softmax): Softmax(dim=1)
	(dropout): Dropout(p=0.2, inplace=False)
	(model): Sequential(
		(0): Linear(in_features=8032, out_features=1000, bias=True)
		(1): ReLU()
		(2): Dropout(p=0.2, inplace=False)
		(3): Linear(in_features=1000, out_features=100, bias=True)
		(4): Tanh()
		(5): Dropout(p=0.2, inplace=False)
		(6): Linear(in_features=100, out_features=1000, bias=True)
		(7): ReLU()
		(8): Dropout(p=0.2, inplace=False)
		(9): Linear(in_features=1000, out_features=10, bias=True)
		(10): ReLU()
		(11): Dropout(p=0.2, inplace=False)
		(12): Linear(in_features=10, out_features=4, bias=True)
	)
)

Reports

`Accuracy of Implemented Neural Network: 94%`

classes = {
	'Irrelevant': 0, 
	'Negative': 1,
 	'Neutral': 2,
 	'Positive': 3
}

Saved NN Model

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
README.md		README.md
net_94acc.pt		net_94acc.pt
tweet-sentiment-analysis.ipynb		tweet-sentiment-analysis.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`tweet-sentiment-analysis`

Steps involved in Preprocessing of Raw Data (w/ `regex`)

Neural-Net Architecture

Reports

`Accuracy of Implemented Neural Network: 94%`

`References`

About

Releases

Packages

Languages

lilithfactor/tweet-sentiment-analysis

Folders and files

Latest commit

History

Repository files navigation

tweet-sentiment-analysis

Steps involved in Preprocessing of Raw Data (w/ regex)

Neural-Net Architecture

Reports

Accuracy of Implemented Neural Network: 94%

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`tweet-sentiment-analysis`

Steps involved in Preprocessing of Raw Data (w/ `regex`)

`Accuracy of Implemented Neural Network: 94%`

`References`

Packages