Machine Learning Engineer Intern

Organization: InterGlobe Technologies
Duration: Dec 2018 - Mar 2019
Location: Gurgaon, Haryana, India
Language: Python
Certificate: Certificate
Github: Sentiment Analyzer and Email Classifier

Overview: 1-2 minute read

I Started by doing a literature survey on various Natural Language Processing libraries like Spacy, Nltk, TextBlob, Textacy and documented the pros and cons of each library; hence making it easy for me to select the preferred library.

After selecting a library, I moved on to creating a sentiment analyzer using the Twitter Sentiment140 dataset which contains 1.6 million sentences classified as good, bad or neutral sentiment. Using the bag of words methodology I featurized my data points by normalizing, lemmatizing, stemming and removing the stopwords. I Then used various Machine Learning algorithms to build a predictive model and got an overall accuracy of 83%.

For the last step, I used similar techniques to classify airways emails according to it's content into classes like "Passport", "Complaint", "Boarding Pass" etc and received an overall accuracy of 87%; crushing the 74% accuracy of the previously deployed email classifier.

Google Sites

Report abuse

Machine Learning Engineer Intern

Organization: InterGlobe Technologies

Duration: Dec 2018 - Mar 2019

Location: Gurgaon, Haryana, India

Language: Python

Certificate: Certificate

Github: Sentiment Analyzer and Email Classifier

Overview: 1-2 minute read

Get In Touch

jaskiratsinghbhatia4@gmail.com | (226) 507-2637

© Jaskirat Singh Bhatia, 2020