Project Title: Building a Humor Detection Model
BASIS Advisor: Dr. Ryan Grove
Internship Location: Clarabridge Inc.
Onsite Mentor: Sandesh Sanjay Gade
For my Senior Project, I will be researching artificial intelligence (AI) at Clarabridge, a leading software company in advanced text analytics. The purpose of my project is to build a humor detection model. Identifying humor adds another skill to Clarabridge’s wide range of offerings, which already includes sentiment analysis, emotion recognition, intent detection, and predictive analytics. Adding a model that can detect humor would enhance Clarabridge’s understanding of customers’ feedback in surveys, customer service calls, emails, and social media platforms, helping create a better customer experience for users. Moreover, key concepts and findings from building this model could be extraordinarily useful in related natural language processing classification models, including irony and sarcasm detection. To complete this project, I will first implement a Flask application service that can be deployed using Docker to train machine learning models, make predictions, and test trained models using k-folds cross validation. Through experimentation as well as individual research in this rapidly advancing field of machine learning, I aim to explore the strengths and weaknesses of each approach. By the end of the project, I hope to have gained a thorough understanding of humor detection using AI. I also hope my humor classification model achieves a high performance and helps Clarabridge better analyze the feelings and emotions of customers.
My Posts
Week 11: Final Presentation & Deliverable
Final Presentation Final Deliverable Below is the link to my GitHub page, where you can find all the code needed to run my Humor Detection model on your computer: https://github.com/ETH88/Humor_Detection. The instructions to run my code are on the GitHub page. Feel free to ask me any questions! Special Thanks This project would […]
Week 10: Using my Rules Together – a Success!
Intro Hi everyone! Welcome back to my blog! After three months of hard work, my senior research project has come to an end. In this blog, I will share the pipeline architecture I built for this project, as well as my final results. Pipeline Architecture Using My Rules Together – Results! What I’ve Learned Future […]
Week 9: Training & Testing a Fasttext Humor Detection Model
Intro Hi everyone! Welcome back to my blog! This week, I’ve been able to train a Fasttext model to classify humor! In this blog post, I will talk about the following: The Power of Fasttext How Does Fasttext Work? Preprocessing Training Fasttext Test Results Hyperparameter Tuning Next Steps The Power of Fasttext Fasttext is […]
Week 8: Training & Testing a Humor Detection Machine Learning Model
Intro Hi everyone! Welcome back to my blog! This week has been quite eventful. I’ve been able to train a machine learning model using a Random Forest Classifier to classify humor! In this blog post, I will talk about the following: Preprocessing Why Random Forest? How Does Random Forest Work? Challenges Training Random Forest Test […]
Week 7: Improving Rule 1 (Contain_Funny) and Exploring Text Vectorization
Intro Hi everyone! Welcome back to my blog! In this blog post, I will talk about the following: Improving Humor Phrases Rules Results Text Vectorization One-hot Encoding Bag of Words (Count Vectorizer) Term Frequency – Inverse Document Frequency (TF-IDF) Next Steps Improving Humor Phrases While examining the results of rule 1 (contain_funny), which classified […]
Week 6: Testing my Rule-Based Approach for Humor Detection
Intro Hi everyone! Welcome back to my blog! This week has been quite eventful. I’ve been able to find a large humor dataset and test my five rules! In this blog post, I will talk about the following: Finding a Dataset What Makes a Word “Funny” or “Not Funny?” Preprocessing Test Results Test Metrics: Accuracy, […]
Week 5: Researching Humor & Developing a Rules-Based Humor Detection Model
Intro Hi everyone! Welcome back to my blog! I hope everyone is well and staying safe during these difficult times. Just to recap a bit, during the first two weeks, I followed various tutorials online and got acquainted with the applications I would be using for my project, which include Python (using VSCode), Docker, Flask, […]
Weeks 3 & 4: Improving my Project’s Framework
Intro Hi everyone! Welcome back to my blog! Week 3 and 4 of my senior project have been quite eventful. I know that my blogs (including this one) have been quite technical and long (sorry!). If you’re confused at all, feel free to ask any questions you have in the comments below. In my last […]
Week 2: Creating the Architecture for My Project
Intro Hi everyone. Welcome back to my blog! This week, Clarabridge instituted a work from home policy, so I’ve been working in the comfort of my bedroom. Although working from home has been nice, I really miss buzzing into Clarabridge’s spacious headquarters, munching on their free snacks, and drinking as much soda as I want. […]
Week 1: Getting My Feet Wet
Intro Hi everyone! Welcome back to my blog! My first week interning at Clarabridge has been quite eventful. I’ve had the opportunity to meet my mentor, Mr. Gade, as well as so many other awesome co-workers. I’ve also had the chance to learn about the fascinating projects they are working on and get started on […]
Introduction: Humor Detection
Hi everyone, welcome to my first blog post! I’m Ethan, a senior at BASIS Independent McLean. For my senior research project, I will be researching AI at Clarabridge, a leading software company in advanced text analytics. The purpose of my project is to build a humor detection model. Identifying humor adds another skill to Clarabridge’s […]