Exploring MEMEs using constractive learning

This repository provides a simplified implementation of a variation of CLIP by OPEN AI with Gensim Doc2Vec/Hugging Face DistilBERT and Facebook DINOv2/Google AI EfficientNet as the text and image encoders, respectively.

The repository also includes notebooks for training the models to fullfill the following tasks.

Objective 1： Read the sarcasm! Is this meme sarcastic or not? -- A classifier

Objective 2: Build a ranking system for MEMES

Methodology

Here is a flow chart drawn by an awesome artist. (ME.)

Dataset

We used the MEMOTION DATASET 7K as our training and testing dataset.

Dataset Class: Datasets/MemeDataset.py

Features: Images and Captions.

Dataset characterization

Dataset size: 7000. (6931 clean data.) Train-test split = 8:2
Text format: cvs file
Image format: jpg
Testing set size: 2000
Text Preprocessing: strip all special characters, watermarks, dates, and stop words. Lemmatization.
Image Preprocessing: file corruption, re-size

Three classes are included in MemeDatasets.py:

Structure of the repository

*The implementation of clip model is in the "custom_models" folder. *

The trainer module "CLIP_Classifier2.py" and training note of the model is in the root folder.

The Datasets folder includes a sample of images and texts and the Dataset class in CLIP_Datasets.py

Results:

accuracy: 0.7436 auroc 0.7969 f1 0.6392

Name		Name	Last commit message	Last commit date
Latest commit History 131 Commits
Datasets		Datasets
IMG_Encoder		IMG_Encoder
Scripts		Scripts
TEXT_Encoders		TEXT_Encoders
custom_models		custom_models
readme-images		readme-images
utils		utils
CLIP_classifier.py		CLIP_classifier.py
Classify_multimodal.ipynb		Classify_multimodal.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Exploring MEMEs using constractive learning

Methodology

Dataset

Structure of the repository

Results:

About

Uh oh!

Releases

Packages

Languages

NGYeung/CLIP-Sarcasm-Detection

Folders and files

Latest commit

History

Repository files navigation

Exploring MEMEs using constractive learning

Methodology

Dataset

Structure of the repository

Results:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages