blue white and yellow abstract painting

ChiaraBi

Software Developer | Data Science MSc graduate

EU


Skills

Machine Learning

65%

Python

85%

Git

95%

Languages

Italian

English

Spanish



Projects

Sentiment Analysis, Emotion Recognition and Hate Speech Detection on English tweets
Jul 2022

Natural Language Processing tasks have been widely studied in the context of big corpora, but, when applied to social media content, they have been proven to be harder. This is mainly due to the following reasons:

  • Social media have a high-paced, conversational and idiosyncratic nature;
  • There might be restriction in the number of characters allowed (this is the case on Twitter);
  • Short texts contains a limited amount of contextual cues;

The lack of a unified evaluation framework makes it hard to compare different models.

In this project I used BERTweet, a large-scale language model pretrained on English tweets, and I evaluated its performances on three different NLP tasks:

  • Emotion Recognition, which consists in recognizing the emotion evoked in a text (a tweet in this case);
  • Hate Speech Detection, which consists in understanding whether a tweet is hateful or not towards certain target communities;
  • Sentiment Analysis, i.e. the task of recognizing if the content of a text is positive, negative or neutral.

Automatic Image Colorization: a comparative overview
Aug 2021 - Jan 2022

The colorization of greyscale images is an ill-posed problem that was approached in different ways in literature. This project provides a comparative analysis concerning five pre-trained colorization models and a cartoonization-based baseline of our invention. The performances are assessed through both quantitative and qualitative metrics, with a final evaluation of the results with respect to image filtering.

Colorization comparison on two images from ImageNet Church (first row) and Bird Species Flamingo (second row)

Universal Adversarial Perturbation starring Frank-Wolfe
Jul 2021

The main goal of the project is to analyze three different Stochastic Gradient Free Frank-Wolfe algorithms for producing Universal Adversarial Perturbations. These perturbations are designed to fool advanced Convolutional Neural Networks, such as LeNet-5 and AlexNet, on the classification task performed over the MNIST dataset.

MNIST dataset + universal adversarial perturbation = perturbated sample

Work Experience

Software Developer - CERN
Feb 2018 - Jan 2020

I worked on Zenodo, a digital repository for scientific publications, and on Invenio, an open-source framework for large-scale digital repositories.

Education

Data Science
2020 - 2024

Master's Degree


Computer Science
2012 - 2015

Bachelor's Degree