Senior data analyst with experience in all stages of the data pipeline. Adept at using various tools and Python libraries (Azure Databricks, PySpark, Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn) to clean and analyse data as well as train and evaluate models.
An avid learner always keen to work with new technologies, especially those that involve complex data analysis, data science and machine learning.
Identified countries most in need of aid by clustering them based on the most recent data from the CIA World Factbook.
An analysis into 20 features for 227 countries as well as an app that can be used to predict GDP per capita.
An analysis between different years of the Economic Freedom Index (EFI) and the Economic Freedom of the World (EFW) dataset.
Imbalanced classification using data on opinions regarding the decision to suspend free travel for under-18s in London.
A multi-class classification project utilising NLP techniques to classify hotel reviews from the popular travel booking site TripAdvisor.
An analysis of abilities, skills and tech skills data from the O*NET database as well as classification of around 500 random LinkedIn job titles.
An analysis into the self-reported salaries of individuals who work in the publishing industry in the UK.