Gergely at Grand Canyon
+31620794515
Amsterdam
CV

Gergely Papp

Machine Learning Engineer

Expert in deep learning, computer vision and NLP.

Technical Skills

Languages

Python Python
SQL SQL
JavaScript JavaScript
Java Java
C# C#

Libraries & Frameworks

PyTorch PyTorch
TensorFlow TensorFlow
Keras Keras
Huggingface Huggingface
Scikit-Learn Scikit-Learn
Pandas Pandas
NumPy NumPy
OpenCV OpenCV
Django Django
Flask Flask
NodeJS NodeJS

DevOps & Tools

Docker Docker
Git Git
Cloud Cloud
Azure Azure
Databricks Databricks
REST API REST API
SLURM SLURM
Neptune.ai Neptune.ai
Wandb Wandb
Jenkins Jenkins
CI/CD CI/CD

I confidently execute end-to-end ML projects across computer vision and NLP domains, with publications at top conferences backing my expertise. While grounded in fundamental ML theory and practice, I'm actively expanding my knowledge of industry-standard tools and platforms to enhance implementation efficiency and deployment capabilities.

Professional Experience

Deep Learning Research Engineer

Alfréd Rényi Institute of Mathematics

2020 - 2024

Teaching Assistant

University of Amsterdam

2024

Machine Learning Consultant

Asura Technologies Ltd.

2020 - 2024

Machine Learning Engineer

Asura Technologies Ltd.

2018 - 2020

Risk Analyst (AI Team)

Morgan Stanley

2017 - 2018

Education

Master of Artificial Intelligence

University of Amsterdam

Grade: 8.1, Cum Laude

Bachelor of Computer Science

University of Manchester

Grade: Second-Upper Class

Projects

Prisma Present

Extracting psychological data, resilience, and player communication statistics from e-sport footage for further analysis.

Python Huggingface Docker Databricks
Python, Huggingface, Docker, Pygame, Databricks

Linux Bash Agent Present

Developing a natural language interface for terminal operations that can execute complex tasks through prompting, with optimization for deployment on resource-constrained devices.

Python Azure Huggingface
Python, Azure, Huggingface

RAG 2024

Participated in a Retrieval Augmented Generation project involving vector databases, knowledge graphs, and text generation with LLMs.

Python Huggingface PyTorch
Python, Huggingface, PyTorch

MSc Thesis 2024

Investigated Vision Transformers' ability to generalize across object properties (shape, texture, color, count) on CLEVR-4.

Python Huggingface
Python, Huggingface

Stitch-BERT 2023

Analyzed how NLP transformers fine-tuned for different languages and tasks relate geometrically and functionally, revealing potential for cross-task insights.

Python PyTorch
PyTorch, Python

Gaming Bot 2023

Developed a rule-based AI in NodeJS for automating gameplay for a browser game with attack timing, reaction to reports, and HTML dashboard logging.

NodeJS JavaScript
NodeJS, HTML, JavaScript

Energy Consumption Prediction 2023

Developed a time series forecasting model to predict energy consumption based on historical data through collection, cleaning, preprocessing, and model training.

Python PyTorch Docker
Python, PyTorch, Docker

Self-Supervised Learning 2022

Explored innovative self-supervised image classification methods competing with state-of-the-art approaches, achieving promising results on smaller datasets.

Python PyTorch Wandb Docker
Python, PyTorch, Wandb, Docker

Image Interpretation of CNNs 2021

Visualized the role of neurons in CNNs using Lucid and GANs, revealing what images best represent specific classes in a CelebA-trained classifier.

Python TensorFlow
Python, Lucid, TensorFlow

MRZ Extraction 2020

Developed a system to extract Machine Readable Zones from passport images through comprehensive data collection, preprocessing, and model training.

Python TensorFlow OpenCV NumPy
Python, TensorFlow, OpenCV, Tesseract, NumPy

Wheel Counter 2020

Developed a real-time system to count wheels through comprehensive data collection, preprocessing, and model training.

Python TensorFlow Keras OpenCV Neptun.ai Jenkins CI/CD
Python, TensorFlow, Keras, OpenCV, Neptun.ai, Jenkins, CI/CD

Car + License Plate Detection 2019

Implemented a real-time YOLO-based system for detecting vehicles and license plates, overcoming challenges with fish-eye camera distortion through specialized preprocessing.

Python TensorFlow Keras OpenCV SQL REST API Neptun.ai Jenkins CI/CD
Python, TensorFlow, Keras, SQL, OpenCV, REST API, Neptun.ai, Jenkins, CI/CD

People Counter 2019

Developed a tool to count people entering a shopping mall in real-time with 90%+ accuracy, providing reliable customer traffic estimates.

Python TensorFlow Keras OpenCV Jenkins
Python, TensorFlow, Keras, OpenCV, Jenkins

Make & Model Recognition 2019

Categorized car images into their make and model with 90%+ accuracy through comprehensive data collection, preprocessing, and model training.

Python TensorFlow Keras OpenCV Jenkins CI/CD
Python, TensorFlow, Keras, OpenCV, Jenkins, CI/CD

Watermeter Reader 2019

Built an end-to-end OCR solution that automatically cleans, rotates, and extracts readings from watermeter images, deployed to the cloud for seamless integration.

Python TensorFlow Docker REST API
Python, TensorFlow, Docker, REST API

AlphaZero 2018

Reimplemented AlphaZero to explore temporal difference learning vs. Monte Carlo methods. The study revealed unique in-game strategies made with Reinforcement Learning.

Python TensorFlow
Python, TensorFlow

Balance Sheet Reconciliation 2018

Engineered a tool using hierarchical clustering to reconcile balance sheets between different accounting systems, automating complex financial comparisons.

Python Pandas NumPy Scikit-learn
Python, Pandas, NumPy, Excel, Scikit-learn

Invoice OCR 2018

Architected an OCR system that translates invoices into structured formats and extracts relevant information using a shallow neural network for efficient data processing.

Python TensorFlow Scikit-learn
Python, TensorFlow, Tesseract, Scikit-learn

Structured VAE Latent 2018

Investigated latent space properties of structured VAEs by forcing a torus shape while reconstructing clock images, advancing understanding of controlled generative models.

Python TensorFlow OpenCV
Python, TensorFlow, OpenCV

Gun Detection 2018

Tackled the challenge of detecting small firearms in high-resolution images in real-time, discovering the critical importance of contextual information for accurate detection.

Python TensorFlow OpenCV NumPy
Python, TensorFlow, OpenCV, NumPy

Time Series Forecasting 2017

Built a predictive model using linear regression to forecast future values based on historical time series data, enabling data-driven decision making.

Python Scikit-learn Pandas
Python, Scikit-learn, Pandas

Chess Engine 2017

Designed a Java-based neural chess engine from scratch without the use of tree search, achieving entry-level play

Java
Java

Research & Publications

Neural Networks (2023)

"Mode combinability: Exploring convex combinations of permutation aligned models"

ReScience (2023)

"Reproducibility study of 'Label-Free Explainability for Unsupervised Models'"

NeurIPS (2021)

"Similarity and Matching of Neural Network Representations"

AITP (2021)

"Ordering Subgoals in a Backward Chaining Prover"

Achievements & Distinctions

European Champion in Pool Billiard (2010)

27th place in National Secondary School Mathematics Competition

24th place in National Secondary School Programming Competition

Participated in multiple hackathons

  • Ultrahack (Helsinki, 2016)
  • MLH (Manchester, 2015)
  • MLH (Sheffield, 2016)
  • HackPrague (Prague, 2018)
  • LikeABosch (Budapest, 2019)

Hobbies & Interests

Gym

4 times a week

Piano

8 years experience

Video Games

When time permits

Board Games

Every 2 weeks

Pool Billiard

Champion in 2010