Welcome to my GitHub profile! I'm a currently working on my PhD in Machine Learning, passionate about Sustainable Machine Learning with a particular focus on Tensor Networks based methods and Uncertainty Estimation in ML. I love solving problems, collaborating on open-source projects, and learning new technologies in my free time.
- π Iβm currently a PhD candidate at Delft University of Technology, working on Probabilistic Tensor Methods.
- π My research interests include: Tensor Networks, Deep Learning, Probabilistic ML, Recommender Systems, Reasoning Models.
- π― Iβm looking to collaborate on any topic connected to my research interests!π
- π« How to reach me: Linkedin, Telegram
- β‘ Fun fact: Learning Deep Learning with CS231n has been an exciting challenge! Batch normalization still keeps me on my toes, but I'm tackling it one layer at a time!
Here are some of the tools and technologies I frequently use:
- π» Languages: Python, SQL, C/C++.
- π§ Frameworks & Libraries: Numpy, Jax, PyTorch, Numba; Pandas, Matplotlib; Scikit-Learn, Hugging Face.
- π Tools & Platforms: Git, Linux, Docker, CI/CD, Cloud, MLOps.
Here are some of my open-source projects:
- Tensor Network Based Feature Learning Model - The Feature Learning (FL) model introduces a learnable Canonical Polyadic Decomposition (CPD) for tensor-product features, enabling efficient learning of hyperparameters alongside model parameters using Alternating Least Squares (ALS). Experiments show that FL consistently trains 3-5 times faster than standard cross-validation approaches while achieving comparable prediction quality.
- Dynamic Collaborative Filtering - TIRecA is a novel collaborative filtering model designed for sequential recommender systems that efficiently updates its parameters with only new data, allowing for incremental addition of users and items. Experiments on four datasets show that TIRecA achieves comparable prediction quality to baseline models while being 10-20 times faster in training.
- Federated Privacy-Preserving Collaborative Filtering for on-device Next App Prediction - We propose a novel SeqMF model for predicting the next app launch in mobile device usage, addressing the challenges of sequential data, distributed user feedback, and data privacy. By modifying matrix factorization and incorporating federated learning, our model ensures privacy protection while processing user data across devices. Evaluations in static and dynamic environments show that SeqMF provides comparable quality to other methods, with a superior privacy-utility trade-off in dynamic environments that simulate real-world usage.
- MEKER: Memory Efficient Knowledge Embedding Representation for Link Prediction and Question Answering - We propose MEKER, a memory-efficient Knowledge Graph (KG) embedding model that represents the KG as a 3rd-order binary tensor and utilizes a generalized CP decomposition to optimize memory usage without backpropagation. Our model achieves state-of-the-art performance on link prediction tasks and KG-based Question Answering while reducing memory costs during training.
- Image Caption Generator - In this rather fun project, we compare encoder-decoder models with different backbones, including CNN, RNN, and Transformers, for the task of image content description generation. Experiments on the Flickr8k dataset and evaluation on COCO show that the best model, DenseNet161 + Transformer, achieves a BLEU@1 score of 62.57, while CNN + LSTM configurations perform faster but lack the quality of Transformer-based models.
Feel free to explore my GitHub repositories for more research-related projects!
- LinkedIn: Linkedin
- Email: Analystalb@gmail.com