Machine Learning Engineer &
Windows application developer
Specializing in Speech Synthesis, Transformers, and C++ software dev.
View My WorkAbout Me
I started my journey with Visual Basic and C# as a child, moving to C++ in 2013. My early projects involved videogame modding (2017-2019) using C++, Qt, and WinAPI.
In 2020, I pivoted to Machine Learning, driven by a passion for speech synthesis. I went from zero Python experience to becoming a maintainer of TensorFlowTTS in 2021, where I contributed the first open-source TTS model capable of 44.1KHz audio output.
I briefly attended university (2019-2021) but dropped out to focus entirely on ML research and development. I built TensorVox, a C++ ML TTS application, and freelanced training/modifying models. In 2023 I joined a startup and started getting deep into model stuff
Today, I focus on Transformers for GenAI (Speech, Image, Text). I have experience finetuning GPT-J-6B on TPUs back in 2021 and hold a ROCm Star certificate as a result of my work with AMD hardware.
"I am not designed to come second or third. I am designed to win."
Buenos Aires, Argentina
Technical Arsenal
Core & Systems
Machine Learning & AI
Hardware & Acceleration
Selected Projects
MQGAN / MusicLSTM
Experimental audio tokenization using a spectrogram codec. Implemented a ResNet compressor and U-Net refiner for high-quality reconstruction.
WinDiffusion
A lightweight C++/Qt frontend for Stable Diffusion. No Python dependencies required for the end-user. Supports txt2img, img2img, and inpainting.
Tacotron 2 Conv Attention
A novel approach to solving attention stability issues in Tacotron 2 using Convolutional Attention Consistency.
TensorVox
High-performance local TTS application written in C++/Qt. Supports PyTorch and TensorFlow models. Widely used for freelancing deliverables.
ZMapCharter
Statistical map charting tool built with C++ and SFML. Supports custom shapefiles and CSV/XLSX imports.
ZDEditorRS
Modern C++ rewrite of the database modding tool for SuperPower 2. Became the community standard for modding.
Open Source Contributions
44.1KHz Pretrained Model for TensorFlowTTS
Trained and released the first 44.1KHz open-source TTS model for the repo, introducing Multi-Band MelGAN + HiFi-GAN discriminator.
View PRC++ Support for TensorFlowTTS
Implemented C++ support for exporting and using TTS models via TensorFlow SavedModel C API.
View PR