Skip to content
View ryouze's full-sized avatar

Highlights

  • Pro

Block or report ryouze

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ryouze/README.md

ryouze

About Me

I hold an M.A. in Language, Mind, Technology from Adam Mickiewicz University. This interdisciplinary program combines linguistics with computer science, cognitive science, and AI to study language using both theoretical and experimental methods.

During a 3-year paid research internship on the PRODIS project, I built and maintained the full machine learning and data processing stack. This included a first-of-its-kind phoneme-level GPT model for Polish, CI pipelines for survey processing, transcription QA tools, a batch ASR wrapper, and a custom web interface for data collection. I also developed an internal GUI tool to track the progress of data collection.

Outside of research, I build cross-platform tools, games, and backend infrastructure in Python and C++. My projects emphasize automation, reproducibility, and performance. I've used Linux since 2016 and maintain a self-hosted server.


Tech Stack

Languages

Python C++ JavaScript Go

Machine Learning

PyTorch NumPy Pandas

Dev Tools

CMake GitHub Actions Docker

Platforms

Linux macOS


Projects

Highlights include:

  • model – Python CLI pipeline for training a phoneme-level GPT model on Polish IPA, with a custom tokenizer, TOML-based configs, and multithreaded scripts for formant extraction, surprisal prediction, alignment, and stress annotation.
  • survey – Python CI-based tool for cleaning and standardizing survey exports, including translation and structural validation.
  • asr – Python CLI wrapper around Whisper for batch ASR with stereo-to-mono conversion and model/language selection.
  • header-warden – C++ CLI multithreaded static analysis tool that reports missing standard library headers in C++ code.
  • fattura – C++ GUI app for editing transcription verification status CSV with autosave.
  • aegyo – C++ GUI app for learning Korean Hangul with full mouse and keyboard input.
  • vroom – In-progress C++ GUI 2D racing game with arcade drift physics, procedurally-generated tracks, and waypoint AI.

More: ryouze.net/projects


Contact

Pinned Loading

  1. vroom vroom Public

    Cross-platform 2D racing game with arcade drift physics, procedurally-generated tracks, and waypoint AI.

    C++ 2 1

  2. aegyo aegyo Public archive

    Cross-platform GUI app for learning Korean Hangul.

    C++ 1

  3. py-template py-template Public

    Barebones Python project template (poetry + pytest).

    Python

  4. header-warden header-warden Public archive

    Cross-platform multithreaded CLI tool that identifies and reports missing standard library headers in C++ code.

    C++ 1

  5. yt-table yt-table Public archive

    Cross-platform CLI tool for managing YouTube subscriptions locally through a shell-like interface.

    C++

  6. applefetch applefetch Public archive

    macOS CLI system information tool, inspired by neofetch.

    C++