Hello, I'm Cat.

I'm a Computer Scientist. I focus on building intelligence.

My interests are:

  • Reinforcement Learning
  • Deep Learning
  • Mathematics
  • Large-scale Software Engineering
  • Distributed Systems
  • and more...

My experiences

My Experiences

Here's a timeline of my professional journey and the projects I've worked on.

May 2025 - Aug 2025

Cohere

Member of Technical Staff

Cohere

Designed and deployed scalable data pipelines using PySpark on GCP Dataproc to ingest, preprocess, and store millions of multi-domain documents daily for LLM training

Developed and benchmarked a high-precision RAG framework, leveraging retrieval outputs to contextualize and curate long-context training datasets

Trained and conducted ablation studies on LLMs across data and architecture configurations using distributed GPU clusters on CoreWeave with Kubernetes and Grafana

May 2024 - May 2025

University of Cincinnati

Undergraduate Researcher

University of Cincinnati

Improved Patch-based DDPM framework by incorporating conditional diffusion, achieving a 15% boost in SSIM for PET image reconstruction

Developed two meta-learning algorithms for image classification: DG-SharpMAML and AS-MAML, incorporating gradient-matching and sharpness-aware minimization

Demonstrated performance through ablation studies and theoretical analyses, beating state-of-the-art by 2% on accuracy benchmarks

Aug 2023 - Dec 2023

Kinetic Vision

Machine Learning Intern

Kinetic Vision

Enhanced Nvidia's NeMo ASR into a speaker diarization pipeline, achieving word error rate below 5% on videos over 1 hour in length

Built a Streamlit app for video transcription with OpenAI Whisper, integrating Longformer and BART for summarization

Proposed robust training pipeline for YOLOv8 to detect pharmaceutical products using 21,500 synthetic images, achieving 98% mAP@50 score

Jan 2023 - May 2023

FPT Software

AI Engineer Intern

FPT Software

Optimized PaddleOCR for information extraction from 40,000+ multilingual invoices for SAP clients

Automated data labeling and information extraction with Python, reducing processing time per batch by 16%

Jan 2022 - Dec 2022

Digital Scholarship Center

Data Science Intern

Digital Scholarship Center

Standardized 31 datasets and integrated 106K documents via New York Times API to create 3 new datasets

Streamlined dataset mapping with Python and JavaScript on AWS OpenSearch, reducing upload time by 12%

Categorized 256 application essays using Linear SVC, Naive Bayes, and KNN, achieving 88% accuracy

Check out my projects

My Projects

Here are some of my recent projects. I'm always working on something new, so check back often!

Fable Dyslexia-friendly reading app Chandra Project Agentic robots simulation BardTales Audiobook generator DESigmoid Image enhancement with evolutionary algorithm CHEER-Ekman Embodied emotion research SpacePal RAG for NASA documents
View All My Projects

My publications

My Publications

Research publications and academic contributions.

Anatomy of a Feeling

Mohammad Saim, Phan Anh Duong, Cat Luong, Aniket Bhanderi, Tianyu Jiang

EMNLP 2025

Narrating Embodied Emotions via Large Vision-Language Models

CHEER-Ekman

Phan Anh Duong, Cat Luong, Divyesh Bommana, Tianyu Jiang

ACL 2025

Fine-grained Embodied Emotion Classification

AS-MAML

Usman Anjum, Chris Stockman, Cat Luong, Felix Zhan

Springer Nature 2025

Using adaptive learning and momentum to improve generalization

DGS-MAML

Usman Anjum, Chris Stockman, Cat Luong, Justin Zhan

ArXiv preprint 2025

Improve Learning in Meta-Learning Algorithms

Σ Cat

© 2025 Cat Luong

LinkedIn GitHub Email Google Scholar