Skip to content
View EwenCheung's full-sized avatar

Highlights

  • Pro

Block or report EwenCheung

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
EwenCheung/README.md

Waving hand About Me

I am Ewen Cheung Yi Wen, a Computer Science undergraduate at the National University of Singapore focused on software engineering, AI systems, and full-stack product engineering. My work sits at the intersection of backend systems, machine learning, LLM infrastructure, agentic workflows, data platforms, and production-grade user-facing applications.

I have built and deployed systems across RAG, text-to-SQL, vLLM inference optimisation, AI weather forecasting, private equity intelligence pipelines, multi-agent orchestration, academic planning systems, and pathfinding agents with CNN-based perception. I am also deepening my ML foundation across computer vision, diffusion models, generative modelling, and hallucination evaluation. I care about systems that are not only intelligent, but also reliable, auditable, secure, measurable, and usable by real people.

Engineering Focus What I Build
Software Engineering Full-stack applications, backend APIs, database schemas, system design, testing workflows
AI / ML Engineering RAG, text-to-SQL, model evaluation, inference optimisation, CNNs, GNNs, Transformers
Product Engineering Responsive interfaces, user workflows, automation tools, end-to-end deployed products
Enterprise AI Secure LLM-to-database workflows, Zero Trust-aligned access, monitoring, observability

Open To

AI Engineering Software Engineering ML Research Agentic AI Full Stack AI Products



Animated AI systems flow from data to retrieval, agents, evaluation, product, and impact

Tech Stack

Languages

Languages

SQL

Frontend

Frontend

Backend & Databases

Backend and Databases

Cloud, DevOps & Tooling

Cloud DevOps Tooling

PyTorch Hugging Face LangChain LangGraph vLLM Langfuse LangSmith


AI / ML Expertise

Domain Proficiency Details
RAG Systems Advanced Hybrid retrieval, lexical/vector search, grounding, document/data pipelines, enterprise query answering
Text-to-SQL Advanced Real-time SQL generation, enterprise data grounding, database access controls, query evaluation
Agentic AI Advanced Tool-using agents, multi-agent orchestration, RBAC, approval-gated execution, workflow monitoring
LLM Infrastructure Advanced vLLM serving, batching, latency optimisation, Langfuse/LangSmith observability
Computer Vision Strong CNN-based tile recognition, 21-class classification, 210K+ augmented samples, constrained model deployment
Forecasting ML Strong GNN encoders, Transformer latent rollouts, observation-to-forecast modelling, meteorology workflows
Deep Learning Foundations Strong Transformer internals, attention mechanisms, encoder-decoder training, optimisation loops, PyTorch implementation
Generative AI Research Developing Diffusion models, hallucination evaluation, generative model reliability, multimodal AI systems
ML Evaluation Strong Evaluation pipelines, model quality measurement, investment analysis consistency, performance benchmarking
Full-Stack AI Products Strong React/Next.js frontends, FastAPI/REST backends, PostgreSQL/Supabase, responsive product workflows

Experience

Animated experience timeline across production AI, weather research, and finance intelligence

Private Equity Quantitative Strategist Intern - GIC

Jun 2026 - Dec 2026

Built an end-to-end private equity intelligence pipeline that transforms deal data into structured investment insights, accelerating underwriting analysis and improving evaluation consistency.

Scope of Work

  • Developed evaluation workflows for private equity investment analysis.
  • Automated repetitive underwriting and deal analysis processes.
  • Generated structured, data-driven insights for investment decision-making.
  • Connected software engineering, data processing, and evaluation consistency in a finance context.

Evaluation Private Equity Data Pipelines Investment Intelligence

ML Research Intern, Meteorology - NEA / Climate Research (CCRS)

May 2026 - Jun 2026

Developed an Aardvark-inspired end-to-end AI weather forecasting pipeline using GNN encoders and Transformer latent rollouts, converting raw meteorological observations into forecast-ready latent states for seconds-to-minutes inference.

Scope of Work

  • Built observation-to-forecast modelling workflows for meteorology.
  • Used GNN encoders and Transformer latent rollouts for forecast-ready representation learning.
  • Reduced forecast initialisation latency from 3-6 hours to 5 minutes.
  • Replaced traditional initialisation bottlenecks with end-to-end AI inference design.

GNNs Transformers Forecasting PyTorch

AI/ML Engineer Intern - V-Key Pte Ltd

May 2025 - Dec 2025

Engineered production-grade RAG and text-to-SQL systems for enterprise data analysis, with hybrid retrieval, real-time SQL generation, inference optimisation, and Zero Trust-aligned LLM-to-database access controls.

Scope of Work

  • Achieved 90%+ accuracy on enterprise information queries.
  • Built hybrid retrieval pipelines with lexical/vector search and enterprise grounding.
  • Reduced LLM response latency by 30% through vLLM inference optimisation, batching, and serving workflow improvements.
  • Improved auditability and access safety for enterprise data workflows.

RAG Text-to-SQL vLLM Zero Trust Langfuse


Featured Projects

One-Man Business Multi-Agent Orchestrator - Enterprise Agentic AI System

A supervisor-driven, role-aware multi-agent system for solo business operations. It combines a FastAPI backend, Next.js dashboard, Supabase/PostgreSQL data layer, Telegram/web messaging, LangGraph orchestration, retrieval, policy grounding, memory, external research, risk checks, and human approval gates.

Category Details
Stack Python, FastAPI, Next.js, React, Supabase, PostgreSQL, LangGraph, Docker, Telegram Webhooks
Scale 20+ relational tables, owner/stakeholder dashboards, multi-channel messaging, seeded business data, tests and evaluation workflows
Performance Supervisor pipeline delegates work to retrieval, policy, research, memory, and reply agents instead of relying on one unconstrained LLM pass
Security RBAC, role-aware tool/data access, policy grounding, risk evaluation, approval-gated execution, human-in-the-loop controls
Impact Won Best Project Award with 24/25; demonstrates enterprise-grade agent harness engineering and operational AI system design
Repository github.com/EwenCheung/One-Man-Business-Multi-Agent-Orchestrator-System

This is my strongest project because it shows production-style software engineering around AI: orchestration, permissions, persistence, evaluation, UI, integrations, deployment, and safety boundaries.

AI-DOP - Aardvark-Inspired AI Weather Forecasting Pipeline

An AI weather forecasting research project inspired by Aardvark-style observation-to-forecast modelling. The repository includes encoder, processor, decoder, evaluation, end-to-end fine-tuning, WeatherBench-style evaluation, PBS training jobs, notebooks, and generated research outputs.

Category Details
Stack Python, PyTorch, Transformers, NumPy, pandas, xarray, Matplotlib, Jupyter, PBS, MLflow
Scale Encoder/processor/decoder training workflows, cached tensor pipelines, station evaluation, baseline preparation, and forecast evaluation scripts
Performance Targets seconds-to-minutes AI inference by replacing slow forecast initialisation stages with learned observation-to-forecast components
Security Local research pipeline with environment-based paths, reproducible scripts, and separated training/evaluation entrypoints
Impact Bridges ML research and climate/weather infrastructure; directly aligns with NEA / CCRS meteorology work
Repository github.com/EwenCheung/AI-DOP

This is worth featuring because it is not just a concept repo: it contains real model-training, evaluation, notebook, and HPC-style workflow assets.

NUS Planner - Auto + AI-Powered Academic Planning Platform

An AI-powered academic planning platform that generates optimised 4-year study plans in seconds. The system combines automated module data pipelines, prerequisite enforcement, exchange mapping, mutually exclusive module handling, backend validation services, structured planning endpoints, and a responsive React frontend.

Category Details
Stack TypeScript, React, Next.js, Supabase, PostgreSQL, pgvector, RAG, REST APIs
Scale Multi-year academic planning across modules, prerequisites, semester availability, and exchange mappings
Performance Generates optimised 4-year plans in seconds using a constraint-aware planning engine
Security Environment-based configuration, private data support, backend validation
Impact Turns manual academic planning into an automated, explainable, interactive workflow
Repository github.com/EwenCheung/NUSPlanner

This project represents my product engineering style: algorithmic planning, AI assistance, structured data, frontend UX, backend validation, and deployment readiness in one product.

CS2109S Path-Finding Agent with CNN and A* Search

A hybrid AI agent for Grid Universe tasks that combines classical A* search, CNN/RNN perception, and ML-based decoding. The system handles dynamic hazards, interactive elements, power-ups, multiple objectives, and constrained model deployment.

Category Details
Stack Python, PyTorch, CNNs, RNNs, A* Search, Jupyter, custom evaluation scripts
Scale 210K+ augmented image samples, 21-class tile recognition, multi-objective grid environments
Performance 97/100 overall score, Top 5%, 100% tile-recognition accuracy under a 2 MB model constraint
Security Offline local execution, reproducible environment, contained model artefacts
Impact Demonstrates integration of search, planning, perception, ML modelling, and constrained AI deployment
Repository github.com/EwenCheung/CS2109S-Path-Finding-Agent-with-CNN-and-A-Search

This project is strong public proof of my ability to combine classical AI, deep learning, and practical engineering constraints into one working agent system.

Train a Transformer from Zero to Hero - English-Chinese NMT

A from-scratch Transformer implementation for English-Chinese neural machine translation. The project implements embeddings, positional encoding, scaled dot-product attention, multi-head attention, encoder/decoder blocks, training, checkpointing, BLEU evaluation, tokenizer preparation, and interactive translation.

Category Details
Stack Python, PyTorch, custom Transformer modules, SentencePiece-style tokenization, BLEU evaluation
Scale Web-crawled English-Chinese corpus, train/dev/test datasets, tokenizer pipeline, training and translation entrypoints
Performance Tracks validation loss and BLEU score, saves best checkpoints, supports inference through an interactive translation script
Security Local model training/inference pipeline with explicit configs and reproducible entrypoints
Impact Demonstrates first-principles understanding of Transformer internals rather than only using high-level model APIs
Repository github.com/EwenCheung/Train-a-transformer-from-zero-to-hero

This project is worth mentioning because it shows model-level engineering: attention, encoder-decoder architecture, optimization, training loops, and inference mechanics.

SuperConfig - No-Code Agentic AI Configuration Platform

A no-code agentic AI configuration project built for the SimplifyNext x AWS hackathon. The platform focuses on rapid setup of personalised AI agents and workflow configuration for users who need agent capabilities without manually engineering the system from scratch.

Category Details
Stack Python, AWS, agent configuration, workflow automation
Scale Demo-oriented agent setup flow with configurable behaviour and deployment workflow
Performance Designed for rapid configuration and fast agent setup
Security Configuration-driven access patterns and controlled setup flow
Impact Lowers the barrier to deploying personalised agentic AI workflows
Repository github.com/EwenCheung/SuperConfig

This project shows my interest in turning agentic AI from a research idea into a configurable product experience.

Dual Defence v3 - Python Tower Defense Game

A comprehensive Python/Pygame tower-defense game featuring campaign modes, progression systems, user management, custom game mechanics, and replayable gameplay loops.

Category Details
Stack Python, Pygame, custom game logic
Scale Multiple modes, progression mechanics, user management, custom gameplay systems
Performance Local game loop with responsive interaction and state-driven mechanics
Security Local user state handling and contained runtime
Impact Demonstrates software design, game state management, event systems, and end-to-end project completion
Repository github.com/EwenCheung/Dual-Defence-v3-latest

This project demonstrates my foundation in software construction before moving deeper into AI and enterprise systems.


๐ŸŽ“ Education & Achievements

๐Ÿซ Education

NUS

B.Sc. (Honours), Computer Science โ€” Aug 2025 โ€“ May 2028

Focus Area: Artificial Intelligence & Algorithms

NTU

Data Science & Artificial Intelligence โ€” Aug 2024 โ€“ Jul 2025

Completed Year 1 ยท Credits transferred to NUS (entered as Year 2)

๐Ÿ† Achievements

๐Ÿฅˆ 2x First Runner-Up โ€” National Hackathons

๐Ÿ… 2x Finalist โ€” National Hackathons

๐ŸŽ–๏ธ Google AI CTO Bootcamp โ€” Selected Representative

๐ŸŽฏ Top 1-5% โ€” NUS CS2109S (Score: 97/100)

๐Ÿ† Best Project Award โ€” Multi-Agent Business Orchestrator (24/25)

๐Ÿซ NUSSU CommIT โ€” Training Cell Head


GitHub Analytics

GitHub Stats GitHub Streak
Top Languages

Contribution Activity

Contribution Activity Graph

Contribution Snake

GitHub contribution snake animation

Current Focus

Animated current focus radar for ML, agentic AI, finance AI, and hallucination evaluation

Learning:
  - Advanced LLM evaluation and agent reliability
  - Secure AI systems for enterprise data workflows
  - Forecasting models with GNNs and Transformer latent rollouts
  - Computer vision beyond classification: detection, segmentation, and representation learning
  - Diffusion models and generative modelling fundamentals

Building:
  - Private equity intelligence and evaluation pipelines at GIC
  - AI weather forecasting pipelines with NEA / CCRS
  - Full-stack AI products with reliable backend infrastructure

Exploring:
  - Applied AI for finance and investment intelligence
  - Agentic AI orchestration and tool-use reliability
  - Hallucination detection, grounding, and evaluation
  - Machine learning systems for CV, generative AI, and forecasting
  - RAG quality measurement and text-to-SQL auditability

Open To:
  - AI / ML engineering roles
  - Software engineering internships
  - Applied ML research collaborations
  - Full-stack AI product projects

Connect

Gmail LinkedIn GitHub Portfolio


๐Ÿ’ก "Coding is the process of building things from 0 to visible."

Pinned Loading

  1. Dual-Defence-v3-latest Dual-Defence-v3-latest Public

    ๐ŸŽฎ Comprehensive tower defense game featuring Stick of War campaign mode and Bokemon vs Stick survival mode. Built with Python and Pygame, showcasing complete user management, progression mechanics,โ€ฆ

    Python 3

  2. PathForger-PSA-Hackathon PathForger-PSA-Hackathon Public

    Demo Video with Explanation on this Project

    TypeScript 2 2

  3. SC2006-InternLink SC2006-InternLink Public

    JavaScript 2 1

  4. SuperConfig SuperConfig Public

    Video Demo with how to Setup your Personalised Agent

    Python 3 2

  5. CS2109S-Path-Finding-Agent-with-CNN-and-A-Search CS2109S-Path-Finding-Agent-with-CNN-and-A-Search Public

    Run the demo.ipynb or check the video link.

    Jupyter Notebook 1

  6. NUSPlanner NUSPlanner Public

    Try the live demo for a quick preview. โญ โ€” Star the project if you find it useful. โ€” For full AI features and private data support, clone the repo, configure your .env, and run the data seeding scrโ€ฆ

    TypeScript 4 1