INPUTN00N01N02HIDDEN_1N10N11N12N13N14HIDDEN_2N20N21N22N23OUTPUTN30N31
REV: 2.0
SCALE: 1:1
NEURAL_ARCH_v2
FANDI.INFERENCE

Hi, I'm Fandi Yi(ʘ‿ʘ)╯

Senior ML EngineerLLM SpecialistBuilder

I turn ML ideas into production-ready systems — LLMs, agents, prediction models, and everything in between.

... and occasionally lose track of time exploring new frameworks at 2am.

6+
Years Experience
15+
Projects
1
Publication
20+
Tech Stack
Scroll
01 // Interactive

Talk to Digital MeBeta

Please note, this is a simulation of an LLM inference engine. I just made it for fun, not a real LLM. ฅ(^•ﻌ•^ฅ)

Suggested Questions

fandi-inference-v2.0
[*]

FANDI_INFERENCE_ENGINE V2.0 initialized...

[*]

Model loaded fandi-fancy-embeddings-v2 | Ready for inference

>>

Hello! Ask me anything about Fandi. Try the suggestions below or type your own question.

02 // Introduction

About Me

👋

Hello!

I am a builder and problem solver with over 6 years' Experience in Machine learning. Skilled in multimodal LLMs, agentic frameworks, and RAG, with end-to-end expertise from model design to deployment and optimization. At RBC, I lead a team to develop LLM-driven agentic chatbot that streamline enterprise operations and projected 30% reduction in operational time for entire enterprise change management team. Outside of work, I co-authored a MODELS 2023 paper on LLM taxonomy generation and contribute to Sherpa, an open-source LLM agentic framework.

Languages

EnglishFluent
ChineseNative
FrenchBeginner

Technical Skills

Languages & Tools

Python (8+ yrs)JavaSQLJavaScriptMATLABC

Frameworks

PyTorchTensorFlowHuggingFaceLangGraphLangChainLangFuse

ML Focus

LLMRAGReinforcement LearningModel QuantizationA/B Testing

Cloud & Infra

GCP (Vertex AI, BigQuery)AzureDockerOCPKafkaJenkinsCI/CD

Data Systems

ElasticSearchNeo4jMongoDBMySQL

Specialties

Multimodal AgentsConversational AIModel DeploymentPerformance Optimization

Skill Distribution

ML/AI
95%
Backend
85%
Cloud
80%
Frontend
60%
03 // Career Path

Work & Education

WWork Experience

01

Senior Machine Learning Engineer

RBC

2024/10 - Current
  • Won RBC employee performance award 2025.
  • Lead a 5 members team developed and deployed the agentic RAG based chatbot by leveraging FastMCP, LangGraph, Elasticsearch (VectorDB) and self-hosted LangFuse (monitoring), which consumed by 100+ employees under RBC change management team, projected 30% reduction in operational time by the end of 2026.
  • Deployed and Optimized 4 enterprise-level models with on-prem OCP. Designed Models' architecture to optimize performance: Successfully reduced alert triage model's 80% memory consumption by refactoring script.
  • Delivered a real-time incident alerting system using Java Spring Boot, Python FastAPI and Kafka. Consumed by two business units in RBC to help them reduce incident processing time by 30%.
02

Machine Learning Research Assistant (Part-time)

McGill University

2022/12 - 2023/05
  • Worked with Prof. Emine Sarigollu's team to explore customers' interaction for circular economy (CE) topic on Twitter.
  • Scraped 500k+ tweets and built NLP pipelines for emotion, sentiment and topic analytics using fine-tuned BERT-based classifiers using PyTorch with CUDA accelerators.
03

NLP & ML Engineering

CIBC

2022/01 - 2024/10
  • Developed entity linking API using scikit-learn and Spacy for daily contract analytics, which are used for matching entities in contract documents with CIBC internal suppliers' name.
  • Created a hierarchical structure for contracts' parent-child relationship, helping identify contracts' property.
  • Enabled large-scale contract documents search (>100k files) by building an automated document metadata extraction pipeline to clean, extract and update data from unstructured text files by using Python and SQLite.
04

ML Engineering Intern

Alibaba Group

2021/10 - 2021/12
  • Worked in the Alibaba Brain team to analyze and develop a product for Objectives Key Results (OKR) project management tool.
  • Partnered with product managers and developers from other teams to Define the indicators measuring the synergy effect among different business units using the OKR data (10 million+ records) from the AlibabaCloud database.
  • Applied Neo4j to build a graph database for OKR data to visualize collaboration among departments.
05

Data Scientist Intern

CIBC

2021/05 - 2021/09
  • Developed an end-to-end generative system to automate supplier profile slides generation process by querying Microsoft Access databases with VBA.
  • Created a customized named entity recognition (NER) model to extract key information from contract documents.
06

Data Scientist Coop

Allianz SE Insurance

2020/11 - 2021/05
  • Built a semi-supervised model to predict the intention of Canadian small business buying insurance products, and built an LSTM model to predict the Google Trend for insurance products to provide insights for their market team.
  • Deployed real-time ML application for social media analytics with Google Cloud Function and Google Data Studio.

EEducation

🎓

Master of Management in Analytics

McGill University

2020/08 - 2021/12
Entrance ScholarshipGPA: 3.89/4.00

Key Courses

Deep LearningDatabase Distrib. Syst.NLPLarge Language ModelA/B Testing
🎓

Bachelor of Electrical Engineering

McGill University

2016/09 - 2020/05
GPA: 3.35/4.00, Final Year: 3.93/4.00

Key Courses

Algorithm DesignComputer VisionApplied Machine LearningNumerical Methods
2016
Started Journey
6+
Years in ML
04 // Portfolio

Project & Research

T_00arXiv

MODELS Conference 2023

Published on MODELS conference: Prompting or Fine-tuning? A Comparative Study of Large Language Models for Taxonomy Construction as a co-first author.

Compared different LLMs' prediction performance with prompting and fine-tuning methods for various taxonomy datasets.

2023/02 - 2023/09
T_01GitHub

LLM Framework – Sherpa

Joined as one of the contributors in Sherpa developer community.

Built search refinement and chain-of-action tools for this agentic LLM framework.

2024/05 - Current
T_02GitHub

Dream Journal App

Practicing my vibe coding skill by publishing a dream journal application to apple app store from scratch.

Vibe coding frontend and backend with Cursor IDE, set up database and edge functions in Supabase, Build and submit app with Expo Application Services.

2025/10 - Current

Stock Signal Bot

Created a taxonomy dataset specific to computing classification system domain.

Deployed real-time Discord bot by using Heroku for stock signal alerts using MACD/RSI analytics.

2023/01 - Present
T_04GitHub

Advanced AI Analytics for Airbnb Hosts

Built an application powered by a polynomial regression model to help hosts to adjust their prices.

Applied AutoML with ML Flow on Databricks to choose the best performance ML model and hyperparameter tuning.

Applied Docker to containerize ML models and the application orchestrated with Kubernetes.

2021/02 - 2021/04
T_05GitHub

Continuous Testing And Validation of Jamscript

Worked in Prof. M.Maheswaran's lab to test and validate a programming language for Edge-Oriented mobile IoT.

Developed a continuous integration pipeline for an open-source programming language: Jamscript (a polyglot language that combines C and JavaScript) with Travis CI.

2019/09 - 2020/05
T_06GitHub

Face Recognition and Tagging

Developed a face recognition system from scratch in a team of 5 using Python Sklearn.

Compared the face recognition performance on PCA and bag-of-words methods.

2019/09 - 2019/12
T_07McGill University

McGill Rocket Club

Aero-Structure and Propulsion Sub-Team Member.

Participated in designing different parts of the rocket model; Won Spaceport America Cup 2018 champion.

2018/10 - 2020/04
End of Array
05 // Connect

Get In Touch

Have a project in mind or want to collaborate? Let's talk!

Contact Details

Email
fandiyi2333 [at] gmail • com
LinkedIn
Connect with me
GitHub
View my code
Location
Toronto, ON M9A 0C9

Send a Message