Hi, I'm Mincan πŸ‘‹πŸ»

I'm a software engineer with 4+ years of experience building AI infrastructure at AWS β€” specifically the capacity layer that decides how GPU resources get reserved, allocated, and delivered to ML training and inference workloads at scale.

I build the systems behind products like Capacity Blocks for ML (reserved GPU scheduling) and UltraServers (multi-instance GPU supercomputers connected via high-bandwidth accelerator interconnects for trillion-parameter model training). My work involves distributed workflow orchestration, capacity reservation lifecycle management, and the scheduling primitives that bridge capacity planning with GPU utilization.

Before AWS, I interned at Apple on strategic data infrastructure and did ML research at Boston University Department of Medicine. I studied CS & Math at Boston University and Entertainment Technology at Carnegie Mellon University.

This blog is where I write about GPU scheduling, capacity planning for AI workloads, distributed systems patterns, and the infrastructure that makes large-scale ML possible. Bilingual (δΈ­ζ–‡/English), depending on the topic.

Latest posts
See all posts
Work Experience
  • Aug2022 - Current
    Amazon Web Services
    Software Engineer β€” EC2 AI Infrastructure

    Building state-of-the-art AWS AI infrastructure focusing on GPU capacity:

    • Capacity Blocks for ML β€” Reserved GPU scheduling that guarantees compute availability for training workloads weeks in advance
    • UltraServers β€” Multi-instance GPU supercomputers connected via high-bandwidth accelerator interconnects for trillion-parameter model training
  • May2021 - Dec2021
    Apple
    Software Engineer Intern β€” Strategic Data Solutions

    Backend development in Apple’s Strategic Data Solutions group.

  • Oct2019 - Aug2020
    Boston University Department of Medicine
    Machine Learning Research Assistant β€” Kolachalama Lab

    Developed a deep learning framework for detecting, segmenting, and mapping glomeruli from kidney biopsy images across multiple staining protocols.

Education
  • 2020 – 2022
    Carnegie Mellon University
    Master's β€” Entertainment Technology
  • 2016 – 2020
    Boston University
    B.A. β€” Computer Science & Mathematics
Let's Connect

Interested in compute capacity, AI infrastructure, or distributed systems? Let's chat. Reach out on LinkedIn or drop me an email.