DUSQ (formerly InnerGize)
ML Engineer Intern
- Cut end-to-end inference latency by 88.9% (7.7 s to 0.86 s) and 9x throughput via parallel and async execution.
- Built a 27-keypoint placement API with visibility-aware training for robust side-view guidance at sub-second speeds.
- Optimised preprocessing to 26 ms mean latency (64% faster; P99 37 ms, -66%), then packaged low-latency pipelines for production.