BLOGS.

SYSTEMS JOURNALthoughts, tutorials, and insights i share.
AI & DEEP LEARNING

Optimizing NLP Transformer Inference Latency on Intel Hardware

A deep dive into compiling transformer models to ONNX and utilizing Intel® Extension for PyTorch (IPEX) and TensorRT to achieve 35%+ speed improvements in production workloads.

MAY 15, 2025
6 MIN READ
SYSTEMS

Building Low-Latency Trading Infrastructure with Node.js and Rust

An architectural review of designing trading microservices that process real-time order-book data under 15ms. We cover network configurations, memory safety, and thread safety patterns.

MARCH 28, 2025
8 MIN READ
WEB DEVELOPMENT

Architecting a Resilient Caching Strategy in Next.js App Router

Explaining how to manage incremental static regeneration (ISR) alongside dynamic edge caching to scale web platforms containing high-frequency telemetry logs.

FEBRUARY 10, 2025
5 MIN READ