Projects

Active Projects

AI/ML Systems

Designing efficient scheduling strategies for (distributed) inference of large AI models across heterogeneous compute clusters.

Started 2024-07 · Ongoing

AI/ML Systems

Developing optimal partitioning strategies for edge inference of transformer models across heterogeneous compute devices.

Started 2024-12 · Ongoing

Networking

Workload-aware low-latency transport protocols for Distributed Compute

Started 2023-04 · Ongoing

Data Centers

Configuration and resource management strategies for achieving optimal energy-performance tradeoffs in large-scale data center networks.

Started 2023-07 · Ongoing

Cloud Computing

Revenue-maximizing pricing schemes for networked compute nodes serving large-scale AI training and inference jobs in public and private clouds.

2024-06 — 2026-04

Edge Computing

Optimal job scheduling and resource allocation for hierarchical edge-cloud architectures to minimize latency under energy constraints.

2024-07 — 2025-06