Muhammad Uzair — Islamabad, Pakistan

I build intelligent systems — shipping voice AI and agentic LLMs in production, and publishing research on multi-agent reinforcement learning.

IEEE publications
Mitacs
Globalink 2025
B.E.
Software, NUST
3.61
CGPA / 4.00

About

A short introduction.

§ 01

I'm a software engineer by training (NUST, 2025) and an AI engineer by trade. My work sits at the intersection of shipped systems and published research — voice AI pipelines, agentic LLMs, and reinforcement-learning policies for wireless networks.

I've spent time benchmarking PPO in cart-pole and pendulum environments, integrating human advisory signals into RL training loops at McMaster, and training multi-agent DRL frameworks for CR-NOMA IoT — work that turned into two IEEE publications.

Day-to-day, I build: voice agents on VAPI + Deepgram, RAG retrieval layers with pgvector, and LLM orchestration services in FastAPI.

Selected work

Projects, shipped and studied.

§ 02

Peer-reviewed

Publications.

§ 03
  1. [1]

    Multiagent Reinforcement Learning for Joint Spectrum and Energy Optimization in CR-NOMA Enabled Internet of Unmanned Agents

    Saleha Ahmed, Muhammad Uzair, Syed Asad Ullah, et al.

    IEEE Internet of Things Journal, 2025

    A cooperative multi-agent DRL framework for CR-NOMA IoT, where distributed agents jointly learn spectrum access and power-control policies under partial observability.

    PDF DOI →
  2. [2]

    Energy Efficient Uplink Communications for Wireless Powered Networks with EH Diversity: A DRL-driven Strategy

    Saleha Ahmed, Muhammad Uzair, Syed Asad Ullah, et al.

    IEEE International Conference on Communications (ICC), 2025

    DRL-driven transmit-power control for energy-harvesting uplink nodes, evaluated against MRC, SC, and EGC diversity-combining schemes under Rayleigh fading.

Timeline

Experience & education.

§ 04
  1. Industry · Islamabad, PK

    Nov 2025 — present

    AI Engineer — Adept Tech Solutions

    • Built end-to-end voice AI pipelines on VAPI + Deepgram with sub-400ms transcription latency.
    • Engineered a multi-agent LLM orchestration system over FastAPI microservices and PostgreSQL.
    • Shipped a RAG retrieval layer with MPNet (768-dim) embeddings and pgvector.
    • Deployed email intelligence agents with intent detection, processing 500+ messages/day.
  2. Research · Hamilton, ON

    Jun 2025 — Aug 2025

    Research Intern — Mitacs Globalink — McMaster University

    • Fully funded Mitacs internship on guided policy optimization in sequential decision-making.
    • Benchmarked REINFORCE and PPO in Gymnasium Pac-Man; tuned reward shaping and entropy regularization.
    • Integrated human advisory signals via subjective-logic belief modeling — accelerated convergence over baseline PPO.
  3. Research · Islamabad, PK

    Jun 2024 — Sep 2025

    Research Collaborator — Information Processing & Transmission Lab

    • Benchmarked DDPG, TD3, and PPO for continuous-action transmit-power control under stochastic fading.
    • Developed a multi-agent DRL framework for joint spectrum access + power control in CR-NOMA IoT.
    • Analyzed MRC / SC / EGC diversity-combining under Rayleigh fading.
  4. Education · Islamabad, PK

    Nov 2021 — Jun 2025

    B.E. Software Engineering — National University of Sciences and Technology

    • CGPA 3.61 / 4.0.
    • Coursework: Machine Learning, Reinforcement Learning, Large Language Models, Probability & Statistics.
    • 4× FBISE HSSC merit scholarship recipient.

Get in touch

Open to conversations.

§ 05

Currently open to AI engineering roles and PhD opportunities in reinforcement learning, multi-agent systems, and applied ML.

The fastest way to reach me is email — I usually respond within a day. You can also grab my CV below if you're evaluating me for a role or program.