Abdul Wahab

wahab1 (at) ualberta (dot) ca

I am a graduate student working with Dr. Martha White at the University of Alberta. I work on exploration and representation learning for reinforcement learning.

Previously, I worked as a machine learning engineer at Veeve.io, with Dr. Faisal Shafait and Dr. Ahmad Salman. I developed vision-based solutions for different modules of a smart shopping cart, like cart-state tracking, hand-motion analysis, trajectory analysis, gesture prediction, shrinkage control, item classification, and barcode-through-vision.

Before that I did my undergraduate from the National University of Sciences & Technology (NUST), Islamabad, Pakistan, working in TUKL-NUST R&D Lab with Dr. Faisal Shafait, and Dr. Arsalan Ahmad. I have had the pleasure of visiting the Augmented Vision lab DFKI Kaiserslautern, Germany, as a research associate intern to work on analyzing line-scan camera inputs, with Dr. Gerd Reis and Dr. Didier Stricker.

CV / LinkedIn / Google Scholar / Github

Research interests

My research interests include:

Exploration in reinforcement learning
Representation learning
Continual learning

Current research projects

Image from RL book

Effective Exploration with Sample Efficient Architectures
Abdul Wahab, David Janz, Martha White

In this work, we aim to combine theoretically backed exploration algorithms with sample-efficient architectures.

Understanding the role of representations in Offline-Online reinforcement learning
Abdul Wahab, Muhammad Kamran Janjua, Wesley Chung, Maryam Hashemzadeh, Martha White

In this project, we use Two-timescale networks (TTN) in the Offline-Online setting, in which an agent is trained on offline data, and is then allowed to update the representation and the policy networks online. We empirically show that TTN is well suited for the Offline-Online setting as the online updates are more stable and the model converges relatively faster. We also study, analyze and compare different representation learning approaches like sparsity (FTA), input transformations (sparse, random, random-sparse), augmentation, and self-supervised contrastive losses in different combinations, to identify the best combination of these representation learning approaches.

Research papers

Short-term load forecasting using bi-directional sequential models and feature-engineering
Abdul Wahab, MA Tahir, N Iqbal, A Ul-Hasan, F Shafait, SMR Kazmi
IEEE Access, 2021
code

We propose a general method based on bi-directional sequential models (LSTMs) for short-term energy load forecasting suitable for individual household energy forecasting and aggregated energy forecasting for a region.

Capacity sharing approaches in multi-tenant, multi-service PONs for low-latency fronthaul applications based on cooperative-DBA
Arsalan Ahmad, Abdul Wahab, Marco Ruffini, Frank Slyne, Sanwal Zeb, Rana Azhar Khan
Optical Fiber Communications Conference and Exhibition (OFC), 2020
poster

We propose and compare algorithms to allocate upstream PON capacity, where multiple virtual operators generate independent frame-level allocation over a shared infrastructure. Our fragmentation-aware approach shows the ability to limit latency increase to a few microseconds.

Other projects

	A strong Clobber 2D Player using CGT MCTS-Solver Abdul Wahab, Muhammad Kamran Janjua, Saqib Ameen report, code, presentation In this project, our goal is to couple combinatorial game theory (CGT) enhancements such as endgame database, and subgame simplifications with the Monte-Carlo Tree Search (MCTS) Solver to design a strong 2D-Clobber player.
	Learning Perceptual and Quasi-Perceptual Representations from fMRI Abdul Wahab, Muhammad Kamran Janjua report, code, presentation Human visual system depends on imagery representations in addition to the visual input to build a concrete holistic picture of the object of interest. These representations are correlated and shared in the human brain and help provide a crisp understanding of the environment. In this work, we study if a similar correlation exists in Artificial Neural Networks (ANNs) and if the correlation is strong enough to use the corresponding fMRI data of both perceptual and quasi-perceptual experiences interchangeably for downstream tasks such as object category prediction.
	Line-scan camera-input processing with ConvLSTMs for Road Condition Monitoring Abdul Wahab, Gerd Reis, Didier Stricker demo code In this work, we explored the advantages of using a line-scan camera in industrial applications, where the point of observation is a single plane, for monitoring, classification, and segmentation of moving objects. A ConvLSTM-based architecture was proposed, which extracts temporal-correlation from the sequence of line-scans to compensate for the loss of spatial-information in the individual line-scan (single row of pixels). The methodology evaluated on road-crack segmentation datasets outperforms the current benchmark on four public datasets. The methodology was also evaluated on a novel dataset collected on the roads of Islamabad, Pakistan. The methodology was also implemented for a seed-sorting system.

Abdul Wahab. Design inspired by this website.