Interests

Probabilistic Regression • Optimisations for Neural Networks • Fully-convolutional Neural Network • Neural Network Compression




Cross-modality Learning using Fully-Convolutional Neural Networks

In this work, we worked on designing, implementing (Python/Tensorflow) and evaluating a fully-convolutional neural network that enabled cross-modality learning for Magnetic Resonance Imaging (MRI). The proposed method generated T2-weighted MRI images from T1. T2-weighted MRI is particularly challenging to acquire due to time, cost constraints as well as significant imaging artefacts.

Published as workshop paper: XmoNet: A Fully Convolutional Network for Cross-Modality MR Image Inference at MICCAI 2018 Workshops.

XmoNet: A Fully Convolutional Network for Cross-Modality MR Image Inference.
XmoNet: A Fully Convolutional Network for Cross-Modality MR Image Inference.




Efficient Hand Orientation and Pose Estimation for Uncalibrated Cameras

For my doctorate thesis, I explored advance non-linear regression methods for estimating hand orientation and pose from uncalibrated colour images. The work aimed to propose machine learning techniques that can generalise well for variations in hand shape, size, style and be independent of camera calibration - hence easily deployable on mobile devices.

I made contributions in terms of novel probabilistic regression methods that enabled to learn from ambiguous datasets where existing regression methods struggled to generalise.

A significant part of my thesis was published as a journal paper titled SPORE: Staged Probabilistic Regression for Hand Orientation Inference at CVIU 2017.

Efficient hand orientation and pose estimation for uncalibrated cameras.
Efficient hand orientation and pose estimation for uncalibrated cameras.




Optical Biopsy of Brain Tumours

During my doctorate degree, I worked on research and implementation (Matlab) of a supervised learning method for image-based categorisation of brain tumours. The proposed method characterised brain tissue (normal, low-grade or high-grade tumour) using Magnetic Resonance Spectroscopy (MRS) imaging. We addressed the problem of partial volume effect.

Published as conference paper: Supervised Partial Volume Effect Unmixing for Brain Tumor Characterization using Multi-voxel MR Spectroscopic Imaging at ISBI 2016.

The left two images show a patient’s brain using different MR modalities, and the right image has our technique overlaid, as heatmaps indicating tissue grades.
The left two images show a patient’s brain using different MR modalities, and the right image has our technique overlaid, as heatmaps indicating tissue grades.




Hand Gesture Recognition using Microsoft Kinect

My dissertation project for my Master’s degree at The University of Sheffield explored hand gesture recognition using depth images from Microsoft Kinect v1. It involved research, design and implementation (C++/OpenCV/OpenNI) on utilising Kinect depth images for dynamic hand gesture recognition. The work proposed a distance-invariant hand cropping method that used the distance of hand to the sensor for adjusting the cropping window. Additionally, a method for addressing quantisation noise in depth images through morphological post-processing steps was proposed. The dynamic hand gestures were divided into pose stages, which were detected using a Multi-layered Perceptron (MLP) network on post-processed projections of depth images on x-y, y-z, x-z planes. A simple Hidden Markov model (HMM) was used on the output of MLP to recognise the dynamic gesture sequences.

Published as conference paper: Kinect Depth Stream Pre-processing for Hand Gesture Recognition at ICIP 2013.

Distance Invariant Hand Cropping:

Distance

Input

Fixed

Distance Invariant


0.7 meters


1.7 meters


Post-processing for quantisation noise in Kinect depth images:

Original

space space space space space space

Proposed

space space space space space space




Traffic Flow Estimation for Edge Devices

During my Master’s degree at The University of Sheffield, I worked on a coursework project for Computer Vision module building a traffic flow estimation algorithm. This project proposed an optimised algorithm targetted for resource constraint hardware, such as BeagleBoard-xM. I implemented (C++/OpenCV) a method using background subtraction, morphological post-processing and image moments to detect cars and estimate traffic congestion.




Smartphone based Guidance System for Visually Impaired

During my Bachelor’s degree at National University of Computer and Emerging Sciences, I worked on my dissertation project building a guidance system for visually impaired. This project involved research, design and implementation (Matlab/Android/OpenCV) of a method that used morphological edge detectors, Hough transform and RANSAC for detecting vanishing points from 2D uncalibrated colour images of a straight path. The detected vanishing points were used to provide guidance on the deviation of a visually impaired person from a straight path.

Published as conference paper: Smartphone based Guidance System for Visually Impaired Person at IPTA 2012.