Bimsara Pathiraja

Logo

CV | LinkedIn | GitHub

I am pursing BSc. (Hons) degree in Engineering at the University of Moratuwa, Sri Lanka.

I am currently a part-time Machine Learning Engineer at PromiseQ.

My main interests are Computer Vision, Autonomous Driving and 3D Deep Learning.

Portfolio


Work Experience

Machine Learning Engineer (Part-time remote) - promiseQ

promiseQ uses advanced real-time video analysis, object detection and tracking to reduce the cost and time wasted associated to false alarms.

My contributions


[Self-Driving-Car-Stage-II] Multi-Sensor based Dynamic Object Detection, Tracking, and Trajectory Prediction

The final year project of the degree program and our project is based on dynamic object detection, tracking, trajectory prediction, signal light identification and data collection using LiDAR and camera.

3D object Detection and Tracking


Trajectory Prediction

Paper submitted to IEEE-ITSC - Class-Aware Attention for Multimodal Trajectory Prediction

Open Research Poster

Abstract

Abstract—Predicting the possible future trajectories of the surrounding dynamic agents is an essential requirement in autonomous driving. These trajectories mainly depend on the surrounding static environment, as well as the past movements of those dynamic agents. Furthermore, the multimodal nature of agent intentions makes the trajectory prediction problem more challenging. All of the existing models consider the target agent as well as the surrounding agents similarly, without considering the variation of physical properties. In this paper, we present a novel deep-learning based framework for multimodal trajectory prediction in autonomous driving, which considers the physical properties of the target and surrounding vehicles such as the object class and their physical dimensions through a weighted attention module, that improves the accuracy of the predictions. Our model has achieved the highest results in the nuScenes trajectory prediction benchmark, out of the models which use rasterized maps to input environment information. Furthermore, our model is able to run in real-time, achieving a high inference rate of over 300 FPS.


Sample Results

Quantitative results for nuScenes dataset

MinADE_5 - 1.67m MinFDE_1 - 8.43m

Leaderbaord: 12th rank


Computer Vision

CS231n: Convolutional Neural Networks for Visual Recognition

My complete implementation of assignments and projects in CS231n: Convolutional Neural Networks for Visual Recognition by Stanford (Spring, 2021).

View on GitHub

Implementing CNN image classification module using Numpy: An image classification model implementing with fully connected networks, non linear activations, batch normalization, dropout and convolutional networks including back propagation (GitHub).

Image Captioning: An image captioning model with vanilla RNNs, LSTM and Transformer network. RNN and LSTM were implemented from scratch using numpy including backpropagation. Attention, Multi-head attention and Transformer were implemented using Pytorch (GitHub).

GAN: Implementing Vanilla GAN, Least Square GAN and Deep Convolutional GAN (DCGAN).

Network Visualization: Visualizing a pretrained model using saliency maps, fooling images and class visualization.


Natural Language Processing

CS224n: Natural Language Processing with Deep Learning

My complete implementation of assignments and projects in CS224n: Natural Language Processing with Deep Learning by Stanford (Winter, 2019).

View on GitHub

Neural Machine Translation: An NMT system which translates texts from Spanish to English using a Bidirectional LSTM encoder for the source sentence and a Unidirectional LSTM Decoder with multiplicative attention for the target sentence (GitHub).

Dependency Parsing: A Neural Transition-Based Dependency Parsing system with one-layer MLP (GitHub).


Internship Projects

Company: Creative Software

Corrosion Detection using Semantic Segmentation

Corrosion Detection for industrial environment using semantic segmentation. I used U-Net model for semantic segmentation. I completed writing the model, testing and all the training. Using a combination of focal loss and dice loss increased the accuracy significantly and using lot of augmentations reduced false positives.

Synthetic data generation is also done using Unity 3D since the real image dataset was not enough.

Object Detection in Industrial Environment

Object detection model was trained using Detectron2 for idenitifying industrial objects like gauges, motors, valves, pumps etc.


Other Projects

Garment ReConstruction - NeurIPS Challenge

3D Texture garment reconstruction using CLOTH3D dataset and SMPL body parameters. PyMesh, Open3d, Meshlab, MeshlabXML, Pytorch Geometric libraires were used. Only the data preprocessing part is done. The model is yet to be implemented.

Subsampling points

Non-rigid Iterative Closest Point (ICP)

Custom maxpooling

FPGA processor for Matrix Multiplication

The project included designing an Instruction Set Architecture (ISA) for FPGA processor for Matrix Multiplication and implementing with all the necessory components using Verilog HDL. A Python simulator was written to test the performance of the processor. We used Intel Quartus Prime, ModelSim and Xillinx to implement and simulate the processor.


Deep Surveilance System (DSS) - SLIOT Challenges

View on GitHub

Deep Surveillance System, an IoT device which is triggered by threatening sounds to activate the camera. The product included hardware, sensors, ML model, web based UI as well. Urban 8K sound dataset and TensorFlow were used for model training. Implemented using Raspberry Pi, OpenCV and Azure. I involved in model wrting, training and hardware implementation. DSS won 2nd place in the open category of Sri Lanka IoT competition (SLIOT).

Interesting Reads

Besides Machine Learning and Computer Vision I have a great passion on reading books. Below is a list of the best picks from my past year reading.



© 2020 Khanh Tran. Powered by Jekyll and the Minimal Theme.