Junaid Baber

About Me

I am an experienced machine learning engineer and data scientist with a strong track record of delivering end-to-end AI/ML solutions across diverse domains. With expertise spanning deep learning, computer vision, natural language processing, and robotics, I excel at transforming complex business requirements into production-ready intelligent systems. My hands-on experience includes developing real-time object detection systems using LiDAR sensors, building sentiment analysis platforms for multilingual social media data, implementing human-robot collaboration frameworks, and creating cloud-based face recognition systems for large-scale deployments. I thrive in fast-paced environments where I can leverage cutting-edge technologies like TensorFlow, PyTorch, and ROS to solve challenging real-world problems. My technical foundation is complemented by strong software engineering practices and the ability to work effectively across the full ML lifecycle—from data collection and model development to deployment and monitoring.

Education

  • Ph.D. in Multimedia Processing and Retrieval, AIT, Thailand (2011-2013)
  • M.Sc. in Computer Science, AIT, Thailand (2008-2010)

Professional Experience

  • Research Engineer, LIG, UGA, France (Dec 2024 – Present)
    • Developing LLM and SLM-based models for software engineering applications
    • Researching and implementing advanced language models for code generation and analysis
    • Working on innovative AI solutions for improving software development processes
  • Machine Learning Engineer, GIPSA-LAB, France (Dec 2023 – Nov 2024)
    • Developed advanced ML models for signal processing and biomedical applications
    • Implemented deep learning solutions for Raman spectroscopic analysis
    • Collaborated with interdisciplinary teams on real-time data analysis systems
  • AI/ML Engineer, LIG, UGA, France (Dec 2021 – Nov 2023)
    • Built production-grade human-robot collaboration systems using ROS and 3D sensors
    • Developed efficient 3D LiDAR object detection framework for real-time applications
    • Implemented motion prediction models achieving significant accuracy improvements
    • Created custom datasets and optimization pipelines for robotics applications
  • ICT Expert & Solutions Architect, UNDP Balochistan & GIL (Mar 2017 – Dec 2021)
    • Designed and deployed cloud-based face recognition system for smart surveillance
    • Led development of NLP systems for multilingual sentiment analysis (English, Urdu, Roman Urdu)
    • Built scalable web applications using Flask, Django, and modern frontend frameworks
    • Managed end-to-end ML project lifecycle including deployment and monitoring
  • Visiting Researcher, National Institute of Informatics, Tokyo, Japan (June 2010 - Dec 2011)
    • Developed multimedia retrieval systems and video analysis algorithms
    • Implemented computer vision solutions for content-based image retrieval

Technical Skills

Programming Languages

Python (Expert), C++ (Proficient), MATLAB, SQL, JavaScript, LaTeX

Machine Learning & Deep Learning

TensorFlow, PyTorch, Keras, Scikit-learn, XGBoost, LightGBM

CNNs, RNNs, Transformers, GANs, AutoEncoders, Transfer Learning, Object Detection (YOLO, Faster R-CNN)

Computer Vision & NLP

OpenCV, PIL, Image Processing, Video Analysis, Face Recognition

BERT, GPT, Word2Vec, LSTM, Sentiment Analysis, Text Classification, Named Entity Recognition

Robotics & 3D Processing

Robot OS (ROS), RVIZ, PCL (Point Cloud Library), LiDAR Processing, 3D Object Detection

Data Engineering & Databases

MySQL, PostgreSQL, SQLAlchemy, Pandas, NumPy, Data Pipeline Development

Web Development & Deployment

Flask, Django, REST APIs, HTML/CSS/JavaScript, Apache, Docker

Cloud Services, Model Deployment, CI/CD Pipelines

Tools & Platforms

Git, Jupyter, Linux/Unix, AWS/Cloud Computing, Distributed Computing

Key Projects & Technical Implementations

Human-Robot Collaboration System with Motion Prediction

Technologies: Python, ROS (Robot Operating System), PyTorch, ANNOY, 3D Sensors, Real-time Processing

Description: Developed an end-to-end framework for predicting human hand motion in shared workspaces to prevent robot-human collisions. Created custom dataset by simulating collision scenarios and collecting 3D trajectory data. Implemented 3D cell quantization for spatial discretization and developed vector embedding models for motion prediction. Utilized ANNOY (Approximate Nearest Neighbors Oh Yeah) for efficient nearest neighbor search enabling real-time predictions. System achieved superior accuracy compared to baseline methods while maintaining sub-100ms latency requirements.

Impact: Enhanced safety in collaborative robotics environments, enabling faster and safer human-robot interaction workflows.

Real-Time 3D Object Detection in LiDAR Point Clouds

Technologies: Python, PCL (Point Cloud Library), OpenCV, Scikit-learn, KITTI Dataset, ROS/RVIZ

Description: Designed and implemented 3D-PSH (3D Point Spatial Histogram), a lightweight object detection framework optimized for resource-constrained devices. Developed adaptive clustering algorithms for point cloud segmentation, computed 3D spatial histograms as feature descriptors, and applied Bag of Visual Words (BoVW) approach with SVM for classification. System was validated on KITTI dataset and real-world LiDAR sensors, achieving competitive accuracy with 3-5x faster inference compared to deep learning approaches.

Impact: Enabled real-time 3D object detection on edge devices without GPU requirements, suitable for autonomous vehicles and robotics applications.

Enterprise Smart Surveillance Platform

Technologies: Python, Flask/Django, OpenCV, Face Recognition APIs, MySQL, JavaScript, IP Camera Integration

Description: Architected and deployed a production-grade smart surveillance solution for government premises. Integrated multiple IP cameras into a unified system performing real-time face detection, recognition, and tracking. Built RESTful APIs for camera management and data retrieval. Developed real-time web dashboard for monitoring and analytics. Implemented database schemas for efficient storage and querying of recognition events. System handles 10+ concurrent camera streams with sub-second recognition latency.

Impact: Automated visitor tracking reducing manual check-in time by 80%, enhanced security with real-time alerts for unauthorized access.

Biomedical Classification with Deep Learning on Raman Spectroscopy

Technologies: Python, TensorFlow/Keras, Pandas, NumPy, Scikit-learn, Data Augmentation Techniques

Description: Developed deep neural network models for classifying biomedical samples based on Raman spectroscopic signatures. Implemented data preprocessing pipeline handling noisy spectral data including normalization, baseline correction, and augmentation. Designed CNN and fully-connected architectures optimized for 1D spectral data. Performed extensive hyperparameter tuning and cross-validation achieving 95%+ classification accuracy across multiple biomedical classes.

Impact: Automated biomedical sample classification reducing analysis time from hours to seconds, enabling high-throughput screening applications.

Multilingual Sentiment Analysis Platform for Social Media

Technologies: Python, PyTorch, Transformers (BERT), LSTM, Flask, PostgreSQL, Data Scraping Tools

Description: Built end-to-end sentiment analysis system for multilingual social media content (English, Urdu, Roman Urdu). Created benchmark dataset for Roman Urdu (low-resource language) through web scraping and manual annotation. Developed custom tokenizer and embedding layer for Roman Urdu text. Implemented attention-based BiLSTM and fine-tuned multilingual BERT models. Built REST API for real-time sentiment prediction and batch processing. Deployed system analyzing 100K+ tweets daily during COVID-19 for public opinion monitoring.

Impact: Enabled real-time public sentiment tracking for government decision-making during critical periods, achieved F1-score of 0.89 on Roman Urdu dataset.

LLM-Based Rhetorical Strategy Classification for Crowdfunding

Technologies: Python, OpenAI API, Hugging Face Transformers, GPT models, Prompt Engineering, Statistical Analysis

Description: Developed automated system for identifying persuasive rhetorical strategies in crowdfunding campaigns using large language models. Designed prompt engineering strategies for zero-shot and few-shot classification. Compared multiple LLM architectures (GPT-3.5, GPT-4, BERT variants) against ground truth annotations. Performed correlation analysis between rhetorical strategies and funding success metrics. Built data pipeline processing 10K+ campaign texts with statistical validation.

Impact: Provided actionable insights for campaign optimization, identified key rhetorical elements correlated with 40% higher funding success rates.

Contact

Email: junaidbaber.cs@gmail.com

Google Scholar: Google Scholar

Selected publications available upon request