Portfolio

ABOUT

Maroof

Maroof Abdul Aziz

M.Sc. Robotic Systems Engineering @ RWTH Aachen
AI/LLM Developer | Internship & Thesis @ Audi | Ex-Mercedes-Benz

I’m a Master's student in Robotic Systems Engineering at RWTH Aachen, specializing in AI for language and vision. My work focuses on developing and optimizing large language models (LLMs), speech processing pipelines, and computer vision systems.

With industry experience at Audi and Mercedes-Benz, and a first-author IEEE publication, I bridge academic research and real-world AI deployment—building scalable, efficient, and impactful machine learning systems.

EXPERIENCE

Audi AG

Master Thesis – LLM Optimization

Audi AG

Oct 2024 – Present

  • Researched and fine-tuned small language models.
  • Applied PEFT, LoRA, PTQ, ONNX acceleration.
Audi AG

Machine Learning Intern

Audi AG

Apr 2024 – Sep 2024

  • Integrated ChatGPT into speech processing.
  • Automated evaluation and CI/CD integration.
RWTH Aachen University

Research Assistant – Medical Imaging

RWTH Aachen University

Aug 2023 – Mar 2024

  • Developed DL models for tumor detection.
  • Published in IEEE ICIP 2024.
Mercedes-Benz R&D India

Senior Product Design Engineer

Mercedes-Benz R&D India

Dec 2015 – Jul 2022

  • Developed Python tools for automation.
  • Led automotive exterior design projects.

PROJECTS

LangGraph Agent Deployment

Feb 2024 – Apr 2024

  • Full-stack RAG application using FastAPI, LangGraph, and Streamlit for tool-using LLM agents.
  • Document ingestion from websites, PDFs, and SQL into a Qdrant vector store using LlamaIndex.
  • OpenAI and Groq models with session memory, tool calls, and LLM-based reranking.
  • CI/CD pipeline with GitHub Actions to automate testing and deploy Docker containers to AWS EC2.
  • User-friendly chat interface for uploading data, switching models, and visualizing agent reasoning.
View on GitHub ↗
python.png
langgraph.png
llamaindex.png
aws.png
docker.png
qdrant.png
chatgpt.png
groq.png
cohere.png

Master Thesis: Optimizing Small Language Models

Jan 2024 – May 2024

  • Optimized small language models for CPU-only embedded systems and in-vehicle voice assistants.
  • Designed and fine-tuned models using QLoRA with special tokens for tool-call accuracy.
  • Applied structured pruning and quantization (GPTQ, GGUF) for efficient model compression.
  • Built synthetic datasets simulating realistic vehicle assistant queries and tool usage.
  • Benchmarked models using real-world metrics: latency, memory, accuracy, and on-device inference speed.
python.png
pytorch.png
huggingface.png
ollama.jpg
azure.png
mlflow.png
mlstudio.png

LangChain RAG Agent Suite

Nov 2023 – Jan 2024

  • Modular RAG framework using LangChain + LlamaIndex.
  • RAG pipelines with multi-source ingestion: web, PDFs, and SQL databases.
  • Wikipedia, Arxiv, and Tavily tools for agent-based question answering.
  • Used FAISS for vector storage and LLM-based reranking for response relevance.
  • FastAPI backend and Streamlit frontend for interactive multi-modal user experience.
View on GitHub ↗
python.png
langgraph.png
llamaindex.png
chatgpt.png
groq.png
cohere.png

Cancer Detection on Whole Slide Images

Jul 2023 – Oct 2023

  • Cancer detection and subtyping using whole-slide histopathology images.
  • Attention-based heat maps to highlight tumor regions for interpretability and patch selection.
  • Integrated patch-level annotations and center loss to improve classification accuracy and feature separation.
  • Improved preprocessing with white-patch filtering and magnification normalization for consistent patch quality.
  • Enhanced RCC subtype classification, achieving better AUC, bACC, and F1 than baseline models.
View on GitHub ↗
python.png
pytorch.png
tensorflow.png
scikit-learn.png
numpy.png
git.png

Driver Drowsiness Detection

May 2023 – Jul 2023

  • Built drowsiness detection model using CNNs and facial landmarks on 40K+ driver images.
  • Achieved 90% accuracy with custom CNN and 86% with ResNet50 transfer learning.
  • Used OpenCV for facial landmark detection to identify drowsy behavior.
  • Compared traditional CNN, OpenCV, and transfer learning approaches for performance.
  • Collaborated with TechLabs Aachen team during the Digital Shaper Program.
View on GitHub ↗
python.png
tensorflow.png
opencv.png
scikit-learn.png
numpy.png
git.png

PUBLICATIONS

Deep Learning Approach for Renal Cell Carcinoma Detection

IEEE ICIP 2024

A deep learning method for detecting renal cell carcinoma using histopathological images.

View Document ↗

Optimization of Small Language Models for Embedded Voice Assistance

Master's Thesis, 2024

My Master’s thesis focused on usage of SLMs on edge devices.

View Document ↗
Page 1Page 2Page 3

CONTACT

I’m always open to discussing new projects or opportunities. Let's connect!