Automatically Detecting Numerical Instability in Machine Learning Applications via Soft Assertions
Shaila Sharmin, Anwar Hossain Zahid, et al. A method for finding hidden numerical instability bugs in ML applications.
Works
A focused view of my work across AI reliability, numerical debugging, LLM evaluation, GPU numerics, and production software engineering.
Publications
Peer-reviewed and preprint work spanning reliable ML, model evaluation, and numerical correctness.
Shaila Sharmin, Anwar Hossain Zahid, et al. A method for finding hidden numerical instability bugs in ML applications.
Anwar Hossain Zahid, Monoshi Kumar Roy, et al. Evaluates context-sensitive LLM behavior for hate speech detection.
Anwar Hossain Zahid, Ignacio Laguna, Wei Le. Cross-platform GPU numerical analysis for heterogeneous systems.
Raihan Mia, Anwar Hossain Zahid, et al. A software engineering education and benchmarking platform concept.
Projects
Systems and applied projects from research, national lab work, industry, and advanced coursework.
FSE 2025 paper introducing Soft Assertions to detect and trigger numerical instability bugs in ML applications.
Contributed to the LLNL Varity project by implementing HIP backend generation for GPU kernel testing, enabling cross-platform numerical consistency evaluation.
Extended a class project to evaluate hate speech detection models using geographical metadata. Later adapted for large language model testing and published in 2025.
Developed a real-time face recognition and spoof detection system for FinTech applications, enhancing security for remote banking verification.
Implemented Single Sign-On authentication for PRP system, enabling secure unified access across parliamentary resource modules.
Implemented and evaluated ML algorithms for natural language processing tasks. Later extended for LLM experiments and paper publication.
Developed an intelligent Othello game-playing agent using adversarial search algorithms and heuristic evaluation functions.
Created a Mancala game engine with an AI opponent using minimax and heuristic strategies.
Built a ray tracing engine from scratch to render 3D scenes with reflection, refraction, and shading.
Extended the Nachos instructional OS to implement thread scheduling, virtual memory, and file system operations.
Implemented and tested channel equalization algorithms to improve signal quality in noisy communication environments.