∞failed experiments - every metric above is what survived them.
I am a Lead AI Engineer at
The KOW Company,
where I work across AI research, software engineering, and product development. I lead and collaborate
with teams to design, build, and deploy production AI systems, while also taking research ideas
from problem formulation to publication. My research is accompanied by open-source code, datasets,
and models published on GitHub
and Hugging Face.
My work includes writing research proposals and funding requests, collaborating with universities,
and working directly with clients and stakeholders to turn real-world challenges into practical
AI solutions.
Before specializing in AI, I spent five years as a software engineer building large-scale
production systems. This background allows me to bridge research and engineering, helping
transform promising ideas into scalable, reliable, and impactful technologies.
Education
B.Sc. in Computer Science & Engineering
American International University-Bangladesh (AIUB) · 2011 - 2015
CGPA 3.87/4.00 (WES 3.94/4.00) · Magna Cum Laude, Top 3%
Experience
The KOW CompanyDhaka, Bangladesh
Lead, Artificial IntelligenceCurrent Jan 2023 - Present
Leadership & Management
Lead a 20+ member AI engineering team across enterprise delivery, applied research, and product development.
Own execution across AI workstreams by aligning priorities, architecture, timelines, and delivery outcomes.
Run 8-10 monthly engineering meetings to resolve blockers, technical dependencies, and implementation decisions.
Reduced delivery time by 15% through improved engineering workflows, sprint planning, and review processes.
Maintain 90-95% on-time delivery across concurrent AI projects, client commitments, and product milestones.
Review ML and software engineering work to improve correctness, scalability, reliability, and maintainability.
Translate client requirements across European and US time zones into feasible AI system designs.
Mentor junior ML engineers on research methods, model development, evaluation, and production standards.
Engineering Practices
Design secure AI systems with input validation, least-privilege access, secrets management, and dependency hygiene.
Lead system design reviews covering scalability, reliability, data flow, integration patterns, and trade-offs.
Define engineering standards that improve code quality, maintainability, extensibility, and system scalability.
Research & Technical
Lead research on AI hallucination, prompt safety, and vision-language model reliability.
Prepare manuscripts for peer-reviewed journals and AI conference submissions.
Develop evaluation frameworks for hallucination, grounding, and safety behavior in AI models.
Build experimental pipelines for multimodal and language model evaluation.
Develop AI systems using vision-language models, LLM fine-tuning, 3D reconstruction, and pose estimation.
Publish reproducible code, datasets, and models on GitHub and Hugging Face.
Collaborate with East West University faculty on linear algebra and differential geometry for AI research.
Drive architecture decisions for enterprise AI systems across performance, reliability, security, and deployment.
Senior Machine Learning EngineerJul 2021 - Dec 2022
Improved object detection and segmentation accuracy by 20-35% through model optimization and algorithm tuning.
Delivered customer segmentation and territory mapping models for data-driven marketing decisions.
Contributed to a 15% improvement in marketing strategy through deployed ML solutions.
Built A/B testing frameworks to compare model architectures and validate performance gains.
Led 6+ client ML engagements from business requirements to technical delivery.
Converted business requirements into ML solutions with clear timelines, scope, and deliverables.
Supported engineering teams on model selection, data pipelines, evaluation, and implementation.
Machine Learning EngineerJul 2020 - Jun 2021
Built deep learning models for production object recognition and image segmentation.
Developed scalable preprocessing pipelines for repeatable ML training workflows.
Created training workflows to support consistent model experimentation and iteration.
Ran A/B tests across model variants and algorithms to compare performance and guide improvements.
Helped move ML models from experimentation into production-ready systems.
Smart Technologies (BD) LtdDhaka, Bangladesh
Senior Software EngineerSep 2016 - Dec 2019
Enterprise Software Development
Developed ERP modules in .NET across HR, inventory, procurement, audit, sales, and predictive analytics.
Led end-to-end delivery of the supply chain module, including system architecture, domain modeling, business logic, and database integration.
Automated the full supply chain workflow, reducing manual paperwork by 90%.
Achieved 70-75% process automation across core ERP workflows and increased report visualization coverage by 45%.
Designed a modular and scalable architecture that reduced new-feature development time by 30-40%.
Implemented role-based access control, validation rules, reporting workflows, and approval flows across ERP modules.
Data Engineering & Performance Optimization
Reduced report generation time from approximately 20 minutes to 40-54 seconds on a 4TB SQL Server ERP database through index restructuring, query optimization, and execution-plan analysis.
Optimized complex SQL queries, stored procedures, joins, indexes, and reporting views for high-volume enterprise data.
Built an offline-capable synchronization scheduler with a 99.9% success rate, including conflict detection, retry handling, and automated recovery with no data loss.
Reduced overall database load by 35% through targeted caching, query-level optimization, and improved data access patterns.
ProggasoftDhaka, Bangladesh
Software EngineerMar 2015 - Aug 2016
Developed and maintained production features for DevSkill.com using ASP.NET MVC.
Built backend services, database integrations, and web interfaces for user-facing platform features.
Applied SOLID principles, design patterns, and clean-code practices to improve maintainability and extensibility.
Worked across application logic, UI updates, and relational database operations in a production web environment.
Selected Projects
CognixGeneralized Multi-Reference Image Fusion and Editing System
Lead AI EngineerActive R&D / In Development
Started and architected Cognix, an ongoing generalized image fusion and editing system that takes a prompt and 1-3 reference images to generate a single coherent output image.
Designed the system for prompt-guided multi-image conditioning tasks such as product visualization, identity-preserving edits, scene composition, style transfer, garment replacement, and object-level image manipulation.
Building the model strategy around Qwen-Image-Edit-2509, using its native multi-image input, Qwen2.5-VL encoder for image understanding, MMDiT transformer for image generation and fusion, and frozen VAE for decoding.
Implemented a config-driven fine-tuning pipeline using QLoRA with 4-bit NF4 quantization, enabling a ~20B model to train on a 24 GB RTX 4090, with bf16/H100 support through configuration changes.
Developed variable-length multi-reference dataset support for 1, 2, and 3 reference images, using semantic instructions, attribute-rich prompts, and preservation clauses.
Built training, validation, evaluation, and inference workflows, including dataset validation, model loading, LoRA training, checkpoint evaluation, and prompt-based generation.
Verified end-to-end training on an RTX 4090 with a successful micro-train run, 94M trainable LoRA parameters, and checkpoint generation.
Integrated inference through ComfyUI using FP8 model execution, API-based graph submission, multi-image upload, VRAM controls, and LoRA-enabled generation.
Currently evaluating base-model behavior across multi-reference editing tasks before building the full fine-tuning dataset.
Conceived and led the end-to-end development of a unified virtual try-on platform where shoppers can search across brands, try products on with AI, and purchase from one place.
Fine-tuned Qwen with LoRA for garment understanding, catalog matching, and product recommendation across diverse apparel categories.
Built AI-powered 2D virtual try-on workflows for apparel such as t-shirts, jeans, dresses, shoes, and sarees.
Onboarded 170+ brands, including major international names, into the platform’s searchable product catalog.
Reached a 16% purchase conversion rate, reflecting strong shopper confidence from try-before-buy experiences.
Led development of Omnimage.ai, an AI-powered image generation and editing engine for scalable product-image workflows.
Fine-tuned LLaMA-based components to support high-volume image-processing requests with 10K+ daily requests and sub-18-second inference latency.
Built production inference APIs for image editing, background processing, and integration with downstream platforms.
Integrated Omnimage.ai into automated image-processing workflows, including background removal and enhancement tasks used by enterprise e-commerce operations.
Stack
LLaMA
PyTorch
FastAPI
AthingAI-Powered Product and Catalog Intelligence Platform
Lead AI EngineerThe KOW Company
Led development of an AI-powered catalog intelligence platform for product understanding, search, and structured attribute extraction.
Built workflows for product classification, metadata enrichment, and semantic matching across large e-commerce catalogs.
Applied computer vision and language-model pipelines to improve product discovery, catalog quality, and downstream recommendation workflows.
Designed API-driven services for integration with product catalogs, content systems, and client-facing applications.
Stack
Computer Vision
LLMs
PyTorch
FastAPI
Vector Search
HoloSnap.aiMulti-View 3D Reconstruction Platform
Lead AI Engineer
Built a reconstruction pipeline that converts 360° video and multi-view image sequences into high-quality 3D meshes in GLB and OBJ formats.
Implemented multi-view geometry, camera pose estimation, and structure-from-motion workflows for 3D asset generation.
Reduced manual 3D editing effort by 40% through improved automated reconstruction.
Enabled scalable 3D asset creation for e-commerce clients, lowering operational costs by 35%.
Stack
PyTorch
Open3D
COLMAP
Docker
Retouched.aiObject Detection and Segmentation Platform
Lead AI Engineer
BASIS National ICT Award 2020 Champion · APICTA 2021 Finalist
Built a salient-object-detection model reaching 96.23% accuracy using the HCE metric.
Improved segmentation accuracy by 17% and processing speed by 30% compared with existing commercial tools.
Optimized inference to process 257 MB images in 2.27 seconds using RSU blocks and advanced pooling.
Scaled the platform to 5.7M+ paid images processed globally, with 8K-11K images processed per day for enterprise e-commerce clients.
Stack
PyTorch
TensorFlow
U-Net variants
FastAPI
Redis
Docker
GCP
Enterprise AI Site Audit PlatformAI-Powered Catalog, Image, and Content Quality Audit System
Lead AI EngineerMajor US Retail Client · In Development
Leading development of an AI site-audit platform for a major US retail client, automating quality checks across product imagery, catalog data, and backend content health.
Built image-audit workflows to detect inappropriate content, sheer-garment visibility, offensive tattoos, vulgar copy, crop inconsistencies, retouching defects, face-crop issues, and duplicate or near-duplicate images.
Developed data-audit capabilities to validate product copy, color and style attribution, product-type metadata, image origination tags, and DAM/CMS integration requirements.
Designed backend audit checks for inventory gaps, broken image and swatch links, alt-text quality, and pages requiring updates or removal.
Stack
Computer Vision
NLP
PyTorch
FastAPI
Contextual Key-Phrase Spotting from AudioAudio Intelligence and Customer-Service QA System
Technical LeadThe KOW Company
Deployed SeamlessM4T for audio-to-text transcription in customer-service workflows.
Integrated Llama 2 to extract contextual key phrases from transcribed conversations.
Reduced manual transcription effort by 80%, enabling faster quality assurance and review.
Built API workflows for transcription, phrase extraction, and downstream QA integration.
Stack
SeamlessM4T
Llama 2
PyTorch
FastAPI
KTMOrder Management and Image Processing Pipeline
System Design LeadThe KOW Company
Production client: leading European luxury resale marketplace
Designed and built KTM, a .NET and SQL Server platform that automates the full image-processing workflow from client FTP ingest to final delivery.
Architected a 12-step production pipeline covering FTP upload, order creation, editor auto-assignment, AI background removal through Omnimage.ai, editor review, QC approval, and automated FTP delivery.
Processed 4.5M+ images to date at approximately 200K images per month and 5K-8K images per day.
Achieved an average batch turnaround of approximately 65 minutes through automated workflow routing and reduced manual handoffs.
Built the order-management core to distribute files across active editors, collect outputs, and support client-facing QC approval.
Stack
.NET
SQL Server
FTP
Omnimage.ai
Retouched.ai
Publications
Stroke-Level Connectivity Verification: Grounding Vision-Language Models Against Topology Hallucination in Diagram Understanding (ICDAR 2026)
Abdullah Ibne Hanif Arean, Niamul Hassan Samin, Md Arifur Rahman, Renu Akter Suity, Juena Ahmed Noshin, and Md Ashikur Rahman*