Md Ashikur Rahman
Lead AI Engineer @ The KOW Company

Md Ashikur Rahman

Building production AI systems from research to real-world deployment, including vision-language models, LLM fine-tuning, and 3D reconstruction.

  • AI Interpretability
  • Multimodal Learning
  • Computer Vision
  • Large Language Models

I am a Lead AI Engineer at The KOW Company, where I work across AI research, software engineering, and product development. I lead and collaborate with teams to design, build, and deploy production AI systems, while also taking research ideas from problem formulation to publication. My research is accompanied by open-source code, datasets, and models published on GitHub and Hugging Face.

My work includes writing research proposals and funding requests, collaborating with universities, and working directly with clients and stakeholders to turn real-world challenges into practical AI solutions.

Before specializing in AI, I spent five years as a software engineer building large-scale production systems. This background allows me to bridge research and engineering, helping transform promising ideas into scalable, reliable, and impactful technologies.

Education

B.Sc. in Computer Science & Engineering

American International University-Bangladesh (AIUB) · 2011 - 2015

CGPA 3.87/4.00 (WES 3.94/4.00) · Magna Cum Laude, Top 3%

Experience

The KOW Company Dhaka, Bangladesh
Lead, Artificial Intelligence Current Jan 2023 - Present

Leadership & Management

  • Lead a 20+ member AI engineering team across enterprise delivery, applied research, and product development.
  • Own execution across AI workstreams by aligning priorities, architecture, timelines, and delivery outcomes.
  • Run 8-10 monthly engineering meetings to resolve blockers, technical dependencies, and implementation decisions.
  • Reduced delivery time by 15% through improved engineering workflows, sprint planning, and review processes.
  • Maintain 90-95% on-time delivery across concurrent AI projects, client commitments, and product milestones.
  • Review ML and software engineering work to improve correctness, scalability, reliability, and maintainability.
  • Translate client requirements across European and US time zones into feasible AI system designs.
  • Mentor junior ML engineers on research methods, model development, evaluation, and production standards.

Engineering Practices

  • Design secure AI systems with input validation, least-privilege access, secrets management, and dependency hygiene.
  • Lead system design reviews covering scalability, reliability, data flow, integration patterns, and trade-offs.
  • Define engineering standards that improve code quality, maintainability, extensibility, and system scalability.

Research & Technical

  • Lead research on AI hallucination, prompt safety, and vision-language model reliability.
  • Prepare manuscripts for peer-reviewed journals and AI conference submissions.
  • Develop evaluation frameworks for hallucination, grounding, and safety behavior in AI models.
  • Build experimental pipelines for multimodal and language model evaluation.
  • Develop AI systems using vision-language models, LLM fine-tuning, 3D reconstruction, and pose estimation.
  • Publish reproducible code, datasets, and models on GitHub and Hugging Face.
  • Collaborate with East West University faculty on linear algebra and differential geometry for AI research.
  • Drive architecture decisions for enterprise AI systems across performance, reliability, security, and deployment.
Senior Machine Learning Engineer Jul 2021 - Dec 2022
  • Improved object detection and segmentation accuracy by 20-35% through model optimization and algorithm tuning.
  • Delivered customer segmentation and territory mapping models for data-driven marketing decisions.
  • Contributed to a 15% improvement in marketing strategy through deployed ML solutions.
  • Built A/B testing frameworks to compare model architectures and validate performance gains.
  • Led 6+ client ML engagements from business requirements to technical delivery.
  • Converted business requirements into ML solutions with clear timelines, scope, and deliverables.
  • Supported engineering teams on model selection, data pipelines, evaluation, and implementation.
Machine Learning Engineer Jul 2020 - Jun 2021
  • Built deep learning models for production object recognition and image segmentation.
  • Developed scalable preprocessing pipelines for repeatable ML training workflows.
  • Created training workflows to support consistent model experimentation and iteration.
  • Ran A/B tests across model variants and algorithms to compare performance and guide improvements.
  • Helped move ML models from experimentation into production-ready systems.
Smart Technologies (BD) Ltd Dhaka, Bangladesh
Senior Software Engineer Sep 2016 - Dec 2019

Enterprise Software Development

  • Developed ERP modules in .NET across HR, inventory, procurement, audit, sales, and predictive analytics.
  • Led end-to-end delivery of the supply chain module, including system architecture, domain modeling, business logic, and database integration.
  • Automated the full supply chain workflow, reducing manual paperwork by 90%.
  • Achieved 70-75% process automation across core ERP workflows and increased report visualization coverage by 45%.
  • Designed a modular and scalable architecture that reduced new-feature development time by 30-40%.
  • Implemented role-based access control, validation rules, reporting workflows, and approval flows across ERP modules.

Data Engineering & Performance Optimization

  • Reduced report generation time from approximately 20 minutes to 40-54 seconds on a 4TB SQL Server ERP database through index restructuring, query optimization, and execution-plan analysis.
  • Optimized complex SQL queries, stored procedures, joins, indexes, and reporting views for high-volume enterprise data.
  • Built an offline-capable synchronization scheduler with a 99.9% success rate, including conflict detection, retry handling, and automated recovery with no data loss.
  • Reduced overall database load by 35% through targeted caching, query-level optimization, and improved data access patterns.
Proggasoft Dhaka, Bangladesh
Software Engineer Mar 2015 - Aug 2016
  • Developed and maintained production features for DevSkill.com using ASP.NET MVC.
  • Built backend services, database integrations, and web interfaces for user-facing platform features.
  • Applied SOLID principles, design patterns, and clean-code practices to improve maintainability and extensibility.
  • Worked across application logic, UI updates, and relational database operations in a production web environment.

Selected Projects

Cognix Generalized Multi-Reference Image Fusion and Editing System
Lead AI Engineer Active R&D / In Development
  • Started and architected Cognix, an ongoing generalized image fusion and editing system that takes a prompt and 1-3 reference images to generate a single coherent output image.
  • Designed the system for prompt-guided multi-image conditioning tasks such as product visualization, identity-preserving edits, scene composition, style transfer, garment replacement, and object-level image manipulation.
  • Building the model strategy around Qwen-Image-Edit-2509, using its native multi-image input, Qwen2.5-VL encoder for image understanding, MMDiT transformer for image generation and fusion, and frozen VAE for decoding.
  • Implemented a config-driven fine-tuning pipeline using QLoRA with 4-bit NF4 quantization, enabling a ~20B model to train on a 24 GB RTX 4090, with bf16/H100 support through configuration changes.
  • Developed variable-length multi-reference dataset support for 1, 2, and 3 reference images, using semantic instructions, attribute-rich prompts, and preservation clauses.
  • Built training, validation, evaluation, and inference workflows, including dataset validation, model loading, LoRA training, checkpoint evaluation, and prompt-based generation.
  • Verified end-to-end training on an RTX 4090 with a successful micro-train run, 94M trainable LoRA parameters, and checkpoint generation.
  • Integrated inference through ComfyUI using FP8 model execution, API-based graph submission, multi-image upload, VRAM controls, and LoRA-enabled generation.
  • Currently evaluating base-model behavior across multi-reference editing tasks before building the full fine-tuning dataset.
Stack
  • Qwen-Image-Edit-2509
  • Qwen2.5-VL
  • MMDiT
  • QLoRA
  • LoRA
  • NF4 Quantization
  • PyTorch
  • Diffusers
  • ComfyUI
  • FP8 Inference
  • RTX 4090
  • H100 Config Support
The Fitting Room Unified Virtual Try-On Platform
Ideation to Implementation Lead The KOW Company
  • Conceived and led the end-to-end development of a unified virtual try-on platform where shoppers can search across brands, try products on with AI, and purchase from one place.
  • Fine-tuned Qwen with LoRA for garment understanding, catalog matching, and product recommendation across diverse apparel categories.
  • Built AI-powered 2D virtual try-on workflows for apparel such as t-shirts, jeans, dresses, shoes, and sarees.
  • Onboarded 170+ brands, including major international names, into the platform’s searchable product catalog.
  • Reached a 16% purchase conversion rate, reflecting strong shopper confidence from try-before-buy experiences.
Stack
  • Qwen
  • LoRA
  • PyTorch
  • FastAPI
  • Redis
  • Docker
  • Google Cloud VM
  • Amazon S3
  • DynamoDB
Omnimage.ai AI Image Generation and Editing Engine
Lead AI Engineer The KOW Company
  • Led development of Omnimage.ai, an AI-powered image generation and editing engine for scalable product-image workflows.
  • Fine-tuned LLaMA-based components to support high-volume image-processing requests with 10K+ daily requests and sub-18-second inference latency.
  • Built production inference APIs for image editing, background processing, and integration with downstream platforms.
  • Integrated Omnimage.ai into automated image-processing workflows, including background removal and enhancement tasks used by enterprise e-commerce operations.
Stack
  • LLaMA
  • PyTorch
  • FastAPI
Athing AI-Powered Product and Catalog Intelligence Platform
Lead AI Engineer The KOW Company
  • Led development of an AI-powered catalog intelligence platform for product understanding, search, and structured attribute extraction.
  • Built workflows for product classification, metadata enrichment, and semantic matching across large e-commerce catalogs.
  • Applied computer vision and language-model pipelines to improve product discovery, catalog quality, and downstream recommendation workflows.
  • Designed API-driven services for integration with product catalogs, content systems, and client-facing applications.
Stack
  • Computer Vision
  • LLMs
  • PyTorch
  • FastAPI
  • Vector Search
HoloSnap.ai Multi-View 3D Reconstruction Platform
Lead AI Engineer
  • Built a reconstruction pipeline that converts 360° video and multi-view image sequences into high-quality 3D meshes in GLB and OBJ formats.
  • Implemented multi-view geometry, camera pose estimation, and structure-from-motion workflows for 3D asset generation.
  • Reduced manual 3D editing effort by 40% through improved automated reconstruction.
  • Enabled scalable 3D asset creation for e-commerce clients, lowering operational costs by 35%.
Stack
  • PyTorch
  • Open3D
  • COLMAP
  • Docker
Retouched.ai Object Detection and Segmentation Platform
Lead AI Engineer

BASIS National ICT Award 2020 Champion · APICTA 2021 Finalist

  • Built a salient-object-detection model reaching 96.23% accuracy using the HCE metric.
  • Improved segmentation accuracy by 17% and processing speed by 30% compared with existing commercial tools.
  • Optimized inference to process 257 MB images in 2.27 seconds using RSU blocks and advanced pooling.
  • Scaled the platform to 5.7M+ paid images processed globally, with 8K-11K images processed per day for enterprise e-commerce clients.
Stack
  • PyTorch
  • TensorFlow
  • U-Net variants
  • FastAPI
  • Redis
  • Docker
  • GCP
Enterprise AI Site Audit Platform AI-Powered Catalog, Image, and Content Quality Audit System
Lead AI Engineer Major US Retail Client · In Development
  • Leading development of an AI site-audit platform for a major US retail client, automating quality checks across product imagery, catalog data, and backend content health.
  • Built image-audit workflows to detect inappropriate content, sheer-garment visibility, offensive tattoos, vulgar copy, crop inconsistencies, retouching defects, face-crop issues, and duplicate or near-duplicate images.
  • Developed data-audit capabilities to validate product copy, color and style attribution, product-type metadata, image origination tags, and DAM/CMS integration requirements.
  • Designed backend audit checks for inventory gaps, broken image and swatch links, alt-text quality, and pages requiring updates or removal.
Stack
  • Computer Vision
  • NLP
  • PyTorch
  • FastAPI
Contextual Key-Phrase Spotting from Audio Audio Intelligence and Customer-Service QA System
Technical Lead The KOW Company
  • Deployed SeamlessM4T for audio-to-text transcription in customer-service workflows.
  • Integrated Llama 2 to extract contextual key phrases from transcribed conversations.
  • Reduced manual transcription effort by 80%, enabling faster quality assurance and review.
  • Built API workflows for transcription, phrase extraction, and downstream QA integration.
Stack
  • SeamlessM4T
  • Llama 2
  • PyTorch
  • FastAPI
KTM Order Management and Image Processing Pipeline
System Design Lead The KOW Company

Production client: leading European luxury resale marketplace

  • Designed and built KTM, a .NET and SQL Server platform that automates the full image-processing workflow from client FTP ingest to final delivery.
  • Architected a 12-step production pipeline covering FTP upload, order creation, editor auto-assignment, AI background removal through Omnimage.ai, editor review, QC approval, and automated FTP delivery.
  • Processed 4.5M+ images to date at approximately 200K images per month and 5K-8K images per day.
  • Achieved an average batch turnaround of approximately 65 minutes through automated workflow routing and reduced manual handoffs.
  • Built the order-management core to distribute files across active editors, collect outputs, and support client-facing QC approval.
Stack
  • .NET
  • SQL Server
  • FTP
  • Omnimage.ai
  • Retouched.ai

Publications

Stroke-Level Connectivity Verification: Grounding Vision-Language Models Against Topology Hallucination in Diagram Understanding (ICDAR 2026)

Abdullah Ibne Hanif Arean, Niamul Hassan Samin, Md Arifur Rahman, Renu Akter Suity, Juena Ahmed Noshin, and Md Ashikur Rahman*

Automated Detection of Diabetic Retinopathy using Deep Residual Learning (IJCA, 2020)

Md Ashikur Rahman, Md Arifur Rahman, Juena Noshin Ahmed

Preprints

Step-Level Visual Grounding Faithfulness Predicts Out-of-Distribution Generalization in Long-Horizon Vision-Language Models (arXiv, 2026)

Md Ashikur Rahman, Md Arifur Rahman, Niamul Hassan Samin, Abdullah Ibne Hanif Arean, Juena Ahmed Noshin

Beyond Dominant Patches: Spatial Credit Redistribution for Grounded Vision-Language Models (arXiv, 2026)

Niamul Hassan Samin, Md Arifur Rahman, Abdullah Ibne Hanif Arean, Juena Ahmed Noshin, Md Ashikur Rahman

Under Review

Manuscripts on prompt safety in vision-language and language models

2 submissions currently under peer review

Full publication list on Google Scholar →

Technical Skills

Core AI
PyTorch · Vision-Language Models · Large Language Models · Computer Vision · Multimodal Learning · LoRA / QLoRA Fine-tuning · 3D Reconstruction
Generative AI & VLMs
Qwen / Qwen-VL · LLaMA · Diffusion Models · Image Editing & Generation · Multi-Reference Image Conditioning · Prompt Safety · Hallucination Evaluation · Visual Grounding
Computer Vision
Object Detection · Semantic / Instance Segmentation · Salient Object Detection · Pose Estimation · Multi-View Geometry · Structure-from-Motion · NeRF · COLMAP · OpenCV
Model Training & Evaluation
Fine-tuning Pipelines · A/B Testing · Model Evaluation · Dataset Validation · Optimization · Transfer Learning · Weights & Biases
Backend, MLOps & Cloud
FastAPI · Docker · Redis · CI/CD · Model Serving · Google Cloud Platform · AWS · Azure · ComfyUI
Software & Data
Python · C# / .NET · SQL Server · PostgreSQL · MongoDB · DynamoDB · Vector Databases

Honors and Awards