AI Engineer · Frontend Engineer · Dhaka, Bangladesh
I’m Saumik Saha Kabbya, a CSE graduate crafting LLM and computer vision systems, then shaping them into polished web experiences. I care about speed, clarity, and accessibility from prototype to launch.
Focus
LLMs & Vision
Stack
FastAPI · Next.js
Signature
Fast, tactile, human-friendly UX.

Local-first vision assistance, LLM-powered recommendation systems, and multi-conversation chat platforms.
Syncing with GitHub…
Core stack
Live
Real-time public GitHub activity and runtime signal from this deployed site.
Active repo
Loading...Last push
Loading...
Pushes (7d)
Loading...
Push streak
Loading...
Public repos
Loading...
Site RTT
Loading...
Syncing telemetry...
Experience
I work across backend intelligence and frontend polish, turning models into clear, responsive products.
AI Engineer & Frontend Engineer · Mar 2025 · Sept 2025
Built AI and computer vision solutions using Remote Python, FastAPI, Groq AI, Ultralytics YOLO, and PyTorch, paired with Next.js for production-grade UIs.
Education & Capstone
Balancing core CS fundamentals with hands-on AI systems built for real-world impact.
BSc in Computer Science & Engineering · Jan 2022 · Present
Focused on intelligent systems, full-stack engineering, and applied AI research.
Jan 2025 · Present · Vision assistant for the visually impaired
Local-first vision assistant with ESP32-CAM wearable, YOLO26s and MediaPipe Active Guidance, BLIP/LLM scene narration, fall detection with guardian alerts, and a hands-free Web Speech API interface—87% object retrieval success rate.
Skills
From systems and models to UI and product polish, I keep the stack tight and the delivery fast.
Projects
AI-driven tools, recommendation systems, and accessibility-first products that ship with clarity and speed.
Local-first vision assistant with ESP32-CAM wearable, YOLO26s and MediaPipe Active Guidance, BLIP/LLM scene narration, fall detection with guardian alerts, and a hands-free Web Speech API interface—87% object retrieval success rate.
AI-powered legal document understanding and grounded drafting with OCR, RAG, evidence-grounded draft generation, and iterative improvement from operator edits.
Personalized recommendations for games, anime, TV series, and movies using Groq LLMs, TVDB for up-to-date metadata, and Supabase.
Document study tool that transforms documents into interactive guides with summaries, QA, chat, and TTS via Groq and ElevenLabs.
Multi-conversation AI chatbot platform with contextual sub-conversations and modular backend design.
Website for Hazel Studio BD with a modern frontend, Supabase-backed services, and a secure Postgres data layer.
Content-based game recommendation engine using Bag of Words, TF-IDF, and SVM to suggest similar video games with likelihood predictions.
Contact
Open to collaborations in AI, computer vision, and product-focused web builds. Share a brief and I’ll reply fast.
Vibe With The Dev
Put this on while you scroll or write. It’s the playlist that keeps the build focused and the ideas flowing.