Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,346
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline

Showing 225 reads from curated sources

Traffic Light Recognition (TLR) Architecture: 2D Bounding Box Detection
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 1h ago
Traffic Light Recognition (TLR) Architecture: 2D Bounding Box Detection
The TLR model is a Fully Convolutional Network (FCN) + FPN + Header model, utilizing an “anchor-free” approach. Instead of guessing… Continue reading on Medium
2D Gaussian Splatting: when removing a dimension makes 3D better
Medium · AI 👁️ Computer Vision ⚡ AI Lesson 3h ago
2D Gaussian Splatting: when removing a dimension makes 3D better
Why 3D Gaussians fail at surfaces, and how flat disks fix it Continue reading on Medium »
"Mastering Digital Logic Counters with C++ OOP: A Hands-On Guide”
Dev.to · Abdullah Fiaz 👁️ Computer Vision ⚡ AI Lesson 12h ago
"Mastering Digital Logic Counters with C++ OOP: A Hands-On Guide”
Introduction Digital logic counters are fundamental in electronics and computing. They track events,...
Como o pensamento computacional me ajudou a estruturar minhas entregas
Medium · Programming 👁️ Computer Vision ⚡ AI Lesson 2d ago
Como o pensamento computacional me ajudou a estruturar minhas entregas
Há um bom tempo venho tentando entrar, bem aos poucos, no mundo da programação. Continue reading on Tatiane Marina »
Manchester Code Made Bits Behave
IEEE Spectrum 👁️ Computer Vision ⚡ AI Lesson 2d ago
Manchester Code Made Bits Behave
In the late 1940s—when computer engineers were grappling with unreliable hardware and noisy transmission environments—a team of engineers inside a modest lab at
Dev.to AI 👁️ Computer Vision ⚡ AI Lesson 2d ago
Why Your Image Upload Pipeline Should Check for Physically Impossible Lighting
Why Your Image Upload Pipeline Should Check for Physically Impossible Lighting If you're building user-generated content platforms, marketplace verification sys
Rasterization Using Bresenham Algorithm and Scanline Algorithm
Dev.to · yubin yang 👁️ Computer Vision ⚡ AI Lesson 2d ago
Rasterization Using Bresenham Algorithm and Scanline Algorithm
1. Overview Bresenham algorithm is the fastest algorithm for drawing straight lines on a...
OCR Intelligente per Documenti Aziendali: Architettura e Lezioni dal Campo
Dev.to · Alessandro Binda 👁️ Computer Vision ⚡ AI Lesson 4d ago
OCR Intelligente per Documenti Aziendali: Architettura e Lezioni dal Campo
L'OCR (Optical Character Recognition) per testo stampato moderno è un problema risolto da decenni....
Computer Vision Yolculuğu — Gün 2: OpenCV ile Frame Üzerine Çizim Yapmak
Medium · AI 👁️ Computer Vision ⚡ AI Lesson 4d ago
Computer Vision Yolculuğu — Gün 2: OpenCV ile Frame Üzerine Çizim Yapmak
Computer Vision projelerinde kameradan görüntü almak yalnızca ilk adımdır. Gerçek sistemlerde asıl önemli nokta, alınan frame’lerin… Continue reading on Medium
Who Really Deserves To Be Called The Father Of The Internet
Medium · Programming 👁️ Computer Vision ⚡ AI Lesson 4d ago
Who Really Deserves To Be Called The Father Of The Internet
From ARPANET to the World Wide Web the Internet was built by a network of pioneers not one inventor Continue reading on IT Chronicles »
Why Your Computer Reads Numbers Backwards: Byte Order Explained
Dev.to · hassaan-syed 👁️ Computer Vision ⚡ AI Lesson 4d ago
Why Your Computer Reads Numbers Backwards: Byte Order Explained
What is Byte Order? Before understanding byte order, we need to understand one thing: A byte = 8...
Dev.to AI 👁️ Computer Vision ⚡ AI Lesson 4d ago
High Speed and Performance
High Speed and Performance C language is very fast because it is a compiled language. It converts code directly into machine language, so programs run quickly a
Dev.to AI 👁️ Computer Vision ⚡ AI Lesson 4d ago
Building a License Plate Recognition Engine in C++ — Part 2: Grayscale Image Preprocessing and Local Contrast Edge Detection
In the previous article, we loaded an image, converted it into grayscale, and introduced the core data structures used by the recognition engine. In this part,
Inside SAM 3D: how Meta turns a single image into 3D
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 6d ago
Inside SAM 3D: how Meta turns a single image into 3D
For about forty years, “3D” in the practical sense meant one thing: triangle meshes. Every game shipped, every animated film rendered… Continue reading on Mediu
Inside SAM 3D: how Meta turns a single image into 3D
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 6d ago
Inside SAM 3D: how Meta turns a single image into 3D
For about forty years, “3D” in the practical sense meant one thing: triangle meshes. Every game shipped, every animated film rendered… Continue reading on Mediu
Demystifying CNNs: How Convolutional Filters and Max-Pooling Actually Work
Medium · Data Science 👁️ Computer Vision ⚡ AI Lesson 6d ago
Demystifying CNNs: How Convolutional Filters and Max-Pooling Actually Work
If you’ve ever wondered how a computer can look at a photo of a car and instantly know it’s a car, you’re looking at the magic of… Continue reading on Medium »
Dev.to AI 👁️ Computer Vision ⚡ AI Lesson 1w ago
Your "Biometric Age Check" Isn't Verifying Identity — And Defense Lawyers Know It
Understanding the distinction between biometric age estimation and identity verification For developers in the computer vision and biometrics space, the nuance
MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 1w ago
MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons
This project speaks for itself. It covers three crucial steps in one go: motion tracking, skeleton reconstruction, and 3D animation. What… Continue reading on M
How I Built a Perceptual Color Quantization Engine for LEGO Mosaics
Dev.to · BMBrick 👁️ Computer Vision ⚡ AI Lesson 1w ago
How I Built a Perceptual Color Quantization Engine for LEGO Mosaics
The Problem Converting a photo into a LEGO mosaic sounds simple: resize the image, find...
Computer Vision Is Rebuilding the Fitting Room
Medium · AI 👁️ Computer Vision ⚡ AI Lesson 1w ago
Computer Vision Is Rebuilding the Fitting Room
The models, the stack, the ROI — no fluff Continue reading on Medium »
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 1w ago
Intelligent CCTV for Urban Design: AI-Based Analysis of Soft Infrastructure at Intersections
arXiv:2605.05402v1 Announce Type: new Abstract: Artificial intelligence (AI) and computer vision are transforming transportation data collection. This study int
Panduan Praktis Optimasi Pencahayaan Citra Digital dengan Python
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1w ago
Panduan Praktis Optimasi Pencahayaan Citra Digital dengan Python
Mengapa Pencahayaan Itu Krusial? ​Pernahkah Anda mengambil foto di kondisi minim cahaya dan mendapati hasilnya sangat gelap hingga… Continue reading on Medium »
Efficiency vs. Precision: A Python Deep Dive into Faster R-CNN and SSD PyTorch
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1w ago
Efficiency vs. Precision: A Python Deep Dive into Faster R-CNN and SSD PyTorch
In the rapidly evolving landscape of artificial intelligence, selecting the optimal architecture for computer vision is rarely a simple… Continue reading on Obj
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1w ago
Dari Pixel ke AI: Bagaimana Komputer Memahami Sebuah Gambar
“Sebuah eksplorasi sederhana tentang bagaimana gambar digital diubah menjadi informasi yang dapat dipahami oleh Artificial Intelligence.”… Continue reading on M
Teaching a Random Forest to Tell Walking from Running: A Computer Vision Pipeline with Hand-Built...
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1w ago
Teaching a Random Forest to Tell Walking from Running: A Computer Vision Pipeline with Hand-Built...
How a 56-feature baseline became a 240-feature classifier at 86% accuracy, with per-class SHAP guiding every feature engineering decision. Continue reading on M
Mengenal Lebih Dekat Deteksi Tepi Canny Pada Pengolahan Citra Digital dengan python dan opencv
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1w ago
Mengenal Lebih Dekat Deteksi Tepi Canny Pada Pengolahan Citra Digital dengan python dan opencv
Dalam dunia pengolahan citra digital, mendeteksi batas suatu objek merupakan hal yang sangat penting. Continue reading on Medium »
Building Samaritan: A Multi-Camera Real-Time Face Recognition System in Python — Part 4
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 2w ago
Building Samaritan: A Multi-Camera Real-Time Face Recognition System in Python — Part 4
Add multi-camera support to a Python face recognition system using threaded OpenCV capture for faster, non-blocking real-time video Continue reading on Medium »
Building Samaritan: A Multi-Camera Real-Time Face Recognition System in Python — Part 4
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 2w ago
Building Samaritan: A Multi-Camera Real-Time Face Recognition System in Python — Part 4
Add multi-camera support to a Python face recognition system using threaded OpenCV capture for faster, non-blocking real-time video Continue reading on Medium »
Building an AI-Based Exam Monitoring System Using Computer Vision, YOLO, and OpenCV
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 2w ago
Building an AI-Based Exam Monitoring System Using Computer Vision, YOLO, and OpenCV
Introduction Continue reading on Medium »
Deconstructing the TikTok Media Stack: Building a High-Performance, No-Watermark Extraction Engine
Dev.to · yqqwe 👁️ Computer Vision ⚡ AI Lesson 2w ago
Deconstructing the TikTok Media Stack: Building a High-Performance, No-Watermark Extraction Engine
Introduction As developers, we are often fascinated by how global-scale platforms manage...
Dev.to AI 👁️ Computer Vision ⚡ AI Lesson 2w ago
Light Fields — Deep Dive + Problem: Set Matrix Zeroes
A daily deep dive into cv topics, coding problems, and platform features from PixelBank . Topic Deep Dive: Light Fields From the Image-Based Rendering chapter I
IMPLEMENTING FASTER RCNN FROM SCRATCH IN PYTORCH FOR OBJECT DETECTION — PART ONE
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 2w ago
IMPLEMENTING FASTER RCNN FROM SCRATCH IN PYTORCH FOR OBJECT DETECTION — PART ONE
Learning computer vision has been an exciting journey over the past few weeks. From data preprocessing to model evaluation, every new… Continue reading on Mediu
Dev.to AI 👁️ Computer Vision ⚡ AI Lesson 2w ago
How I Built a High-Precision AI Manga OCR Translator for Hardcore Readers
Most OCR tools are built for clean text. Receipts. Documents. Screenshots. Menus. Maybe a street sign if the lighting is kind. Manga is none of those things. A
What is Camera Calibration? How It Helps in Computer Vision Tasks
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 2w ago
What is Camera Calibration? How It Helps in Computer Vision Tasks
A ground truth guide to how cameras distort reality and why calibration is critical for accurate computer vision systems. Continue reading on Medium »
What is Camera Calibration? How It Helps in Computer Vision Tasks
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 2w ago
What is Camera Calibration? How It Helps in Computer Vision Tasks
A ground truth guide to how cameras distort reality and why calibration is critical for accurate computer vision systems. Continue reading on Medium »
Building Samaritan: A Multi-Camera Real-Time Face Recognition System in Python — Part 2
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 3w ago
Building Samaritan: A Multi-Camera Real-Time Face Recognition System in Python — Part 2
Build real-time face recognition in Python with OpenCV, DeepFace, ArcFace embeddings, and live webcam-based identity matching. Continue reading on Medium »
Image Classification for AI: A Practical Guide for 2026
Medium · AI 👁️ Computer Vision ⚡ AI Lesson 3w ago
Image Classification for AI: A Practical Guide for 2026
Practical guide to image classification for AI: learn how to manage datasets, ensure accuracy, and scale your computer vision projects. Continue reading on Medi
Building Samaritan: A Multi-Camera Real-Time Face Recognition System in Python — Part 1
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 3w ago
Building Samaritan: A Multi-Camera Real-Time Face Recognition System in Python — Part 1
Build Samaritan, a Python real-time face recognition system using OpenCV, DeepFace, ArcFace, and multi-camera support. Continue reading on Medium »
The First Program Was Not Just Code
Medium · Programming 👁️ Computer Vision ⚡ AI Lesson 3w ago
The First Program Was Not Just Code
From algebra to execution: what the first program actually describes Continue reading on Level Up Coding »
Is career in computer vision engineering a Dead-end ?
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 3w ago
Is career in computer vision engineering a Dead-end ?
Until end of last year, despite LLMs on track for becoming world class SWE, I was still fairly confident about job security as a computer… Continue reading on M
Dev.to AI 👁️ Computer Vision ⚡ AI Lesson 3w ago
AI photo tagging app
Introducing a newly released AI photo tagging app for the iphone. More details on our website ( https://siwave.io ) and a link to the kickstarter project. We we
From Factory Floor to Distributed System: Engineering a Real-Time Computer Vision Backend for…
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 3w ago
From Factory Floor to Distributed System: Engineering a Real-Time Computer Vision Backend for…
Imagine you are on the floor of a battery manufacturing plant. Thousands of battery covers move down a conveyor every shift, each stamped… Continue reading on M
What Re-Learning C Taught Me About the Code I Write Every Day
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 3w ago
What Re-Learning C Taught Me About the Code I Write Every Day
Each weekend my younger brothers and I join a Discord call for our weekly game nights. Although the primary activity is gaming, a close… Continue reading on Cof
Dev.to AI 👁️ Computer Vision ⚡ AI Lesson 4w ago
DeepID-Net: multi-stage and deformable deep convolutional neural networks forobject detection
Revolutionizing Geospatial Data: Architecting Automated and Real-Time GeoAI Pipelines
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 4w ago
Revolutionizing Geospatial Data: Architecting Automated and Real-Time GeoAI Pipelines
Moving beyond static GIS to build predictive, event-driven spatial systems using advanced Computer Vision, streaming data, and edge… Continue reading on DataEng
Bilgisayarın Gözleri #2 — Görüntülerin Mutfağı: Pikseller, Matrisler ve Kanallar
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Bilgisayarın Gözleri #2 — Görüntülerin Mutfağı: Pikseller, Matrisler ve Kanallar
Bir önceki bölümde görüntü işlemeye hızlı bir giriş yapmış ve OpenCV ile ilk fotoğrafımızı ekrana yansıtmıştık. “Bilgisayar görüntüyü… Continue reading on HUAWE
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Beyond Bounding Boxes: Achieving Cinematic Reframing via YOLOv11 Instance Segmentation
The transition from 16:9 landscape to 9:16 vertical video is often treated as a simple cropping problem. In most automated workflows, the… Continue reading on M
Dev.to AI 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Efficient Pipeline for Camera Trap Image Review