Foundations
Computer Vision
Object detection, segmentation, YOLO, CLIP, and vision-language models
Skills in this topic
3 skills — Sign in to track your progress
Showing 225 reads from curated sources

Medium · Machine Learning
👁️ Computer Vision
⚡ AI Lesson
2h ago
Traffic Light Recognition (TLR) Architecture: 2D Bounding Box Detection
The TLR model is a Fully Convolutional Network (FCN) + FPN + Header model, utilizing an “anchor-free” approach. Instead of guessing… Continue reading on Medium

Medium · AI
👁️ Computer Vision
⚡ AI Lesson
4h ago
2D Gaussian Splatting: when removing a dimension makes 3D better
Why 3D Gaussians fail at surfaces, and how flat disks fix it Continue reading on Medium »

Dev.to · Abdullah Fiaz
👁️ Computer Vision
⚡ AI Lesson
12h ago
"Mastering Digital Logic Counters with C++ OOP: A Hands-On Guide”
Introduction Digital logic counters are fundamental in electronics and computing. They track events,...
Medium · Programming
👁️ Computer Vision
⚡ AI Lesson
2d ago
Como o pensamento computacional me ajudou a estruturar minhas entregas
Há um bom tempo venho tentando entrar, bem aos poucos, no mundo da programação. Continue reading on Tatiane Marina »

IEEE Spectrum
👁️ Computer Vision
⚡ AI Lesson
2d ago
Manchester Code Made Bits Behave
In the late 1940s—when computer engineers were grappling with unreliable hardware and noisy transmission environments—a team of engineers inside a modest lab at
Dev.to AI
👁️ Computer Vision
⚡ AI Lesson
2d ago
Why Your Image Upload Pipeline Should Check for Physically Impossible Lighting
Why Your Image Upload Pipeline Should Check for Physically Impossible Lighting If you're building user-generated content platforms, marketplace verification sys

Dev.to · yubin yang
👁️ Computer Vision
⚡ AI Lesson
2d ago
Rasterization Using Bresenham Algorithm and Scanline Algorithm
1. Overview Bresenham algorithm is the fastest algorithm for drawing straight lines on a...

Dev.to · Alessandro Binda
👁️ Computer Vision
⚡ AI Lesson
4d ago
OCR Intelligente per Documenti Aziendali: Architettura e Lezioni dal Campo
L'OCR (Optical Character Recognition) per testo stampato moderno è un problema risolto da decenni....

Medium · AI
👁️ Computer Vision
⚡ AI Lesson
4d ago
Computer Vision Yolculuğu — Gün 2: OpenCV ile Frame Üzerine Çizim Yapmak
Computer Vision projelerinde kameradan görüntü almak yalnızca ilk adımdır. Gerçek sistemlerde asıl önemli nokta, alınan frame’lerin… Continue reading on Medium

Medium · Programming
👁️ Computer Vision
⚡ AI Lesson
4d ago
Who Really Deserves To Be Called The Father Of The Internet
From ARPANET to the World Wide Web the Internet was built by a network of pioneers not one inventor Continue reading on IT Chronicles »

Dev.to · hassaan-syed
👁️ Computer Vision
⚡ AI Lesson
4d ago
Why Your Computer Reads Numbers Backwards: Byte Order Explained
What is Byte Order? Before understanding byte order, we need to understand one thing: A byte = 8...
Dev.to AI
👁️ Computer Vision
⚡ AI Lesson
4d ago
High Speed and Performance
High Speed and Performance C language is very fast because it is a compiled language. It converts code directly into machine language, so programs run quickly a
Dev.to AI
👁️ Computer Vision
⚡ AI Lesson
4d ago
Building a License Plate Recognition Engine in C++ — Part 2: Grayscale Image Preprocessing and Local Contrast Edge Detection
In the previous article, we loaded an image, converted it into grayscale, and introduced the core data structures used by the recognition engine. In this part,

Medium · Machine Learning
👁️ Computer Vision
⚡ AI Lesson
6d ago
Inside SAM 3D: how Meta turns a single image into 3D
For about forty years, “3D” in the practical sense meant one thing: triangle meshes. Every game shipped, every animated film rendered… Continue reading on Mediu

Medium · Deep Learning
👁️ Computer Vision
⚡ AI Lesson
6d ago
Inside SAM 3D: how Meta turns a single image into 3D
For about forty years, “3D” in the practical sense meant one thing: triangle meshes. Every game shipped, every animated film rendered… Continue reading on Mediu

Medium · Data Science
👁️ Computer Vision
⚡ AI Lesson
6d ago
Demystifying CNNs: How Convolutional Filters and Max-Pooling Actually Work
If you’ve ever wondered how a computer can look at a photo of a car and instantly know it’s a car, you’re looking at the magic of… Continue reading on Medium »
Dev.to AI
👁️ Computer Vision
⚡ AI Lesson
1w ago
Your "Biometric Age Check" Isn't Verifying Identity — And Defense Lawyers Know It
Understanding the distinction between biometric age estimation and identity verification For developers in the computer vision and biometrics space, the nuance

Medium · Machine Learning
👁️ Computer Vision
⚡ AI Lesson
1w ago
MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons
This project speaks for itself. It covers three crucial steps in one go: motion tracking, skeleton reconstruction, and 3D animation. What… Continue reading on M

Dev.to · BMBrick
👁️ Computer Vision
⚡ AI Lesson
1w ago
How I Built a Perceptual Color Quantization Engine for LEGO Mosaics
The Problem Converting a photo into a LEGO mosaic sounds simple: resize the image, find...

Medium · AI
👁️ Computer Vision
⚡ AI Lesson
1w ago
Computer Vision Is Rebuilding the Fitting Room
The models, the stack, the ROI — no fluff Continue reading on Medium »
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
1w ago
Intelligent CCTV for Urban Design: AI-Based Analysis of Soft Infrastructure at Intersections
arXiv:2605.05402v1 Announce Type: new Abstract: Artificial intelligence (AI) and computer vision are transforming transportation data collection. This study int

Medium · Python
👁️ Computer Vision
⚡ AI Lesson
1w ago
Panduan Praktis Optimasi Pencahayaan Citra Digital dengan Python
Mengapa Pencahayaan Itu Krusial? Pernahkah Anda mengambil foto di kondisi minim cahaya dan mendapati hasilnya sangat gelap hingga… Continue reading on Medium »

Medium · Python
👁️ Computer Vision
⚡ AI Lesson
1w ago
Efficiency vs. Precision: A Python Deep Dive into Faster R-CNN and SSD PyTorch
In the rapidly evolving landscape of artificial intelligence, selecting the optimal architecture for computer vision is rarely a simple… Continue reading on Obj
Medium · Python
👁️ Computer Vision
⚡ AI Lesson
1w ago
Dari Pixel ke AI: Bagaimana Komputer Memahami Sebuah Gambar
“Sebuah eksplorasi sederhana tentang bagaimana gambar digital diubah menjadi informasi yang dapat dipahami oleh Artificial Intelligence.”… Continue reading on M

Medium · Python
👁️ Computer Vision
⚡ AI Lesson
1w ago
Teaching a Random Forest to Tell Walking from Running: A Computer Vision Pipeline with Hand-Built...
How a 56-feature baseline became a 240-feature classifier at 86% accuracy, with per-class SHAP guiding every feature engineering decision. Continue reading on M

Medium · Python
👁️ Computer Vision
⚡ AI Lesson
1w ago
Mengenal Lebih Dekat Deteksi Tepi Canny Pada Pengolahan Citra Digital dengan python dan opencv
Dalam dunia pengolahan citra digital, mendeteksi batas suatu objek merupakan hal yang sangat penting. Continue reading on Medium »

Medium · Machine Learning
👁️ Computer Vision
⚡ AI Lesson
2w ago
Building Samaritan: A Multi-Camera Real-Time Face Recognition System in Python — Part 4
Add multi-camera support to a Python face recognition system using threaded OpenCV capture for faster, non-blocking real-time video Continue reading on Medium »

Medium · Deep Learning
👁️ Computer Vision
⚡ AI Lesson
2w ago
Building Samaritan: A Multi-Camera Real-Time Face Recognition System in Python — Part 4
Add multi-camera support to a Python face recognition system using threaded OpenCV capture for faster, non-blocking real-time video Continue reading on Medium »

Medium · Deep Learning
👁️ Computer Vision
⚡ AI Lesson
2w ago
Building an AI-Based Exam Monitoring System Using Computer Vision, YOLO, and OpenCV
Introduction Continue reading on Medium »

Dev.to · yqqwe
👁️ Computer Vision
⚡ AI Lesson
2w ago
Deconstructing the TikTok Media Stack: Building a High-Performance, No-Watermark Extraction Engine
Introduction As developers, we are often fascinated by how global-scale platforms manage...
Dev.to AI
👁️ Computer Vision
⚡ AI Lesson
2w ago
Light Fields — Deep Dive + Problem: Set Matrix Zeroes
A daily deep dive into cv topics, coding problems, and platform features from PixelBank . Topic Deep Dive: Light Fields From the Image-Based Rendering chapter I

Medium · Deep Learning
👁️ Computer Vision
⚡ AI Lesson
2w ago
IMPLEMENTING FASTER RCNN FROM SCRATCH IN PYTORCH FOR OBJECT DETECTION — PART ONE
Learning computer vision has been an exciting journey over the past few weeks. From data preprocessing to model evaluation, every new… Continue reading on Mediu
Dev.to AI
👁️ Computer Vision
⚡ AI Lesson
2w ago
How I Built a High-Precision AI Manga OCR Translator for Hardcore Readers
Most OCR tools are built for clean text. Receipts. Documents. Screenshots. Menus. Maybe a street sign if the lighting is kind. Manga is none of those things. A

Medium · Machine Learning
👁️ Computer Vision
⚡ AI Lesson
2w ago
What is Camera Calibration? How It Helps in Computer Vision Tasks
A ground truth guide to how cameras distort reality and why calibration is critical for accurate computer vision systems. Continue reading on Medium »

Medium · Deep Learning
👁️ Computer Vision
⚡ AI Lesson
2w ago
What is Camera Calibration? How It Helps in Computer Vision Tasks
A ground truth guide to how cameras distort reality and why calibration is critical for accurate computer vision systems. Continue reading on Medium »

Medium · Deep Learning
👁️ Computer Vision
⚡ AI Lesson
3w ago
Building Samaritan: A Multi-Camera Real-Time Face Recognition System in Python — Part 2
Build real-time face recognition in Python with OpenCV, DeepFace, ArcFace embeddings, and live webcam-based identity matching. Continue reading on Medium »

Medium · AI
👁️ Computer Vision
⚡ AI Lesson
3w ago
Image Classification for AI: A Practical Guide for 2026
Practical guide to image classification for AI: learn how to manage datasets, ensure accuracy, and scale your computer vision projects. Continue reading on Medi

Medium · Machine Learning
👁️ Computer Vision
⚡ AI Lesson
3w ago
Building Samaritan: A Multi-Camera Real-Time Face Recognition System in Python — Part 1
Build Samaritan, a Python real-time face recognition system using OpenCV, DeepFace, ArcFace, and multi-camera support. Continue reading on Medium »

Medium · Programming
👁️ Computer Vision
⚡ AI Lesson
3w ago
The First Program Was Not Just Code
From algebra to execution: what the first program actually describes Continue reading on Level Up Coding »
Medium · Machine Learning
👁️ Computer Vision
⚡ AI Lesson
3w ago
Is career in computer vision engineering a Dead-end ?
Until end of last year, despite LLMs on track for becoming world class SWE, I was still fairly confident about job security as a computer… Continue reading on M
Dev.to AI
👁️ Computer Vision
⚡ AI Lesson
3w ago
AI photo tagging app
Introducing a newly released AI photo tagging app for the iphone. More details on our website ( https://siwave.io ) and a link to the kickstarter project. We we

Medium · Python
👁️ Computer Vision
⚡ AI Lesson
3w ago
From Factory Floor to Distributed System: Engineering a Real-Time Computer Vision Backend for…
Imagine you are on the floor of a battery manufacturing plant. Thousands of battery covers move down a conveyor every shift, each stamped… Continue reading on M

Medium · Python
👁️ Computer Vision
⚡ AI Lesson
3w ago
What Re-Learning C Taught Me About the Code I Write Every Day
Each weekend my younger brothers and I join a Discord call for our weekly game nights. Although the primary activity is gaming, a close… Continue reading on Cof
Dev.to AI
👁️ Computer Vision
⚡ AI Lesson
4w ago
DeepID-Net: multi-stage and deformable deep convolutional neural networks forobject detection

Medium · Python
👁️ Computer Vision
⚡ AI Lesson
4w ago
Revolutionizing Geospatial Data: Architecting Automated and Real-Time GeoAI Pipelines
Moving beyond static GIS to build predictive, event-driven spatial systems using advanced Computer Vision, streaming data, and edge… Continue reading on DataEng

Medium · Python
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Bilgisayarın Gözleri #2 — Görüntülerin Mutfağı: Pikseller, Matrisler ve Kanallar
Bir önceki bölümde görüntü işlemeye hızlı bir giriş yapmış ve OpenCV ile ilk fotoğrafımızı ekrana yansıtmıştık. “Bilgisayar görüntüyü… Continue reading on HUAWE
Medium · Python
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Beyond Bounding Boxes: Achieving Cinematic Reframing via YOLOv11 Instance Segmentation
The transition from 16:9 landscape to 9:16 vertical video is often treated as a simple cropping problem. In most automated workflows, the… Continue reading on M
Dev.to AI
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Efficient Pipeline for Camera Trap Image Review
DeepCamp AI