✕ Clear filters
654 lessons

👁️ Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

All ▶ YouTube 176,102📚 Coursera 15,979
How I Built an AI Guitar Teacher | Learn To Use AI with Live Video
Computer Vision
How I Built an AI Guitar Teacher | Learn To Use AI with Live Video
Roboflow Beginner 2d ago
Learn Drone Programming with Python – Tutorial
Computer Vision
Learn Drone Programming with Python – Tutorial
freeCodeCamp.org Beginner 4d ago
Bringing Visual Intelligence to AMRs: Peer Robotic’s Vishrut Kaushik
Computer Vision
Bringing Visual Intelligence to AMRs: Peer Robotic’s Vishrut Kaushik
Roboflow Beginner 2w ago
AI Powered Surveillance System for India
Computer Vision
AI Powered Surveillance System for India
AI Anytime Beginner 3w ago
Why SEOs Need To Start Playing Offense Instead Of Defense by Chris Long | MozCon 2023
Computer Vision
Why SEOs Need To Start Playing Offense Instead Of Defense by Chris Long | MozCon 2023
Moz Beginner 3w ago
OpenClaw Explained: Create AI Agents Without Coding (Full Intro)
Computer Vision
OpenClaw Explained: Create AI Agents Without Coding (Full Intro)
Muhammad Moin Beginner 3w ago
V-JEPA 2.1 Explained: Dense Predictive Loss and Multi-Modal Tokenization. V-JEPA World Models. EBMs
Computer Vision
V-JEPA 2.1 Explained: Dense Predictive Loss and Multi-Modal Tokenization. V-JEPA World Models. EBMs
AI Podcast Series. Byte Goose AI. Beginner 3w ago
Is Benjamin Netanyahu an AI clone?
Computer Vision
Is Benjamin Netanyahu an AI clone?
The TensorFlow Channel Beginner 3w ago
Build a WhatsApp AI Agent (Auto Replies) Using OpenClaw – Step-by-Step
Computer Vision
Build a WhatsApp AI Agent (Auto Replies) Using OpenClaw – Step-by-Step
Muhammad Moin Beginner 3w ago
Jueves de Quack con Nerdearla
Computer Vision
Jueves de Quack con Nerdearla
GitHub Beginner 1mo ago
Image Search Engine in Python - Multimodal Embeddings
Computer Vision
Image Search Engine in Python - Multimodal Embeddings
NeuralNine Beginner 1mo ago
Pegasus by TwelveLabs #ai #video #patternrecognition #tech #explained #llm #imagerecognition
Computer Vision
Pegasus by TwelveLabs #ai #video #patternrecognition #tech #explained #llm #imagerecognition
Jessica Wang Beginner 1mo ago
What Is Multimodal AI? Real-World Examples
Computer Vision
What Is Multimodal AI? Real-World Examples
Coursera Beginner 1mo ago
Full Speaking Course 2026: Duolingo English Test
Computer Vision
Full Speaking Course 2026: Duolingo English Test
Teacher Luke - Duolingo English Test Beginner 1mo ago
Music AI Sandbox | AI x Creativity: Wyclef Jean
Computer Vision
Music AI Sandbox | AI x Creativity: Wyclef Jean
Google DeepMind Beginner 1mo ago
One Open AI Model Built My Website, Image & Video
Computer Vision
One Open AI Model Built My Website, Image & Video
Analytics Vidhya Beginner 1mo ago
Page Match lets you quickly sync your spot in a physical or ebook with an audiobook.
Computer Vision
Page Match lets you quickly sync your spot in a physical or ebook with an audiobook.
The Verge Beginner 2mo ago
An image is worth NxN words | Diffusion Transformers (ViT, DiT, MMDiT)
Computer Vision
An image is worth NxN words | Diffusion Transformers (ViT, DiT, MMDiT)
Julia Turc Beginner 2mo ago
Full Duolingo English Test with Answers: January 2026 Format
Computer Vision
Full Duolingo English Test with Answers: January 2026 Format
Teacher Luke - Duolingo English Test Beginner 2mo ago
Artem Sevastopolsky and Dmitrii Pozdeev - DenseMarks  Learning Canonical Embeddings for Human Heads
Computer Vision
Artem Sevastopolsky and Dmitrii Pozdeev - DenseMarks Learning Canonical Embeddings for Human Heads
Cohere Beginner 2mo ago
On-Device AI Just Leveled Up: Liquid AI’s LFM-2.5 Explained
Computer Vision
On-Device AI Just Leveled Up: Liquid AI’s LFM-2.5 Explained
Analytics Vidhya Beginner 3mo ago
What does AI mean for education?
Computer Vision
What does AI mean for education?
Anthropic Beginner 3mo ago
Why Vision Language Models Ignore What They See [Munawar Hayat] - 758
Computer Vision
Why Vision Language Models Ignore What They See [Munawar Hayat] - 758
TWIML AI Podcast Beginner 4mo ago
Why are Transformers replacing CNNs?
Computer Vision
Why are Transformers replacing CNNs?
Julia Turc Beginner 4mo ago
Should AI be introduced to kids early?  #podcast #interview
Computer Vision
Should AI be introduced to kids early? #podcast #interview
Abhishek Thakur Beginner 4mo ago
Multimodal and Multi-model AI in Action
Computer Vision
Multimodal and Multi-model AI in Action
Microsoft 365 Developer Beginner 4mo ago
A no nonsense intro to BM25
Computer Vision
A no nonsense intro to BM25
Abhishek Thakur Beginner 4mo ago
Getting Started With Transformers for Computer Vision - Divya Swaminathan & Tony Reina
Computer Vision
Getting Started With Transformers for Computer Vision - Divya Swaminathan & Tony Reina
PyData Beginner 4mo ago
Choosing Your Path: AI Professional Program Course Selection Guide
Computer Vision
Choosing Your Path: AI Professional Program Course Selection Guide
Stanford Online Beginner 5mo ago
From Parking Lots to Airports: How Metropolis Uses AI for Seamless Payments
Computer Vision
From Parking Lots to Airports: How Metropolis Uses AI for Seamless Payments
The Information Beginner 5mo ago
ExecuTorch 1.0: General Availability Status for Mobile and Embedded...- Mergen Nachin & Cemal Bilgin
Computer Vision
ExecuTorch 1.0: General Availability Status for Mobile and Embedded...- Mergen Nachin & Cemal Bilgin
PyTorch Beginner 5mo ago
OneDrive’s AI is scanning your PHOTOS
Computer Vision
OneDrive’s AI is scanning your PHOTOS
David Bombal Beginner 5mo ago
Is YOLO26 Faster Than YOLO11? Full Comparison & Results
Computer Vision
Is YOLO26 Faster Than YOLO11? Full Comparison & Results
Muhammad Moin Beginner 1mo ago
Helping Sports Teams Improve Decision Making with AI: Interview with PlayVision's Marc Zoghby
Computer Vision
Helping Sports Teams Improve Decision Making with AI: Interview with PlayVision's Marc Zoghby
Roboflow Beginner 2mo ago
Google T5Gemma 2 Explained: The AI Built for Long Documents & Multimodal Reasoning
Computer Vision
Google T5Gemma 2 Explained: The AI Built for Long Documents & Multimodal Reasoning
Analytics Vidhya Beginner 3mo ago
AI for Occupancy Analytics | Building a Smart Parking System
Computer Vision
AI for Occupancy Analytics | Building a Smart Parking System
Roboflow Beginner 4mo ago
DeepSeek V3.2 Speciale Testing – Can It Handle Complex Tasks Without Tools?
Computer Vision
DeepSeek V3.2 Speciale Testing – Can It Handle Complex Tasks Without Tools?
Muhammad Moin Beginner 4mo ago
Build a Hybrid CSV Intelligence Agent with RAG, Pandas, and LLM Judge
Computer Vision
Build a Hybrid CSV Intelligence Agent with RAG, Pandas, and LLM Judge
Muhammad Moin Beginner 4mo ago
I Took the Duolingo English Test and Here’s What Happened
Computer Vision
I Took the Duolingo English Test and Here’s What Happened
Teacher Luke - Duolingo English Test Beginner 4mo ago
Multimodal RAG: Chat with Complex PDFs (Text, Tables & Images)
Computer Vision
Multimodal RAG: Chat with Complex PDFs (Text, Tables & Images)
Muhammad Moin Beginner 4mo ago
What is Segment Anything 3 (SAM3)? Live Q&A with Meta's Engineers Behind the Model
Computer Vision
What is Segment Anything 3 (SAM3)? Live Q&A with Meta's Engineers Behind the Model
Roboflow Beginner 4mo ago
Vibe + VSCode + Codex = Search UI
Computer Vision
Vibe + VSCode + Codex = Search UI
Abhishek Thakur Beginner 4mo ago
Validate Actions with Vision AI | Building a Web App for Real-Time Drinking Detection
Computer Vision
Validate Actions with Vision AI | Building a Web App for Real-Time Drinking Detection
Roboflow Beginner 4mo ago
Vision AI in a Web Browser: Creating a Scavenger Hunt App with Inference.js
Computer Vision
Vision AI in a Web Browser: Creating a Scavenger Hunt App with Inference.js
Roboflow Beginner 5mo ago
Vibe Coding with AI in 2025 – Build Anything with Google AI Studio
Computer Vision
Vibe Coding with AI in 2025 – Build Anything with Google AI Studio
Muhammad Moin Beginner 5mo ago
Ask the Engineers: RF-DETR Segmentation and Creating Best-in-Class Vision Models for the Edge
Computer Vision
Ask the Engineers: RF-DETR Segmentation and Creating Best-in-Class Vision Models for the Edge
Roboflow Beginner 5mo ago
How The Field Museum Unlocks New Research Possibilities with Vision AI
Computer Vision
How The Field Museum Unlocks New Research Possibilities with Vision AI
Roboflow Beginner 5mo ago
How Jesai Scored a Perfect 160 on the Duolingo English Test (DET)!
Computer Vision
How Jesai Scored a Perfect 160 on the Duolingo English Test (DET)!
Teacher Luke - Duolingo English Test Beginner 5mo ago
📚 Coursera Courses Opens on Coursera · Free to audit
1 / 3 View all →
Camera and Imaging
📚 Coursera Course ↗
Self-paced
Camera and Imaging
Opens on Coursera ↗
Future of data and technology in football
📚 Coursera Course ↗
Self-paced
Future of data and technology in football
Opens on Coursera ↗
Entendiendo la depresión a lo largo del ciclo vital
📚 Coursera Course ↗
Self-paced
Entendiendo la depresión a lo largo del ciclo vital
Opens on Coursera ↗
Python Project: Software Engineering and Image Manipulation
📚 Coursera Course ↗
Self-paced
Python Project: Software Engineering and Image Manipulation
Opens on Coursera ↗
Deploy & Evaluate Vision Models Effectively
📚 Coursera Course ↗
Self-paced
Deploy & Evaluate Vision Models Effectively
Opens on Coursera ↗
Build Real-Time Face Recognition with OpenCV
📚 Coursera Course ↗
Self-paced
Build Real-Time Face Recognition with OpenCV
Opens on Coursera ↗