Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1189
lessons
As we outsource more to smart home gadgets, have we thought about how weโ€™d react in their place?
Computer Vision
As we outsource more to smart home gadgets, have we thought about how weโ€™d react in their place?
The Verge Intermediate 4mo ago
Real Time AI Video Object Tracking! ๐Ÿ’ฅEdgeTAM - Sam 2 for On-Device ๐Ÿ”ฅ
Computer Vision
Real Time AI Video Object Tracking! ๐Ÿ’ฅEdgeTAM - Sam 2 for On-Device ๐Ÿ”ฅ
1littlecoder Intermediate 4mo ago
How to Create a Profitable Paid Search Strategy for 2026
Computer Vision
How to Create a Profitable Paid Search Strategy for 2026
Exposure Ninja Intermediate 5mo ago
Vibe Coding with AI in 2025 โ€“ Build Anything with Google AI Studio
Computer Vision
Vibe Coding with AI in 2025 โ€“ Build Anything with Google AI Studio
Muhammad Moin Beginner 5mo ago
From Parking Lots to Airports: How Metropolis Uses AI for Seamless Payments
Computer Vision
From Parking Lots to Airports: How Metropolis Uses AI for Seamless Payments
The Information Beginner 5mo ago
Genomcore impulsa la investigaciรณn biomรฉdica con AWS e IA | Amazon Web Services
Computer Vision
Genomcore impulsa la investigaciรณn biomรฉdica con AWS e IA | Amazon Web Services
Amazon Web Services Advanced 5mo ago
ExecuTorch 1.0: General Availability Status for Mobile and Embedded...- Mergen Nachin & Cemal Bilgin
Computer Vision
ExecuTorch 1.0: General Availability Status for Mobile and Embedded...- Mergen Nachin & Cemal Bilgin
PyTorch Beginner 5mo ago
Behind the scenes yesterday @ an NFL game for a collab with the Detroit Lions for my Pokรฉmon channel
Computer Vision
Behind the scenes yesterday @ an NFL game for a collab with the Detroit Lions for my Pokรฉmon channel
Pat Flynn Intermediate 5mo ago
Building, learning and teaching with AI (w/ Parul Pandey)
Computer Vision
Building, learning and teaching with AI (w/ Parul Pandey)
Abhishek Thakur Beginner 5mo ago
How to Stay Relevant in AI & Data Science (w/ Alexey Grigorev)
Computer Vision
How to Stay Relevant in AI & Data Science (w/ Alexey Grigorev)
Abhishek Thakur Intermediate 5mo ago
Ask the Engineers: RF-DETR Segmentation and Creating Best-in-Class Vision Models for the Edge
Computer Vision
Ask the Engineers: RF-DETR Segmentation and Creating Best-in-Class Vision Models for the Edge
Roboflow Beginner 5mo ago
Where Hazel is at and what we've been up to // October 2025 Hazel Dev Log
Computer Vision
Where Hazel is at and what we've been up to // October 2025 Hazel Dev Log
The Cherno Intermediate 5mo ago
Multimodal Data Analysis with AI
Computer Vision
Multimodal Data Analysis with AI
Latent Space Intermediate 5mo ago
Generate Image Captions That Focus on What You Need
Computer Vision
Generate Image Captions That Focus on What You Need
NVIDIA Developer Intermediate 5mo ago
How The Field Museum Unlocks New Research Possibilities with Vision AI
Computer Vision
How The Field Museum Unlocks New Research Possibilities with Vision AI
Roboflow Beginner 5mo ago
Meta Engineer on Industrial Computer Vision systems
Computer Vision
Meta Engineer on Industrial Computer Vision systems
MLOps.community Intermediate 5mo ago
Duolingo English Test - NEW Complete Practice Test with Answers
Computer Vision
Duolingo English Test - NEW Complete Practice Test with Answers
Teacher Luke - Duolingo English Test Intermediate 5mo ago
Ashmal Vayani - Seeing the World as It Speaks  Multilingual, Culturally Aware Multimodal AI
Computer Vision
Ashmal Vayani - Seeing the World as It Speaks Multilingual, Culturally Aware Multimodal AI
Cohere Advanced 5mo ago
OneDriveโ€™s AI is scanning your PHOTOS
Computer Vision
OneDriveโ€™s AI is scanning your PHOTOS
David Bombal Beginner 5mo ago
The SECRET to Hyper Segmentation (and Sales)
0:35
Computer Vision
The SECRET to Hyper Segmentation (and Sales)
Optimum7 Intermediate 5mo ago
How Jesai Scored a Perfect 160 on the Duolingo English Test (DET)!
Computer Vision
How Jesai Scored a Perfect 160 on the Duolingo English Test (DET)!
Teacher Luke - Duolingo English Test Beginner 5mo ago
Salary of Computer Vision Engineer | How Much does a Computer Vision Engineer Make?
Computer Vision
Salary of Computer Vision Engineer | How Much does a Computer Vision Engineer Make?
Simplilearn Beginner 5mo ago
๐Ÿšจ Smart AI for Wildlife & Traffic Safety! ๐Ÿ˜๐Ÿšฆ
Computer Vision
๐Ÿšจ Smart AI for Wildlife & Traffic Safety! ๐Ÿ˜๐Ÿšฆ
Arivi by HCL GUVI Beginner 5mo ago
The Vergeโ€™s Vee Song joins us on The Vergecast to chat about Pelotonโ€™s approach to AI. #vergecast
Computer Vision
The Vergeโ€™s Vee Song joins us on The Vergecast to chat about Pelotonโ€™s approach to AI. #vergecast
The Verge Intermediate 6mo ago
Discover Web AI: Client side Agents, Gen AI, and machine learning in the browser
Computer Vision
Discover Web AI: Client side Agents, Gen AI, and machine learning in the browser
Chrome for Developers Beginner 6mo ago
Shashanka Venkataramana and Valentinos Pariza - Franca  Nested Matryoshka Clustering for Scalable Vi
Computer Vision
Shashanka Venkataramana and Valentinos Pariza - Franca Nested Matryoshka Clustering for Scalable Vi
Cohere Advanced 6mo ago
Industrial AI Machine Vision in Action with Databricks & Crosser
Computer Vision
Industrial AI Machine Vision in Action with Databricks & Crosser
Databricks Intermediate 6mo ago
Mistral AI Models on Amazon Bedrock: When to Use Pixtral Large vs Mistral Small 3.0
Computer Vision
Mistral AI Models on Amazon Bedrock: When to Use Pixtral Large vs Mistral Small 3.0
AWS Developers Beginner 6mo ago
Qwen3-Omni: The First Open All-in-One AI?
Computer Vision
Qwen3-Omni: The First Open All-in-One AI?
What's AI by Louis-Franรงois Bouchard Advanced 6mo ago
"Smartest" VISION AI in Cars Do Reasoning?
Computer Vision
"Smartest" VISION AI in Cars Do Reasoning?
Discover AI Intermediate 6mo ago
Build an Agentic RAG with LangGraph | Step-by-Step Guide + Code
Computer Vision
Build an Agentic RAG with LangGraph | Step-by-Step Guide + Code
Muhammad Moin Beginner 6mo ago
What is multimodality? A deep dive on multimodality in Gemma 3
Computer Vision
What is multimodality? A deep dive on multimodality in Gemma 3
Google for Developers Beginner 6mo ago
Everything about MLX (w/ Prince Canuma)
Computer Vision
Everything about MLX (w/ Prince Canuma)
Abhishek Thakur Intermediate 5mo ago
Journey from Non-tech to Meta Engineer with Andrey Lukyanenko
Computer Vision
Journey from Non-tech to Meta Engineer with Andrey Lukyanenko
Abhishek Thakur Beginner 5mo ago
Amazon debuted the new Echo Show 8 and Echo Show 11 during its fall 2025 hardware event on Tuesday.
Computer Vision
Amazon debuted the new Echo Show 8 and Echo Show 11 during its fall 2025 hardware event on Tuesday.
The Verge Intermediate 6mo ago
Meta's Daniel Bolya on Perception Encoder and Improving Visual Understanding
Computer Vision
Meta's Daniel Bolya on Perception Encoder and Improving Visual Understanding
Roboflow Beginner 6mo ago
This is the GPD Win 5, with a 45-85W AMD Strix Halo chip.
Computer Vision
This is the GPD Win 5, with a 45-85W AMD Strix Halo chip.
The Verge Intermediate 6mo ago
Audi Reader: Reinventing the Car User Manual with Vision AI
Computer Vision
Audi Reader: Reinventing the Car User Manual with Vision AI
Roboflow Beginner 6mo ago
No, Apple isnโ€™t trying to buy up all the 13 Pro Maxes.
Computer Vision
No, Apple isnโ€™t trying to buy up all the 13 Pro Maxes.
The Verge Advanced 6mo ago
AI for Robotics: How Almond Uses Computer Vision with Manufacturing Robots
Computer Vision
AI for Robotics: How Almond Uses Computer Vision with Manufacturing Robots
Roboflow Beginner 6mo ago
AI for Food Processing: How FloVision Uses Computer Vision to Reduce Waste and Improve Efficiency
Computer Vision
AI for Food Processing: How FloVision Uses Computer Vision to Reduce Waste and Improve Efficiency
Roboflow Beginner 6mo ago
Meta Connect 2025 had some of the biggest demo failures weโ€™ve seen live in a while.
Computer Vision
Meta Connect 2025 had some of the biggest demo failures weโ€™ve seen live in a while.
The Verge Intermediate 6mo ago
Victoria Song joins us on The Vergecast to talk about the AirPods Pro 3 and their upgraded fit.
Computer Vision
Victoria Song joins us on The Vergecast to talk about the AirPods Pro 3 and their upgraded fit.
The Verge Intermediate 6mo ago
The third-gen SE got a massive, wide-ranging glow-up. #vergecast
Computer Vision
The third-gen SE got a massive, wide-ranging glow-up. #vergecast
The Verge Intermediate 6mo ago
The Vergeโ€™s Allison Johnson joins us on The Vergecast to discuss her time with the iPhone Air.
Computer Vision
The Vergeโ€™s Allison Johnson joins us on The Vergecast to discuss her time with the iPhone Air.
The Verge Intermediate 6mo ago
How to Automate Quality Inspections with ResNet Classification Models
Computer Vision
How to Automate Quality Inspections with ResNet Classification Models
Roboflow Beginner 6mo ago
Agentic RAG Explained: The Future of AI Agents & Retrieval Augmented Generation
Computer Vision
Agentic RAG Explained: The Future of AI Agents & Retrieval Augmented Generation
Muhammad Moin Beginner 6mo ago
Silksong on Android is now a reality, no port required! #TITW
Computer Vision
Silksong on Android is now a reality, no port required! #TITW
The Verge Beginner 6mo ago
๐Ÿ“š Coursera Courses Opens on Coursera ยท Free to audit
1 / 3 View all โ†’
Implementando modelo Computer Vision en Amazon Sagemaker
๐Ÿ“š Coursera Course โ†—
Self-paced
Implementando modelo Computer Vision en Amazon Sagemaker
Opens on Coursera โ†—
Introduction to Image Processing
๐Ÿ“š Coursera Course โ†—
Self-paced
Introduction to Image Processing
Opens on Coursera โ†—
Breastfeeding and Adequate Substitutes
๐Ÿ“š Coursera Course โ†—
Self-paced
Breastfeeding and Adequate Substitutes
Opens on Coursera โ†—
How to Revitalize Mature Products - Jagdish Sheth
๐Ÿ“š Coursera Course โ†—
Self-paced
How to Revitalize Mature Products - Jagdish Sheth
Opens on Coursera โ†—
Market Research and Competitive Analysis
๐Ÿ“š Coursera Course โ†—
Self-paced
Market Research and Competitive Analysis
Opens on Coursera โ†—
Modern AI Models for Vision and Multimodal Understanding
๐Ÿ“š Coursera Course โ†—
Self-paced
Modern AI Models for Vision and Multimodal Understanding
Opens on Coursera โ†—