Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

684
lessons
Bringing Visual Intelligence to AMRs: Peer Robotic’s Vishrut Kaushik
Computer Vision
Bringing Visual Intelligence to AMRs: Peer Robotic’s Vishrut Kaushik
Roboflow Beginner 1w ago
AI Powered Surveillance System for India
Computer Vision
AI Powered Surveillance System for India
AI Anytime Beginner 2w ago
Why SEOs Need To Start Playing Offense Instead Of Defense by Chris Long | MozCon 2023
Computer Vision
Why SEOs Need To Start Playing Offense Instead Of Defense by Chris Long | MozCon 2023
Moz Beginner 2w ago
OpenClaw Explained: Create AI Agents Without Coding (Full Intro)
Computer Vision
OpenClaw Explained: Create AI Agents Without Coding (Full Intro)
Muhammad Moin Beginner 2w ago
V-JEPA 2.1 Explained: Dense Predictive Loss and Multi-Modal Tokenization. V-JEPA World Models. EBMs
Computer Vision
V-JEPA 2.1 Explained: Dense Predictive Loss and Multi-Modal Tokenization. V-JEPA World Models. EBMs
AI Podcast Series. Byte Goose AI. Beginner 2w ago
Is Benjamin Netanyahu an AI clone?
Computer Vision
Is Benjamin Netanyahu an AI clone?
The TensorFlow Channel Beginner 2w ago
Build a WhatsApp AI Agent (Auto Replies) Using OpenClaw – Step-by-Step
Computer Vision
Build a WhatsApp AI Agent (Auto Replies) Using OpenClaw – Step-by-Step
Muhammad Moin Beginner 2w ago
Jueves de Quack con Nerdearla
Computer Vision
Jueves de Quack con Nerdearla
GitHub Beginner 3w ago
Image Search Engine in Python - Multimodal Embeddings
Computer Vision
Image Search Engine in Python - Multimodal Embeddings
NeuralNine Beginner 4w ago
Pegasus by TwelveLabs #ai #video #patternrecognition #tech #explained #llm #imagerecognition
Computer Vision
Pegasus by TwelveLabs #ai #video #patternrecognition #tech #explained #llm #imagerecognition
Jessica Wang Beginner 4w ago
What Is Multimodal AI? Real-World Examples
Computer Vision
What Is Multimodal AI? Real-World Examples
Coursera Beginner 1mo ago
Xiaomi is releasing its Leica Leitzphone outside of Japan for the first time.
Computer Vision
Xiaomi is releasing its Leica Leitzphone outside of Japan for the first time.
The Verge Beginner 1mo ago
Full Speaking Course 2026: Duolingo English Test
Computer Vision
Full Speaking Course 2026: Duolingo English Test
Teacher Luke - Duolingo English Test Beginner 1mo ago
Music AI Sandbox | AI x Creativity: Wyclef Jean
Computer Vision
Music AI Sandbox | AI x Creativity: Wyclef Jean
Google DeepMind Beginner 1mo ago
One Open AI Model Built My Website, Image & Video
Computer Vision
One Open AI Model Built My Website, Image & Video
Analytics Vidhya Beginner 1mo ago
Page Match lets you quickly sync your spot in a physical or ebook with an audiobook.
Computer Vision
Page Match lets you quickly sync your spot in a physical or ebook with an audiobook.
The Verge Beginner 2mo ago
Helping Sports Teams Improve Decision Making with AI: Interview with PlayVision's Marc Zoghby
Computer Vision
Helping Sports Teams Improve Decision Making with AI: Interview with PlayVision's Marc Zoghby
Roboflow Beginner 2mo ago
Full Duolingo English Test with Answers: January 2026 Format
Computer Vision
Full Duolingo English Test with Answers: January 2026 Format
Teacher Luke - Duolingo English Test Beginner 2mo ago
Artem Sevastopolsky and Dmitrii Pozdeev - DenseMarks  Learning Canonical Embeddings for Human Heads
Computer Vision
Artem Sevastopolsky and Dmitrii Pozdeev - DenseMarks Learning Canonical Embeddings for Human Heads
Cohere Beginner 2mo ago
On-Device AI Just Leveled Up: Liquid AI’s LFM-2.5 Explained
Computer Vision
On-Device AI Just Leveled Up: Liquid AI’s LFM-2.5 Explained
Analytics Vidhya Beginner 2mo ago
What does AI mean for education?
Computer Vision
What does AI mean for education?
Anthropic Beginner 3mo ago
Why Vision Language Models Ignore What They See [Munawar Hayat] - 758
Computer Vision
Why Vision Language Models Ignore What They See [Munawar Hayat] - 758
TWIML AI Podcast Beginner 3mo ago
Why are Transformers replacing CNNs?
Computer Vision
Why are Transformers replacing CNNs?
Julia Turc Beginner 4mo ago
What is reciprocal rank fusion in hybrid search?
Computer Vision
What is reciprocal rank fusion in hybrid search?
Abhishek Thakur Beginner 4mo ago
Should AI be introduced to kids early?  #podcast #interview
Computer Vision
Should AI be introduced to kids early? #podcast #interview
Abhishek Thakur Beginner 4mo ago
Multimodal and Multi-model AI in Action
Computer Vision
Multimodal and Multi-model AI in Action
Microsoft 365 Developer Beginner 4mo ago
Getting Started With Transformers for Computer Vision - Divya Swaminathan & Tony Reina
Computer Vision
Getting Started With Transformers for Computer Vision - Divya Swaminathan & Tony Reina
PyData Beginner 4mo ago
Choosing Your Path: AI Professional Program Course Selection Guide
Computer Vision
Choosing Your Path: AI Professional Program Course Selection Guide
Stanford Online Beginner 4mo ago
From Parking Lots to Airports: How Metropolis Uses AI for Seamless Payments
Computer Vision
From Parking Lots to Airports: How Metropolis Uses AI for Seamless Payments
The Information Beginner 5mo ago
ExecuTorch 1.0: General Availability Status for Mobile and Embedded...- Mergen Nachin & Cemal Bilgin
Computer Vision
ExecuTorch 1.0: General Availability Status for Mobile and Embedded...- Mergen Nachin & Cemal Bilgin
PyTorch Beginner 5mo ago
OneDrive’s AI is scanning your PHOTOS
Computer Vision
OneDrive’s AI is scanning your PHOTOS
David Bombal Beginner 5mo ago
Is YOLO26 Faster Than YOLO11? Full Comparison & Results
Computer Vision
Is YOLO26 Faster Than YOLO11? Full Comparison & Results
Muhammad Moin Beginner 1mo ago
5 Easy Tips to Get a High Score on the Duolingo English Test in 2026
Computer Vision
5 Easy Tips to Get a High Score on the Duolingo English Test in 2026
Teacher Luke - Duolingo English Test Beginner 3mo ago
Google T5Gemma 2 Explained: The AI Built for Long Documents & Multimodal Reasoning
Computer Vision
Google T5Gemma 2 Explained: The AI Built for Long Documents & Multimodal Reasoning
Analytics Vidhya Beginner 3mo ago
AI for Occupancy Analytics | Building a Smart Parking System
Computer Vision
AI for Occupancy Analytics | Building a Smart Parking System
Roboflow Beginner 3mo ago
DeepSeek V3.2 Speciale Testing – Can It Handle Complex Tasks Without Tools?
Computer Vision
DeepSeek V3.2 Speciale Testing – Can It Handle Complex Tasks Without Tools?
Muhammad Moin Beginner 3mo ago
Build a Hybrid CSV Intelligence Agent with RAG, Pandas, and LLM Judge
Computer Vision
Build a Hybrid CSV Intelligence Agent with RAG, Pandas, and LLM Judge
Muhammad Moin Beginner 4mo ago
I Took the Duolingo English Test and Here’s What Happened
Computer Vision
I Took the Duolingo English Test and Here’s What Happened
Teacher Luke - Duolingo English Test Beginner 4mo ago
Multimodal RAG: Chat with Complex PDFs (Text, Tables & Images)
Computer Vision
Multimodal RAG: Chat with Complex PDFs (Text, Tables & Images)
Muhammad Moin Beginner 4mo ago
What is Segment Anything 3 (SAM3)? Live Q&A with Meta's Engineers Behind the Model
Computer Vision
What is Segment Anything 3 (SAM3)? Live Q&A with Meta's Engineers Behind the Model
Roboflow Beginner 4mo ago
A no nonsense intro to BM25
Computer Vision
A no nonsense intro to BM25
Abhishek Thakur Beginner 4mo ago
Vibe + VSCode + Codex = Search UI
Computer Vision
Vibe + VSCode + Codex = Search UI
Abhishek Thakur Beginner 4mo ago
Validate Actions with Vision AI | Building a Web App for Real-Time Drinking Detection
Computer Vision
Validate Actions with Vision AI | Building a Web App for Real-Time Drinking Detection
Roboflow Beginner 4mo ago
Vision AI in a Web Browser: Creating a Scavenger Hunt App with Inference.js
Computer Vision
Vision AI in a Web Browser: Creating a Scavenger Hunt App with Inference.js
Roboflow Beginner 4mo ago
Vibe Coding with AI in 2025 – Build Anything with Google AI Studio
Computer Vision
Vibe Coding with AI in 2025 – Build Anything with Google AI Studio
Muhammad Moin Beginner 5mo ago
Building, learning and teaching with AI (w/ Parul Pandey)
Computer Vision
Building, learning and teaching with AI (w/ Parul Pandey)
Abhishek Thakur Beginner 5mo ago
Ask the Engineers: RF-DETR Segmentation and Creating Best-in-Class Vision Models for the Edge
Computer Vision
Ask the Engineers: RF-DETR Segmentation and Creating Best-in-Class Vision Models for the Edge
Roboflow Beginner 5mo ago
How The Field Museum Unlocks New Research Possibilities with Vision AI
Computer Vision
How The Field Museum Unlocks New Research Possibilities with Vision AI
Roboflow Beginner 5mo ago
📚 Coursera Courses Opens on Coursera · Free to audit
1 / 3 View all →
Process Documents with Python Using the Document AI API
📚 Coursera Course ↗
Self-paced
Process Documents with Python Using the Document AI API
Opens on Coursera ↗
Fine-Tuning and Evaluating Vision AI Models
📚 Coursera Course ↗
Self-paced
Fine-Tuning and Evaluating Vision AI Models
Opens on Coursera ↗
Future of data and technology in football
📚 Coursera Course ↗
Self-paced
Future of data and technology in football
Opens on Coursera ↗
Deep Learning for Object Detection
📚 Coursera Course ↗
Self-paced
Deep Learning for Object Detection
Opens on Coursera ↗
Positioning: What you need for a successful Marketing Strategy
📚 Coursera Course ↗
Self-paced
Positioning: What you need for a successful Marketing Strategy
Opens on Coursera ↗
Vision Models: Train and Evaluate
📚 Coursera Course ↗
Self-paced
Vision Models: Train and Evaluate
Opens on Coursera ↗