Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,346
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
Use Dedicated Deployments with Computer Vision Workflows
Computer Vision
Use Dedicated Deployments with Computer Vision Workflows
Roboflow Intermediate 1y ago
I’ve been doing marketing for 20 years now, and here’s my biggest source of inspiration.
Computer Vision
I’ve been doing marketing for 20 years now, and here’s my biggest source of inspiration.
Neil Patel Intermediate 1y ago
Missy Franklin, Angela Ruggiero & Ashton Eaton | Olympic Panel | Talks at Google
Computer Vision
Missy Franklin, Angela Ruggiero & Ashton Eaton | Olympic Panel | Talks at Google
Talks at Google Advanced 1y ago
C4AI Expedition Aya - Most Promising Prize: Maya: Multimodal Aya
Computer Vision
C4AI Expedition Aya - Most Promising Prize: Maya: Multimodal Aya
Cohere Beginner 1y ago
Beyond Language: The future of multimodal models in health, gaming, & AI | Microsoft Research Forum
Computer Vision
Beyond Language: The future of multimodal models in health, gaming, & AI | Microsoft Research Forum
Microsoft Research Advanced 1y ago
Qwen2-VL: The Best Open Source Vision Model for OCR & VQA
Computer Vision ⚡ AI Lesson
Qwen2-VL: The Best Open Source Vision Model for OCR & VQA
AI Anytime Intermediate 1y ago
Football AI | Community Q&A (Aug 29)
Computer Vision ⚡ AI Lesson
Football AI | Community Q&A (Aug 29)
Roboflow Advanced 1y ago
Exploring Robotics and Python Through Electronic Projects | Real Python Podcast #218
Computer Vision
Exploring Robotics and Python Through Electronic Projects | Real Python Podcast #218
Real Python Beginner 1y ago
Joy Buolamwini—trail-blazing AI ethicist outlines the dark side of image recognition on #DataFramed
Computer Vision ⚡ AI Lesson
Joy Buolamwini—trail-blazing AI ethicist outlines the dark side of image recognition on #DataFramed
DataCamp Intermediate 1y ago
Segment Anything 2: Memory + Vision = Object Permanence — with Nikhila Ravi and Joseph Nelson
Computer Vision
Segment Anything 2: Memory + Vision = Object Permanence — with Nikhila Ravi and Joseph Nelson
Latent Space Advanced 1y ago
How to run SAM 2 (Segment Anything AI Model)?
Computer Vision ⚡ AI Lesson
How to run SAM 2 (Segment Anything AI Model)?
AI Anytime Intermediate 1y ago
JETSON AI LAB | Research Group Meeting (8/6/2024)
Computer Vision
JETSON AI LAB | Research Group Meeting (8/6/2024)
NVIDIA Developer Advanced 1y ago
Meta Unveils Segment Anything 2: Revolutionizing Image and 3D Segmentation! #meta #ai #genai
Computer Vision
Meta Unveils Segment Anything 2: Revolutionizing Image and 3D Segmentation! #meta #ai #genai
Deepak Bhaskaran Beginner 1y ago
Boost #WorkplaceSafety with Intenseye, an AI-powered employee health and safety (EHS) platform.
Computer Vision
Boost #WorkplaceSafety with Intenseye, an AI-powered employee health and safety (EHS) platform.
Google Cloud Beginner 1y ago
SAM 2 is going to transform COMPUTER VISION!!!
Computer Vision
SAM 2 is going to transform COMPUTER VISION!!!
1littlecoder Intermediate 1y ago
Audience Segmentation Tips: 3 Ways to Segment Your Email List
3:24
Computer Vision ⚡ AI Lesson
Audience Segmentation Tips: 3 Ways to Segment Your Email List
Klaviyo Advanced 1y ago
An Overview of Object Recognition Tasks
Computer Vision ⚡ AI Lesson
An Overview of Object Recognition Tasks
Machine Learning Studio Beginner 1y ago
Excitement for the Generative AI era: Multi-Modal inputs
Computer Vision
Excitement for the Generative AI era: Multi-Modal inputs
Weights & Biases Intermediate 1y ago
Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - 692
Computer Vision ⚡ AI Lesson
Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - 692
The TWIML AI Podcast with Sam Charrington Advanced 1y ago
Denoising Images with OpenCV in Python
Computer Vision ⚡ AI Lesson
Denoising Images with OpenCV in Python
NeuralNine Beginner 1y ago
Reimagine document processing and understanding with generative AI
Computer Vision
Reimagine document processing and understanding with generative AI
Google Cloud Intermediate 1y ago
Microsoft's Florence 2: Breaking Boundaries in AI Vision Language!
Computer Vision
Microsoft's Florence 2: Breaking Boundaries in AI Vision Language!
Mervin Praison Beginner 1y ago
Florence 2 - The Best Small VLM Out There?
Computer Vision ⚡ AI Lesson
Florence 2 - The Best Small VLM Out There?
Sam Witteveen Beginner 1y ago
New Microsoft Vision Model has AMAZING TRICKS!!!
Computer Vision ⚡ AI Lesson
New Microsoft Vision Model has AMAZING TRICKS!!!
1littlecoder Advanced 1y ago
From Robotics to Recommender Systems // Miguel Fierro // MLOps Podcast #240
Computer Vision ⚡ AI Lesson
From Robotics to Recommender Systems // Miguel Fierro // MLOps Podcast #240
MLOps.community Beginner 1y ago
Case study on CLIP: Large Multi-Modal Models for Blind & Low Vision Users | Microsoft Research Forum
Computer Vision ⚡ AI Lesson
Case study on CLIP: Large Multi-Modal Models for Blind & Low Vision Users | Microsoft Research Forum
Microsoft Research Advanced 1y ago
OpenAI CLIP model explained
Computer Vision
OpenAI CLIP model explained
Machine Learning Studio Beginner 1y ago
Using PAM EXEC to Log Passwords on Linux
Computer Vision ⚡ AI Lesson
Using PAM EXEC to Log Passwords on Linux
IppSec Beginner 1y ago
Robotics AI for Industrial Applications
Computer Vision
Robotics AI for Industrial Applications
Weights & Biases Advanced 1y ago
Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...
Computer Vision ⚡ AI Lesson
Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...
Cohere Intermediate 1y ago
Can New GPT Model Read Music Notation? Multimodal GPT-4o Omni
2:29
Computer Vision ⚡ AI Lesson
Can New GPT Model Read Music Notation? Multimodal GPT-4o Omni
Burned Guitarist Intermediate 2y ago
Getting started With Google's PaliGemma: Open Vision-Language Model
Computer Vision ⚡ AI Lesson
Getting started With Google's PaliGemma: Open Vision-Language Model
Krish Naik Beginner 2y ago
New2Cyber en Espanol | El final de la era del profesional de seguridad
Computer Vision ⚡ AI Lesson
New2Cyber en Espanol | El final de la era del profesional de seguridad
SANS Institute Intermediate 2y ago
How To Fine-tune LLaVA Model (From Your Laptop!)
Computer Vision
How To Fine-tune LLaVA Model (From Your Laptop!)
Brev Intermediate 2y ago
New course with Comet: Prompt Engineering for Vision Models
Computer Vision ⚡ AI Lesson
New course with Comet: Prompt Engineering for Vision Models
DeepLearningAI Beginner 2y ago
It's easy to get stuck in our ways
Computer Vision ⚡ AI Lesson
It's easy to get stuck in our ways
General Musings with Kevin Powell Beginner 2y ago
Analyze documents in BigQuery with Document AI
Computer Vision
Analyze documents in BigQuery with Document AI
Google Cloud Tech Beginner 2y ago
Pose landmark detection - ML on Web with MediaPipe: Episode 8
Computer Vision
Pose landmark detection - ML on Web with MediaPipe: Episode 8
Google for Developers Beginner 2y ago
Build an AI/ML Football Analysis system with YOLO, OpenCV, and Python
Computer Vision
Build an AI/ML Football Analysis system with YOLO, OpenCV, and Python
Code In a Jiffy Beginner 2y ago
Football AI Tutorial: From Basics to Advanced Stats with Python
Computer Vision
Football AI Tutorial: From Basics to Advanced Stats with Python
Roboflow Intermediate 1y ago
Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI
Computer Vision ⚡ AI Lesson
Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI
Roboflow Intermediate 1y ago
AI-Assisted Data Labeling | Weekly Roboflow Product Session
Computer Vision
AI-Assisted Data Labeling | Weekly Roboflow Product Session
Roboflow Beginner 1y ago
Segment Anything 2 (SAM 2): Meta AI's Newest Model | Community Q&A (Jul 30)
Computer Vision
Segment Anything 2 (SAM 2): Meta AI's Newest Model | Community Q&A (Jul 30)
Roboflow Advanced 1y ago
Florence-2: Fine-tune Microsoft’s Multimodal Model
Computer Vision
Florence-2: Fine-tune Microsoft’s Multimodal Model
Roboflow Beginner 1y ago
How good is YOLOv10? | Hacking Google's new VLM, PaliGemma | Community Q&A (Jun 6)
Computer Vision
How good is YOLOv10? | Hacking Google's new VLM, PaliGemma | Community Q&A (Jun 6)
Roboflow Beginner 1y ago
PaliGemma by Google: Train Model on Custom Detection Dataset
Computer Vision
PaliGemma by Google: Train Model on Custom Detection Dataset
Roboflow Intermediate 1y ago
What is Document AI?
Computer Vision
What is Document AI?
Google Cloud Beginner 2y ago
Build computer vision applications easily with Roboflow and Google Cloud
Computer Vision
Build computer vision applications easily with Roboflow and Google Cloud
Google Cloud Advanced 2y ago
📚 Coursera Courses Opens on Coursera · Free to audit
1 / 3 View all →
Implementando modelo Computer Vision en Amazon Sagemaker
📚 Coursera Course ↗
Self-paced
Implementando modelo Computer Vision en Amazon Sagemaker
Opens on Coursera ↗
AutoML: Build ML Models without Code
📚 Coursera Course ↗
Self-paced
AutoML: Build ML Models without Code
Opens on Coursera ↗
Unity: Design & Deform Meshes for 3D Geometry Control
📚 Coursera Course ↗
Self-paced
Unity: Design & Deform Meshes for 3D Geometry Control
Opens on Coursera ↗
Implement Hand Gesture Recognition with OpenCV
📚 Coursera Course ↗
Self-paced
Implement Hand Gesture Recognition with OpenCV
Opens on Coursera ↗
Sync CRM Contacts
📚 Coursera Course ↗
Self-paced
Sync CRM Contacts
Opens on Coursera ↗
Azure Practical - Cognitive Services
📚 Coursera Course ↗
Self-paced
Azure Practical - Cognitive Services
Opens on Coursera ↗