Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1189
lessons
Football AI Tutorial: From Basics to Advanced Stats with Python
Computer Vision
Football AI Tutorial: From Basics to Advanced Stats with Python
Roboflow Intermediate 1y ago
Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI
Computer Vision
Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI
Roboflow Intermediate 1y ago
Segment Anything 2: Memory + Vision = Object Permanence — with Nikhila Ravi and Joseph Nelson
Computer Vision
Segment Anything 2: Memory + Vision = Object Permanence — with Nikhila Ravi and Joseph Nelson
Latent Space Advanced 1y ago
How to run SAM 2 (Segment Anything AI Model)?
Computer Vision
How to run SAM 2 (Segment Anything AI Model)?
AI Anytime Intermediate 1y ago
JETSON AI LAB | Research Group Meeting (8/6/2024)
Computer Vision
JETSON AI LAB | Research Group Meeting (8/6/2024)
NVIDIA Developer Advanced 1y ago
Meta Unveils Segment Anything 2: Revolutionizing Image and 3D Segmentation! #meta #ai #genai
Computer Vision
Meta Unveils Segment Anything 2: Revolutionizing Image and 3D Segmentation! #meta #ai #genai
Deepak Bhaskaran Beginner 1y ago
Boost #WorkplaceSafety with Intenseye, an AI-powered employee health and safety (EHS) platform.
Computer Vision
Boost #WorkplaceSafety with Intenseye, an AI-powered employee health and safety (EHS) platform.
Google Cloud Beginner 1y ago
SAM 2 is going to transform COMPUTER VISION!!!
Computer Vision
SAM 2 is going to transform COMPUTER VISION!!!
1littlecoder Intermediate 1y ago
Audience Segmentation Tips: 3 Ways to Segment Your Email List
3:24
Computer Vision
Audience Segmentation Tips: 3 Ways to Segment Your Email List
Klaviyo Advanced 1y ago
An Overview of Object Recognition Tasks
Computer Vision
An Overview of Object Recognition Tasks
Machine Learning Studio Beginner 1y ago
Excitement for the Generative AI era: Multi-Modal inputs
Computer Vision
Excitement for the Generative AI era: Multi-Modal inputs
Weights & Biases Intermediate 1y ago
Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - 692
Computer Vision
Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - 692
The TWIML AI Podcast with Sam Charrington Advanced 1y ago
Denoising Images with OpenCV in Python
Computer Vision
Denoising Images with OpenCV in Python
NeuralNine Beginner 1y ago
Reimagine document processing and understanding with generative AI
Computer Vision
Reimagine document processing and understanding with generative AI
Google Cloud Intermediate 1y ago
Microsoft's Florence 2: Breaking Boundaries in AI Vision Language!
Computer Vision
Microsoft's Florence 2: Breaking Boundaries in AI Vision Language!
Mervin Praison Beginner 1y ago
Florence 2 - The Best Small VLM Out There?
Computer Vision
Florence 2 - The Best Small VLM Out There?
Sam Witteveen Beginner 1y ago
New Microsoft Vision Model has AMAZING TRICKS!!!
Computer Vision
New Microsoft Vision Model has AMAZING TRICKS!!!
1littlecoder Advanced 1y ago
From Robotics to Recommender Systems // Miguel Fierro // MLOps Podcast #240
Computer Vision
From Robotics to Recommender Systems // Miguel Fierro // MLOps Podcast #240
MLOps.community Beginner 1y ago
Case study on CLIP: Large Multi-Modal Models for Blind & Low Vision Users | Microsoft Research Forum
Computer Vision
Case study on CLIP: Large Multi-Modal Models for Blind & Low Vision Users | Microsoft Research Forum
Microsoft Research Advanced 1y ago
OpenAI CLIP model explained
Computer Vision
OpenAI CLIP model explained
Machine Learning Studio Beginner 1y ago
Using PAM EXEC to Log Passwords on Linux
Computer Vision
Using PAM EXEC to Log Passwords on Linux
IppSec Beginner 1y ago
Robotics AI for Industrial Applications
Computer Vision
Robotics AI for Industrial Applications
Weights & Biases Advanced 1y ago
Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...
Computer Vision
Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...
Cohere Intermediate 1y ago
Can New GPT Model Read Music Notation? Multimodal GPT-4o Omni
2:29
Computer Vision
Can New GPT Model Read Music Notation? Multimodal GPT-4o Omni
Burned Guitarist Intermediate 1y ago
Getting started With Google's PaliGemma: Open Vision-Language Model
Computer Vision
Getting started With Google's PaliGemma: Open Vision-Language Model
Krish Naik Beginner 1y ago
New2Cyber en Espanol | El final de la era del profesional de seguridad
Computer Vision
New2Cyber en Espanol | El final de la era del profesional de seguridad
SANS Institute Intermediate 1y ago
How To Fine-tune LLaVA Model (From Your Laptop!)
Computer Vision
How To Fine-tune LLaVA Model (From Your Laptop!)
Brev Intermediate 1y ago
It's easy to get stuck in our ways
Computer Vision
It's easy to get stuck in our ways
General Musings with Kevin Powell Beginner 1y ago
Analyze documents in BigQuery with Document AI
Computer Vision
Analyze documents in BigQuery with Document AI
Google Cloud Tech Beginner 1y ago
Pose landmark detection - ML on Web with MediaPipe: Episode 8
Computer Vision
Pose landmark detection - ML on Web with MediaPipe: Episode 8
Google for Developers Beginner 1y ago
Build an AI/ML Football Analysis system with YOLO, OpenCV, and Python
Computer Vision
Build an AI/ML Football Analysis system with YOLO, OpenCV, and Python
Code In a Jiffy Beginner 1y ago
The Longevity Expert: Is There A Link Between Milk & Cancer? + Ozempic Can Really Mess You Up!
Computer Vision
The Longevity Expert: Is There A Link Between Milk & Cancer? + Ozempic Can Really Mess You Up!
The Diary Of A CEO Beginner 1y ago
The lies that sell fast fashion
Computer Vision
The lies that sell fast fashion
Vox Beginner 1y ago
Stanford Seminar - Silicon Valley & The U.S. Government: Vannevar Lab's Brett Granberg
Computer Vision
Stanford Seminar - Silicon Valley & The U.S. Government: Vannevar Lab's Brett Granberg
Stanford Online Intermediate 2y ago
Dana White: UFC, Fighting, Khabib, Conor, Tyson, Ali, Rogan, Elon & Zuck | Lex Fridman Podcast #421
Computer Vision
Dana White: UFC, Fighting, Khabib, Conor, Tyson, Ali, Rogan, Elon & Zuck | Lex Fridman Podcast #421
Lex Fridman Beginner 2y ago
Real-Time Car Speed Tracking & Object Classification Revealed
Computer Vision
Real-Time Car Speed Tracking & Object Classification Revealed
Mervin Praison Beginner 2y ago
Bringing AI to the Masses with Adam D'Angelo, CEO of Quora
Computer Vision
Bringing AI to the Masses with Adam D'Angelo, CEO of Quora
a16z Intermediate 2y ago
How to perform object detection with KerasCV
Computer Vision
How to perform object detection with KerasCV
TensorFlow Official Beginner 2y ago
AI-Assisted Data Labeling | Weekly Roboflow Product Session
Computer Vision
AI-Assisted Data Labeling | Weekly Roboflow Product Session
Roboflow Beginner 1y ago
Segment Anything 2 (SAM 2): Meta AI's Newest Model | Community Q&A (Jul 30)
Computer Vision
Segment Anything 2 (SAM 2): Meta AI's Newest Model | Community Q&A (Jul 30)
Roboflow Advanced 1y ago
Florence-2: Fine-tune Microsoft’s Multimodal Model
Computer Vision
Florence-2: Fine-tune Microsoft’s Multimodal Model
Roboflow Beginner 1y ago
How good is YOLOv10? | Hacking Google's new VLM, PaliGemma | Community Q&A (Jun 6)
Computer Vision
How good is YOLOv10? | Hacking Google's new VLM, PaliGemma | Community Q&A (Jun 6)
Roboflow Beginner 1y ago
PaliGemma by Google: Train Model on Custom Detection Dataset
Computer Vision
PaliGemma by Google: Train Model on Custom Detection Dataset
Roboflow Intermediate 1y ago
What is Document AI?
Computer Vision
What is Document AI?
Google Cloud Beginner 1y ago
Build computer vision applications easily with Roboflow and Google Cloud
Computer Vision
Build computer vision applications easily with Roboflow and Google Cloud
Google Cloud Advanced 1y ago
Dwell Time Analysis | Real-Time Stream Processing | Community Q&A (April 11)
Computer Vision
Dwell Time Analysis | Real-Time Stream Processing | Community Q&A (April 11)
Roboflow Beginner 1y ago
Dwell Time Analysis with Computer Vision | Real-Time Stream Processing
Computer Vision
Dwell Time Analysis with Computer Vision | Real-Time Stream Processing
Roboflow Beginner 2y ago
YOLOv9 Live Coding & Community Q&A (March 14)
Computer Vision
YOLOv9 Live Coding & Community Q&A (March 14)
Roboflow Beginner 2y ago
📚 Coursera Courses Opens on Coursera · Free to audit
1 / 3 View all →
Customer Relationship Management
📚 Coursera Course ↗
Self-paced
Customer Relationship Management
Opens on Coursera ↗
Self-Driving Car Specialization Course
📚 Coursera Course ↗
Self-paced
Self-Driving Car Specialization Course
Opens on Coursera ↗
Intro to Operating Systems 2: Memory Management
📚 Coursera Course ↗
Self-paced
Intro to Operating Systems 2: Memory Management
Opens on Coursera ↗
AI Applications: Computer Vision and Speech Recognition
📚 Coursera Course ↗
Self-paced
AI Applications: Computer Vision and Speech Recognition
Opens on Coursera ↗
Build Real-Time Face Recognition with OpenCV
📚 Coursera Course ↗
Self-paced
Build Real-Time Face Recognition with OpenCV
Opens on Coursera ↗
Future of data and technology in football
📚 Coursera Course ↗
Self-paced
Future of data and technology in football
Opens on Coursera ↗