✕ Clear filters
330 lessons

👁️ Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

All ▶ YouTube 176,145📚 Coursera 16,021
RF-DETR: How to Train SOTA for Object Detection on a Custom Dataset | Step-by-step guide
Computer Vision
RF-DETR: How to Train SOTA for Object Detection on a Custom Dataset | Step-by-step guide
Roboflow Intermediate 7mo ago
New Way Now: Simbe's AI robotic vision tech improves retail sales and margin with Google Cloud
Computer Vision
New Way Now: Simbe's AI robotic vision tech improves retail sales and margin with Google Cloud
Google Cloud Intermediate 7mo ago
EV Pickups Are a Bust for US Carmakers
Computer Vision
EV Pickups Are a Bust for US Carmakers
Bloomberg Technology Intermediate 7mo ago
Vision AI in 2025 — Peter Robicheaux, Roboflow
Computer Vision
Vision AI in 2025 — Peter Robicheaux, Roboflow
AI Engineer Intermediate 8mo ago
The Segmentation Tweak That Quietly BOOSTS Klaviyo Revenue #shorts #emailmarketing
1:26
Computer Vision
The Segmentation Tweak That Quietly BOOSTS Klaviyo Revenue #shorts #emailmarketing
Emissary 2.0 Intermediate 8mo ago
I trained an AI Model to Detect Trading Candlesticks (from scratch using ViTs)
Computer Vision
I trained an AI Model to Detect Trading Candlesticks (from scratch using ViTs)
Nicholas Renotte Intermediate 8mo ago
DAViD: Data-efficient and Accurate Vision Models from Synthetic Data
Computer Vision
DAViD: Data-efficient and Accurate Vision Models from Synthetic Data
Microsoft Research Intermediate 8mo ago
Is Your Business Running on Empty? 🤖
Computer Vision
Is Your Business Running on Empty? 🤖
imFORZA Intermediate 8mo ago
How to Fine-Tune SmolVLM2 | Convert Documents into JSON
Computer Vision
How to Fine-Tune SmolVLM2 | Convert Documents into JSON
Roboflow Intermediate 9mo ago
Transforming Data Governance for Multimodal Data at Amgen With Databricks
Computer Vision
Transforming Data Governance for Multimodal Data at Amgen With Databricks
Databricks Intermediate 9mo ago
Multimodal Open Source at Kyutai, From Online Demos to On-Device - Alexandre Défossez
Computer Vision
Multimodal Open Source at Kyutai, From Online Demos to On-Device - Alexandre Défossez
PyTorch Intermediate 10mo ago
MedGemma LLM: Doctors, Meet Your AI Assistant 🧠
Computer Vision
MedGemma LLM: Doctors, Meet Your AI Assistant 🧠
AI Anytime Intermediate 10mo ago
China’s ByteDance Just Dropped BAGEL — Multimodal AI Beast!
Computer Vision
China’s ByteDance Just Dropped BAGEL — Multimodal AI Beast!
Analytics Vidhya Intermediate 10mo ago
Uber CEO Dara Khosrowshahi on the company's new Route Share feature. Presented by @AdobeExpress
Computer Vision
Uber CEO Dara Khosrowshahi on the company's new Route Share feature. Presented by @AdobeExpress
The Verge Intermediate 10mo ago
The Shape of Intelligence
Computer Vision
The Shape of Intelligence
Latent Space Intermediate 10mo ago
How to Segment Your Audience in Mailchimp
9:16
Computer Vision
How to Segment Your Audience in Mailchimp
Intuit Mailchimp Intermediate 11mo ago
Intuit uses Google Cloud Document AI to further simplify tax prep for millions
Computer Vision
Intuit uses Google Cloud Document AI to further simplify tax prep for millions
Google Cloud Intermediate 1y ago
Multimodal AI & Next Gen Databases | Data Brew | Episode 42
Computer Vision
Multimodal AI & Next Gen Databases | Data Brew | Episode 42
Databricks Intermediate 1y ago
Expedition Aya Kick Off Event
Computer Vision
Expedition Aya Kick Off Event
Cohere Intermediate 1y ago
Building a travel buddy with Gemma
Computer Vision
Building a travel buddy with Gemma
Google for Developers Intermediate 1y ago
Peter Tong - MetaMorph: Multimodal Understanding and Generation via Instruction Tuning
Computer Vision
Peter Tong - MetaMorph: Multimodal Understanding and Generation via Instruction Tuning
Cohere Intermediate 1y ago
How to Quickly Leverage Computer Vision in Python
Computer Vision
How to Quickly Leverage Computer Vision in Python
Data Professor Intermediate 1y ago
Next Multi trillion dollar industry?
Computer Vision
Next Multi trillion dollar industry?
Full Disclosure Intermediate 1y ago
DeepSeek’s Janus-Pro-7B Crushes DALL·E 3!  #deepseek #openai
Computer Vision
DeepSeek’s Janus-Pro-7B Crushes DALL·E 3! #deepseek #openai
Analytics Vidhya Intermediate 1y ago
This Python module is your go-to for speech and image recognition!
Computer Vision
This Python module is your go-to for speech and image recognition!
Tech With Tim Intermediate 1y ago
Not ElevenLabs, This new #1 Text to Speech AI is FREE!!!!
Computer Vision
Not ElevenLabs, This new #1 Text to Speech AI is FREE!!!!
1littlecoder Intermediate 1y ago
Next AI Project is Image Classification in Python🔍🤖
Computer Vision
Next AI Project is Image Classification in Python🔍🤖
Tech With Tim Intermediate 1y ago
Best of 2024 in Vision [LS Live @ NeurIPS]
Computer Vision
Best of 2024 in Vision [LS Live @ NeurIPS]
Latent Space Intermediate 1y ago
How to Do Email Segmentation the Right Way
0:47
Computer Vision
How to Do Email Segmentation the Right Way
Spark Bridge Digital | Email Marketing Agency Intermediate 1y ago
OpenAI DevDay 2024 | Multimodal apps with the Realtime API
Computer Vision
OpenAI DevDay 2024 | Multimodal apps with the Realtime API
OpenAI Intermediate 1y ago
Ethan Norville EXPOSES Coronation Project Secrets
Computer Vision
Ethan Norville EXPOSES Coronation Project Secrets
Professor Charley T Intermediate 1y ago
MediaPipe Web: Bringing cross-platform AI tech to the browser
Computer Vision
MediaPipe Web: Bringing cross-platform AI tech to the browser
Chrome for Developers Intermediate 1y ago
Moondream: how does a tiny vision model slap so hard? — Vikhyat Korrapati
Computer Vision
Moondream: how does a tiny vision model slap so hard? — Vikhyat Korrapati
AI Engineer Intermediate 1y ago
Transformers.js: State-of-the-art Machine Learning for the web
Computer Vision
Transformers.js: State-of-the-art Machine Learning for the web
Chrome for Developers Intermediate 1y ago
Stanford Seminar - Open-world Segmentation and Tracking in 3D
Computer Vision
Stanford Seminar - Open-world Segmentation and Tracking in 3D
Stanford Online Intermediate 1y ago
The Next Decade in AI and Computer Vision
Computer Vision
The Next Decade in AI and Computer Vision
a16z Intermediate 1y ago
Multimodal RAG YT Video
Computer Vision
Multimodal RAG YT Video
Srikantan Sankaran Intermediate 1y ago
Drowsiness Detection with Vision AI | Improve Safety with AI
Computer Vision
Drowsiness Detection with Vision AI | Improve Safety with AI
Roboflow Intermediate 10mo ago
RF-DETR, Batch Processing, Instant Training, Serverless Inference, and More | What's New in Roboflow
Computer Vision
RF-DETR, Batch Processing, Instant Training, Serverless Inference, and More | What's New in Roboflow
Roboflow Intermediate 1y ago
Build an AI-Powered Self-Serve Checkout & Cost Calculator in 10 Minutes (Almost)
Computer Vision
Build an AI-Powered Self-Serve Checkout & Cost Calculator in 10 Minutes (Almost)
Roboflow Intermediate 1y ago
Measure Liquid Levels with AI | Build a Web App Powered by Computer Vision
Computer Vision
Measure Liquid Levels with AI | Build a Web App Powered by Computer Vision
Roboflow Intermediate 1y ago
Florence-2: Create and Deploy a Custom Vision Language Model
Computer Vision
Florence-2: Create and Deploy a Custom Vision Language Model
Roboflow Intermediate 1y ago
YOLO11: Performance Benchmark and Real World Use Cases
Computer Vision
YOLO11: Performance Benchmark and Real World Use Cases
Roboflow Intermediate 1y ago
Video Analytics with AI | Live Coding & Q&A (Oct 9th)
Computer Vision
Video Analytics with AI | Live Coding & Q&A (Oct 9th)
Roboflow Intermediate 1y ago
GPT-4o: Fine-tune OpenAI's Multimodal Model | Live Coding & Q&A (Oct 3rd)
Computer Vision
GPT-4o: Fine-tune OpenAI's Multimodal Model | Live Coding & Q&A (Oct 3rd)
Roboflow Intermediate 1y ago
YOLO11: How to Train for Object Detection | Live Coding & Q&A (Sep 30)
Computer Vision
YOLO11: How to Train for Object Detection | Live Coding & Q&A (Sep 30)
Roboflow Intermediate 1y ago
Using RTSP Streams for Computer Vision | Tracking & Counting Objects
Computer Vision
Using RTSP Streams for Computer Vision | Tracking & Counting Objects
Roboflow Intermediate 1y ago
The era of unbounded products: Designing for Multimodal IO: Ben Hylak
Computer Vision
The era of unbounded products: Designing for Multimodal IO: Ben Hylak
AI Engineer Intermediate 1y ago
📚 Coursera Courses Opens on Coursera · Free to audit
1 / 3 View all →
Networking and Security Architecture with VMware NSX
📚 Coursera Course ↗
Self-paced
Networking and Security Architecture with VMware NSX
Opens on Coursera ↗
Process Images, Create Captioning AI Models
📚 Coursera Course ↗
Self-paced
Process Images, Create Captioning AI Models
Opens on Coursera ↗
Advancing Your Career in Computer Vision Engineering
📚 Coursera Course ↗
Self-paced
Advancing Your Career in Computer Vision Engineering
Opens on Coursera ↗
AI and Disaster Management
📚 Coursera Course ↗
Self-paced
AI and Disaster Management
Opens on Coursera ↗
Build a DIY Multimodal Question Answering System with Vertex AI
📚 Coursera Course ↗
Self-paced
Build a DIY Multimodal Question Answering System with Vertex AI
Opens on Coursera ↗
Camera and Imaging
📚 Coursera Course ↗
Self-paced
Camera and Imaging
Opens on Coursera ↗