Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,346

lessons

Skills in this topic

3 skills — Sign in to track your progress

View full skill map →

Classify images with a pre-trained CNN

Modern CV Models

Run YOLO for real-time object detection

Build a Stable Diffusion inference pipeline

Videos 1,121 Reads 225

Level: All Beginner Intermediate Advanced

Any Length Short (<5m) Medium (5-20m) Long (>20m)

Newest Popular Oldest

Use Dedicated Deployments with Computer Vision Workflows

Computer Vision

Use Dedicated Deployments with Computer Vision Workflows

Roboflow Intermediate 1y ago

I’ve been doing marketing for 20 years now, and here’s my biggest source of inspiration.

Computer Vision

I’ve been doing marketing for 20 years now, and here’s my biggest source of inspiration.

Neil Patel Intermediate 1y ago

Missy Franklin, Angela Ruggiero & Ashton Eaton | Olympic Panel | Talks at Google

Computer Vision

Missy Franklin, Angela Ruggiero & Ashton Eaton | Olympic Panel | Talks at Google

Talks at Google Advanced 1y ago

C4AI Expedition Aya - Most Promising Prize: Maya: Multimodal Aya

Computer Vision

C4AI Expedition Aya - Most Promising Prize: Maya: Multimodal Aya

Cohere Beginner 1y ago

Beyond Language: The future of multimodal models in health, gaming, & AI | Microsoft Research Forum

Computer Vision

Beyond Language: The future of multimodal models in health, gaming, & AI | Microsoft Research Forum

Microsoft Research Advanced 1y ago

Qwen2-VL: The Best Open Source Vision Model for OCR & VQA

Computer Vision ⚡ AI Lesson

Qwen2-VL: The Best Open Source Vision Model for OCR & VQA

AI Anytime Intermediate 1y ago

Football AI | Community Q&A (Aug 29)

Computer Vision ⚡ AI Lesson

Football AI | Community Q&A (Aug 29)

Roboflow Advanced 1y ago

Exploring Robotics and Python Through Electronic Projects | Real Python Podcast #218

Computer Vision

Exploring Robotics and Python Through Electronic Projects | Real Python Podcast #218

Real Python Beginner 1y ago

Joy Buolamwini—trail-blazing AI ethicist outlines the dark side of image recognition on #DataFramed

Computer Vision ⚡ AI Lesson

Joy Buolamwini—trail-blazing AI ethicist outlines the dark side of image recognition on #DataFramed

DataCamp Intermediate 1y ago

Segment Anything 2: Memory + Vision = Object Permanence — with Nikhila Ravi and Joseph Nelson

Computer Vision

Segment Anything 2: Memory + Vision = Object Permanence — with Nikhila Ravi and Joseph Nelson

Latent Space Advanced 1y ago

How to run SAM 2 (Segment Anything AI Model)?

Computer Vision ⚡ AI Lesson

How to run SAM 2 (Segment Anything AI Model)?

AI Anytime Intermediate 1y ago

JETSON AI LAB | Research Group Meeting (8/6/2024)

Computer Vision

JETSON AI LAB | Research Group Meeting (8/6/2024)

NVIDIA Developer Advanced 1y ago

Meta Unveils Segment Anything 2: Revolutionizing Image and 3D Segmentation! #meta #ai #genai

Computer Vision

Meta Unveils Segment Anything 2: Revolutionizing Image and 3D Segmentation! #meta #ai #genai

Deepak Bhaskaran Beginner 1y ago

Boost #WorkplaceSafety with Intenseye, an AI-powered employee health and safety (EHS) platform.

Computer Vision

Boost #WorkplaceSafety with Intenseye, an AI-powered employee health and safety (EHS) platform.

Google Cloud Beginner 1y ago

SAM 2 is going to transform COMPUTER VISION!!!

Computer Vision

SAM 2 is going to transform COMPUTER VISION!!!

1littlecoder Intermediate 1y ago

Audience Segmentation Tips: 3 Ways to Segment Your Email List

Computer Vision ⚡ AI Lesson

Audience Segmentation Tips: 3 Ways to Segment Your Email List

Klaviyo Advanced 1y ago

An Overview of Object Recognition Tasks

Computer Vision ⚡ AI Lesson

An Overview of Object Recognition Tasks

Machine Learning Studio Beginner 1y ago

Excitement for the Generative AI era: Multi-Modal inputs

Computer Vision

Excitement for the Generative AI era: Multi-Modal inputs

Weights & Biases Intermediate 1y ago

Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - 692

Computer Vision ⚡ AI Lesson

Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - 692

The TWIML AI Podcast with Sam Charrington Advanced 1y ago

Denoising Images with OpenCV in Python

Computer Vision ⚡ AI Lesson

Denoising Images with OpenCV in Python

NeuralNine Beginner 1y ago

Reimagine document processing and understanding with generative AI

Computer Vision

Reimagine document processing and understanding with generative AI

Google Cloud Intermediate 1y ago

Microsoft's Florence 2: Breaking Boundaries in AI Vision Language!

Computer Vision

Microsoft's Florence 2: Breaking Boundaries in AI Vision Language!

Mervin Praison Beginner 1y ago

Florence 2 - The Best Small VLM Out There?

Computer Vision ⚡ AI Lesson

Florence 2 - The Best Small VLM Out There?

Sam Witteveen Beginner 1y ago

New Microsoft Vision Model has AMAZING TRICKS!!!

Computer Vision ⚡ AI Lesson

New Microsoft Vision Model has AMAZING TRICKS!!!

1littlecoder Advanced 1y ago

From Robotics to Recommender Systems // Miguel Fierro // MLOps Podcast #240

Computer Vision ⚡ AI Lesson

From Robotics to Recommender Systems // Miguel Fierro // MLOps Podcast #240

MLOps.community Beginner 1y ago

Case study on CLIP: Large Multi-Modal Models for Blind & Low Vision Users | Microsoft Research Forum

Computer Vision ⚡ AI Lesson

Case study on CLIP: Large Multi-Modal Models for Blind & Low Vision Users | Microsoft Research Forum

Microsoft Research Advanced 1y ago

OpenAI CLIP model explained

Computer Vision

OpenAI CLIP model explained

Machine Learning Studio Beginner 1y ago

Using PAM EXEC to Log Passwords on Linux

Computer Vision ⚡ AI Lesson

Using PAM EXEC to Log Passwords on Linux

IppSec Beginner 1y ago

Robotics AI for Industrial Applications

Computer Vision

Robotics AI for Industrial Applications

Weights & Biases Advanced 1y ago

Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...

Computer Vision ⚡ AI Lesson

Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...

Cohere Intermediate 1y ago

Can New GPT Model Read Music Notation? Multimodal GPT-4o Omni

Computer Vision ⚡ AI Lesson

Can New GPT Model Read Music Notation? Multimodal GPT-4o Omni

Burned Guitarist Intermediate 2y ago

Getting started With Google's PaliGemma: Open Vision-Language Model

Computer Vision ⚡ AI Lesson

Getting started With Google's PaliGemma: Open Vision-Language Model

Krish Naik Beginner 2y ago

New2Cyber en Espanol | El final de la era del profesional de seguridad

Computer Vision ⚡ AI Lesson

New2Cyber en Espanol | El final de la era del profesional de seguridad

SANS Institute Intermediate 2y ago

How To Fine-tune LLaVA Model (From Your Laptop!)

Computer Vision

How To Fine-tune LLaVA Model (From Your Laptop!)

Brev Intermediate 2y ago

New course with Comet: Prompt Engineering for Vision Models

Computer Vision ⚡ AI Lesson

New course with Comet: Prompt Engineering for Vision Models

DeepLearningAI Beginner 2y ago

It's easy to get stuck in our ways

Computer Vision ⚡ AI Lesson

It's easy to get stuck in our ways

General Musings with Kevin Powell Beginner 2y ago

Analyze documents in BigQuery with Document AI

Computer Vision

Analyze documents in BigQuery with Document AI

Google Cloud Tech Beginner 2y ago

Pose landmark detection - ML on Web with MediaPipe: Episode 8

Computer Vision

Pose landmark detection - ML on Web with MediaPipe: Episode 8

Google for Developers Beginner 2y ago

Build an AI/ML Football Analysis system with YOLO, OpenCV, and Python

Computer Vision

Build an AI/ML Football Analysis system with YOLO, OpenCV, and Python

Code In a Jiffy Beginner 2y ago

Football AI Tutorial: From Basics to Advanced Stats with Python

Computer Vision

Football AI Tutorial: From Basics to Advanced Stats with Python

Roboflow Intermediate 1y ago

Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI

Computer Vision ⚡ AI Lesson

Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI

Roboflow Intermediate 1y ago

AI-Assisted Data Labeling | Weekly Roboflow Product Session

Computer Vision

AI-Assisted Data Labeling | Weekly Roboflow Product Session

Roboflow Beginner 1y ago

Segment Anything 2 (SAM 2): Meta AI's Newest Model | Community Q&A (Jul 30)

Computer Vision

Segment Anything 2 (SAM 2): Meta AI's Newest Model | Community Q&A (Jul 30)

Roboflow Advanced 1y ago

Florence-2: Fine-tune Microsoft’s Multimodal Model

Computer Vision

Florence-2: Fine-tune Microsoft’s Multimodal Model

Roboflow Beginner 1y ago

How good is YOLOv10? | Hacking Google's new VLM, PaliGemma | Community Q&A (Jun 6)

Computer Vision

How good is YOLOv10? | Hacking Google's new VLM, PaliGemma | Community Q&A (Jun 6)

Roboflow Beginner 1y ago

PaliGemma by Google: Train Model on Custom Detection Dataset

Computer Vision

PaliGemma by Google: Train Model on Custom Detection Dataset

Roboflow Intermediate 1y ago

What is Document AI?

Computer Vision

What is Document AI?

Google Cloud Beginner 2y ago

Build computer vision applications easily with Roboflow and Google Cloud

Computer Vision

Build computer vision applications easily with Roboflow and Google Cloud

Google Cloud Advanced 2y ago

📚 Coursera Courses Opens on Coursera · Free to audit

View all →

Implementando modelo Computer Vision en Amazon Sagemaker

📚 Coursera Course ↗

Implementando modelo Computer Vision en Amazon Sagemaker

Opens on Coursera ↗

AutoML: Build ML Models without Code

📚 Coursera Course ↗

AutoML: Build ML Models without Code

Opens on Coursera ↗

Unity: Design & Deform Meshes for 3D Geometry Control

📚 Coursera Course ↗

Unity: Design & Deform Meshes for 3D Geometry Control

Opens on Coursera ↗

Implement Hand Gesture Recognition with OpenCV

📚 Coursera Course ↗

Implement Hand Gesture Recognition with OpenCV

Opens on Coursera ↗

📚 Coursera Course ↗

Sync CRM Contacts

Opens on Coursera ↗

Azure Practical - Cognitive Services

📚 Coursera Course ↗

Azure Practical - Cognitive Services

Opens on Coursera ↗

📚 Coursera Course ↗

Custom Document Extraction with Document AI Workbench

Opens on Coursera ↗

📚 Coursera Course ↗

Create Image Captioning Models - Español

Opens on Coursera ↗

Cisco CCNP Data Center DCCOR (350-601)

📚 Coursera Course ↗

Cisco CCNP Data Center DCCOR (350-601)

Opens on Coursera ↗

Digital Marketing Foundations: Analyze & Apply Strategies

📚 Coursera Course ↗

Digital Marketing Foundations: Analyze & Apply Strategies

Opens on Coursera ↗

H2O Cloud AI Developer Services

📚 Coursera Course ↗

H2O Cloud AI Developer Services

Opens on Coursera ↗

📚 Coursera Course ↗

Humanidades digitales

Opens on Coursera ↗

Interdisciplinarity in Thought and Practice

📚 Coursera Course ↗

Interdisciplinarity in Thought and Practice

Opens on Coursera ↗

Future of data and technology in football

📚 Coursera Course ↗

Future of data and technology in football

Opens on Coursera ↗

📚 Coursera Course ↗

Autoscaling TensorFlow Model Deployments with TF Serving and Kubernetes

Opens on Coursera ↗

📚 Coursera Course ↗

Introduction to Vertex AI Embeddings: Text and Multimodal

Opens on Coursera ↗

Introduction to Deep Learning for Computer Vision

📚 Coursera Course ↗

Introduction to Deep Learning for Computer Vision

Opens on Coursera ↗

📚 Coursera Course ↗

Marketing Communications: Intro to Consumer Behavior

Opens on Coursera ↗