Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

2,368

lessons

Skills in this topic

3 skills — Sign in to track your progress

View full skill map →

Classify images with a pre-trained CNN

Modern CV Models

Run YOLO for real-time object detection

Build a Stable Diffusion inference pipeline

Videos 1,145 Reads 1,223

Level: All Beginner Intermediate Advanced

Any Length Short (<5m) Medium (5-20m) Long (>20m)

Newest Popular Oldest

Using RTSP Streams for Computer Vision | Tracking & Counting Objects

Computer Vision

Using RTSP Streams for Computer Vision | Tracking & Counting Objects

Roboflow Intermediate 1y ago

The era of unbounded products: Designing for Multimodal IO: Ben Hylak

Computer Vision

The era of unbounded products: Designing for Multimodal IO: Ben Hylak

AI Engineer Intermediate 1y ago

Use Dedicated Deployments with Computer Vision Workflows

Computer Vision

Use Dedicated Deployments with Computer Vision Workflows

Roboflow Intermediate 1y ago

I’ve been doing marketing for 20 years now, and here’s my biggest source of inspiration.

Computer Vision

I’ve been doing marketing for 20 years now, and here’s my biggest source of inspiration.

Neil Patel Intermediate 1y ago

Organize PDFs Efficiently: Build a Streamlit PDF Sorter Application using LangChain and Llama 3.1

Computer Vision

Organize PDFs Efficiently: Build a Streamlit PDF Sorter Application using LangChain and Llama 3.1

Muhammad Moin Intermediate 1y ago

Qwen2-VL: The Best Open Source Vision Model for OCR & VQA

Computer Vision ⚡ AI Lesson

Qwen2-VL: The Best Open Source Vision Model for OCR & VQA

AI Anytime Intermediate 1y ago

Joy Buolamwini—trail-blazing AI ethicist outlines the dark side of image recognition on #DataFramed

Computer Vision ⚡ AI Lesson

Joy Buolamwini—trail-blazing AI ethicist outlines the dark side of image recognition on #DataFramed

DataCamp Intermediate 1y ago

How to run SAM 2 (Segment Anything AI Model)?

Computer Vision ⚡ AI Lesson

How to run SAM 2 (Segment Anything AI Model)?

AI Anytime Intermediate 1y ago

SAM 2 is going to transform COMPUTER VISION!!!

Computer Vision

SAM 2 is going to transform COMPUTER VISION!!!

1littlecoder Intermediate 1y ago

New Way Now: McLaren Racing is shifting performance into top gear with Google Cloud

Computer Vision

New Way Now: McLaren Racing is shifting performance into top gear with Google Cloud

Google Cloud Intermediate 1y ago

Excitement for the Generative AI era: Multi-Modal inputs

Computer Vision

Excitement for the Generative AI era: Multi-Modal inputs

Weights & Biases Intermediate 1y ago

Reimagine document processing and understanding with generative AI

Computer Vision

Reimagine document processing and understanding with generative AI

Google Cloud Intermediate 2y ago

[CVPR2024] Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth Estimation

Computer Vision

[CVPR2024] Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth Estimation

anucvml Intermediate 2y ago

Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...

Computer Vision ⚡ AI Lesson

Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...

Cohere Intermediate 2y ago

Multimodal AI Business Companions

Computer Vision

Multimodal AI Business Companions

Daniel Finkenstadt Intermediate 2y ago

Enrich Scenario Planning with Multimodal Wargames

Computer Vision

Enrich Scenario Planning with Multimodal Wargames

Daniel Finkenstadt Intermediate 2y ago

Can New GPT Model Read Music Notation? Multimodal GPT-4o Omni

Computer Vision ⚡ AI Lesson

Can New GPT Model Read Music Notation? Multimodal GPT-4o Omni

Burned Guitarist Intermediate 2y ago

New2Cyber en Espanol | El final de la era del profesional de seguridad

Computer Vision ⚡ AI Lesson

New2Cyber en Espanol | El final de la era del profesional de seguridad

SANS Institute Intermediate 2y ago

How To Fine-tune LLaVA Model (From Your Laptop!)

Computer Vision

How To Fine-tune LLaVA Model (From Your Laptop!)

Brev Intermediate 2y ago

Mean Average Precision (mAP) | Explanation and Implementation for Object Detection

Computer Vision

Mean Average Precision (mAP) | Explanation and Implementation for Object Detection

ExplainingAI Intermediate 2y ago

Is synthetic data from generative models ready for image recognition? (ICLR 2023, spotlight)

Computer Vision

Is synthetic data from generative models ready for image recognition? (ICLR 2023, spotlight)

MIPAL-SNU Intermediate 2y ago

Bringing AI to the Masses with Adam D'Angelo, CEO of Quora

Computer Vision ⚡ AI Lesson

Bringing AI to the Masses with Adam D'Angelo, CEO of Quora

a16z Intermediate 2y ago

Multi-Modal NSFW Detection with AI

Computer Vision

Multi-Modal NSFW Detection with AI

James Briggs Intermediate 2y ago

This VLM can be your MultiModal AI with less than 6GB Memory!!!

Computer Vision

This VLM can be your MultiModal AI with less than 6GB Memory!!!

1littlecoder Intermediate 2y ago

New course with Hugging Face: Open Source Models with Hugging Face

Computer Vision ⚡ AI Lesson

New course with Hugging Face: Open Source Models with Hugging Face

DeepLearningAI Intermediate 2y ago

Multimodality: The Next Big Step (Demis Hassabis - Google DeepMind CEO)

Computer Vision

Multimodality: The Next Big Step (Demis Hassabis - Google DeepMind CEO)

Dwarkesh Patel Intermediate 2y ago

How hard are computer vision datasets? Calibrating dataset difficulty to viewing time

Computer Vision

How hard are computer vision datasets? Calibrating dataset difficulty to viewing time

MIPAL-SNU Intermediate 2y ago

Vision Transformer (ViT)

Computer Vision

Vision Transformer (ViT)

Machine Learning Studio Intermediate 2y ago

The Future Of Computer Vision

Computer Vision ⚡ AI Lesson

The Future Of Computer Vision

a16z Intermediate 2y ago

Create a Custom Document Extractor with Document AI

Computer Vision ⚡ AI Lesson

Create a Custom Document Extractor with Document AI

Google Cloud Tech Intermediate 2y ago

Tune in to know what are the most exciting opportunities to look out for in computer vision!

Computer Vision ⚡ AI Lesson

Tune in to know what are the most exciting opportunities to look out for in computer vision!

The TWIML AI Podcast with Sam Charrington Intermediate 2y ago

Vision community faces evaluation challenges and should lean on cost-effective automatic evaluation

Computer Vision ⚡ AI Lesson

Vision community faces evaluation challenges and should lean on cost-effective automatic evaluation

The TWIML AI Podcast with Sam Charrington Intermediate 2y ago

Image segmentation - ML on Android with MediaPipe Series

Computer Vision

Image segmentation - ML on Android with MediaPipe Series

Google for Developers Intermediate 2y ago

AI Revolutionizing Immigration: Streamlining Visa Processing 🌐✈️ #AIInImmigration #VisaProcessing

Computer Vision

AI Revolutionizing Immigration: Streamlining Visa Processing 🌐✈️ #AIInImmigration #VisaProcessing

LawSikho Technology & AI Law Intermediate 2y ago

Want to find the BEST segmentation for your business?

Computer Vision

Want to find the BEST segmentation for your business?

Adam Erhart Intermediate 2y ago

How does AI aid in immigration document verification and processing for visas and asylum cases?

Computer Vision

How does AI aid in immigration document verification and processing for visas and asylum cases?

LawSikho Technology & AI Law Intermediate 2y ago

Accelerating Explorations in Vision and Multimodal AI Using Pytorch...- Nicolas, Philip, Evan & Peng

Computer Vision

Accelerating Explorations in Vision and Multimodal AI Using Pytorch...- Nicolas, Philip, Evan & Peng

PyTorch Intermediate 2y ago

TIME Best Invention of 2023: NVIDIA Neuralangelo

Computer Vision

TIME Best Invention of 2023: NVIDIA Neuralangelo

NVIDIA Developer Intermediate 2y ago

Object detection using Yolo V8

Computer Vision

Object detection using Yolo V8

Developers Hutt Intermediate 2y ago

Football AI Tutorial: From Basics to Advanced Stats with Python

Computer Vision

Football AI Tutorial: From Basics to Advanced Stats with Python

Roboflow Intermediate 1y ago

Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI

Computer Vision ⚡ AI Lesson

Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI

Roboflow Intermediate 1y ago

PaliGemma by Google: Train Model on Custom Detection Dataset

Computer Vision

PaliGemma by Google: Train Model on Custom Detection Dataset

Roboflow Intermediate 2y ago

YOLOv9 Tutorial: Train Model on Custom Dataset | How to Deploy YOLOv9

Computer Vision ⚡ AI Lesson

YOLOv9 Tutorial: Train Model on Custom Dataset | How to Deploy YOLOv9

Roboflow Intermediate 2y ago

Big Ideas 2024: New Applications for Computer Vision and Video Intelligence with Kimberly Tan

Computer Vision ⚡ AI Lesson

Big Ideas 2024: New Applications for Computer Vision and Video Intelligence with Kimberly Tan

a16z Intermediate 2y ago

¿La verdadera razón detrás de la transformación digital?

Computer Vision

¿La verdadera razón detrás de la transformación digital?

Google Cloud Intermediate 2y ago

Can AI-Inventions Be Patented in India? Exploring Patent Law Dynamics! 🤖💡 #AIPatents #PatentLaw

Computer Vision

Can AI-Inventions Be Patented in India? Exploring Patent Law Dynamics! 🤖💡 #AIPatents #PatentLaw

LawSikho Technology & AI Law Intermediate 2y ago

C360 for BigQuery powered by Lytics fuels next gen AI, analytics, and predictions

Computer Vision

C360 for BigQuery powered by Lytics fuels next gen AI, analytics, and predictions

Google Cloud Intermediate 2y ago

AI.engineer 2023: Live Coding a Multimodal Game, paint.wtf

Computer Vision

AI.engineer 2023: Live Coding a Multimodal Game, paint.wtf

Roboflow Intermediate 2y ago

📚 Continue on Coursera External links · Free to audit

View all →

Low Code Image Segmentation

📚 External: Coursera ↗

Low Code Image Segmentation

Opens on Coursera ↗

Implementando modelo Computer Vision en Amazon Sagemaker

📚 External: Coursera ↗

Implementando modelo Computer Vision en Amazon Sagemaker

Opens on Coursera ↗

📚 External: Coursera ↗

Image and Video Processing: From Mars to Hollywood with a Stop at the Hospital

Opens on Coursera ↗

Business Economics and Game Theory for Decision Making

📚 External: Coursera ↗

Business Economics and Game Theory for Decision Making

Opens on Coursera ↗

📚 External: Coursera ↗

Humanidades digitales

Opens on Coursera ↗

Digital Marketing Foundations: Analyze & Apply Strategies

📚 External: Coursera ↗

Digital Marketing Foundations: Analyze & Apply Strategies

Opens on Coursera ↗

YOLO-NAS + v8 Full-Stack Computer Vision Course

📚 External: Coursera ↗

YOLO-NAS + v8 Full-Stack Computer Vision Course

Opens on Coursera ↗

International Marketing Strategies and Global Trade

📚 External: Coursera ↗

International Marketing Strategies and Global Trade

Opens on Coursera ↗

CompTIA Cloud CV0-003: Unit 3

📚 External: Coursera ↗

CompTIA Cloud CV0-003: Unit 3

Opens on Coursera ↗

The Social Media Landscape

📚 External: Coursera ↗

The Social Media Landscape

Opens on Coursera ↗

📚 External: Coursera ↗

Optical Character Recognition (OCR) with Document AI (Python)

Opens on Coursera ↗

📚 External: Coursera ↗

Classify Images of Clouds in the Cloud with AutoML Vision

Opens on Coursera ↗

📚 External: Coursera ↗

Form Parsing with Document AI (Python)

Opens on Coursera ↗

Behavioral Marketing

📚 External: Coursera ↗

Behavioral Marketing

Opens on Coursera ↗

6G Evolution: Blockchain, Semantic Communications & Radar

📚 External: Coursera ↗

6G Evolution: Blockchain, Semantic Communications & Radar

Opens on Coursera ↗

📚 External: Coursera ↗

Create Image Captioning Models - Português Brasileiro

Opens on Coursera ↗

Fine-Tuning and Evaluating Vision AI Models

📚 External: Coursera ↗

Fine-Tuning and Evaluating Vision AI Models

Opens on Coursera ↗

Self-Driving Car Specialization Course

📚 External: Coursera ↗

Self-Driving Car Specialization Course

Opens on Coursera ↗