AI Skillset Course
Building AI Agents with Multimodal Models image
Current
Intermediate

Building AI Agents with Multimodal Models

NVIDIA Deep Learning Institute (DLI) · NVIDIA · Updated March 2026

Platform rating

4.6/5

Champ rating

8.6/10

Duration

8 hours instructor-led

Classes

24

Learn to build powerful AI agents using multimodal models that combine text, image, and video understanding for complex reasoning tasks.

What you'll get

Build AI agents using multimodal foundation models
Implement vision-language reasoning pipelines
Deploy multimodal agents for real-world tasks
Evaluate agent performance and reliability

Fit

Best for

Developers
AI Engineers
Data Scientists
Technical Builders

Not ideal for

Learners seeking only entry-level overviews

Prerequisites & pricing

Prerequisites

Python and deep learning experience

Pricing

Contact for pricing

Certification

Certificate

AI Agents
Multimodal AI
LLM
Computer Vision
NVIDIA
Go to Course

Alternatives to Building AI Agents with Multimodal Models

Current
Champ's Pick

Intro to AI Agents

Codecademy · Codecademy

4.8
8.8/10
<1 hour

Beginner course on agentic AI concepts, autonomous systems, retrieval, and tool integration for workplace usage.

Free (certificate with Plus/Pro)
View
Current
Champ's Pick

AI Agents Course

Hugging Face · Hugging Face

4.8
8.8/10
Recommended weekly pace (~3-4 hours/week)

Free interactive course on agent fundamentals, frameworks, real-world assignments, and benchmark challenges with optional certification.

Free
View
Current
Champ's Pick

Model Context Protocol (MCP) Course

Hugging Face · Hugging Face

4.8
8.8/10
Recommended weekly pace (~3-4 hours/week)

Free MCP course (with Anthropic collaboration) focused on protocol architecture, SDKs, end-to-end apps, and deployment-oriented use cases.

Free
View
Current
Champ's Pick

Level Up Your AI Agent Skills

Databricks Academy · Databricks

4.8
8.8/10
90 minutes

Free 90-minute AI agent fundamentals training with four videos, industry use cases, and badge-based assessment.

Free
View