Checkout

Cart () Loading...

    • Quantity:
    • Delivery:
    • Dates:
    • Location:

    $

Contact Sales

Building AI Agents with Multimodal Models

Learn how to build neural network agents that reason across multiple data types using advanced fusion techniques, OCR, and NVIDIA AI Blueprints for real-world applications like robotics and healthcare.

GK# 847003
Vendor Credits:
No matching courses available.
Start learning as soon as today! Click Add To Cart to continue shopping or Buy Now to check out immediately.
Access Period:
Scheduling a custom training event for your team is fast and easy! Click here to get started.

What You'll Learn

In this course, you will learn about:

  • Different data types and how to make them neural network ready
  • Model fusion, and the differences between early, late, and intermediate fusion
  • PDF extraction using OCR
  • The difference between modality and agent orchestration
  • Customization of NVIDIA AI Blueprints with Video Search and Summarization (VSS)