
Extract Insights from Visual Data on Azure (AI-3008)
Course Duration
1 Day
Audience
Employees of federal, state and local governments; and businesses working with the government.
Prerequisites
Basic programming experience and a general understanding of cloud or AI concepts.
Course Description
This 1-day course focuses on building intelligent applications that can see, interpret, and reason over images and documents using multimodal models and agent-based tools. Learners explore how visual and document inputs can be combined with language models to enable structured extraction, analysis, and decision-making workflows. The course emphasizes practical patterns for extracting information, orchestrating tools, and grounding model responses in visual data.
Learning Objectives
- Build vision-enabled generative AI applications using multimodal models
- Generate images and videos programmatically using Microsoft Foundry
- Analyze images and documents using Azure Content Understanding
- Extract structured data from forms and documents with Azure Document Intelligence
- Build knowledge mining pipelines using Azure AI Search
- Combine visual and language models to enable intelligent decision-making workflows
Course Outline
Module 1: Develop a Vision-Enabled Generative AI Application
- Use a vision-capable model in the Microsoft Foundry portal
- Develop a vision-based chat app
Module 2: Generate Images with AI
- What are image-generation models?
- Explore image-generation models in Microsoft Foundry portal
- Create a client application that uses an image generation model
Module 3: Generate Videos with Microsoft Foundry
- Deploy a video generating model
- Generate video from a prompt
- Generate video in Python
Module 4: Analyze Images with Content Understanding
- What is Content Understanding?
- Analyze images with Content Understanding
Module 5: Create a Multimodal Analysis Solution with Azure Content Understanding
- What is Azure Content Understanding?
- Create a Content Understanding analyzer
- Use the Content Understanding API
Module 6: Create an Azure Content Understanding Client Application
- Prepare to use the AI Content Understanding API
- Create a Content Understanding analyzer
- Analyze content
Module 7: Extract Data with Azure Document Intelligence
- What is Azure Document Intelligence?
- Use the Document Intelligence Studio
- Use prebuilt models
- Train and use custom models
Module 8: Create a Knowledge Mining Solution with Azure AI Search
- What is Azure AI Search?
- Extract data with an indexer
- Enrich extracted data with AI skills
- Search an index
- Persist extracted information in a knowledge store
Frequently Asked Questions
What does the Extract Insights from Visual Data on Azure (AI-3008) course cover?
This course covers building intelligent applications that analyze images and documents using multimodal models, Azure Content Understanding, Azure Document Intelligence, and Azure AI Search. IT Dojo delivers it as live instructor-led training for federal and DoD technology professionals.
How long is IT Dojo's AI-3008 training?
AI-3008 is a 1-day course. It is available as live remote online instruction or on-site at your facility.
Who should attend this course?
Developers, AI engineers, and technical professionals who want to build applications that work with images and documents using multimodal, agent-driven approaches on Azure.
Does IT Dojo offer this training on-site at government or DoD facilities?
Yes. IT Dojo delivers AI-3008 on-site at government agencies, DoD commands, military installations, and contractor facilities. Contact IT Dojo to schedule.
How do I register for this course?
IT Dojo training is employer-sponsored — your organization registers and pays for seats. Contact IT Dojo via the Request Training form or call 757-216-3656.