Microsoft AI Transformation Leader AB-731 Exam Questions

Page: 1 / 14
Total 53 questions
Question 1

Your company receives thousands of scanned invoices each month. You need to recommend an AI solution that can automatically extract key details, such as invoice numbers, vendor names, and total amounts. What is the best solution to recommend? More than one answer choice may achieve the goal. Select the BEST answer.



Answer : A

For scanned invoices, the requirement is structured field extraction (invoice number/ID, vendor, totals) from document images or PDFs at scale. The best fit is Azure Document Intelligence because it is purpose-built for document processing and provides prebuilt invoice models that combine OCR with layout/structure understanding to extract common invoice fields into a structured output. Microsoft's invoice model is explicitly designed to analyze invoices (including scanned images) and return key fields and line items in structured form, which directly maps to this scenario.

Azure Vision (B) can perform OCR and basic image analysis, but OCR alone typically returns text without robust invoice-specific field interpretation (e.g., reliably identifying ''Invoice ID'' vs. ''Order ID,'' totals vs. subtotals, vendor vs. ship-to). Document Intelligence is optimized for advanced document structure extraction and is therefore the ''best'' single recommendation.

Azure AI Search (C) focuses on indexing and retrieval/knowledge mining across a corpus; it's not the primary service for extracting invoice fields for downstream processing. Azure Machine Learning (D) could be used to build a custom model, but that adds cost and time compared with a prebuilt invoice extractor designed for this document type.


Question 2

Your company plans to implement a proof of concept PoC agent that uses Azure OpenAI. The solution must start small and provide flexibility to scale usage as demand grows. Which pricing model should you use?



Answer : D

For a proof of concept, the key requirements are low commitment, quick start, and the ability to scale up or down as you learn what real usage looks like. Azure OpenAI Standard On-Demand pricing is designed for exactly that: you pay per token consumed (input and output) on a pay-as-you-go basis, which makes it ideal when demand is uncertain or variable---typical in early pilots and PoCs.

By contrast, Provisioned (PTUs) is best when you have well-defined, predictable throughput and latency requirements---usually a more mature, production workload. PTUs involve reserving model processing capacity to achieve consistent performance and more predictable costs, which is usually premature for a PoC where actual traffic patterns are not yet known.

Batch API is optimized for asynchronous high-volume jobs with a target turnaround (for example, up to 24 hours) and discounted pricing. That's great for offline processing, but it does not match an interactive ''agent'' PoC that typically needs near-real-time responses and iterative testing.

Microsoft 365 Copilot is a separate SaaS licensing model and is not the Azure OpenAI pricing model for building your own agent solution.


Question 3

Your company plans to use generative AI to help project managers and engineers work with construction blueprints stored as PDF files. You need to recommend a generative AI solution that processes both images and text, summarizes building design, answers questions, and extracts information such as locations of electrical, heating, and plumbing systems. What should you recommend?



Answer : A

Construction blueprints in PDFs often contain a mix of text, symbols, linework, and diagrams. The requirements include understanding both visual layout (where systems are located) and textual annotations, producing summaries, and answering Q&A. That combination requires a multimodal generative AI approach---models that can reason over images and text together. Therefore, A is best.

OCR alone (B) can extract printed text, but it won't reliably interpret diagram geometry, symbols, or spatial relationships (e.g., ''electrical riser is on the east core near gridline B-4''). Text completion (C) is too generic and doesn't address image understanding. Document summarization (D) is only one requirement (summary) and still depends on first extracting/understanding both visual and textual elements.

A multimodal solution can ingest the PDF pages as images (or rendered page images) plus extracted text, then answer questions grounded in both modalities. In practice, you may combine OCR and layout extraction with a multimodal LLM so the model can reference drawing regions, legends, callouts, and system diagrams to produce accurate explanations and field extractions.


Question 4

Your company manages an online catalog of office supplies. You plan to use a generative AI solution to create product descriptions for your company's website. The solution must ensure descriptions can be posted immediately after creation, enable selection/inclusion of product details, and be fast and simple for non-technical staff. What is the best type of solution to use? Select the BEST answer.



Answer : D

The task is high-volume content generation with consistent structure and immediate publishing: product descriptions that reliably include chosen product attributes (brand, specs, materials, dimensions, use cases) and can be produced quickly by non-technical staff. The best fit is a fine-tuned LLM (D) because fine-tuning can standardize tone, format, and completeness against your catalog schema, reducing variability and minimizing manual editing before posting. With a fine-tuned model, you can strongly enforce style guidelines (length, voice, prohibited claims), and you can template prompts so staff only supply product fields and get publish-ready copy.

Option A is not best: Azure Machine Learning is excellent for predictive models but is unnecessary for straightforward text generation. B (Researcher) is optimized for multistep research across work data + web, not deterministic product copy generation. C (interactive agent) can help collect requirements, but it's more complexity than needed; the core need is consistent text generation from structured product data, which fine-tuning addresses directly while keeping user interaction simple (fill fields generate description).


Question 5

Your company stores thousands of reports and documents across multiple systems. You recommend using Azure AI Search as part of a new generative AI solution to improve information discovery. What is a key benefit of using Azure AI Search in this scenario?



Answer : B

Azure AI Search provides an indexing and retrieval layer that makes large, distributed document collections searchable in a consistent way. The key benefit in an information discovery scenario is that it can index content from many sources and then retrieve relevant documents/passages using rich query capabilities, including natural language-style queries and semantic ranking. That directly aligns with B.

This retrieval capability is foundational for RAG architectures: the system uses Azure AI Search to find the best matching content, then supplies those results to a generative model so the answer is grounded in organizational knowledge. That improves relevance and reduces hallucinations because the model is guided by retrieved evidence.

Option A is the opposite of what you want---Search is used precisely to reference existing data. C is more aligned to workflow automation platforms (Logic Apps/Power Automate) and document processing services. D describes fine-tuning, which is a different approach; Azure AI Search improves discovery and grounding through retrieval, not by changing model weights.


Question 6

Your company plans to build a generative AI solution based on internal dat

a. You recommend using Microsoft Foundry as a starting point to develop and manage the solution. What is a key benefit of using Microsoft Foundry for this project?



Answer : A

Microsoft Foundry is positioned as a unified, enterprise-grade platform that helps organizations build, deploy, scale, and govern AI apps and agents---especially generative AI solutions that need to work with business context and internal data. That directly aligns with A: Foundry provides a scalable platform for developing and deploying generative AI solutions. Microsoft describes Foundry as an interoperable platform that makes it easier to build, deploy, and scale AI apps and agents, while also providing centralized security and governance features for organizations.

B is incorrect because Foundry does not remove model choice/configuration; in fact, it supports selecting among models and using tools/frameworks to build solutions. You still choose appropriate model(s), configure endpoints, and design grounding and safety controls.

C and D are not the best characterization of Foundry's primary benefit. While Foundry offers ''friendly interfaces,'' Microsoft primarily positions it for developers, model builders, and enterprise AI operations---not as a low-code platform for business users (that role is more commonly filled by Copilot Studio/Power Platform).


Question 7

Which statement accurately describes the difference between a pretrained generative AI model and a fine-tuned generative AI model?



Answer : C

A pretrained generative AI model is trained initially on a large, broad, and diverse dataset so it learns general language (or multimodal) patterns and capabilities. Fine-tuning then takes that pretrained base and performs additional training on a smaller, task- or domain-specific dataset to specialize behavior---improving performance for a particular use case, tone, style, or domain knowledge representation. That is exactly what option C states, making it the correct answer.

Option A is incorrect because both pretraining and fine-tuning may use labeled or unlabeled data depending on the technique; the distinction is not ''labeled vs. unlabeled.'' Option B is incorrect because a pretrained model is not ''faster to train'' due to fewer parameters; pretraining is typically the most compute-intensive phase precisely because it's done at large scale, while fine-tuning is smaller but still trains the same model architecture. Option D is reversed: the pretrained model is the general-purpose foundation, while the fine-tuned model is the specialized variant for a specific task or dataset.


Page:    1 / 14   
Total 53 questions