Exam AIF-C01 Topic 3 Question 147 Discussion

Actual exam question for Amazon's AIF-C01 exam
Question #: 147
Topic #: 3

A social media company wants to use a large language model (LLM) to summarize messages. The company has chosen a few LLMs that are available on Amazon SageMaker JumpStart. The company wants to compare the generated output toxicity of these models.
Which strategy gives the company the ability to evaluate the LLMs with the LEAST operational overhead?

A. Crowd-sourced evaluation B. Automatic model evaluation C. Model evaluation with human workers D. Reinforcement learning from human feedback (RLHF)

Suggested Answer: B Vote an answer

The least operational overhead comes from automated tools that can scan and evaluate LLM outputs for toxicity. AWS and SageMaker JumpStart support integrations with automatic evaluation tools and APIs (such as Amazon Comprehend or third-party toxicity classifiers).
* B is correct: Automated evaluation provides quick, scalable, and repeatable analysis, requiring minimal human intervention.
* A and C require manual effort, increasing operational overhead.
* D (RLHF) is resource-intensive and not designed for rapid, automated model comparison.
"Automated evaluation can quickly assess generated text for specific attributes like toxicity, sentiment, or compliance using pre-trained classifiers, reducing human involvement and operational complexity." (Reference: AWS SageMaker JumpStart Evaluation, AWS AI Practitioner Guide)

by Rachel at Dec 08, 2025, 04:42 AM

Limited Time Offer

15%

Off

Get Premium AIF-C01 Questions as Interactive Self Test Engine or PDF

Comments

0 Satisfied Customers

0 Shares

0 Demo Downloads

10 Years in Business