Exam AIF-C01 Topic 3 Question 147 Discussion

Actual exam question for Amazon's AIF-C01 exam
Question #: 147
Topic #: 3
A social media company wants to use a large language model (LLM) to summarize messages. The company has chosen a few LLMs that are available on Amazon SageMaker JumpStart. The company wants to compare the generated output toxicity of these models.
Which strategy gives the company the ability to evaluate the LLMs with the LEAST operational overhead?

Suggested Answer: B Vote an answer

The least operational overhead comes from automated tools that can scan and evaluate LLM outputs for toxicity. AWS and SageMaker JumpStart support integrations with automatic evaluation tools and APIs (such as Amazon Comprehend or third-party toxicity classifiers).
* B is correct: Automated evaluation provides quick, scalable, and repeatable analysis, requiring minimal human intervention.
* A and C require manual effort, increasing operational overhead.
* D (RLHF) is resource-intensive and not designed for rapid, automated model comparison.
"Automated evaluation can quickly assess generated text for specific attributes like toxicity, sentiment, or compliance using pre-trained classifiers, reducing human involvement and operational complexity." (Reference: AWS SageMaker JumpStart Evaluation, AWS AI Practitioner Guide)

by Rachel at Dec 08, 2025, 04:42 AM

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
Nick name: Submit Cancel
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

0
0
0
10