A team notices vague, inconsistent LLM outputs for the same story for two different prompts. Which technique BEST helps choose the stronger wording among two prompt versions using predefined metrics?
Who typically defines the system prompt in a testing workflow?
Which competency MOST helps testers steer LLMs to produce useful, on-policy testware?
Which setting can reduce variability by narrowing the sampling distribution during inference?
Which standard specifies requirements for managing AI systems within an organization, supporting consistent GenAI use in testing?
Which option BEST differentiates the three prompting techniques?
Which consideration BEST aligns LLM choice with organizational goals in a GenAI testing strategy?
Which concept refers to breaking text into smaller units for processing by LLMs?
Which of the following is NOT a valid form of LLM-driven test data generation?
You are using an LLM to assist in analyzing test execution trends to predict potential risks. Which of the following improvements would BEST enhance the LLM's ability to predict risks and provide actionable alerts?