Basic Concept: LLM API billing is primarily based on token consumption. High costs resulting from staff using powerful, verbose models can be controlled by limiting the maximum tokens processed per interaction and restricting which models staff can access. CompTIA SecAI+ Study Guide covers token management as the primary cost control mechanism for AI deployments.
Why D is Correct: Implementing token limits caps the maximum tokens consumed per API call for both input and output. This directly controls the per-interaction cost by preventing excessively long prompts or overly verbose model responses from generating large token bills. Combined with model tier restrictions, token limits ensure that interactions with powerful models remain within budget constraints regardless of how extensively staff use the service.
Why A is Wrong: Storage monitoring tracks the utilization and performance of data storage systems. Storage costs are separate from LLM API token-based billing and monitoring storage does not address the high API charges resulting from excessive model usage by staff.
Why B is Wrong: Modality types refer to the input formats an AI model accepts such as text, images, audio, or video. While different modalities have different pricing, managing modality types is not the direct cost control lever. Token consumption is the primary cost driver for text-based interactions.
Why C is Wrong: Prompt firewalls inspect and filter prompt content for security and policy compliance. While they can block certain types of queries, they are designed for security purposes, not as financial controls to limit token consumption or enforce cost budgets across staff usage.
Submit