Databricks Certified Generative AI Engineer Associate Databricks-Generative-AI-Engineer-Associate Question # 11 Topic 2 Discussion

Databricks Certified Generative AI Engineer Associate Databricks-Generative-AI-Engineer-Associate Question # 11 Topic 2 Discussion

Databricks-Generative-AI-Engineer-Associate Exam Topic 2 Question 11 Discussion:
Question #: 11
Topic #: 2

After changing the response generating LLM in a RAG pipeline from GPT-4 to a model with a shorter context length that the company self-hosts, the Generative AI Engineer is getting the following error:

Databricks-Generative-AI-Engineer-Associate Question 11

What TWO solutions should the Generative AI Engineer implement without changing the response generating model? (Choose two.)


A.

Use a smaller embedding model to generate


B.

Reduce the maximum output tokens of the new model


C.

Decrease the chunk size of embedded documents


D.

Reduce the number of records retrieved from the vector database


E.

Retrain the response generating model using ALiBi


Get Premium Databricks-Generative-AI-Engineer-Associate Questions

Contribute your Thoughts:


Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.