Answerability time is a common metric used for evaluating inference efficiency. It is defined as the time to first answer for answerable queries, and the total time for unanswerable queries.
Answerability time is a common metric used for evaluating inference efficiency. It is defined as the time to first answer for answerable queries, and the total time for unanswerable queries.