Evaluate LLM responses
Use Label Studio UI for LLM evaluation
Connect to Label Studio
Let’s connect to the running Label Studio instance. You need API_KEY
that can be found in Account & Settings
-> API Key
section.
Different LLM Evaluation Strategies
There are several strategies to evaluate LLM responses, depending on the complexity of the system and specific evaluation goals.
Create Evaluation Task
Picking one of the provided evaluation strategies, you can now upload your task
to created Label Studio project
:
Now open the Label Studio UI and navigate to http://localhost:8080/projects/{project.id}/data?labeling=1
to start LLM evaluation.
Collect Annotated Data
The final step is to collect the annotated data from the Label Studio project. You can export the annotations in various formats like JSON, CSV, or directly to cloud storage providers.
You can also use a Python SDK to retrieve the annotations. For example, to collect and display all user choices from the project: