Answering instructions
This notebook demonstrates how you can modify default instructions for question types and answer comments.
Default question instructions
Each EDSL question type includes default instructions to the model about how to answer the question. We can view these instructions by inspecting the user prompt for a question that has been created (the other type of prompt–systen prompt–is for agent instructions).
For example, here we see that the default instruction for multiple choice questions is:
“Only 1 option may be selected. Respond only with a string corresponding to one of the options. After the answer, you can put a comment explaining why you chose that option on the next line.”
This text is automatically appended to the question text:
[1]:
from edsl import QuestionMultipleChoice, Survey, Model
q = QuestionMultipleChoice(
question_name = "primary_color",
question_text = "What is the most common primary color?",
question_options = ["Red", "Yellow", "Blue"]
)
survey = Survey([q])
survey.show_prompts()
[1]:
user_prompt | system_prompt | interview_index | question_name | scenario_index | agent_index | model | estimated_cost | cache_key | |
---|---|---|---|---|---|---|---|---|---|
0 | What is the most common primary color? Red Yellow Blue Only 1 option may be selected. Respond only with a string corresponding to one of the options. After the answer, you can put a comment explaining why you chose that option on the next line. | nan | 0 | primary_color | 0 | 0 | gpt-4o | 0.000678 | 20e75009c72f3e88c490c58bf13d6a72 |
We can isolate the user prompt:
[2]:
survey.by(Model()).prompts().select("user_prompt")
[2]:
user_prompt | |
---|---|
0 | What is the most common primary color? Red Yellow Blue Only 1 option may be selected. Respond only with a string corresponding to one of the options. After the answer, you can put a comment explaining why you chose that option on the next line. |
We can compare this with default instructions for other question types:
[3]:
from edsl import QuestionCheckBox, Survey, Model
q = QuestionCheckBox(
question_name = "primary_colors",
question_text = "Which colors are 'primary'?",
question_options = ["Red", "Orange", "Yellow", "Green", "Blue", "Purple"]
)
survey = Survey([q])
survey.by(Model()).prompts().select("user_prompt")
[3]:
user_prompt | |
---|---|
0 | Which colors are 'primary'? 0: Red 1: Orange 2: Yellow 3: Green 4: Blue 5: Purple Please respond only with a comma-separated list of the code of the options that apply, with square brackets. E.g., [0, 1, 3] After the answer, you can put a comment explaining your choice on the next line. |
[4]:
from edsl import QuestionRank, Survey, Model
q = QuestionRank(
question_name = "primary_colors_rank",
question_text = "Rank the primary colors in terms of popularity.",
question_options = ["Red", "Yellow", "Blue"]
)
survey = Survey([q])
survey.by(Model()).prompts().select("user_prompt")
[4]:
user_prompt | |
---|---|
0 | Rank the primary colors in terms of popularity. The options are 0: Red 1: Yellow 2: Blue You can inlcude up to 3 options in your answer. Please respond only with a comma-separated list of the code of the raked options, with square brackets. E.g., [0, 1, 3] After the answer, you can put a comment explaining your choice on the next line. |
[5]:
from edsl import QuestionLinearScale, Survey, Model
q = QuestionLinearScale(
question_name = "primary_color_scale",
question_text = "Most people know what the primary colors are.",
question_options = [1,2,3,4,5],
option_labels = {
1:"This statement is completely inaccurate",
5:"This statement is completely accurate."
}
)
survey = Survey([q])
survey.by(Model()).prompts().select("user_prompt")
[5]:
user_prompt | |
---|---|
0 | Most people know what the primary colors are. 1 : This statement is completely inaccurate 2 : 3 : 4 : 5 : This statement is completely accurate. Only 1 option may be selected. Respond only with the code corresponding to one of the options. E.g., "1" or "5" by itself. After the answer, you can put a comment explaining why you chose that option on the next line. |
Formatting answers & comments
We can see that each default instruction includes directions on (1) formatting the answer and (2) providing a comment about the answer. When a question is administered, the contents of the comment that is returned are automatically stored in a separate field of the results. We can see this when we run a question and inspect the columns of the results that have been created. Here we run the multiple choice question created above:
[6]:
from edsl import QuestionMultipleChoice, Survey, Model
q = QuestionMultipleChoice(
question_name = "primary_color",
question_text = "What is the most common primary color?",
question_options = ["Red", "Yellow", "Blue"]
)
r = q.run() # default model will be used
Job UUID | 30ba8018-ee73-4006-b773-06d76a02c9c5 |
Progress Bar URL | https://www.expectedparrot.com/home/remote-job-progress/30ba8018-ee73-4006-b773-06d76a02c9c5 |
Exceptions Report URL | None |
Results UUID | b2b46967-fdae-4b47-8402-4bfafcef37f0 |
Results URL | https://www.expectedparrot.com/content/b2b46967-fdae-4b47-8402-4bfafcef37f0 |
We can see that the results include a comment
field:
[7]:
r.columns
[7]:
0 | |
---|---|
0 | agent.agent_index |
1 | agent.agent_instruction |
2 | agent.agent_name |
3 | answer.primary_color |
4 | cache_keys.primary_color_cache_key |
5 | cache_used.primary_color_cache_used |
6 | comment.primary_color_comment |
7 | generated_tokens.primary_color_generated_tokens |
8 | iteration.iteration |
9 | model.frequency_penalty |
10 | model.inference_service |
11 | model.logprobs |
12 | model.max_tokens |
13 | model.model |
14 | model.model_index |
15 | model.presence_penalty |
16 | model.temperature |
17 | model.top_logprobs |
18 | model.top_p |
19 | prompt.primary_color_system_prompt |
20 | prompt.primary_color_user_prompt |
21 | question_options.primary_color_question_options |
22 | question_text.primary_color_question_text |
23 | question_type.primary_color_question_type |
24 | raw_model_response.primary_color_cost |
25 | raw_model_response.primary_color_one_usd_buys |
26 | raw_model_response.primary_color_raw_model_response |
27 | scenario.scenario_index |
We can display it with any other fields:
[8]:
r.select("model", "primary_color", "primary_color_comment")
[8]:
model.model | answer.primary_color | comment.primary_color_comment | |
---|---|---|---|
0 | gpt-4o | Red | Red is often considered the most attention-grabbing and vibrant of the primary colors, making it commonly used and recognized. |
Turning off comments
If desired, we can omit the instruction to provide a comment by passing a parameter include_comment=False
to the question constructor. This may be desired if comments are not necessary or to save tokens. Here we inspect how the question prompt has been modified and verify that the comment field in the results is blank:
[9]:
from edsl import QuestionMultipleChoice, Survey, Model
q = QuestionMultipleChoice(
question_name = "primary_color",
question_text = "What is the most common primary color?",
question_options = ["Red", "Yellow", "Blue"],
include_comment = False # optional
)
q.by(Model()).prompts().select("user_prompt")
[9]:
user_prompt | |
---|---|
0 | What is the most common primary color? Red Yellow Blue Only 1 option may be selected. Respond only with a string corresponding to one of the options. |
[10]:
r = q.run() # default model will be used
r.select("model", "primary_color", "primary_color_comment")
Job UUID | 009caecc-8a6a-49e7-b8ed-8f819b845e0a |
Progress Bar URL | https://www.expectedparrot.com/home/remote-job-progress/009caecc-8a6a-49e7-b8ed-8f819b845e0a |
Exceptions Report URL | None |
Results UUID | 60d3b168-7c94-45ed-a76d-97ed258e69e7 |
Results URL | https://www.expectedparrot.com/content/60d3b168-7c94-45ed-a76d-97ed258e69e7 |
[10]:
model.model | answer.primary_color | comment.primary_color_comment | |
---|---|---|---|
0 | gpt-4o | Blue | nan |
Modifying comments
We can also modify the default instruction if we want to use the comment field in a different way. This can be done by passing an optional parameter answering_instruction
to the question constructor. For example, here we pass an instruction that preserves the directions about the format of the answer to a multiple choice question (”Respond only with a string corresponding to one of the options.”) but replace the comments part of the instruction with a new instruction for the model to
instead note it’s second choice answer. We include the original question in the survey as well for ease of comparison:
[11]:
from edsl import QuestionMultipleChoice, Survey, Model
q1 = QuestionMultipleChoice(
question_name = "primary_color_v1",
question_text = "What is the most common primary color?",
question_options = ["Red", "Yellow", "Blue"]
)
q2 = QuestionMultipleChoice(
question_name = "primary_color_v2",
question_text = "What is the most common primary color?",
question_options = ["Red", "Yellow", "Blue"],
answering_instructions = """
Respond only with a string corresponding to one of the options.
After the answer, please provide your second choice on the next line.
"""
)
survey = Survey([q1, q2])
survey.by(Model()).prompts().select("user_prompt")
[11]:
user_prompt | |
---|---|
0 | What is the most common primary color? Red Yellow Blue Only 1 option may be selected. Respond only with a string corresponding to one of the options. After the answer, you can put a comment explaining why you chose that option on the next line. |
1 | What is the most common primary color? Red Yellow Blue Only 1 option may be selected. Respond only with a string corresponding to one of the options. After the answer, please provide your second choice on the next line. |
[12]:
r = survey.run() # default model will be used
Job UUID | b615466b-ab2e-4ba0-a9e6-5ddbbbf8a9f9 |
Progress Bar URL | https://www.expectedparrot.com/home/remote-job-progress/b615466b-ab2e-4ba0-a9e6-5ddbbbf8a9f9 |
Exceptions Report URL | None |
Results UUID | 5f0c22c9-205f-43c7-89b4-de08902cb8d2 |
Results URL | https://www.expectedparrot.com/content/5f0c22c9-205f-43c7-89b4-de08902cb8d2 |
[13]:
r.select("model", "primary_color_v1", "primary_color_v1_comment", "primary_color_v2", "primary_color_v2_comment")
[13]:
model.model | answer.primary_color_v1 | comment.primary_color_v1_comment | answer.primary_color_v2 | comment.primary_color_v2_comment | |
---|---|---|---|---|---|
0 | gpt-4o | Red | Red is often considered the most attention-grabbing and vibrant of the primary colors, making it commonly used and recognized. | Blue | Red |
Further reading
Please see the questions page of the documentation for more examples and details on all of the required and optional parameters for question types!
Here we post this notebook to Coop for reference:
[14]:
from edsl import Notebook
n = Notebook("answering_instructions_example.ipynb")
n.push(description = "Example answering instructions", visibility = "public")
[14]:
{'description': 'Example answering instructions',
'object_type': 'notebook',
'url': 'https://www.expectedparrot.com/content/994ffea0-4597-4442-9ead-f58314a426d4',
'uuid': '994ffea0-4597-4442-9ead-f58314a426d4',
'version': '0.1.39.dev9',
'visibility': 'public'}
[14]:
from edsl import Notebook
n = Notebook("answering_instructions_example.ipynb")
n.patch(uuid = "994ffea0-4597-4442-9ead-f58314a426d4", value = n)
[14]:
{'status': 'success'}