Skip logic & scenarios

This notebook provides example EDSL code for using a language model to simulate a survey that uses skip logic: rules for determining which questions are administered based on responses to other questions in the survey.

In the first example below we construct a survey of questions and then add a rule to skip one question based on the response to another question.

In the second example we add some complexity. We first create different “scenarios” (versions) of questions and combine them in a survey. Then we add multiple rules to skip specific versions of the questions based on responses to a particular version of a question.

EDSL is an open-source library for simulating surveys, experiments and other research with AI agents and large language models. Before running the code below, please ensure that you have installed the EDSL library and either activated remote inference from your Coop account or stored API keys for the language models that you want to use with EDSL. Please also see our documentation page for tips and tutorials on getting started using EDSL.

Example 1

In the first example below we construct questions, combine them in a survey, and add a rule to skip the second question based on the response to the first question. Then we create Scenario objects for contents that will be added to the questions when the survey is run. The effect of this is that the second question will be skipped based on the response to the first question for each individual scenario.

We start by constructing questions:

[1]:

from edsl import QuestionYesNo, QuestionNumerical, QuestionMultipleChoice

q1 = QuestionYesNo(
    question_name = "recent_purchase",
    question_text = "In the last year have you or anyone in your household purchased any {{ scenario.item }}?",
)

q2 = QuestionNumerical(
    question_name = "amount",
    question_text = "In the last year, how much did your household spend on {{ scenario.item }} (in USD)?"
)

q3 = QuestionMultipleChoice(
    question_name = "next_purchase",
    question_text = "When do you next expect to purchase {{ scenario.item }}?",
    question_options = [
        "Never",
        "Within the next month",
        "Within the next year",
        "I do not know"
    ]
)

We combine the questions in a survey to administer them together:

[2]:

from edsl import Survey

survey = Survey(questions = [q1, q2, q3])
survey

[2]:

Survey # questions: 3; question_name list: ['recent_purchase', 'amount', 'next_purchase'];

	question_options	question_name	question_type	question_text
0	['No', 'Yes']	recent_purchase	yes_no	In the last year have you or anyone in your household purchased any {{ scenario.item }}?
1	nan	amount	numerical	In the last year, how much did your household spend on {{ scenario.item }} (in USD)?
2	['Never', 'Within the next month', 'Within the next year', 'I do not know']	next_purchase	multiple_choice	When do you next expect to purchase {{ scenario.item }}?

Here we add a rule to skip q2 based on the response to q1:

[3]:

survey = survey.add_skip_rule(q2, "{{ recent_purchase.answer }} == 'No'")

Next we create scenarios for the “item” to be used with each question:

[4]:

from edsl import Scenario, ScenarioList

s = ScenarioList(
    Scenario({"item":item}) for item in ["electronics", "phones"]
)

Note that we could also use a method for the data type that we are using–this is equivalent:

[5]:

s = ScenarioList.from_list("item", ["electronics", "phones"])
s

[5]:

ScenarioList scenarios: 2; keys: ['item'];

	item
0	electronics
1	phones

We can inspect the flow of the survey that has been created with the scenarios that we’re using:

[6]:

survey.by(s).show_flow()

../_images/notebooks_skip_logic_scenarios_13_0.png

Next we create some agent personas to answer the questions:

[7]:

from edsl import Agent, AgentList

income_levels = ["under $100,000", "$100,000-250,000", "above $250,000"]
ages = [30, 50, 70]

a = AgentList(
    Agent({"annual_income":income, "age":age}) for income in income_levels for age in ages
)
a

[7]:

AgentList agents: 9;

	annual_income	age
0	under $100,000	30
1	under $100,000	50
2	under $100,000	70
3	$100,000-250,000	30
4	$100,000-250,000	50
5	$100,000-250,000	70
6	above $250,000	30
7	above $250,000	50
8	above $250,000	70

Next we select a model to generate the responses (check available models and pricing):

[8]:

from edsl import Model

m = Model("gemini-1.5-flash")

We can inspect (or modify) the default parameters of the model that will be used:

[9]:

[9]:

gemini-1.5-flash

	key	value
0	model	gemini-1.5-flash
1	parameters:temperature	0.500000
2	parameters:topP	1
3	parameters:topK	1
4	parameters:maxOutputTokens	2048
5	parameters:stopSequences	[]
6	inference_service	google

We run the survey by adding any scenarios, agents and models and then calling the run:

[10]:

results = survey.by(s).by(a).by(m).run()

⌃ Job Status 🦜

Completed (18 completed, 0 failed)

Job Links

Results

Progress Report

Content

Remote Jobs

Remote Cache

Identifiers

Results UUID:

28bb67bb...811e

Use Results.pull(uuid) to fetch results.

Job UUID:

ef305b0f...5d03

Use Jobs.pull(uuid) to fetch job.