Skip to main content

Representative samples FAQ

Who is eligible to be in a representative sample of the UK / US?

In order for a participant to be eligible for a representative sample, there must be space remaining in their matching age, sex, or ethnicity subgroup. Additionally, they must be a current resident of the country being sampled and they must be fluent in the language of that country.

Where does Prolific get its census data from?

We use census data from the US Census Bureau and the UK Office of National Statistics.

  • The age by sex by ethnic group proportions for the UK can be calculated from 2021/2022 census data. We have combined data for England & Wales, Northern Ireland and Scotland*.
  • The age by sex by ethnic group proportions for the US can be calculated from US Census Bureau population group estimates from 2021, found here.
  • The age by sex by ethnic group proportions for the US political representative sample can be calculated from the US Census Bureau population group estimates from 2021, found here. The political affiliation data from 2022 is here.

*The age/sex/ethnicity breakdowns for Scotland are due to be released later in 2024. In the interim, we have used the available 2022 census age/sex breakdowns and overlaid the most up-to-date census data on ethnicity from 2011.

Please note that the age brackets and age/sex/ethnicity proportions used for the UK and US non-political representative samples were updated on 27th February 2024 to use the most up-to-date data. If you have any questions about this, please reach out to our Support Team using the link at the bottom of this article.

How does Prolific create the demographic subgroups used for representative samples?

Participant eligibility for representative samples is calculated on the basis of prescreening answers. We use three prescreeners: 'date of birth', 'sex’, and 'ethnicity (simplified)’.

Standard representative sample

  • Starting with the youngest allowable participation age on Prolific, we stratify age using seven brackets: 18-24, 25-34, 35-44, 45-54, 55-64, 65-74 and 74+.
  • ‘Sex’ is stratified into male and female.
  • ‘Ethnicity (simplified)’ is stratified into the five categories recommended by the UK Office of National Statistics: White, Mixed, Asian, Black and Other.

Political representative sample

  • Starting with the youngest allowable participation age on Prolific, we stratify age using seven brackets: 18-24, 25-34, 35-44, 45-54, 55-64, 65-74 and 74+.
  • ‘Sex’ is stratified into male and female.
  • ‘Ethnicity (simplified)’ is stratified into the five categories recommended by the UK Office of National Statistics: White, Mixed, Asian, Black and Other.
  • 'Political affiliation' is stratified into Republican, Democrat, and Independent.

What type of allocation algorithm does Prolific use?

Standard representative samples

Cross stratifying on age (5 brackets), ethnicity (5 groups) and sex (2 groups) results in 50 subgroups: one for every combination of answers.

Political representative samples

Cross stratifying on age (7 brackets), ethnicity (5 groups), sex (2 groups) and political affiliation (3 groups) results in 210 subgroups: one for every combination of answers.

All representative samples

Using census data, we can calculate the proportion of each subgroup in the national population. In order to ensure minimal representation, we first allocate 1 space per subgroup. We then allocate the remaining spaces in the sample proportionally, according to the national population.

Where we need to round subgroup sizes (i.e. proportionate allocation indicates a subgroup should have 2.6 participants) we try to round to the closest whole number, while balancing round-ups and round-downs to ensure the total sample size remains exactly what was asked for.

How are participants recruited to my representative sample?

Participants take part in a representative sample study in exactly the same way as a normal Prolific study. In other words, a representative sample is collected on a (mostly) first come, first serve basis: though we do have processes in place to ensure fair distribution of studies across the participant pool.

How long does it take to collect a representative sample?

Representative samples tend to take longer to recruit than Standard samples. Representative samples include demographics with smaller population numbers, so it can be hard to guarantee all demographics will be online on a daily basis to take your study. We're always striving to recruit more participants in niche demographics to improve this service for you. Typically, we expect a representative sample study to complete within 2 to 4 days.

What can I do if my sample is slow to fill?

If a representative sample study is still awaiting submissions 48 hours after you have launched it on Prolific, you will have the option to reallocate your sample by loosening the eligibility requirements of each remaining place a little in order to strike a balance between timely data collection and sample representativeness.

If you choose to reallocate your sample, you will not be able to undo this or later increase places on your study.

We find most representative samples are at least 90% full after 48 hours and fill completely after loosening. This normally results in a sample accuracy of 95% or more.

How does reallocating work?

Reallocating removes age bracket restrictions from all unfilled places in the sample.

As an example, if you have 2 unfilled places for 28-37-year-old Asian women in your study 48 hours after publication. These places will normally be opened up to all Asian women in the country, regardless of age.

Additionally, the algorithm will monitor the overall demographic distribution based on sex and ethnicity. This means that sometimes reallocated places may be reassigned to a different sex/ethnicity to give the most accurate representation of the population.

How can I calculate sample accuracy?

You can calculate final representative sample accuracy by summing the total number of subgroup requirements met by participants in the final sample, and dividing it by the total number of requirements that could have possibly been met.

For example: You have a final sample of 1000 participants.

  1. The total possible number of requirements your sample could meet is 3000. 3 (age, ethnicity, sex) x 1000 (the total sample size)
  2. 900 places have been filled by participants that met their subgroup’s requirements for age, ethnicity and sex: 900 x 3 = 2700 requirements met.
  3. 80 participants were collected after one round of reallocation: These participants met the requirements for ethnicity and sex: 80 x 2 = 160 requirements met.
  4. 20 participants were collected after two rounds of reallocation: These participants met the requirement for sex only: 20 x 1 = 20 requirements met.
  5. 2700 + 160 + 20 = 2880
  6. 2880 / 3000 = 96% sample accuracy.

 


I need further help

 Click here to contact us


Was this article helpful?
powered by Typeform