When you run a study on Prolific, you'll usually download at least two datasets:
A demographic export from Prolific
Your survey response data from your external survey platform
Combining these datasets can make analysis easier by allowing you to view participant demographics alongside their survey responses in a single spreadsheet.
The exact steps may vary depending on the spreadsheet software you're using, such as Microsoft Excel, Google Sheets, or Apple Numbers.
Before you start
Open both spreadsheets side by side and locate the column containing the Prolific Participant ID in each file.
In the Prolific demographic export, the Participant ID is typically found in column B.
In your survey response export, the Participant ID column may vary depending on the platform.
Because the demographic export is usually the smaller dataset, we recommend adding the demographic information into your survey response spreadsheet.
Option 1: Use a formula (recommended)
Formulas allow you to automatically match participants across both spreadsheets using their Prolific Participant ID.
You can use functions such as:
XLOOKUP
INDEX-MATCH
VLOOKUP
This guide uses VLOOKUP as an example.
Step 1: Insert a VLOOKUP formula
In your survey response spreadsheet, select the cell where you want the demographic data to appear and enter a VLOOKUP formula.
The formula requires four pieces of information:
Lookup Value
The unique identifier used to match participants between spreadsheets.
In this case, use the Prolific Participant ID.
Table Array
The range of data from the demographic export that contains the information you want to import.
Important: The Participant ID column must be the leftmost column in the selected range.
Column Index Number
The position of the demographic column you want to return, counting from the Participant ID column.
For example:
Participant ID is in column A
Age is in column D
Since column D is the fourth column in the selected range, enter 4 as the column index number.
Range Lookup
This determines whether the match must be exact.
Enter FALSE to ensure only exact Participant ID matches are returned.
Step 2: Lock your references
To prevent the selected data range from changing when you copy the formula down the spreadsheet, lock the cell references using $ symbols.
In many spreadsheet applications, you can quickly do this by selecting the reference and pressing F4.
Step 3: Copy the formula down
Once you've confirmed the first formula returns the correct result:
Select the cell containing the formula.
Drag the fill handle down to apply the formula to all remaining rows.
Step 4: Repeat for additional columns
If you need to import multiple demographic fields, repeat the process for each column.
Step 5: Paste as values
After the formulas have returned the correct data:
Copy the populated cells.
Paste them back into the same location using Paste Special → Values (or the equivalent option in your spreadsheet software).
This converts the formulas into static data and prevents changes if the original spreadsheet is moved or edited later.
Option 2: Sort both spreadsheets by Participant ID
If using formulas isn't suitable, you can manually align the spreadsheets by sorting them.
Step 1: Sort both files
In each spreadsheet:
Select the Participant ID column.
Sort the data in ascending order.
Important: Make sure the entire spreadsheet is sorted, not just the Participant ID column. Sorting only one column will misalign your data.
Step 2: Verify the sort
Before copying any data, compare a few rows between both spreadsheets.
For example, check that the Participant ID in row 26 of one spreadsheet matches the Participant ID in row 26 of the other spreadsheet.
Spot-checking helps ensure the data has been sorted correctly.
Step 3: Copy the demographic data
Once both spreadsheets are aligned:
Copy the demographic columns from the Prolific export.
Paste them into the survey response spreadsheet.
We recommend pasting the data into new columns at the end of the spreadsheet so that no existing data is overwritten.
Which method should I use?
We recommend using a lookup formula whenever possible, as it:
Reduces the risk of human error
Automatically matches participants by ID
Works even when the datasets contain different numbers of rows
Makes it easier to update your data later
Sorting and manually copying data can be useful for smaller datasets, but requires extra care to ensure both spreadsheets remain perfectly aligned.

