21 Structure of the Method

The method section is normally split into a series of subsections, each detailing different aspects of how you went about running your study. For your report, we recommend using the following subsections:

Participants
Materials
Procedure
Design and Data Analysis

As you read more journal articles, you might notice some additional sub-sections, a different order of presentation, or merging some of these sections into one. However, we encourage you to start by learning this method, so you can recognise the key information that should be included in the method section.

Keep in mind you are writing a registered report, so there will be some details you will not know that you see included in a traditional report. For example, you will not know the final sample size, nor will you know precisely how you analysed the data. You can only report how you designed the study to constrain researcher degrees of freedom, not what the end product is.

One element that might feel a little odd is you write the method section in past tense - describing how you planned your study - even when you have not conducted it yet. We do this as a published final registered report includes all the sections with the method and results in the middle, so it is presented as one longer piece of work for how you designed your study and then what results you found. We are retaining the style, even though you will submit the stage one and stage two report separately.

As we work through the sections, try and identify all the different decisions you and your group will need to make. Ask yourself whether you know the information yet, or whether its something you still need to find out. Then, consider how you will justify the decision using supporting evidence where applicable.

21.1 Participants

Normally, this would appear first in the method and what you are trying to show is who were the people that took part in your study. You would normally include aspects such as:

Method of recruitment
- Opportunity/convenience, volunteer, or random sampling?
- How were they encouraged to take part? Was there any incentive?
Relevant demographic information
- What is relevant depends on what you are testing. This is about contextualising your sample to the reader.
- So, we tend to give an overall description of the sample and then we give a description relevant to what we are testing. However, if you are not testing any groups then you might not need this and you just need the overall view.
- However, it is not necessary to go into lots of detail about nationality and other demographics if you are not testing that demographic and do not think it is relevant to the outcome of your study.
Any inclusion or exclusion criteria
- Inclusion criteria are features you are looking for in your sample.
- Exclusion criteria are features that you would exclude people from your sample.
- These are not necessarily opposites, think of it as inclusion criteria are what would get people into your sample, and exclusion criteria are what would remove people from your sample.
Think about the order that you present information within a section.
- Remember you are leading a reader through the methods, so it is important for them to be able to follow what you are saying. This means thinking about what they will know at certain points and what they will understand. Ask yourself, "will this make sense to a reader if I present this information now, or should I present it later/earlier?"
- For example, if you plan on excluding people and this changes the possible demographics of your groups, then it would make sense to present the demographics after you have presented the exclusion criteria - so it is clear the demographics are of those left in your study and do not include people you plan on removing.

21.2 Materials

This would normally come second in your method section and it is really about the software and questionnaires in this study, or about the stimuli, questionnaire, software, and/or additional materials used more generally. Assume the reader knows nothing about the questionnaires and software here and you are trying to convey information about the materials to them so they can understand what they are and how they were used.

A well-written materials section should allow someone who was not part of the research team to replicate the study. You are trying to give them as much detail as possible that they could run your study, but sometimes longer details (such as a full list of questions) can be included in the appendix to save space in the main report.

You would normally include aspects such as:

Demographic questions
- How many questions were there?
- What were the response options?
- Give an example of a question or two. It does not have to detail every question but can summarise the main points.
- For example: "We used a series of {state number} demographic questionnaires to establish age, nationality... with dropdown categorical options to respond to ... and free-response boxes to answer.."
- Again, think about what is relevant. Focus on detailing the demographics you are using for the analysis. If you are not using a demographic variable, then keep this just as a broad overview of the demographics part and maybe just detail age and gender questions as you need them for in the participants section.
Motivated Strategies for Learning Questionnaire (MSLQ)
- You should describe the full MSLQ and state which subscale/s you will use data from.
- You do not have to list all the subscales but give an overview of the general purpose of the questionnaire and then focus on the subscale(s) you are using. It can be useful to reiterate what the subscale(s) mean here.
- The MSLQ should have a formal citation (available in the MSLQ overview document)
- State aspects such as how many questions there were in total and how many questions are in your relevant subscale/s
- What were the response options on the subscale/s?
- For example: "The help-seeking subscale is made up of 5 questions such as "example question" and "example question", with potential responses on a 5-point Likert scale where 1 means .. and 5 means.."
You should also cite the software used to host the questionnaire
- For example: "The study was hosted on Experimentum (DeBruine et al., 2020)"
- DeBruine, L., Lai, R., Jones, B., Abdullah, R., & Mahrholz, G. (2020). Experimentum (Version v.0.2). Zenodo. doi:10.5281/zenodo.2634355

21.3 Procedure

The procedure details what happened in the study. This section normally comes after the materials section as the reader needs to understand what you used before they can understand how you used it. It can take a while to recognise the distinction between the two sections as you are trying to avoid overlap between the sections, but consider the materials as what participants completed, and the procedure as when in the study they completed each element of the materials.

The main point to keep in mind when writing this section is to think about reproducibility. After reading this sub-section, a reader should be able to reproduce your study exactly, based on what you have written in the materials and procedure.

The procedure is often written as one paragraph, but it does not necessarily need to be if it makes more sense to break things up. However, try and avoid using one sentence paragraphs or presenting it as bullet points.

You would normally include aspects such as:

What the participants did in the study, what order they did things in, how the study looked to them, and how long they took overall.
Try to think about how the participant accessed the study. It can help to go back and run the study on yourself again and think about the different things that you do in the study.
This is where you outline the ethics processes, such as reading the information sheet and providing consent at the start, and reading the debrief at the end. You do not need to explain what information these documents contain, just when they read them and how they interacted with them.
You can include the color of font and background color.
You would normally state what order participants did things in: did they do the demographics and then the MSLQ, or vice versa?
Try to state how the questions were presented and whether they were randomised or not.
Try to state if all the questions were on the same screen or was it one question on the screen at any one time?
Try to state how participants responded: did they use the mouse, did they use certain keys, did they speak their answer?
Did participants have a time duration to respond by or was it untimed, and what did they do if they did not want to respond to a question?

If you find you are repeating a lot of information from the materials, then there is a good chance you are mixing up what information goes where. Remember: the procedure is about what participants saw and did, and how the study was presented to them. The main goal here is to find a balance between providing enough detail for someone to replicate your study, and avoiding surplus detail that is not that directly relevant.

For example: "Participants saw an advert on social media and then they clicked on the link and then they read the consent form and then if they decided they wanted to do the study they consented and then they started answering the questions" can be reduced to: "Participants accessed the study through a link on social media. Before starting the study, participants read an information sheet and gave consent to taking part."

21.4 Design and Data Analysis

This subsection usually comes last in the method because it is now leading the reader into the results. It is the transition point between how the study was run and what you used into how you plan on analysing the data. It is not about the procedure as it is not about what your participants saw or did, it is about what you plan on doing with the data.

Thinking about logical flow again, if you were to present this at the start of the method section, then it would not make much sense to the reader as they just do not know enough information about the study yet.

You would normally include aspects such as:

Specify the research design of your study
- Is it within-subjects, between-subjects, correlational?
- If you have more than one analysis in your study, you must remember to discuss and present both - either as one paragraph or two separate paragraphs.
State the dependent/measured variables
- If you are planning a t-test then it would be better to talk about your dependent variable
- If you are planning a correlation, then it would be better to talk about measured variables as there is no independent and dependent variable in a correlational design.
- Be precise: you are testing the mean or sum score on a subscale of the MSLQ, it is not just e.g., self-efficacy as an abstract concept.
State the independent variable and its conditions/levels (levels is just another name for conditions or groups), if you have one (i.e., those using a t-test)
- You can clarify it is a quasi-experimental variable.
- For example: "In our study, we have an independent variable of level of study with two levels: postgraduate and undergraduate.
- Your levels will potentially link back to information in the participants section if you have an independent variable. Remember though: if you have a correlational design, you probably will not have one.
Typically, you could report a power analysis here to outline how many participants you estimate you need to detect your smallest effect size of interest. Alternatively, you could report it at the start of your results section. If you and your group do attempt a power analysis, think about:
- You will want to state and justify the alpha value, the power value, and the smallest effect size of interest. Typically, you will be calculating the sample size as the output here.
- For example: "Previous research (Paterson & Robertson, 2014; Swingler & Toivo, 2019; Toivo & Swingler, 2020) found small- to medium-sized correlations in related studies looking at test anxiety and help-seeking. As such, using the field standard of alpha = .05, power = .80, and a smallest effect size of interest of r = .3 (Swingler & Toivo, 2019), we require 35 participants.
Briefly state how you plan on analysing the data
- You would state the type of test, use of R (with citations), and any relevant packages.
- The idea here is to constrain those researcher degrees of freedom, so you are outlining how you plan on analysing the data to the best of your knowledge, so you can explain and justify any deviations from this plan in your stage two individual report.

21.5 Guidance on writing a good methods section

Methods sections are formulaic and work by putting the right information in the right sections. Try to follow the guidance above on what goes where.
You must use sub-headings and we would highly recommend the sub-headings we have stated here.
Try to avoid repeating information across sections. If you find you are doing that, then it is likely that you are including information in the wrong sections. Each section has a different focus.
Be concise but detailed. Try not to waste words in the method section and state things clearly. Save words for building your narrative in the introduction and discussion sections. Its a balancing act as your method section should also be detailed enough to allow someone to replicate your study.
Reading journal articles and focusing on the method section is a good way to see what information can go where. There is a lot of variation in published articles though and they are not often replicable, so another thing to think about when reading papers is "what is missing here that would allow me to replicate this study?".
Read the ILOs in the Assessment Information Sheet (AIS) for aspects about demonstrating technical knowledge of the different parts, for writing clearly and succinctly, and for professional communication and organisation.

21.6 Example sections

Here, we are going to look at a procedure section and a material section from a couple of papers to help drive home some of the ideas we have discussed above. Try to think about the questions and pointers were are raising here when you look at your material and procedure sections, but also your participants and design and data analysis sections.

21.6.1 Example 1: A procedure sub-section

This is the Procedure section from Tsantani et al. (2016). Given that it is published, it would be considered good. However, there are also aspects that could be improved. Read the paragraph then look at the highlighted sections after.

"The experiment took place in the experimental laboratories of the University of Glasgow. Participants were required to complete a 2AFC task during which they listened to pairs of voices comprising high- and low-pitched versions of the original recordings. The sound samples were presented through headphones (participants' own) connected to a computer with the sound set at approximately 80 dB Sound Pressure Level (SPL): System volume was measured prior to the experiment using a standard headphone set (Sennheisser Beyerdynamic DT 770 PRO 250 OHM) and sound meter. At the beginning of the experiment, participants were informed, via on-screen instructions, that they would hear pairs of voices in two blocks, by trait, and would be asked to make a decision regarding each pair. Participants were told that there was no time limit to their decision but were encouraged to answer with their first impression. After each pair of voices the question "Which voice did you perceive as more {dominant} {trustworthy}?" was displayed on the screen. Pressing the "s" key would mean that they perceived Voice 1 as being most dominant or trustworthy, whereas the "k" key represented Voice 2. The definitions of dominance and trustworthiness used in the instructions were "Dominance means having power and influence over others" and "Trustworthiness means able to be relied on as honest or truthful." The order of the dominance and trustworthiness blocks, as well as the order in which the voice trials were presented within the block, were counterbalanced across participants. Male and female trials were presented randomly within the same block, as opposed to being presented in different blocks, to avoid an additional potential block-order effect caused by the gender of the voice. Finally, the order of the high- and low-pitched versions of the recordings within each trial was counterbalanced by including two trials of each pair in a block, with the high- and low-pitched versions in a different order. Therefore, within each block, the 20 pairs of voice samples were presented twice. The voices within each pair were played consecutively with a 1-s pause between the first voice and the second voice, and participants proceeded to the next trial by pressing "space." The experiment lasted approximately 14 min."

21.6.1.1 Highlighted section 1

"After each pair of voices the question "Which voice did you perceive as more {dominant} {trustworthy}?" was displayed on the screen. Pressing the "s" key would mean that they perceived Voice 1 as being most dominant or trustworthy, whereas the "k" key represented Voice 2."

This is clear on which keys participants were told to use. This is quite specific detail, but clearly the authors had a reason for this. Normally, this will be because it helps reduce reaction times if specific keys are used and participants hold their fingers on those keys at all times.
However, what is not stated is whether participants were told to always have fingers on those keys and whether this "key-mapping" (what key represents what answer) was shown all the time to participants, just at the start, or just at breaks?

21.6.1.2 Highlighted section 2

"The definitions of dominance and trustworthiness used in the instructions were "Dominance means having power and influence over others" and "Trustworthiness means able to be relied on as honest or truthful."

This is really clear on what the specific definitions were that participants were instructed to use but it is not stated if these definitions were displayed throughout or just at the start. Would this make a difference if you tried to replicate the study? It might help to add one or two words to make it clear what actually happened.

21.6.1.3 Highlighted section 3

"The order of the dominance and trustworthiness blocks, as well as the order in which the voice trials were presented within the block, were counterbalanced across participants."

Words like "counterbalanced" are technical terms to help clarify what happened but reduce the word count of a longer explanation. However, it is a bit unclear what it means in terms of "voice trials", so that might need further clarification.

21.6.1.4 Highlighted section 4

"Finally, the order of the high- and low-pitched versions of the recordings within each trial was counterbalanced by including two trials of each pair in a block, with the high- and low-pitched versions in a different order. Therefore, within each block, the 20 pairs of voice samples were presented twice. The voices within each pair were played consecutively with a 1-s pause between the first voice and the second voice, and participants proceeded to the next trial by pressing "space."

This is getting rather complex and could maybe have been improved with an example, (A vs B, B vs A) and the total number of trials, i.e. 40. The main thing though is that it is trying to give enough detail for someone to replicate (1 second gap, press space)

21.6.1.5 Highlighted section 5

"The experiment lasted approximately 14 min."

It is always good to include the approximate time for participants to complete the study. If someone is replicating your study and their version takes 45 minutes, they know something is wrong.

21.6.2 Example 2: A materials sub-section

This is adapted from the materials section of a first draft of a paper by Stuart McLaren and Phil McAleer.

"The Statistical Anxiety Rating Scale (STARS; Cruise et al., 1985) consists of 51 items comprised of two sections and six subscales. The first section includes 23 statements rated on a 5-point Likert scale that range from "1 = No Anxiety" to "5 = Strong Anxiety". This section addresses three factors related to how individuals experience specific statistical situations: (a) Test and Class Anxiety (8-items with scores that range from 8 to 40), (b) Interpretation Anxiety (11-items with scores that range from 11 to 55) and (c) Fear of Asking for Help (4-items with scores that range from 4 to 20). For instance, statements include "Doing an examination for a statistics course" or "Going to ask my statistics teacher for individual help with material I am having difficulty in understanding". The second section consists of 28 statements rated on a 5-point Likert scale that range from "1 = Strongly Disagree" to "5 = Strongly Agree". This section examines levels of attitudes towards scenarios that involve statistics and statistics teachers over three factors: (d) Worth of Statistics (16-items with scores that range from 16 to 80), (e) Fear of Statistic Teachers (5-items with scores that range from 5 to 25), and (f) Computational Self-Concept (7-items with scores that range from 7 to 35). For example, statements include "I wish the statistics requirement would be removed from my academic program" or "Statisticians are more number orientated than they are people orientated". A compound score of each subscale is calculated by summing item scores in each subscale, with higher scores indicating higher levels of statistics anxiety. The six subscales show good Cronbach's alpha reliabilities between .81 and .94 (Chew et al., 2018)."

21.6.2.1 Highlighted section 1

"The Statistical Anxiety Rating Scale (STARS; Cruise et al., 1985) consists of 51 items comprised of two sections and six subscales."

Although we may have stated this information earlier in the introduction, we are now in the method section and all that relevant information needs to be here.

21.6.2.2 Highlighted Section 2

"The first section includes 23 statements rated on a 5-point Likert scale that range from "1 = No Anxiety" to "5 = Strong Anxiety".

Note here that we are talking about the scales: what it looked liked and specifically not how the participants responded. Think about what information goes where at all times. In addition, would it maybe help here if we stated whether or not the values 2, 3 and 4 had labels on them or not? It would at least help if we said they had no labels.

21.6.2.3 Highlighted Section 3

"A compound score of each subscale is calculated by summing item scores in each subscale, with higher scores indicating higher levels of statistics anxiety."

This works here, but it could also appear in the data analysis subsection. The idea is that by the time you get to the results, you know what is happening with the scales and how they are calculated and interpreted. A key observation here is that there is not one perfect approach but often when you start to think about the logical flow of information, the better approach comes apparent.

21.6.2.4 Highlighted Section 4

"The six subscales show good Cronbach's alpha reliabilities between .81 and .94 (Chew et al., 2018)."

Finally, we have not explored reliability of scales in much detail (briefly in lecture 2) but if you had that information from a previous paper, then it can be useful and helps give context to the scales to show they are valid and reliable. The one thing maybe here that is missing is just stating what these values mean: for example, do they mean high reliability or low reliability? It could be extended by "The six subscales show good Cronbach's alpha reliabilities between .81 and .94 (Chew et al., 2018) meaning that...".

20 Academic Writing and Academic Integrity

22 Reporting Power Analyses and Reporting Correlation Results