PsychDisclosure.org provides a platform for authors of recently published articles in Psychology to publicly disclose four categories of methodological details not required to be disclosed under current reporting standards, but which are essential for interpreting research findings. Click here for in press article reporting on our initiative. Learn More
Disclosure categories:
  1. Exclusions: Disclosed total number of observations excluded and criterion for doing so.
  2. Conditions: Disclosed all tested experimental conditions, including failed manipulations.
  3. Measures: Disclosed all administered measures and items.
  4. Sample size: Disclosed (a) basis for chosen sample sizes and (b) basis for stopping data collection.
Submit your disclosure information for any article in any journal! Please see Contact Us page for more details.
Current response rate overall = 50.3% (190/378); PS = 50% (71/141), JPSP = 49% (50/102), JEPLMC = 49% (44/90), JEPG = 56% (25/45). [Website up-to-date as of May 16, 2013 12:35PM EST]
IssueAuthorsArticle Title
In pressLeBel & CampbellHeightened sensitivity to temperature cues in highly anxiously attached individuals: Real or elusive phenomenon? 
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Full Disclosure (details provided in the preregistered study protocols, accessible via links in the article)
Mar 2013Ackerman, Kashy, Donnellan et al.The Interpersonal Legacy of a Positive Family Climate in Adolescence
  1. Exclusions: We excluded some people from the sample because they did not have the observational data in adolescence or data about marital romantic partnerships in adulthood.  We did not exclude anyone who otherwise met the selection criteria for our study.
  2. Conditions: Full Disclosure (We had no manipulations. In our analyses we did examine a couple of parenting variables but then we learned that another team working with the same data set was using those outcomes so we dropped them.)
  3. Measures: No.  There are many more measures of different constructs in the dataset. We selected those most relevant to our investigation.
  4. Sample Size: The data were already collected so we simply analyzed what was available to us from this existing project.
Goldfarb & TreismanCounting Multidimensional Objects: Implications for the Neural-Synchrony Theory
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We decided ahead of time to collect data until minimum sample size was achieved and this was followed.
Hehman, Leitner, Deegan et al.Facial Structure Is Indicative of Explicit Support for Prejudicial Beliefs
  1. Exclusions: Full Disclosure
  2. Conditions: In Study 3, there was no difference between conditions on any of our dependent measures, nor interactions. Therefore, there was no evidence that this manipulation was effective or that it even functionally "existed."
  3. Measures: Full Disclosure
  4. Sample Size: In Study 1, we collected as many as possible during a semester. In Study 2, we aimed for 100 participants as we were unsure what effect size to expect, and stopped when we reached that goal. In Study 3, we based our sample goal on the size of the effect demonstrated in Study 2.
Schneider, Eerland, van Harreveld et al.One Way and the Other: The Bidirectional Relationship Between Ambivalence and Body Movement
  1. Exclusions: Full Disclosure
  2. Conditions: Ambivalence is hard to manipulate experimentally among Dutch students, we pretested more self-written articles but used only the strongest manipulation.  Because there were no differences in ambivalence, we did not analyze these data further, and as such, this information was not interesting.
  3. Measures: Some were significant, some were not, but the most important reason was doubts regarding validity. However, we mention the additional measures in the paper and interested researchers may contact us about these measures. 
  4. Sample Size: We decided ahead of time to collect data until minimum sample size achieved, or data collection period ended, and this was followed.
Sutin, Terracciano, Milaneschi et al.The Effect of Birth Cohort on Well-Being: The Legacy of Economic Hard Times
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: The Baltimore Longitudinal Study of Aging (BLSA) is an on-going epidemiological study of normal aging. BLSA participants undergo extensive testing during each visit that lasts for 2-3 days. This testing includes numerous measures of physical, cognitive, and emotional health. The National Health and Nutrition Examination Survey (NHANES I) was likewise a large study that included numerous measures of health and nutrition. From both studies, we selected the measure that was relevant to our research question.
  4. Sample Size: We selected every participant who had completed the CES-D from the time it was introduced into the BLSA (1979) to the time of the initial data analysis (2010). BLSA participants continue to fill out the CES-D at every visit. From NHANES I, we selected adult participants who completed the CES-D.
Van der Burg, Awh, & OliversThe Capacity of Audiovisual Integration Is Limited to One Item
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We included a predetermined number of subjects in each experiment. This was based on experience.
Feb 2013Caparos, Linnell, Bremner et al.Do Local and Global Perceptual Biases Tell Us Anything About Local and Global Selective Attention?
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Participants performed two blocks of trials. Only the data obtained in the first block are reported in the paper. The effects (reported in the paper) were also present in the second block, however, there were also carry-over effects that were not sufficiently reliable to be reported. The paper format (brief report) was not adequate for us to discuss these effects.
  4. Sample Size: We aimed to test at least 50 participants in each group (a group of British participants and a group of traditional African participants). In Africa, two weeks of testing were dedicated to data collection for this experiment. We tested as many participants as we could during these two weeks (reaching a sample size of 58 in the African group). We then tested an equivalent number of British participants.
Jamieson, Koslov, Nock et al.Experiencing Discrimination Increases Risk Taking
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We collected pre-experiment questionnaires on-line that all participants completed and were beyond the scope of the current article. Measures included items such as intergroup contact, personality measures, and other individual differences
  4. Sample Size: As in all the studies in my lab we decide on the targeted N based on previous studies and power analysis. We then run 10% over the targeted amount due to typical loss in physiolgocial measures and biological samples (due to electrical interference, loss of signal, contaminated saliva samples, etc). We *never* analyze our data until the study is complete primarily because we send out biological samples in batch so that they are assayed at one time. I didn't respond (yes or no) above because this stopping rule is not stated explictly, but we do cite standard articles and chapters that outline this protocol and space restraints prevent this type of extra information.
Laran & SalernoLife-History Strategy, Food Choice, and Caloric Consumption
  1. Exclusions: Full Disclosure
  2. Conditions: An entire study, from the first submission, did not make the final version of the paper as per editorial request.
  3. Measures:  In study 2, we included a few other filler questions unrelated to our research questions that were included to support our cover story. These measures did not vary as a function of our experimental conditions.
  4. Sample Size: Study 1: We aimed to collect at least 25 participants per cell. We obtained our final sample by asking our undergraduate research assistants to recruit as many participants as they could over a two day period of a few hours each day and ended up with more participants than the 25 per cell initially expected (n = 121). Studies 2 and 3: We aimed to collect 40 subjects per cell given that the dependent variable was binary. Our total n was lower than expected in Study 2 (n = 238) and Study 3 (n = 144) based on fluctuation in attendance rates for the sessions held at our lab.
Marinovic, Pearce, & ArnoldAttentional-Tracking Acuity Is Modulated by Illusory Changes in Perceived Speed
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Based on past experience of motion adaptation phenomena, we determined on a sample size of 8, as this should be more than ample to detect a low-level visual aftereffect. We stopped testing once we had tested all the participants.
Simonsohn & GinoDaily Horizons: Evidence of Narrow Bracketing in Judgment From 10 Years of M.B.A. Admissions Interviews
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Full Disclosure
Jan 2013Briñol, Gascó, Petty et al. Treating Thoughts as Material Objects Can Increase or Decrease Their Impact on Evaluation
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We excluded any items that were unrelated to the research questions or that were included for exploratory purposes. Furthermore, we focused on the items that were included in all the studies within the paper in order to maintain convergence across experiments.
  4. Sample Size: The number selected was based on our prior experience with this research topic and the number of participants that could be successfully recruited within an academic term (without crossing terms). Also, given that not all participants who signed up for the experiments in advance showed up to participate, the total number of subjects per cell was not identical in all cases. Also, we did not conduct any statistical tests until we were done collecting data.
Cook, Johnston, & HeyesFacial Self-Imitation: Objective Measurement Reveals No Improvement Without Visual Feedback
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Decided ahead of time to collect data until minimum sample size achieved and this was followed
Kille, Forest & WoodTall, Dark, and Stable: Embodiment Motivates Mate Selection Preferences
  1. Exclusions: Data from 2 participants were excluded: 1 was unable to sit in either of our chairs—which constituted our manipulation of physical stability—due to due his/her weight, and 1 did not comply with the researcher's assignment to condition. When these participants are included in our analyses (the participant who was unable to use our chair was assigned to a separate "stable" chair), results remained significant.
  2. Conditions: Full Disclosure
  3. Measures: We also gathered measures to address a separate research question regarding participants’ perceptions of the stability of their own singlehood status that we did not report. As we predicted, we found that participants in the physically unstable (vs. stable) condition felt that their singlehood was less likely to last. After participants completed all of the measures reported in the paper, they went on to complete measures assessing their preferences for products (e.g., Aerobics step bench) unrelated to relationships.   
  4. Sample Size: We recruited participants in a campus student center, which requires reserving time slots in advance. We reserved a number of timeslots that we felt would give us adequate access to participants to obtain at least 20 participants per cell in our design and collected the data until our slots were completed.
Lerner, Yi, & WeberThe Financial Costs of Sadness
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We aimed for 30 subjects per cell, based on past experience with these kinds of studies. We did not conduct a power analysis.  Once we reached at least 30 per cell, we continued running until all previously scheduled subjects had been run
Spunt & LiebermanThe Busy Social Brain: Evidence for Automaticity and Control in the Neural Systems Supporting Social Cognition and Action Understanding
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Participants in the study completed several validated personality questionnaires following their MRI session. To be perfectly honest, their inclusion was primarily motivated by convenience: given that MRI data is expensive to collect, we often include additional measurements that are secondary to the main purpose of the study but which will permit theoretically-related follow-up analyses (for instance, examining the moderating influence of a personality variable on the strength of an observed group effect). For the published study in question, I have not had the time to even begin to look at this individual difference data.
  4. Sample Size: We determined sample size in heuristic-fashion based on our previously published studies using this paradigm (Spunt, Satpute, & Lieberman, 2012, Journal of Cognitive Neuroscience; Spunt, Falk, & Lieberman, 2010, Psychological Science). We collected a few more subjects than in those previous studies given that this study was examining the moderating effect of an additional manipulation (i.e., memory load). I completely acknowledge that this is a highly informal procedure; at the time it was unclear how best to formally determine sample size. While there are still many ambiguities in how to best determine sample size for fMRI studies, recent publications and software-releases (e.g., http://fmripower.org/) are beginning to clarify things.
Dec 2012Berntsen, Johannessen, Thomsen et al.Peace and War: Trajectories of Posttraumatic Stress Disorder Symptoms Before, During, and After Military Deployment in Afghanistan
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure (As stated in the supplementary material as well as in the paper: The reported study was part of a large survey conducted through the military. It included many questionnaires and we had to focus on the ones of key relevance. This is often the case with this size of data bases (unlike the typical lab experiment).)
  4. Sample Size: Full Disclosure
Gaissmaier & Gigerenzer9/11, Act II A Fine-Grained Analysis of Regional Variations in Traffic Fatalities in the Aftermath of the Terrorist Attacks
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure (Note: We analyzed publicly available observational data from the 50 US states (+DC) only and thus did not have any experimental conditions)
  3. Measures: In response to the editor’s and reviewers’ comments, we conducted and/or discussed some additional analyses, which were only presented to the editors and reviewers, but not included in the paper – either because they yielded redundant results (e.g., statistics per inhabitant with driver’s licence rather than per each inhabitant) or because the number of observations was too small to yield reliable results (e.g., number of drunk driving citations on a state-by-state level)
  4. Sample Size: Full Disclosure (Note: The sample size was simply determined by the number of states)
Korjoukov, Jeurissen, Kloosterman et al.The Time Course of Perceptual Grouping in Natural Scenes
  1. Exclusions: In the “Size” experiment, described in the Appendix, we excluded data from 15 participants due to a technical error. Another data set, collected over 12 participants, was excluded due to a difference in procedure (difference in the overall number of trials and session duration).
  2. Conditions: We did not report the differences between three d-conditions in the experiments because they are irrelevant for main focus of the study.
  3. Measures: Full Disclosure
  4. Sample Size: We found that the results were highly significant after we tested a predetermined number of participants.
Rodeheffer, Hill & LordDoes This Recession Make Me Look Black? The Effect of Resource Scarcity on the Categorization of Biracial Faces
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: For both studies, we decided ahead of time to aim for 30 participants per cell and stopped data collection once we reached that target.  In Study 1 we went slightly over (N = 35) and in Study 2 we were slightly under (N = 27). Discrepancies were due to fluctuations in participant attendance rates. We did not look at my data before data collection ended, nor did we run more participants once we had decided when the last experiment session would be.
Nov 2012MatthewsHow Much Do Incidental Values Affect the Judgment of Time
  1. Exclusions: When participants were excluded on the basis of their responses to questions asked during the task (e.g., extreme values), I explained how many participants were excluded and the basis for exclusion in the Supplementary Materials for the paper (which describe the methods in detail). In addition, several of my studies were run on-line. For these studies, I applied eligibility criteria to determine whether the participant was eligible to be included in the sample. These included age (participants had to be at least 16), answering all questions (i.e., not choosing to withdraw from the task), and not having an ip address that appeared earlier in the study or in one of the earlier studies in the series. These eligibility criteria are fully documented in the Supplementary Materials. Ineligible responses were never analysed and there is no way of knowing, for example, how many actual participants they represent (e.g., duplicate ip addresses may be one person or several), so I did not report the precise numbers of responses that were screened on these grounds (although I am happy to provide that information).
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Sample sizes were based on power analysis and sampling continued until a minimum sample size was achieved. It was not possible to specify in advance precisely what sample size would be tested because I could not control exactly how many eligible people would sign up. The policy was to recruit more participants than were needed to give high power to detect the effect of interest (see Table 1 of the paper). That is, I aimed to “over-shoot” slightly so as to have high power after removing ineligible respondents. Samples were intentionally larger for on-line studies because (a) participants were easier to recruit, and (b) the more heterogeneous sample and testing environment might reduce effect size. There was no optional stopping.
Stallen, De Dreu, Shalvi et al.The Herding Hormone: Oxytocin Stimulates In-Group Conformity
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: The number of participants was determined before starting the study, and in line with typical sample sizes used in studies in this field. Data collection was terminated upon reaching the predefined sample size
Weems, Scott, Banks & GrahamIs TV Traumatic for All Youths? The Role of Preexisting Posttraumatic-Stress Symptoms in the Link Between Disaster Coverage and Stress
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure (non-experimental research)
  3. Measures: Data was from a larger longitudinal study and we do reference this fact and the other work (previously published studies) in the paper. We also tested a number of alternative explanations with additional measures and this we report in our supplemental data available online.
  4. Sample Size: Full Disclosure
Oct 2012Berman & SmallSelf-interest without selfishness: The hedonic benefit of imposed self-interest
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We took additional measures that were not reported in the final manuscript. These measures were not included because: a) they were irrelevant to the main hypothesis and were not analyzed; b) they were removed during the review process; or c) reporting the results did not fit within the manuscript word count. 
  4. Sample Size: All of our sample sizes were determined in advance of collecting data and data collection stopped when the target sample sizes were reached. For study 2, we purchased a set of gift cards ahead of time in bulk, and stopped when we ran out of gift cards.
Brascamp & BlakeInattention abolishes binocular rivalry: Perceptual evidence
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We empirically determined the amount of data needed to get a clear and interpretable data pattern in our two reference conditions (called 'Attended' and 'Absent'). For our condition of interest ('Unattended') we then collected the same amount of data.
Fairbanks, Way, Breidenthal et al.Maternal and offspring dopamine D4 receptor genotypes interact to influence juvenile impulsivity in vervet monkeys
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: The sample size was not determined a priori. We tested all of the juvenile monkeys available in our colony during the 7 year time period of the research.
Grossman, Karasawa, Izumi et al.Aging and wisdom: Culture matters
  1. Exclusions: Data from 22 American participants excluded for being outside the matching age range of the corresponding Japanese sample (25-75). The survey company in charge of subject recruitment in Japan did not recruit Japanese over 76 yrs. To match the age range of its Japanese equivalent, we reduced the American sample. Results remain virtually identical when examining all American adults. The full American sample was reported in the initial paper from this project (collected before the Japanese counterpart; Grossmann et al., 2010). Results are very similar across both types of samples.
  2. Conditions: Full Disclosure
  3. Measures: The study was part of a large-scale project examining cultural differences between Americans and Japanese in cognition and emotion. Thus, in other sessions participants were tested for a variety of instruments dealing with cultural constructs of independence vs. interdependence; holistic attention; positivity bias in memory, etc. Reporting these measures was outside the scope of the paper both thematically and in terms of page length. Further, many of these tasks were not yet entirely coded and analyzed at the time this paper was in press.
  4. Sample Size: We used an age-stratified random sample with oversampling. The latter was done to ensure that we have a comparable number of individuals of both genders, different levels of education (junior High vs. college), and in each of the three age groups (25-40; 41-55; 60-75). Our goal was to have at least 25 people in each cell. In the U.S., we stopped collecting data when we achieved this quota for the cells we have to oversample; in Japan a survey company made a corresponding decision.
Hu, Rosenfeld, & BodenhausenCombating automatic autobiographical associations: The effect of instruction and training in strategically concealing information in the autobiographical Implicit Association Test
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Sample sizes are determined based on previous similar experiments conducted by the first author. We decided to stop data collection after we reached the predetermined number, which is N=16 in each condition.
Shalvi, Eldar, & Bereby-MeyerHonesty requires time (and lack of justifications)
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We instructed the RA to target at 30-35 participants per cell, based on our experience with the strength of studied effects and a general convention of good practice in the field. Data collection was stopped once this target was met. We ended up with a slightly higher n-per-cell due to good show up rates in some experimental sessions.
Ybarra, Lee, & GonzalezSupportive social relationships attenuate the appeal of choice
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Study 1 contained measures not related to research question.
  4. Sample Size: No (Disclosure statement coming soon.)
Sep 2012Cain, Vul, Clark et al.A Bayesian optimal foraging model of human visual search.
  1. Exclusions: Full Disclosure
  2. Conditions: All between-subjects conditions and manipulations were reported. We attempted a within-subjects version but it was too difficult for participants and was canceled.
  3. Measures: We also collected additional demographic information (e.g. video game playing behavior) for future recruitment purposes. This were not analyzed in relation to the dependent measures of this study.
  4. Sample Size: We collected 10 participants per group (30 total) and examined the results. The results were unclear so we decided to collect an additional 5 participants per group. At that point a clearer picture had emerged and we stopped data collection. These values were informed by previous studies from our lab using related paradigms that tested 12 participants per group.
Emery, Finkel, & PedersenPulmonary function as a cause of cognitive aging
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure (Not applicable.)
  3. Measures: Other measures assessed but not reported because data came from longitudinal, population based study of multiple outcomes
  4. Sample Size: Coming soon.
Grant & DuttonBeneficiary or benefactor: Are people more prosocial when they reflect on receiving or giving?
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We collected additional questionnaire measures unrelated to the research question.
  4. Sample Size: For Study 1 the sample was the population of employees at the call center. For Study 2 we set our data collection termination rule in advance based on power calculations from Cohen (1992 PB) and sample size availability in the behavioral lab. We did not modify the rule in the course of the research.
O’Hara, Gibbons, Gerrard et al.Greater exposure to sexual content in popular movies predicts earlier sexual debut and increased sexual risk taking
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure (non-experimental)
  3. Measures: These data came from an extensive longitudinal study of media and health and many measures were not related to this research question.
  4. Sample Size: A power analysis was used to determine that the analyses from the original grant proposal required successful follow-up with 2200 never-smokers at baseline resulting in an original sample of 6522 participants at Time 1.
Raby, Cicchetti, Carlson et al.Genetic and caregiving-based contributions to infant attachment: Unique associations with distress reactivity and attachment security
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Our study was part of' a longitudinal project that collected several other measures not relevant to our hypotheses. However exploratory analyses (involving measures of infant temperament) were completed at a reviewer's request but the results were not reported because the measures were not sufficiently reliable.
  4. Sample Size: Full Disclosure
Aug 2012Monti, Parsons, & OshersonThought beyond language: Neural dissociation of algebra and natural language
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Sample size (20 subs + 1 pilot) was decided at the time we put in our request MRI scanning slots for the study and set in line with typical sample sizes in the field. Collection was terminated upon reaching the target N.
PleskacComparability effects in probability judgments.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: I sought to obtain 30 participants. The design was completely within subjects and so this sample size provides sufficient power (greater than 80%) at the aggregate level. Note also I collected enough observations per subject 450 so I can actually treat each participants as his or her own experiment.
Jul 2012Bélanger, Slattery, Mayberry et al.Skilled deaf readers have an enhanced perceptual span in reading.
  1. Exclusions: One participant was excluded from the experiment based on he/she not meeting our inclusion criterion on non-verbal IQ. This was not reported because of the limited space available to report more relevant results.
  2. Conditions: Full Disclosure
  3. Measures: We had a background test to assess ASL skills and the test was not well suited to our adult population. Participants's scores reached ceiling or near ceiling score so it could not be used as a covariate as originally planned.
  4. Sample Size: We ran the maximum number of people that we could find in our special population that also met our inclusion criteria (those were included in the paper).
Fuller-Rowell, Evans, & OngPoverty and health: The mediating role of perceived discrimination
  1. Exclusions: Full Disclosure (We used FIML estimation in order to be able to include all individuals who participated in W3 of the study in the models.)
  2. Conditions: Full Disclosure (Not applicable. Our study was not experimental.)
  3. Measures: Study included a large number of measures. We only discussed the measures relevant to the analyses presented in our paper.
  4. Sample Size: Coming soon.
Leander, Chartrand, & BarghYou give me the chills: Embodied reactions to inappropriate amounts of behavioral mimicry
  1. Exclusions: Full Disclosure
  2. Conditions: In the third study we ran (now Study 1), I attempted to add a second, male experimenter, but his data were uninterpretable and only seemed to add error variance. That is why all studies specifically report using only a female experimenter.
  3. Measures: Scales/questionnaires unrelated to the research question were not reported. The DVs were the first things we assessed after the manipulations and we included additional questionnaires afterwards so as to make full use of the participants' time while we had them in the lab. It seems superfluous and distracting to report such information if it is independent of the study procedure and would not be meaningful for the purpose of someone trying to replicate the findings (which, in my mind, is the essence of how to write a research report).
  4. Sample Size: An a priori decision was made to stop data collection at the end of the given block/semester.
Longo, Long, & HaggardMapping the invisible hand: A body model of a phantom limb
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: This was a single case-study of an individual with congenital limb absence so the issue of determining sample size is not applicable.
Mazerolle, Régner, Morisset et al.Stereotype threat strengthens automatic recall and undermines controlled processes in older adults
  1. Exclusions: We removed 4 participants (two young and two old participants) for being outliers (on Cook's D and SSD, following Judd & McClelland, 1989; McClelland, 2000). Because of PS word count for short reports, we didn'tmentioned these informations.
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: A french version of the process dissociation procedure (PDP) was constructed in French, including four sets of words (similar in letters and syllables numbers, and frequency, based on Jacoby's recommendations, 1998) corresponding to four instructions conditions (inclusion, inclusion filler, exclusion, exclusion filler). To account for possible differences in words sets, we counterbalanced each set, creating 4 PDP versions. Then, we counterbalanced each PDP version with threat conditions and age groups. We decided that 56 participants for each PDP version was sufficient to account for potential differences, resulting in 56 participants X 4 PDP versions = 224 participants. Analysis didn't shown any difference between the 4 PDP versions. Sample size was decided ahead and was followed. Again, because of PS word count for short reports, we didn't mentioned these informations.
Parise & CsibraElectrophysiological evidence for the understanding of maternal speech by 9-month-old infants
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: 14 infants / group were targeted but the sample overshoot because less infants were excluded due to bad data than expected.
WolfeSaved by a log: How do humans perform hybrid visual and memory search?
  1. Exclusions: We excluded outlier trials with RTs > 7000 msec. That seems to have eliminated 8 of 12000+ trials. Usually we report that exclusion. I think that must have fallen victim to the word count restriction in Psych Sci.
  2. Conditions: Frankly this is a bit of a silly question. We would typically run various pilot versions of experiments to make sure that the code works that we know how long the task takes etc. Once we are sure we are not wasting our time collecting garbage we would run a decently powered experiment. What you (I assume) really want to know is whether we ran more or less the same experiment 20 times and are only reporting the one time that p scraped over 0.05. That we did not do.
  3. Measures: Oh come now we record all sorts of things out of a sense of completeness. For example in this experiment I know the location on the screen of every target item. There are undoubtedly effects of this variable on reaction time. Those effects might be interesting. We do not happen to have analyzed that variable. Is it unrelated to the research question? I dont know. Might be a good exploration for a rainy day or for someone who asks for our data.
  4. Sample Size: Many years of experience with experiments of this sort suggest that if we collect on the order of 50 data points in each cell of the experiment (in this case 50 trials target present and absent for each combination of visual and memory set size) and if we run 10-12 observers that our results will have sufficient power to see differences between conditions when such differences exist.
Jun 2012Donkin & NosofskyA power-law model of psychological memory strength in short- and long-term recognition
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We decided to collect four participants (each of whom completed 10 sessions) because our goal was to fit a quantitative model to individual subject response time distributions. This number is standard for this type of analysis. The experiment was a replication of a previous study and our own pilot studies revealed that the effect was remarkably robust in individuals (even when they completed the task for just one hour) which told us that we did not need to collect more participants than is standard.
Hirsh, Kang, & BodenhausenPersonalized persuasion: Tailoring persuasive appeals to recipients' personality traits
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Sample size was determined a priori using power analysis software based on effect size estimates from previous research. Data collection was stopped once we achieved the target sample size.
Hofmann, Vohs, & BaumeisterWhat people desire, feel conflicted about, and try to resist in everyday life
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Other measures were assessed but not reported because they were unrelated to the research question (Large experience sampling project addressing multiple research questions not all of which were addressed in the above publication as explicitly stated.)
  4. Sample Size: The goal was to collect as many participants as possible with the available project funds.
Hulme, Bowyer-Crane, Carroll et al.The causal role of phoneme awareness and letter-sound knowledge in learning to read: Combining
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Article reports analyses of a subset of measures from an earlier study. Article only reports data relevant to assessing a specific hypothesis that the effects of the reading and phonology training in the previous study were mediated by changes in phoneme awareness and letter-sound knowledge.
  4. Sample Size: The data come from a previously conducted randomized controlled trial wherein sample size was based on a power calculation and we recruited samples that were as large as possible within the time and resources available.
Keysar, Hayakawa, & AnThe foreign-language effect: Thinking in a foreign tongue reduces decision biases
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: For experiments conducted on campus: We targeted around 30 subjects per condition in advance. For experiments abroad and out of state, we instructed RAs to recruit as many subjects as they could within their limited time-frame.
May 2012Bernard, Gervais, Allen et al.Integrating sexual objectification with object versus person recognition: The sexualized-body-inversion hypothesis.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We also measured potential moderators: ambivalent sexism (ASI) internalization of beauty standards (SATAQ) and self-objectification. We did not find any significant correlations and we decided to report all tested experimental conditions (i.e. recognition of inverted males upright males inverted females and upright females) without mentioning these additional measures (i.e. moderators). Participants were also asked to complete two other tasks unrelated to the body inversion paradigm we used.
  4. Sample Size: Before data collection we decided to test approximately 80 participants, based on past studies we have done using this task. We tested during several testing sessions and we stopped data collection after the last testing session (when we had > 80 participants)
John, Loewenstein, & PrelecMeasuring the prevalence of questionable research practices with incentives for truth telling
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Full Disclosure (As stated in the supplement, “We stopped collecting data approximately ten days after the final follow-up email was sent. By this point, the rate of incoming responses had dropped off substantially (from over 100 per day in the days immediately following the first solicitation email, to on average fewer than one respondent per day).” That is, the decision to stop was independent from results of data analysis.
Sweeny & VohsOn near misses and completed tasks: The nature of relief.
  1. Exclusions: We excluded one participant in Study 2 because the RA failed to record the experimental condition for that session.
  2. Conditions: Full Disclosure
  3. Measures: Other measures were assessed but were not reported because they were not related to the research question.
  4. Sample Size: We aimed for approximately 100 participants for each study and this was followed. In Study 1 we went slightly over (n = 114) before noting the sample size and cutting off recruitment and in Study 2 we didn't quite reach 100 (n = 79) by the end of the data collection period at the end of term.
Terburg, Aarts, & van HonkTestosterone affects gaze aversion from angry faces outside of conscious awareness.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Measure of digit ratio was included in the research design as a possible mediator in the effects of testosterone. This was however not the case which was not reported for reasons of space/word-limits. These additional data are however reported and discussed in: Terburg D. & van Honk (in press). Approach-avoidance versus dominance-submissiveness: A multilevel framework on how testosterone promotes social status Emotion Review
  4. Sample Size: Sample size was predetermined based on earlier testosterone administration studies; We collected data until the sample-size as written in the protocol (N=20) was reached.
VessWarm thoughts: Attachment anxiety and sensitivity to temperature cues
  1. Exclusions: Full Disclosure (Criterion used for 2 participants excluded in Study 2 was studentized-deleted residual values greater than |3.0|. This criterion was provided in the original submission, but was excluded in the final version due to strict word limits)
  2. Conditions: Full Disclosure
  3. Measures: I included 2 items regarding participants experience with the sentence unscrambling task. These items assessed task difficulty and task enjoyment. Description of these items were included in a supplement to the original submission but were excluded in the final version due to space restrictions and their null impact on the primary results.
  4. Sample Size: A minimum sample size was targeted and each study was opened on-line for a set amount of time. Because the minimum sample size was met in both studies after this set time, each study was stopped at that point.
Apr 2012Chandler & ProninFast thought speed induces risk taking
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: In Study 2 we asked participants items from the CARE (a measure of risk taking) that pertained to sex substance abuse and disorderly conduct. However due to researcher (my) error participants also completed a single item from the sports subscale of the CARE (the extent to which participants enjoy skiing) at the end of the questionnaire. We did not report this item because (aside from the fact that we did not intend to include it) it seemed inappropriate to make claims about behaviors measured by this subscale when it was represented by only a single item. All descriptive results reported in the paper are virtually identical and all statistical tests of significance are unchanged if this item is included.
  4. Sample Size: We had a sense that the effect would likely be large based on earlier research using similar manipulations so we were not too concerned about obtaining a large sample. This was really one of those situations where sample size was determined by resource limitations rather than a solid methodological rationale - I had to be present in the eating hall with the RA while data were collected and so we could only collect at times both of us were free. The rule was collect until the end of the semester and see how things looked then. In both cases we collected a single semester's worth of data checked and terminated.
O'Brien & Ellsworth More than skin deep: Visceral states are not projected onto dissimilar others
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We included a question about perspective taking (To what extent did you step inside Jim's shoes while reading the story? from 0-9); there were no differences on this measure & it didn't affect the results, and a number of people left it blank or put question marks next to it. We also included 5 yes/no questions taken from Van Boven & Loewenstein 2003: Have you ever been lost in the woods? Engaged in backpacking? Engaged in mountaineering? Engaged in hiking? Engaged in wilderness activities? Almost everyone circled No, and many people left them blank. They were printed on the back page of the last sheet of the study packet, so I think some people forgot to flip it over. Hence, due to methodological difficulties and to fit word limits, we dropped these measures.
  4. Sample Size: The general rule of thumb is that we always try to get at least 20-30 people per cell before looking at any of the data, and if more data are needed to run additional blocks of 20-30 before looking. There's also some practical constraints. For example, I think we stopped data collection in Study 2 because we ran out of subject pool hours (the 20-30 rule had also been met).
Mar 2012Forest & WoodWhen social networking is not working: Individuals with low self-esteem recognize but do not reap the benefits of self-disclosure on Facebook
  1. Exclusions: Full Disclosure (Criteria for exclusion were reported in the paper. In general, data from participants who complete a survey multiple times or double-submit pages of the survey are also discarded, or the first set of responses is retained but subsequent responses are discarded). Without going back to the raw user-input data, I cannot be certain which of these strategies was employed, or whether there were any such participants who completed the survey multiple times. However, discarding data from participants who submit multiple times is always done before any analyses are conducted.)
  2. Conditions: Full Disclosure
  3. Measures: Other measures included for example, measures of the Big 5 Personality traits and narcissism, questions about participants' Facebook settings--that were collected for purposes unrelated to the main research questions addressed in the paper and were therefore not reported in the paper.
  4. Sample Size: These data were collected several years ago and I do not recall the specific reasons for sample size decisions in these particular studies. In general, we terminated data collection at the end of an academic term or when a given study had reached its maximum credit allocation from the research participant pool, unless a sample size we deemed sufficient was reached prior to these cutoffs.
Gupta, Jang, Mednick et al.The road not taken: Creative solutions require avoidance of high-frequency responses
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We collected the data during one term and then spent the next year applying mathematical models to these data. There was no modification of sample size.
Jackson, Thoemmes, Jonkmann et al.Military training and personality trait development: Does the military make the man, or does the man make the military
  1. Exclusions: Full Disclosure
  2. Conditions: N/A (non-experimental study)
  3. Measures: There were scores of unreported measures and items given the study was part of a large, multi-wave longitudinal study.
  4. Sample Size: Sample size determined by power analysis for initial grant that took into account the number of schools that we would need to sample (assuming a particular response rate).
McCaffreyInnovation relies on the obscure: A key to overcoming the classic problem of functional fixedness
  1. Exclusions: Full Disclosure
  2. Conditions: Another condition was testing a secondary hypothesis, which did not reach significance. We reported on the primary hypothesis but not the secondary hypothesis.
  3. Measures: Full Disclosure
  4. Sample Size: Full Disclosure
Feb 2012Frankenstein, Mohler, Bülthoff et al.Is the map in our head oriented north
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We decided ahead of time to collect a sample size of 30 participants (15 male 15 female). Due to time constrains of the project and the lab space used by a lot of research groups (i.e. we had only a limited amount of time to collect data in that lab facility) we were not able to run 30 participants and had a few less. All data collection was finished before starting any analyses no additional participants were run after analyses started.
Hodson & Busseri Bright minds and dark attitudes: Lower cognitive ability predicts greater prejudice through right-wing ideology and low intergroup contact.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure (We conducted secondary analyses of large-scale datasets with our analyses focusing on the key variables reported by the original authors.)
  4. Sample Size: Full Disclosure (We used the participant samples as used by the original authors in our secondary analyses.)
Howell & Shepperd Reducing information avoidance through affirmation
  1. Exclusions: Full Disclosure
  2. Conditions: When we conducted Study 1 we included a manipulation intended to have the opposite effect of affirmation. However manipulation check measures suggested that our manipulation failed to produce the intended psychological effect. Thus we dropped it from the remaining studies and do not report it in the paper.
  3. Measures: We included some measures that were not related to the research question. We also included a variety of process variables which either were not reliable or did not predict any variance in our outcomes. We chose to stick to our primary effects for publication because of space constraints and to streamline our story.
  4. Sample Size: We determined that we were going to collect 20-25 participants per condition in advance of the study based on standard power recommendations for the analyses we intended. We ran our research in our university's human-subjects participant pool and uploaded blocks of participation slots each week. For the first study we stopped when the semester ended. Our second two studies included more than 25 participants per cell because of an unanticipated influx of signups at the end of the studies.
O'Brien & Ellsworth Saving the last for best: A positivity bias for end experiences
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We also measured current mood, current hunger level, and general enjoyment of chocolate (on 0-10 scales); there were no differences between groups, and none of these variables influenced the results; I think I dropped them to fit word limits.
  4. Sample Size: We stopped data collection simply because I had to travel to Poland for a summer research exchange program, and I wanted to finish data collection before I left. I collected data up until the last possible day before the trip.
Wang, Li, Fang et al.Individual differences in holistic processing predict face recognition ability
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: At the first step we planned to collect 500 subjects. (For genetic studies usually you need three independent samples and the first sample is for exploratory investigation and the second and the third samples for validation. That is the sample size is determined by the genetic study). However we were only able to collect data from 337 subjects in this step. We happened to learn that our data were also capable of addressing the relation between face recognition and holistic face processing so we used this set of data. We used the data designated for another study (i.e. the genetic basis for face recognition). We used the full data set collected.
Jan 2012Duguid & GoncaloLiving large: The powerful overestimate their own height
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Full Disclosure
Hehman, Gaertner, Dovidio et al.Group status drives majority and minority integration preferences.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Article was based on data from a larger multi-year project hence multiple measures were collected many unrelated to the research question addressed in the manuscript and so were not included. Additionally since we created the scale used for this research several items originally intended to measure our construct of interest were cut from analysis based on a confirmatory factor analysis (though the use of a CFA is reported in the manuscript).
  4. Sample Size: Our original goal was for 150 participants of each type of student (Black or White) at two universities or 600 participants. We quickly realized collecting data from 150 White participants at one university (a historically Black college) was unrealistic. For these groups we collected as many as possible and stopped collecting at the end of the semester. For the groups for which we were able to meet our target goal we stopped when hitting that goal (~150 participants).
Sandman, Davis, & GlynnPrescient human fetuses thrive
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure (We didn't have experimental conditions but naturalistic observations)
  3. Measures: We also assessed hundreds of other measures in this longitudinal project--everything from fetal growth (using ultrasound) to childhood MRI; We did not report these other measures because it was not be feasible.
  4. Sample Size: Longitudinal Study; Sample included all subjects for whom complete data were available for the variables of interest. For the NIH grants that supported the studies detailed descriptions of power were included.
Sternberg & McClellandTwo mechanisms of human contingency learning
  1. Exclusions: Full Disclosure
  2. Conditions: For comparison with the causal framing instructions, we required a comparison framing condition that led to comparable learning of the direct contingencies in the training phase of the experiment across the causal framing and comparison framing condition. Two conditions that did not meet this requirement were tried before the object framing condition. Details are reported in the first author's dissertation (Sternberg, 2012).
  3. Measures: Participants were also asked to give subjective ratings about the probabilities of the outcome for each item at the very end of the experiment, immediately before debriefing. These are not reported in the paper, as we found early on that while they demonstrated participants had declarative knowledge of the direct contingencies, they were in general not reliably sensitive to the observed indirect effects in either task.
  4. Sample Size: As cue competition/indirect effects in fast-paced response time tasks have not to our knowledge been previously observed in the contingency learning literature, we could not perform a direct power analysis based on previous findings. However, we decided on specific target sample sizes (48 per condition in the RT task and 24 per condition, in the prediction task, respectively) from the outset, and stuck with them.
Szpunar, Addis, & SchacterMemory for emotional simulations: Remembering a rosy future.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Sample size was determined on basis of results of previous studies run in the lab. We aspired to reach 24 participants in each delay condition half using the memory sampling technique and half using the list sampling technique. This goal was not modified in the course of the experiment.
Waytz & YoungThe Group-Member Mind Trade-Off: Attributing Mind to Groups Versus Group Members
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Targeted sample sizes were based on previous similar studies, taking into account the particular design of the study at hand, and the total number of cells. We stopped data collection when we reached our pre-determined targets. The first two studies relied on item-wise analyses, so we aimed for approximately 20 subjects accounting for data loss/gain that results from typical discrepancies between MTurk's reported number of hits accepted and the actual numbers of participants that we identified as completing the study upon data inspection. Studies 3 and 4 relied on subject-wise analyses and either a mixed design (Study 3) or a within-subjects design (Study 4). The main analysis of Study 3 involved a 3x2 ANOVA (6 cells) as well as a between-subjects factor, included in an initial analysis, reported in the paper. The primary analysis of Study 4 was a 2x2 ANOVA (4 cells). We aimed for approximately 60 subjects and 30 subjects for Studies 3 and 4 respectively, again accounting for data loss/gain.
IssueAuthorsArticle Title
May 2013Lucas, Lawless et al.Does life seem better on a sunny day? Examining the association between daily weather conditions and life satisfaction judgments.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure (N/A; no experimental conditions)
  3. Measures: Dataset used had many different questions, but only one was of interest to us, the single-item life satisfaction question we analyzed. We did provide a reference to the broader dataset that lists all questions, and we did not try running our analyses using any other measures before writing up the study. There were other weather variables that could have been analyzed; we used those that were most relevant for judging whether the day was a pleasant weather day or not. We did not download any additional weather variables from our sources that were analyzed but not reported.
  4. Sample Size: We used existing data and used all participants who we could match with weather conditions on the day of the survey.
Silvestrini, Gendolla et al.Automatic effort mobilization and the principle of resource conservation: One can only prime the possible and justified.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Actually, the answer is “yes”. However, our impedance cardiograph system (Cardioscreen 1000, Medis, Ilmenau, Gemany) automatically assess various measures of cardiac functioning that we don't analyze, because they are not related to our predictions (e.g., stroke volume or left ventricular ejection time). For the pre-ejection period, our main effort-related cardiac measure, and heart rate we don't use the automatic assessment of the Cardioscreen system but process the raw data with a lab-made software specifically designed to allow visual inspection of the individual cardiac cycles and of the B-point location, as reported in the methods section. That is, the measurement device automatically assess several indices of cardiac functioning, but we did neither measure them actively, nor did we analyze them.
  4. Sample Size: For the first study, we determined the sample size according to the sample sizes of our previous studies (at least 10 valid participants per cell). For the second study, we determined our sample size according to the recently published article of Simmons, Nelson, and Simonsohn (2011), which recommends at least 20 participants per cell. Most relevant, we didn't analyze the data before the data collection was completed and we didn't add participants after the original data collection.
Apr 2013Aknin, Barrington-Leigh et al.Prosocial spending and well-being: Cross-cultural evidence for a psychological universal.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full disclosure for all studies expect for Study 1. Study 1 used data collected in by Gallup World Poll, which contains hundreds of questions. We reported all questions of interest and direct interested readers to the long list of items used. 
  4. Sample Size: We recruited as many participants as possible in a given term or until study funds ran out.  
Shallcross, Ford et al.Getting better with age: The relationship between age, acceptance, and negative affect.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure (N/A: Our study did not involve any experiments/manipulations.)
  3. Measures: We only reported measures that pertained to our research question. The manuscript states that data reported were part of a larger research project. 
  4. Sample Size: (a) Our sample size was determined based on power analyses that were conducted to address a different research question than was the focus of this manuscript. (b) We discontinued data collection when this sample size was achieved.
Van den Akker, Dekovic et al.Personality types in childhood: Relations to latent trajectory classes of problem behavior and overreactive parenting across the transition into adolescence.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure (N/A; this is a longitudinal non-experimental study)
  3. Measures: Unreported measures were not related to research question
  4. Sample Size: The sample size was decided at the time that the study started (1999), based on power analysis, expected attrition  and practical concerns (available funding).
Mar 2013Van Dillen, Papies, & HofmannTurning a blind eye to temptation: How cognitive load can facilitate self-regulation.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: For participant selection purposes we implemented two additional measures (Body Mass Index and Dietary Restraint Questionnaire), but given that we observed no extreme scores on these measures which could have affected our results, we have not used them as a basis for excluding participants.
  4. Sample Size: We decided ahead of time on the minimum sample size and stopped recruiting participants when this number was reached.
Yang, Wu, Zhou et al.Diverging effects of clean versus dirty money on attitudes, values, and interpersonal behavior.
  1. Exclusions: Data from one participant in Experiment 4 excluded because she left before finishing the tasks, 3 participants in Experiment 5 excluded for answering their cell phones during the experiment, 4 participants in Experiment 6 excluded for skipping some trials during the main dependent measure.
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Sample sizes are determined based on pilot studies or previous studies and power analysis; Data collection stopping rule: decided ahead of time and stopped when the number was reached.
Feb 2013Conway & GawronskiDeontological and utilitarian inclinations in moral decision making: A process dissociation approach.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Study 1 included exploratory measures of five additional constructs for which we did not have a priori hypotheses (administered at the end of the study). Because these measures failed to show any significant relations to the constructs of interest (i.e., deontology/utilitarianism), they were not reported in the final paper. Full disclosure for Studies 2 and 3.
  4. Sample Size: The sample size was determined a priori on the basis of previous work in this area. The data were collected in one shot until we attained the desired sample for all studies. 
Gurven, von Rueden, Massenkoff et al.How universal is the Big Five? Testing the five-factor model of personality variation among forager-farmers in the Bolivian Amazon.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: The personality interviews were administered as part of a larger series of interviews (demographic, economic, etc.) and medical checkup we conduct among the Tsimane. Subjects are seen once a year by anthropologists and physicians affiliated with the Tsimane Health and Life History Project (http://www.unm.edu/~tsimane/). We average an 85% participation rate among Tsimane adults in the villages that are part of the THLHP. The sample size of our personality interviews was not pre-determined. Our self-report sample (n=632) resulted from all adults who participated in our project during 2009 and 2010 and who agreed to take the personality questionnaire.  For the first half of 2009, our clinic was mobile and traveled to each of the 28 villages then part of the THLHP. Subsequently, our clinic was located in a fixed position and we transported Tsimane adults from those 28 villages (over 40 years of age) to our clinic on a village by village basis (in addition to their immediate family members and any other village residents reporting injury or illness).  We targeted older individuals for our fixed clinic given the larger goals of the THLHP to investigate senescence in a traditional population. Thus, most of the individuals in our self-report sample under the age of 40 were interviewed at their village in the first half of 2009. During the year 2010, we administered the personality questionnaire to any adults visiting our clinic who did not complete the questionnaire the prior year.  At that point we deemed the sample size (n=632) and representativeness of the sample in terms of age, sex, village affiliation, education, etc. to be sufficiently robust. The spouse-report sample (n=430) was obtained the same way at our fixed clinic during 2011 and 2012.  No data analysis (with respect to the principal results of the paper) was conducted prior to collection of the full datasets.
Peetz & WilsonThe post-birthday world: Consequences of temporal landmarks for temporal self-appraisal and motivation.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Studies 4, 5, and 6 were the earlier studies conducted. To explore potential DVs,  we included additional measures and exploratory items in these studies. Most DVs showed the same pattern but in reporting the studies, we focused on reporting those items that were assessed across all studies for simplicity.
  4. Sample Size:  In Study 1, sample size was determined by inviting every student in an introductory psychology class present at a given day to participate (paper and pencil format). In Study 2a-c, and 3b, we posted a estimated number of participant slots on MTurk, aiming for around 30 participants per condition, accounting for data loss/gain that results from typical discrepancies between MTurk's reported number of hits accepted and the actual numbers of participants that completed the study.  In Study 3a, sample size was determined by collecting 30 participants per condition (paper and pencil format). In Study 4 and 6, we emailed a subset of potential respondents from an email list, aiming for around 25 participants per condition. Once the responses stopped coming in (less than two responses a day), we closed the study. In Study 5, sample size was determined by having a day of data collection assigned to this study and one other study. Three research assistants collected as many people as possible on that day for the two studies.
Sah, Loewenstein, & CainThe burden of disclosure: Increased compliance with distrusted advice.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Our minimum sample size was approximately 30 per cell and the research assistants collected as much data as they could on data collection days.
Syed & Seiffge-KrenkePersonality development from adolescence to emerging adulthood: Linking trajectories of ego development to the family context and identity formation.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: This analysis made use of a very large 10-year longitudinal study that contained many measures. We only included measures that were relevant to our research questions. 
  4. Sample Size: There was no stopping rule. We recruited as many participants as we could given the resources available. 
Jan 2013Bélanger, Lafrenière, Vallerand et al.Driven by fear: The effect of success and failure information on passionate individuals' performance.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We measured an additional potential mediator (self-clarity). We did not find any significant correlation with this measure and its reliability wasn’t great either (alpha < .70), therefore we decided not to discuss this measure.
  4. Sample Size: Sample size was determined based on our experience with this research topic. We stopped data collection at the end of the semester; this was decided a priori. Lastly, we did not conduct statistical tests until the end of data collection.
Cheng, Tracy, Foulsham et al.Two ways to the top: Evidence that dominance and prestige are distinct yet viable avenues to social rank and influence.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We excluded measures that are unrelated to the present research question, but that were collected as part of a larger study.
  4. Sample Size: Data collection was terminated at the end of term when our team of research assistants could no longer conduct study sessions.
Meissner & RothermundEstimating the contributions of associations and recoding in the Implicit Association Test: The ReAL model for the IAT.
  1. Exclusions: Full Disclosure
  2. Conditions: In Experiment 7, we manipulated cognitive capacity during the behavioral measure. While half of the participants had to memorize a 2-digit number, the other half had to keep an 8-digit number in mind. We hoped that the latter would show stronger correlations between the association parameters and the behavioral measures. However, this was not the case (nonsignificant moderation). Furthermore, a manipulation check revealed no significant group differences. Actually, the pattern suggested that both experimental groups had been under (equally) high cognitive load. Full Disclosure for Experiment 1 to 6.
  3. Measures: Subsequent to the measures reported in the article, we collected additional questionnaire measures in Experiment 5, 6, and 7. As these items were unrelated to the research question they were not included in the analysis.
  4. Sample Size: The sample size was determined a priori, based on prior studies in the research area as well as considerations of the maximum sample size that could be achieved within the period of data collection. Data analysis started after data collection had been completed.
Vohs, Park, & SchmeichelSelf-affirmation can enable goal disengagement.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We included a thought perception measure in experiment 2 that was not germane to the present investigation.
  4. Sample Size: The stopping rules for Exps 1 and 4 were simple: the data came only from participants in certain courses and there were only a set number of students in those courses at the time. Exps 2-3 used a stopping rule of obtaining as many participants we could during the course of the semester.
Vorauer & SucharynaPotential negative effects of perspective-taking efforts in the context of close relationships: Increased bias and reduced satisfaction.
  1. Exclusions: Full Disclosure (In footnote 2 we reported the criteria for exclusion, but we did not report the numbers excluded. Footnote 2 states: "Throughout the three studies reported in this paper, participant numbers do not include participants who failed to follow instructions (e.g., did not complete the manipulation or completed the measures incorrectly) or for whom data on entire measures were missing." The actual numbers of participants excluded were 16 in Study 1, 15 (pairs) in Study 2, and 1 (pair) in Study 3. We had much better control over data collection in Study 3 because only one pair participated (in person) at a time. In the online and group session studies (preferred by participants with less strong English language skills) participants were more apt to find the transparency measures confusing and were able to skip the manipulation if they so chose)
  2. Conditions: Full Disclosure
  3. Measures: We included some additional items on an exploratory basis that were not of central importance to the focal research question in the paper. We also asked additional background questions about participants' relationship with their acquaintance/friend/partner for the purpose of understanding our samples.
  4. Sample Size: In each study we determined a priori how many participants would be sufficient and feasible to run. We did not analyze or even enter the data into SPSS until we had come as close to our target N as we could and stopped data collection.
Dec 2012Chan, Allik, Nakazato et al.Stereotypes of age differences in personality traits: Universal and accurate
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Data were collected as part of a larger battery in 26 countries, other administered measures were previously published (e.g. De Fruyt et al., 2009, Assessment). Three additional NCS items were examined and discarded before analyses of age stereotypes began.
  4. Sample Size: Sample size determination was reported in the first publication of the larger study (De Fruyt et al., 2009, Assessment). Collaborators from each site were requested to collect a minimum sample size of 50 M and 50 F participants.
Galak, LeBoeuf, Nelson et al.Correcting the past: Failures to replicate psi
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Full Disclosure
Kochanska, Kim, & Koenig NordlingChallenging circumstances moderate the links between mothers' personality traits and their parenting in low-income families with young children
  1. Exclusions: Full Disclosure (One mother (out of 186) did not provide any questionnaire data, and one more did not provide self-reported power assertion data. Behavioral data are complete)
  2. Conditions: There were no experimental conditions that were relevant at the time (all those measures were collected during pretest in a large intervention study, prior to any randomization to conditions). At that point, it was simply one group of mothers.
  3. Measures: Multiple measures are collected in this very large research program (a large longitudinal study). We reported the measures pertinent to the questions asked in the paper.
  4. Sample Size: The sample size was proposed in the original NIH grant application, subject to further discussions with Program Officers (e.g., budget constraints).  We collected all the data we had promised. We never "stopped" collecting data for any reasons. 
Lemay, Overall, & ClarkExperiences and interpersonal consequences of hurt feelings and anger
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Measures not related to research question, and too many items to report all of them.
  4. Sample Size: Decided ahead of time to collect data until minimum sample size achieved and this was followed.
Paluck & ShepherdThe salience of social referents: A field experiment on collective norms and harassment behavior in a school social network
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We measured a few more aspects of the social network, and these seemed to add too much to the length of the survey; we received few responses on these questions in the first survey wave (they were some of the final survey questions; also one of them was a "negative" network question about conflict that people didn't seem to want to answer) so we subsequently dropped these questions in our later survey waves. We also tried out some questions about "narrative" that did not seem to be responsive to our intervention--two questions that we added to waves 2 and 3 of our survey that reflected some oft-repeated phrases that students used during their intervention assembly. Finally, we measured a few other items that we are reserving for future papers, one on girl vs. boy norms (we are writing a gender paper on that now), and some aspects of students' cultural tastes (e.g. music, internet use), for a potential future paper on the diffusion of taste in the network.
  4. Sample Size: Full Disclosure (we simply measured the whole network)
Sullivan, Landau, Kay et al.Collectivism and the meaning of suffering
  1. Exclusions: Full Disclosure
  2. Conditions: Study 3 originally included a control condition, in which we asked participants to focus on neutral aspects of a story (rather than on singular pronouns, to prime individualism, or plural pronouns, to prime collectivism). Whereas our individualism/collectivism primes were based on prior research and have been repeatedly validated, the additional control condition was our own creation and previously untested. Our control condition did not significantly differ from either individualism or collectivism conditions on our dependent measure. We removed the participants who were assigned to this condition, because their responses did not seem to illuminate the question at hand (the differences between individualism and collectivism), and because it is not conventional in research priming individualism and collectivism to include a control condition.
  3. Measures: In Study 2 we included a belief in a just world scale (based on Lipkus, 1991). Our prime of individualism-collectivism did not have an effect on this scale. We originally reported this finding, but the editor and a reviewer asked that we remove it to streamline our presentation (or to research this particular issue more thoroughly; since it was not our priority in this project, we chose the former option).
  4. Sample Size: Based on a power analysis, assuming a medium effect, we sought to obtain at least 20-25 participants per condition across the studies. In some instances we collected more participants to better insure a robust effect. The editor asked us to explain our procedures in correspondence but did not require that we report them in the text.
Nov 2012Locke, Craig, Baik et al. Binds and bounds of communion: Effects of interpersonal values on assumed similarity of self and others
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: In the cross-cultural studies (studies 3-4) we included a few exploratory questions (concerning knowledge of the other country and imagined experiences as an exchange student in the other country) but did not mention them because they were unrelated to our hypotheses.
  4. Sample Size: Collected data until we obtained a predetermined sample size. (In Study 5 we kept collecting data after exceeding that sample size for reasons completely unrelated to the study.)
Oct 2012Becker The system-stabilizing role of identity management strategies: Social creativity can undermine collective action for social change.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: I also measured identification in all studies but did not report the full results due to a reviewer's request. However I mentioned some brief results in footnotes.
  4. Sample Size: Student assistants were instructed that they need at least N=20 participants per experimental condition. I used the full data set that student assistants collected within a given time.
Sep 2012Federico, Deason, & FisherIdeological asymmetry in the relationship between epistemic motivation and political attitudes
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Not all assessed measures were relevant to this study because data were from a large national survey addressing several research questions.
  4. Sample Size: We simply tried to go for the largest sample size possible within our budget; we contracted for a sample of N = 1500 and received a sample of that size from Knowledge Networks. No decisions were made along the way by us about when to terminate and no analysis was done until we received the full dataset. The sample size was well within the range needed in order to sufficiently power the analyses we did.
Lilienfield, Waldman, Landfield et al. Fearless dominance and the U.S. presidency: Implications of psychopathic personality traits for successful and unsuccessful political leadership
  1. Exclusions: Full Disclosure (We reported all observations that were central tests of our original hypotheses.)
  2. Conditions: Full Disclosure
  3. Measures: We examined other personality measures that we didn't report but these weren't relevant to our hypotheses (and we didn't analyze these).
  4. Sample Size: Full Disclosure
Shiota & LevensonTurn down the volume or change the channel? Emotional effects of detached versus positive reappraisal.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: A battery of trait questionnaire measures completed prior to the study and a few additional thought content and facial expression measures assessed were not reported as these were not relevant to the research question addressed in the paper.
  4. Sample Size: Original target sample size for the entire study (including a 3rd experimental reported in the paper but not used in analyses) was 252 based on power analyses for the grant supporting the research. Final sample size for the entire study was 223 with data collection ended due to departure of the project director (Shiota) for a new job.
Aug 2012Cohrs, Kämpfe-Hargrave & RiemannIndividual differences in ideological attitudes and prejudice: Evidence from peer-report data
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure (non-experimental research)
  3. Measures: Analyses were of secondary nature; data stemmed from large-scale surveys involving numerous measures unrelated to the research questions
  4. Sample Size: Sample sizes were determined by practical issues of data collection; for Study 2, we refer to a publication describing the complete data collection procedure in detail (Stößel et al., 2006)
Hill, Rodeheffer, Griskevicius et al.Boosting beauty in an economic decline: Mating, spending, and the lipstick effect
  1. Exclusions: When we do studies on mate attraction (the current research), we routinely remove Ps who are not heterosexual and who are married. Although we screen Ps for this ahead of time, we inevitably get a few who register for the experiments anyway. This information was included in our sample descriptions for most of the experiments, but it wasn’t mentioned in all of them. This was simply an oversight on our part.
  2. Conditions: Full Disclosure
  3. Measures: Each experiment included some variables measuring potential mediators (i.e., process variables) and moderators (e.g., SOI) which we did not report. The last two of the experiments also included measures that were used to pilot a new study that we were developing.
  4. Sample Size: We typically go with the conventional n = 30 within in each experimental cell. Because we are at the mercy of our research Ps to show up to experiments for which they are registered & not show up at experiments for which they are not, sometimes we get somewhat more than this & sometimes we get less.
Smillie, Cooper, Wilt et al.Do extraverts get more bang for the buck? Refining the affective-reactivity hypothesis of extraversion.
  1. Exclusions: Two participants were omitted from study 4 as they were disproportionately older than the rest of the sample and one reviewer was not happy about this (although this had no effect on the results).
  2. Conditions: Full Disclosure
  3. Measures: Additional measures were assessed but not reported because they were not related to the research question.
  4. Sample Size: We aimed for approximately 30-40 participants in each condition as a rough rule of thumb and the exact point of termination was decided according to convenience (e.g. once the student research participation period had ended for the year).
Jul 2012Aviezer, Trope, & TodorovHolistic person processing: Faces with bodies tell the whole story.
  1. Exclusions: Full Disclosure
  2. Conditions: One experiment was removed from the analysis due to editorial suggestion because the data showed a more complex picture. it was however by no means a failed manipulation. It is being prepared for publication in a different outlet.
  3. Measures: Full Disclosure
  4. Sample Size: In this paradigm (face in body context) I typically run up to 20 subjects per condition. This is based on several previous studies showing that an n of 15-20 is sufficient for reliably detecting the effect (it is a robust effect). We used the same rule of thumb here. I instructed the RA that we should run between 20-30 subjects and data collection stopped when the RA told me that we are in the range that we set for data collection. Data analysis only took place after all the data was collected, so we never stopped running as a function of reaching significance or particular data patterns.
Ermer, Kahn, Salovey et al.Emotional intelligence in incarcerated men with psychopathic traits.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: A variety of neuroimaging personality genetic and neuropsychological measures were also assessed from participants. We only discussed measures related to the research question addressed in this paper.
  4. Sample Size: We used all available MSCEIT (emotional intelligence) data collected over a three year period (except for reported exclusions). Original sample sizes for the NIH grants were determined based on power analyses to detect the effects of interest for the grant's specific aims (largely focused on brain imaging research questions). However, we ended up collecting as many subjects as possible given the funds awarded.
Jun 2012Bergsieker, Leslie, Constantine et al.Stereotyping by omission: Eliminate the negative accentuate the positive
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure (though there was a partially successful very similar parallel study with manipulated self-presentation concerns that didn't make it into the final paper, but which is reported elsewhere: Fiske, Bergsieker, Russell, & Williams, 2009)
  3. Measures: Other measures assessed but were not related to the research question
  4. Sample Size: Ran the max # of Ps available to us in the rather limited Princeton participant pool each term
Carter & GilovichI am what I do not what I have: The differential centrality of experiential and material purchases to the self.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Standard demographic variables (e.g. major or GPA) might have been assessed but weren't reported because they were not relevant to the research question.
  4. Sample Size: We would never take a first look at the data before hitting a rather high threshold for sample size (at *least* 25 per cell) and if we decided to run additional participants would run until we hit a new target N which was typically some multiple of the initial sample. In the event that I looked at the data before we hit that target (to make sure there were no problems with the survey) I did not stop data collection until we hit the target N even if the data were already significant.
Danziger, Montal, & BarkanIdealistic advice and pragmatic choice: A psychological distance account.
  1. Exclusions: Full Disclosure
  2. Conditions: Throughout the course of the project we first tested several scenarios comparing choice and advice. Many of them yielded significant differences, some did not. We then focused on some of the scenarios that yielded significant differences, as reported in the accepted version of the manuscript.
  3. Measures: Full Disclosure
  4. Sample Size: Our decision was based on general conventions (central limit sample size vs. effect size) as well as on ease of recruiting subjects for the study and the number of conditions in the study. Studies ranged from ~40 to 60~ subjects per condition. In studies with 4 conditions we had slightly less subjects per condition than in studies with 2 conditions. In study 5 in which there was a confederate we had slightly less subjects per condition (~20). Pretests had fewer subjects.
Mumenthaler & SanderSocial appraisal influences recognition of emotions
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: we defined the sample size in both experiments before collecting data based on similar experiments and the availability of participants in our subject pool. Data analyses were performed once the minimum sample size was achieved.
Rothschild, Landau, Sullivan et al.A dual-motive model of scapegoating: Displacing blame to reduce guilt or increase control.
  1. Exclusions: 26 participants were excluded from Study 2. Of these excluded cases, 11 were excluded for indicating that they were over 25 years old. Although we tried to specifically recruit participants between the ages of 18 to 25 years, 11 participants outside that age range ended up participating anyway. Given that our manipulation specifically targeted young people ages 18-25, we only included participants that fell within that age range. The remaining 15 cases were excluded for failing a attention check question that followed the article that comprised our threat manipulation which read "According to the article you just read, who is the primary cause of climate change? a) young people (ages 18-25), b) middle-aged people (ages 35-55), c) older people (ages 60-80), d) the primary cause of climate change is unknown."    16 participants were excluded from Study 3. Of these excluded cases, 5 were excluded for indicating that they were over 25 years old. The remaining 11 participants were excluded for failing the same attention check question used in Study 2.
  2. Conditions: Full Disclosure
  3. Measures: We did not report filler questions unrelated to our research questions that were included to support our cover story.
  4. Sample Size: For Study 1 we ran a power analysis assuming a medium to large effect size that indicated a optimal sample size of 111 participants. We gave 6 RAs 20 questionnaires each to hand out around campus. Our RAs returned with 114 questionnaires. For Study 2 we performed a power analysis assuming a large effect size that indicated a optimal sample size of 87 participants. Although we reached this target by the end of the semester, because we had to exclude 26 cases we ended up with only 61 participants in our final analysis. For Study 3 we were shooting for at least 20 participants per cell. We stopped collecting data once we had reached 80 participants. However, because 16 cases ended up being excluded from data analysis we ended up with 64 participants.
Shu & GinoSweeping dishonesty under the rug: How unethical actions lead to forgetting of moral rules
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Full Disclosure
May 2012Briñol, McCaslin, & PettySelf-generated persuasion: Effects of the target and direction of arguments
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We excluded any items that were unrelated to the research questions or that were included for exploratory purposes. Furthermore we focused on the items that were included in all the studies within the paper in order to maintain convergence across experiments.
  4. Sample Size: The number selected was based on our prior experience with this research topic and the number of participants that could be successfully recruited within an academic term (without crossing terms). Also given that not all participants who signed up for the experiments in advance showed up to participate the total number of subjects per cell was not identical in all cases. Also we did not conduct any statistical tests until we were done collecting data.
Apr 2012Kim, Schimmack, & OishiCultural differences in self- and other-evaluations and well-being: A study of European and Asian Canadians. 
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure (Not Applicable. Our research did not involve an experimental design. )
  3. Measures: The data sets included other measures (e.g., importance ratings of different domains in life, gratitude questions and etc). Including the measure(s) would have made the model too complex. 
  4. Sample Size: We did not determine sample size ahead of time but our sample size was very large relative to other studies. We had sufficient power to demonstrate small effects (r = .1, N = 780, power = 80%)  The sample was based on five data sets of which one was collected in 2010 until the end of the term.
Malle & HolbrookIs there a hierarchy of social inferences? The likelihood and speed of inferring intentionality, mind, and personality
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: First study's sample size was determined by rough power analysis on the basis of preliminary dissertation studies (which were underpowered but showed general pattern), but we fell short of target of 50, at end of semester. Second, third study, and fourth study sampling ended when just above target of 50. For fifth study we targeted more, around 70, but I can't recall the exact number. We expected more missing values and greater error variance, because of the changed procedure.
Mar 2012Sullivan, Landau, Branscombe et al.Competitive victimhood as a response to accusations of ingroup harm doing
  1. Exclusions: Full Disclosure
  2. Conditions: Study 4 originally included an additional control condition, which claimed that neither Black Americans nor White Americans experience any discrimination in university admissions. This condition did not significantly differ from the other two conditions. Upon reflection we realized that this condition was ineffective because our outcome measure asked participants to compare the amount of discrimination their group experiences to that of the outgroup. Whereas in the threat condition participants increased their rating as a function of defensive processes (as evidenced by the condition’s differentiation from the other comparison condition, and by our mediational finding), in the control condition participants may have rated the groups as equal due to non-defensive processes. We dropped the condition due to its irrelevance to the overall project and its lack of significant differentiation.
  3. Measures: In Studies 2 and 4 we included items measuring the extent to which participants engaged in competitive victimhood with outgroups other than the group they were accused of harming. In one case there was an effect of condition on these items, in the other there was not. Based on these results and the results of other studies not included in this paper, it seems to be the case that the order of measurement makes the difference. When participants engage in competitive victimhood with the relevant outgroup first, condition has no effect on ratings of other outgroups. However, since we never tested the order effect in a full factorial design, we decided not to report these inconclusive findings.
  4. Sample Size: Based on past convention, we sought to obtain at least 15 participants in each condition. Once this was obtained, we would continue to collect data until the end of the semester. In Study 3 (a one-shot online sample), we simply collected a number of participants that seemed more than sufficient (approx. 150)
Zhou, He, Yang et al.Control deprivation and styles of thinking
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We also assessed other measures at the end of the experiment but did not report these as they were meant for pretesting for another experiment.
  4. Sample Size: Sample sizes were predetermined by pilot studies.
Feb 2012Wohl, Hornsey, & BennettWhy group apologies succeed and fail: Intergroup forgiveness and the role of primary and secondary emotions
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: In the first two studies we included other measures (e.g. collective angst and SDO) for exploratory purposes. These were unrelated to the main hypothesis and were included to see if the dvs would respond to the variations in primary and secondary emotions for future studies. In subsequent studies in the paper we included a measure of perceived ulterior motives response satisfaction and perceived likelihood that the transgressor would re-offend. In the last study we included a measure assessing perceptions of the Canadian Military as a pilot for work that a person in Matthew Hornsey's lab was doing on dehumanization of the military.
  4. Sample Size: Number of participants was based on providing sufficient power (?=.80) to detect a moderate effect size d = .5 as significant (p < .05 two-tailed; Cohen 1988). My rule of thumb to get this is to have 30 participants per cell in an experiment. Data collection continued until we could reach at least that number or until participation dropped off. For the studies in which we were assessing more complex models (moderated mediation) I like to up that number to between 40-50. As can be seen in the manuscript we had over 150 participants in studies where more complex models were tested. Analysis did not occur until we stopped collecting data.
Jan 2012Feinberg, Willer, & Keltner Flustered and faithful: Embarrassment as a signal of prosociality
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We also conducted a study displaying a picture of President Obama expressing an ambiguous emotion with a caption indicating he was expressing embarrassment or amusement. When labeled as embarrassed, the president was rated as more prosocial. This study was included in the initial submission, but reviewers suggested results had confusing implications for our theory, so it was removed.
  4. Sample Size: Study 1a: Without prior precedent, we intended to run about 50. 57 participants had taken part when the data-collection week ended. Study 1b: Research assistants were directed to continually advertise the survey online for a week with the hope of collecting about 40 participants, a target based on Study 1a results.Study 2: Participants were recruited as part of a larger recruitment block. The sample size was a product of the number of participants who signed up during the block. We estimated 80-100 would sign up during this time, a total we thought would offer enough statistical power to find a significant result if one existed. Studies 3 & 4: Participants were recruited as part of two online surveys that involved the reported study questions and batteries that served as “prescreening” for unrelated lab studies (our items appeared first). Sample sizes were a product of the number of participants who took the surveys. Study 5: We used a rule of thumb of 15-20 participants per condition. Because the study was very labor-intensive – employing an experimenter and trained confederates – we looked at the data after approximately 15 participants to check the progress of the study. Results were promising so we continued collecting data until we reached the target sample range.
Johnson, Freeman, & PaukerRace is gendered: How covarying phenotypes and stereotypes bias sex categorization.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Data collection occurred for a pre-specified period of time and data from all participants who completed the study during that period were included in analyses.
Laurin, Kay, & FitzsimonsDivergent effects of activating thoughts of God on self-regulation
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure (with the exception of some demographic information such as country of birth)
  4. Sample Size: During the time at which these studies were conducted, our typical approach was to simply aim for 15-20 observations per cell, and to analyze the data at the end of the semester if the sample size had or had nearly reached that threshold.
Oishi, Miao, Koo et al.Residential mobility breeds familiarity-seeking
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We assessed other measures (e.g. 5-item satisfaction with life scale: SWLS) but did not report them as they weren't relevant to our hypotheses.
  4. Sample Size: Basis of data collection termination was that we set out to collect x participants and terminated when that goal was reached. We never peaked at the data before terminating the data collection.
de Haan, Deković, & PrinzieLongitudinal impact of parental and adolescent personality on parenting
  1. Exclusions: Full Disclosure
  2. Conditions: N/A (non-experimental study)
  3. Measures: Measures not related to research question were not reported.
  4. Sample Size: We aimed for the largest sample size possible within our budget. The desired sample size was decided ahead of time, and the data collection termination rule was not modified during the course of data collection.
IssueAuthorsArticle Title
Mar 2013Gillespie, Pearlmutter et al.Against structural constraints in subject-verb agreement production.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: There were subsets of items that were considered "fillers" in these experiment that were not related to the research questions at hand. None of these have been reported elsewhere.
  4. Sample Size: In these experiments we ran as many participants as possible (far more than previous research has) in order to show that our null effects were not due to a lack of power. There was not set stopping rule, but we decided to collect over 100 participants' data in each experiment.  In Experiment 2, we collected over 150 participants' data, while in similar studies it is common to collect data from 40-50 participants.
Halali, Bereby-Meyer et al.Pitfall or scaffolding? Starting-point pull in configuration tasks.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: At the end of experiments we also measured individual thinking style with the REI (24 items) thinking style questionnaire (Pacini and Epstein, 1999). Since we didn't find any systematic effects and since it was not part of our hypotheses we did not report the results of this questionnaire.
  4. Sample Size: We did not determine the sample size by power analysis, but given our previous experience with similar experiments we aim to collect between 25-30 participants in each between participants experimental condition. In Experiment 3 we ran 40 participants in each condition because we had an hypothesis about a null effect.
Ortells, Mari-Beffa et al.Unconscious congruency priming from unpracticed words is modulated by prime-target semantic relatedness.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Full Disclosure
Schotter, Ferreira, & ReynerParallel object activation and attentional gating of information: Evidence from eye movements in the multiple object naming paradigm.
  1. Exclusions: Data from 7 subjects were excluded from the analyses due to data loss (blinking during the trial, which cannot be classified according to eyetracking data analysis methods). This was not reported in the manuscript to save space and to focus on the important results and because it is peripheral to the theoretical focus of the study (i.e., the fact that subjects blinked excessively is independent from the purpose of the experiment). For all other data (i.e., that included in the analyses), data exclusion procedures and number of included observations were reported.
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure (As noted in Rayner, Staub, Li, et al., this question not entirely relevant in the context of eye movement research. There are tons of measures that could be examined - eye movements give you data extended over a large temporal window. We reported the most theoretically relevant eye movement measure.)
  4. Sample Size: We estimated the number of subjects needed based on the number of subjects and items used in previous, related research.
Jan 2013D'Angelo, Jimanez, Milliken et al.On the specificity of sequential congruency effects in implicit learning of motor and perceptual sequences.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We decided ahead of time to collect data until a sample size of of 40 was achieved, based on previous work examining similar effects.
Grzyb & HabnerExcessive response-repetition costs under task switching: How response inhibition amplifies response conflict.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Target sample sizes of experiments were based on pilot work. Target sample size of Experiment 3 was larger because of an additional manipulation. We stopped recruiting participants when target sample size was reached and, if already more participants enrolled, also collected data from those participants.
Liesefeld & ZimmerThink spatial: The representation in mental rotation is nonvisual.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Participants of Experiment 2 also filled out a paper-and-pencil mental-rotation test (Peters et al., 2005). This test was administered to explore a research question unrelated to the topic of the article. It cannot have influenced the data presented in this article, because it was administered after participation in the experiment.
  4. Sample Size: We have decided on (a generous) sample size based on our experience with previous experiments of this type. Because mental rotation is probably influenced by gender, we made sure to include the same number of men and women. In Experiment 2, a considerable number of participants was excluded based on our predetermined exclusion criteria and due to excessive EEG artifacts (as reported). In order to obtain our desired sample size and to balance the cells of the design, we, therefore, replaced excluded participants.
PecherNo role for motor affordances in visual working memory.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Based on sample sizes of comparable studies in the literature I aimed to run 24 subjects in Experiment 1. I had a slightly higher number of subjects sign up to take no-shows into account, therefore the number of actual participants was 26. The results of Experiment 1 indicated that this number provided enough power. Therefore, I used the same procedure for the other four experiments, that is, aim for at least 24 subjects. The actual numbers were between 26 and 28.
Sulpizio, Arduino, Paizi et al.Stress assignment in reading Italian polysyllabic pseudowords.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: The number of participants was based on the psycholinguistic literature and on our previous experience with other studies addressing the same topic.
Nov 2012Atalay & MisirlisoyCan contingency learning alone account for item-specific control? Evidence from within- and between-language ISPC effects
  1. Exclusions: In Experiments 1 we excluded one participant, because the participant's native language wasn't Turkish. In Experiment 2, we excluded 2 participants, one for being color-blind and the other for not reporting better proficiency in Turkish than in English. These participants were replaced.
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We estimate sample size from similar studies. For Experiments 1A and 1B, sample size in original submission was 34 and 20 respectively. To increase statistical power as recommended by the reviewers sample size was increased to 52 in both experiments.
ChuderskiThe contribution of working memory to fluid reasoning: Capacity, control, or both
  1. Exclusions: Full Disclosure (Two data were excluded from exp. no. 5, and we reported that in the paper. Actually, we did not report on participants who resigned during test taking - assuming that they did not agree to participate in experiment at all, but delayed their decision. As I remember, it could be as few as 3-4 such persons in total, in all our six experiments reported in JEPLMC (i.e., out of around 670 participants in total), but unfortunately we did not register that.)
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Our criteria for sample sizes in each experiment were exactly like this: "decided ahead of time to collect data until minimum sample size achieved and this was followed" - in correlational studies on WM and Gf like the one reported in JEPLMC it is more or less known how many observations one needs to get significant correlations (if they exist) - from 100 to 150 participants (only in the last experiment no. 6 we examined 75 persons, because it was a simple follow-up study - a continuation of exp. no. 5). Moreover, as we paid our participants, we had to estimate the necessary funds for each experiment in advance.
ClapperThe effects of prior knowledge on incidental category learning
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: I had run previous experiments of this type, and determined my sample size in advance based on that (just using the same number as in previous studies of this type). Data collection stopped when the target sample size was reached.
Pansky Inoculation against forgetting: Advantages of immediate versus delayed initial testing due to superior verbatim accessibility
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: The sample size was predetermined at 32 participants per experimental group (X 3 groups = 96) because it was the minimal sample size that satisfied the following two requirements: 1) n>30 to ensure normality of the sampling distribution and 2) n divisible by 8 so that all the counterbalancing combinations were equally repeated. I stopped data collection when n=32 for each experimental group.
Perfect & WeberHow should witnesses regulate the accuracy of their identification decisions: One step forward, two steps back
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure (We reported the existence of a manipulation of disguise in footnote 2, but we did not report the analyses because it did not add interact with our key manipulation, and we were mindful of the word-limit for a research report.)
  3. Measures: The computer program used to collect responses automatically recorded response latencies, but we did not look at these data because we had no theoretical interest in this measure.
  4. Sample Size: We based our sample size on effect sizes found in previous work in the lab. We estimated that we would need a sample of 50 per cell (400 in total) and the research assistant was instructed to stop once that target was reached. In fact they exceeded this total because they over-recruited on the final day, and these data were retained.
Pratte & Rouder Assessing the dissociability of recollection and familiarity in recognition memory
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Simulations were conducted before data collection to determine how many participants and trials were needed to obtain reasonable parameter estimates. We then collected data from about this many participants perhaps with fewer in followup experiments in which highly accurate estimates were not needed. No statistical analyses were conducted before all data were collected.
Rayner, Staub, Li et al.Plausibility effects when reading one- and two-character words in Chinese: Evidence from eye movements
  1. Exclusions: Data from 2 participants were excluded due to poor calibration of the eye tracker.
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure (This question not entirely relevant in the context of eye movement research.  There are tons of measures that could be examined - eye movements give you data extended over a large temporal window.  We did report all of the relevant/standard measures associated with the designated target words in the study.)
  4. Sample Size: The experiment had 4 conditions and we wanted 10 data points from subjects in each condition. We did not modify this goal during the course of the research.
Sep 2012Foster & SahakyanMetacognition influences item-method directed forgetting
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We decided to collect 32 participants in each group. If, by the end of data collection, we obtained an effect in any direction that was close to significance (e.g., p=.08), we continued to run an additional 16 participants.
Gaschler, Frensch, Cohen et al.Implicit sequence learning based on instructed task set
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Sample size for Exp. 1a was based on a prior study (Wenke & Frensch, 2005). Based on the effects in Exp. 1a, a larger sample size was chosen for Exp. 2a and following. In addition, an increase in sample size for Exp. 2b was suggested (and achieved) during the review process.
Hino, Lupker, & TaylorThe role of orthography in the semantic activation of neighbors
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: (a) We usually collect 30-40 participants' data in a single experiment. Thus, we collected 44 participants in each of the two experiments in this article. (b) We kept running the experiments to see whether the effect becomes significant not only in the subject analysis but also in the item analysis. After getting 44 subjects in these experiments, we stopped collecting data because the effect became significant in both the subject and item analyses.
KleinA role for self-referential processing in tasks requiring participants to imagine survival on the savannah
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: I determined sample size based on two factors (a) subject availability (doled out each quarter to each researcher) and (b) previous experience with memory research of this nature suggesting that a sample size of between 20-40 provides a reasonably stable estimate of population parameters.
Liefooghe, Wenke, & De HouwerInstruction-based task-rule congruency effects
  1. Exclusions: Full Disclosure
  2. Conditions: A randomization error occurred in one condition of Experiment 2, this condition was replaced by a properly randomized condition.
  3. Measures: Full Disclosure
  4. Sample Size: Approximate sample size was decided ahead on the basis of pilot work.
Monaghan, Mattock, & Walker The role of sound symbolism in language learning
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Sample size was determined on the basis of previous similar studies (in particular Monaghan Christiansen & Fitneva 2011 JEPG); we stopped collecting data once we had 24 participants in each experiment.
Quinlan & CohenGrouping and binding in visual short-term memory
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Sample size in original submission in both studies was set at 22 (estimated from previous similar studies). To address concerns about statistical power in revising the manuscript the sample size was increased to 30 in both studies.
Uzer, Lee, & BrownOn the prevalence of directly retrieved autobiographical memories
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We decided sample size ahead of time based on previous studies
Jul 2012Bissett & LoganPost-stop-signal adjustments: Inhibition improves subsequent inhibition
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We decided to run 24 subjects in each experiment before we began collecting data.  This decision was based on the number of subjects we needed to complete counterbalancing (e.g., groups of 4) and the number of subjects we have run in similar experiments in the past that detected differences of the same magnitude we expected here.
Boywitt & MeiserThe role of attention for contextcontext binding of intrinsic and extrinsic features.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We usually aim for around 100 participants in two group designs with the type of model used here because it gives adequate power. For Experiment 2 we didn't reach that goal however because we couldn't find any more volunteering participants. Therefore sample size in Experiment 2 (N=92) was slightly smaller than in Experiment 1 (N=100).
Davis, Love, & PrestonStriatal and hippocampal entropy and recognition signals in category learning: Simultaneous processes revealed by model-based fMRI
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Sample size was chosen to match sample size from previous high resolution fMRI studies after the expected number of excluded subjects were taken into account.
Pachur & ScheibehenneConstructing preference from experience: The endowment effect reflected in external information search
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We had originally planned for a sample size of 75 per condition (150 in total). Given that this was the first study on external search processes preceding WTA/WTP judgments, we aimed for a rather high number of participants. Eventually, 2 additional participants registered for the experiment on the institute’s website and we decided not to exclude them.
Rabovsky, Sommer, & Abdel RahmanImplicit word learning benefits from semantic richness: Electrophysiological and behavioral evidence.
  1. Exclusions: Full Disclosure
  2. Conditions: We also manipulated the number of associates (orthogonal with the number of features) but there were no effects and we did not report this manipulation in the paper due to the conciseness and brevity required by the short report format.
  3. Measures: Full Disclosure
  4. Sample Size: We decided ahead of time to collect data from 24 participants and this was followed.
May 2012ArndtThe influence of forward and backward associative strength on false recognition
  1. Exclusions: We completed a stimulus counterbalance at 104 subjects, and collected data from a total of 106 participants. One of the participants’ data was thrown out due to being part of an incomplete counterbalance (participant # 105), while the other was a participant number (#50 in our order of collection) that was mistakenly run by two research assistants. Thus, there were two peoples’ data in the same counterbalance condition at that point in our data collection. Our standard practice is to only use the first person’s data that were collected and to toss the other (as opposed to breaking the randomization of counterbalance orders we set up prior to running).
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: I generally seek to collect a large number of observations per condition (~1000) for a study. No a-priori power analysis was done. If the sample size is 50 or 60, an effect looks very large (informally gauged by comparing the mean difference to the standard error of the mean) and when we reach the end of data collection in a study for the semester (people are no longer signing up for that study very often or the semester is over with), I’ll often terminate the study and consider it finished. Otherwise, we typically try to run until we get to around 1000 observations per condition, look at the data then, and decide if it makes sense to collect more data (e.g., when effects are small, but potentially meaningful and we’re in something of an inferential “no man’s land” with a small mean difference that isn’t outside of the margin of error).
Fedor, Varga, & SzathmárySemantics boosts syntax in artificial grammar learning tasks with recursion
  1. Exclusions: Data form the last participant was not analyzed and not included. He participated in the study a week later than the rest of the participants and his data got lost.
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We aimed to have 15-20 participants (in accordance with previous similar studies in the literature) in each group and tested as many as volunteered within the time frame we had.
Griffiths, Hayes, & NewellFeature-based versus category-based induction with uncertain categories
  1. Exclusions: Full Disclosure
  2. Conditions: In Experiment 1 we replicated the primary result of prior studies (i.e. that feature-based reasoning is frequently used to make predictions about instances whose category membership is uncertain). For pragmatic reasons, when we ran this study we also constructed a novel procedure (resulting in a between-subjects manipulation) to act as a pilot for a second (unrelated) research stream. This new procedure was a failure, but the existing procedure replicated prior data. We reported on the existing procedure, and omitted the data from the new, failed procedure. We did not collapse data across this manipulation.
  3. Measures: We conducted two “manipulation check” measures (after all the critical measures were taken). These were not reported because: (i) in hindsight these measures were redundant (performance on the critical measures provided sufficient validation of the manipulation), and (ii) any explanation would have been lengthy. Indeed, data from these measures have never been analyzed.
  4. Sample Size: We based our sample size on other samples used in closely related studies (several are cited in the paper), and our experience with the pilot studies for this task. We chose n=20 in advance, and complied with that as close as possible (some conditions were more difficult, and thus fewer participants reached criterion performance in these groups. This was unsurprising and is fully described in the article – including the performance of the excluded participants). Data collection stopped once we reached the target sample size.
McVay & KaneDrifting from slow to doh: Working memory capacity and mind wandering predict extreme reaction times and executive control errors.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: One measure was not closely enough related to our central research questions to report. As in a previous article (McVay & Kane 2009 JEPLMC) subjects here also completed a general retrospective self-report questionnaire about cognitive failures the Cognitive Failures Questionnaire - Memory and Attention Lapses (CFQ-MAL). We did not report the results from this measure here because the current article was already trying to accomplish too much both analytically and theoretically to include them; however the new findings closely replicated those of McVay & Kane (2009) with the CFQ-MAL score not correlating significantly with working memory capacity but correlating significantly positively with mind-wandering rates during the SART task and significantly negatively with performance in the SART task.
  4. Sample Size: We had aimed to test at least 100 subjects in each of our two between-subjects conditions. After running the study for one semester (Spring 2008) we were well below that target (with Ns in the 50-60 range in each group) and so we collected data throughout the entire subsequent semester (Fall 2008) which brought our samples well above the target. To be explicit we did not stop testing subjects until we had tested as many people as possible over two complete semesters.
Olds & WestermanCan fluency be interpreted as novelty? Retraining the interpretation of fluency in recognition memory
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Samples sizes were determined based on past experiments on this topic in our lab and whether the key variables were within-subjects or between-subjects, with one exception: Experiment 5 yielded ambiguous data when it was initially submitted. The reviewers asked us to collect more data, which we did (the results did not change--the results of that experiment are still somewhat unclear).
Pyc & RawsonWhy is test-restudy practice beneficial for memory? An evaluation of the mediator shift hypothesis.
  1. Exclusions: Data were excluded from analyses if participants failed to return for the second session (2 in Experiments 1a/1b and 5 in Experiment 2) or if they failed to follow task instructions (5 from Experiment 2).
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We decided ahead of time to collect data until we had 25 subjects per group.
Sprenger & DoughertyGenerating and evaluating options for decision making: The impact of sequentially presented evidence.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Full Disclosure
Mar 2012Hampton, Aina, Andersson et al.The Rumsfeld effect: The unknown unknown
  1. Exclusions: Full Disclosure (only participants who did not return for the second session were excluded, necessarily as their data were missing)
  2. Conditions: One study in the series was not reported. It was underpowered, and the results did not achieve significance. Other studies related to those published have been conducted and await publication.
  3. Measures: Full Disclosure
  4. Sample Size: Sample size for experiments after the first was based on a power analysis from the first experiment in the series (there was no prior way to estimate an effect size for this, so a large effect size was assumed). In no case were the data analysed before data collection had been completed, so no stopping rule problems arise. Sample size varied as a function of availability of suitable participants, and variable return rate for the second session.
Ingram, Mickes, & WixtedRecollection can be weak and familiarity can be strong
  1. Exclusions: Full Disclosure
  2. Conditions: We did not report pilot data that were collected to test different confidence scales (but these were not failed manipulations).
  3. Measures: Full Disclosure
  4. Sample Size: We chose a minimum sample size (e.g., n = 20) and then our lab assistants brought us data to analyze when they reached or exceeded that minimum (e.g., if they reached 23 by the end of the day, our actual n was 23).
Schuck, Gaschler, Keisler et al.Position-item associations play a role in the acquisition of order knowledge in an implicit serial reaction time task
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: After a first round of reviews, we increased the sample size upon the reviewers suggestion.
Jan 2012Andrews & LoNot all skilled readers have cracked the code: Individual differences in masked form priming
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure (A second set of less well-matched items were included in the experiment but not reported in the analysis, as noted in the paper.)
  4. Sample Size: We decided ahead of time to collect data for approximately 100 participants and stopped recruitment when this total was reached.
Bowles, Harlow, Meeking et al.Discriminating famous from fictional names based on lifetime experience: Evidence in support of a signal detection model based on finite mixture distributions.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Data collection was terminated when it was clear that the effect we were interested in could be demonstrated consistently in most individual participants.
Herrera & MacizoSemantic processing in the production of numerals across notations
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Full Disclosure
Klein, Robertson, Delton et al.Familiarity and personal experience as mediators of recall when planning for future contingencies
  1. Exclusions: Full Disclosure
  2. Conditions: Coming soon.
  3. Measures: Full Disclosure
  4. Sample Size: Decided based on subject pool to allot equal numbers across cells. Had sufficient power -- as indicated in paper.
Mutter, Atchley, & PlumleeAging and retrospective revaluation of causal learning
  1. Exclusions: Experiment 1:  One young participant was replaced - failed to follow instructions. Experiment 2a:  One older adult was replaced - missing data. Experiment 2b:  Two older adults were replaced - 1 did not finish the session, 1 was colorblind and thus could not perceive the colors of the stimuli. Experiment 3:  Three older adults were replaced - 1 due to experimenter error (run in wrong experiment), 1 due to failure to follow instructions, 1 due to medical condition that could affect results (stroke)
  2. Conditions: Full Disclosure
  3. Measures: We collect data on several demographic and individual difference measures (e.g., SES, MMSI) as part of the screening protocol for our young and older participants.  We report only those measures that are relevant to the research question and to characterizing the typical differences that are observed between young and older adults.
  4. Sample Size: The number of participants in our young and older groups for all three experiments was determined in advance of data collection based on (1) the number of participants used in the original experiment upon which our procedure was based (i.e., Larkin et al., 1998) and (2) the number of participants needed for counterbalancing over 9 list orders (3 per list).  We did not modify these requirements over the course of data collection for any of the three experiments.
IssueAuthorsArticle Title
Feb 2013Bonnefon, Hopfensitz, & De NeysThe modular nature of trustworthiness detection.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure (We reported all the measures and items that we administered -- but note that other researchers administered other measures to the same subject pool. We had access to some of these measures, but did not use them in our analyses.)
  4. Sample Size: We used the whole subject pool that was made available to us by the university.
Callan, Ferguson, & BindemannEye movements to audiovisual scenes reveal expectations of a just world.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: There was a paper-based questionnaire after the eye tracking part of the experiment (participants made judgments of deservingness and immanent justice across various scenarios). This was to pilot test materials for use in subsequent, unrelated experiments; these data were never linked to, or analysed with, the eye tracking data.
  4. Sample Size: We aimed to test 10 participants in each experimental list, as is standard in this sort of research (we had 4 lists to rotate the items around the four within-subjects conditions), but the participant sign-ups were such that we over-recruited by two. We had to run two more participants to balance the lists out again (resulting in 44 participants in total). Data analysis didn't commence until the lists were balanced at 44 participants.
Cowan & SaultsWhen does a good working memory counteract proactive interference? Surprising evidence from a probe recognition task.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We knew about how many participants this type of study would have and we aimed for as many as we could get within a practical period of time, until the needed number was surpassed.
Dshemuchadse, Scherbaum, & GoschkeHow decisions emerge: Action dynamics in intertemporal decision making.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We increased the sample size due to a reviewer's request. 
Ferrante, Girotto, Straga et al.Improving the past and the future: A temporal asymmetry in hypothetical thinking.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: In the first experiment, we included some additional measures but due to limits on the length of the paper we reported only the measures relevant to our research question. In the supplementary materials, we fully described the materials and procedure of both experiments so that interested researchers may contact us about measures which were not reported.
  4. Sample Size: The size of our samples was determined in advance and based on our previous studies (Girotto et al., 2007 in Psychological Science, Pighin et al. 2011 in Thinking and Reasoning).
Redden & GalakThe subjective sense of feeling satiated.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: For all studies, we  periodically checked on sample size as Ss completed each study. We stopped collecting data when that number was approximately 300 Ss for each study. The selection of 300 as the necessary sample size was approximated based on the sample sizes required in past studies of satiation. 
Tentori, Crupi, & RussoOn the determinants of the conjunction fallacy: Probability versus inductive confirmation.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We decided ahead of time a proper sample size and collected data until it was achieved.
Nov 2012Bijleveld, Custers & AartsAdaptive reward pursuit: How effort requirements affect unconscious reward responses and conscious reward decisions
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: For all experiments, we booked the lab for a fixed amount of time and we tried to run as many participants as possible during this period.
Cook, Dickinson, & HeyesContextual modulation of mirror and countermirror sensorimotor associations
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: The sample size was determined a priori based on i) the requirement to fully counter-balance a number of factors, and ii) previous effect sizes seen in similar experiments conducted within our lab over a number of years. 
Inzlicht & Al-KhindiERN and the placebo: A misattribution approach to studying the arousal properties of the error-related negativity
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We administered a number  of other measures unrelated to the research question, but related to other questions (e.g., scales related to mindfulness, scales related to self-determined motivation, scales related to construal level). We also administered standard demographic measures (age, race, handedness, etc.) that we did not report fully.
  4. Sample Size: We decided ahead of time to stop after we had collected around 20 subjects per cell, and this was what we did.
Lyons, Ansari, & BeilockSymbolic estrangement: Evidence against a strong association between numerical symbols and the quantities they represent
  1. Exclusions: Data from 1 participant in Experiment 1 was removed because that participant performed at chance. Removing this participant did not affect results but was deemed advisable due to the participant's uninterpretable accuracy performance (primary analyses were done on response-times). Note that this information was originally included but removed to meet journal space requirements (This information can now be found in the updated online supplemental document.).
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: The number of 'roughly 20 subjects' was initially chosen as it is in-keeping with many articles in cognitive science that rely on simple within-subjects contrasts on RTs (where rho is expected to be very high across conditions). Indeed, in Experiment 1, 21 subjects yielded adequate power (>.95), so sample sizes for subsequent Experiments (2-3) were yoked to this number (21). Significant effects were found in Experiment 2 but not Experiment 3. Note that a null result was predicted for Experiment 3, and we directly tested the relative size of effects in Experiments 2 and 3 via ANOVA interaction terms (which obtained significance - see page 639 in the article for details).
Mirman & GrazianoIndividual differences in the strength of taxonomic versus thematic relations
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Decided ahead of time to collect data from 15 participants in each of the 2 task versions based on previous experience with the paradigm.
Simmons & Massey Is optimism real
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Full Disclosure
Zhang & EpleyExaggerated, mispredicted, and misplaced: When “it's the thought that counts” in gift exchanges
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We included several additional items to gain more information about the gifts, such as questions about the relationship between the giver and receiver (e.g., Response options for Experiment 1): Friends, romantic partner or spouse, family members, colleague, other), or whether the participant enjoyed the conversation at the beginning of study 3. We did not report these items because they do not bear directly on our hypotheses.
  4. Sample Size: All of our experiments were conducted at the Museum of Science and Industry in Chicago.  We targeted a specified number of participants in each experiment (20 per condition), but would run our experiment for the length of time we were scheduled for the space at the MSI.  Because the number of visitors at the Museum varies from one day to another, our sample sizes varied around that targeted number.  We did not describe this in the text.
Sep 2012Sewell & Lewandowsky Attention and working memory capacity: Insights from blocking, highlighting, and knowledge restructuring
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Because we wanted to analyze our data with structural equation modeling, we aimed to have at least 100 usable data sets (i.e., data from people who performed well above chance and were clearly following instructions). Testing was carried out until we exceeded this aspirational sample size.
Aug 2012Bub & Masson On the dynamics of action representations evoked by names of manipulable objects
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We used a sample size that we have found provides adequate power for effect sizes we expected to obtain in our experiments.  We did not make this point in the article.
Fenn & HambrickIndividual differences in working memory capacity predict sleep-dependent memory consolidation
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: At the start of the study, we set out to obtain a sample size of approximately 100 individuals per condition, based on previous research on individual differences.  We always over-recruit for these types of studies because each participant must complete two separate sessions and one group must remain awake between the sessions.  We therefore lose a large number of participants due to attrition and napping. Our final sample sizes were 111 and 114 per condition.
Lakens, Semin & ForoniBut for the bad, there would not be good: Grounding valence in brightness through shared relational structures
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Goal was 20 participants per within-subject condition. All studies have slightly higher sample sizes because participants were recruited through flyers, and lab regulations required the experiment to continue to accept participants untill 16:00 hours.
Slezak & SigmanDo not fear your opponent: Suboptimal changes of a prevention strategy when facing stronger opponents
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Full Disclosure
May 2012Burns, Caruso, & BartelsPredicting premeditation: Future behavior is seen as more intentional than past behavior
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: In Studies 1 and 2 we administered additional items that were not reported because they did not relate to the main hypothesis. At the end of Study 3 we asked participants how believable they found the scenario to be. As we expected, there was no significant difference in believability between the past and future conditions (p=.83). We did not report this measure in the paper because of word limit constraints.
  4. Sample Size: Study 1: We decided ahead of time to administer the study to all students in an MBA course in decision making to which we had access. The results confirmed our hypotheses with respect to temporal perspective and outcome; however, there was a nonsignificant but suggestive temporal perspective X outcome interaction. To determine whether this interaction was reliable, we then conducted a second wave of data collection. We administered this second wave to all students in another MBA course in decision making (taught by a different instructor) the following quarter. The suggestive temporal perspective X outcome interaction did not emerge in this sample. Because the effect of the temporal perspective manipulation on intentionality ratings did not differ significantly across the two samples (p=.92), we combined results from the two waves in the paper. In both waves, all students agreed to complete the study. Study 2: We instructed research assistants to collect the survey from as many people as they could during specified hours (e.g., 11am and 5pm) on each of three consecutive days prior to and following tax day (April 15). Study 3: We instructed research assistants to collect 60 participants per condition and to stop after reaching that number.
Mason & BarThe effect of mental progression on mood
  1. Exclusions: We excluded one participant from Study 1. The reason was because the experimenter gave the wrong instructions.  More specifically, he or she gave instructions for one condition but ran the script for the second condition. I'm reasonably confident that we neither cleaned nor analyzed this participant's responses.  In other words, this person was excluded in advance of our analyses.  As far as I can tell, no participants were excluded from Study 2.  An earlier version of the manuscript included a third study that went unreported in the published paper. 
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Based on past experience with repeated measure studies, we intuitively judged this. I cant say for sure if we checked and realized we needed more participants or if the results were significant the first time we analyzed the data (It's very possible that the former happened with one or both studies.)  I do know that we collected more data to deal with a reviewer's request to rule out order effects. I believe this request came from someone who reviewed the piece for psych science and not JEPG. 
McVay & KaneWhy does working memory capacity predict variation in reading comprehension? On the influence of mind wandering and executive attention
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: The original proposal was to collect data from 200-300 participants.  We were able to meet that goal (actual sample size: 258) over the course of one semester and so we did not resume data collection the next semester.
Shaw & OlsonChildren discard a resource to avoid inequity
  1. Exclusions: Full Disclosure
  2. Conditions: We excluded an additional condition (ran with different participants) that was in a footnote that was a replication of our Equality Condition. This condition was not a failure but was instead ruling out an explanation of the Equality Condition--that children did not like throwing out 2 erasers. In this condition there were four erasers that were given out, 2 to Mark and 1 to Dan. Children were then asked what to do with the final eraser, give it to Dan or Throw it away. In this condition 100% of participants said to give the eraser to Dan. Based on other experiments in the paper this experiment was thought to be redundant since later studies demonstrated that the explanation based on children not wanting to throw two erasers away was not plausible. If anything, the inclusion of this condition would have help, not hindered, our case since it provided a replication of a control condition.
  3. Measures: Full Disclosure
  4. Sample Size: We decided ahead of time to use 20 participants and used 20 participants in all of our studies,with the exception of the 3- to 5-year-olds from experiment 1 which were run at a particularly busy day at the museum and so we ran an additional 4 participants. The pattern of results was the same (actually slightly more significant) if we stopped at 20 participants for the 3- to 5-year-olds.
TopolinskiThe sensorimotor contributions to implicit memory, familiarity, and recollection
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Because the present experiments are conceptual replications and generalizations of earlier papers, for most of the experiments I decided sample size ahead of time using my experience with the manipulation from those earlier published studies (i.e., minimum 40 per between condition had always been necessary to get the interaction significant). Also, because some of the experiments ran in longer experimental sessions involving other unrelated tasks (as is reported), practical considerations and required sample sizes for those other tasks also sometimes influenced sample size decisions. For Experiments 3 and 5, I collected additional data during the revision process due to concerns of statistical validity.
Feb 2012Jensen, Vangkilde, Frokjaer et al.Mindfulness training affects attention- Or is it attentional effort
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Full Disclosure
IssueAuthorsArticle Title
Jan 2013Cox & HasselmanThe case of Watson vs. James: effect-priming studies do not support ideomotor theory.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure (In fact, we predicted null-results and because of this a previous version of the manuscript was rejected (at another journal). This prompted us to come up with the decisive last experiment. In a way we should be grateful for the rejection, our argument is much stronger now.)
  3. Measures: Full Disclosure
  4. Sample Size: We used the original sample size for the replication and G*power 3 to get an estimate for the sample size of further experiments.
Dec 2012Zwaan & PecherRevisiting Mental Simulation in Language Comprehension: Six Replication Attempts
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We collected some demographic information as part of a standard questionnaire that is not relevant to the hypotheses tested and is used to get a better sense of the subject population.
  4. Sample Size: Full Disclosure
IssueAuthorsArticle Title
Mar 2013LeBel & CampbellThe interactive role of implicit and explicit partner evaluations on ongoing affective and behavioral romantic realities.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure (N/A; no experimental conditions)
  3. Measures: Several other measures were also assessed of relationship constructs unrelated to our research question. For the DV in our second model, several other (daily) positive relationship behaviors were assessed, but these went unreported because we didn't analyze them or because more complex patterns emerged with those behaviors.
  4. Sample Size: Recruited as many heterosexual couples as possible via advertisements placed in campus newspapers.
IssueAuthorsArticle Title
Jan 2013Wijnants, Cox, Hasselman et al.Does sample rate introduce an artifact in spectral analysis of continuous processes?
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: We used unpublished data whose measurement was disclosed in a previous paper. We refer to this previous paper, but did not spell out the context exactly.
  4. Sample Size: We used unpublished data whose sample size was disclosed in a previous paper. We refer to this previous paper, but did not spell out the context exactly.
IssueAuthorsArticle Title
Apr 2010LeBelAttitude accessibility as a moderator of implicit and explicit self-esteem correspondence
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Other measures assessed (e.g., causal attribution measures), but went unreported given they were unrelated to the research question at hand.
  4. Sample Size: Data collection stopped after reaching a predetermined sample size in both samples (which were eventually combined due to editorial request). No analyses were performed (in either samples) before data collection was completed.
IssueAuthorsArticle Title
Jul 2012Wijnants, Hasselman, Cox et al.An interaction-dominant perspective on reading fluency and dyslexia
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: We based our sample size on literature of 1/f noise in ADHD. For 30 participants 550 naming latencies were measured.
IssueAuthorsArticle Title
Sep 2012Giner-Sorolla, Caswell, Bosson et al. Emotions in sexual morality: Testing the separate elicitors of anger and disgust. 
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Disclosed all measures for Study 1. For Study 2, ten emotion measures not relevant to anger and disgust (measuring positive affect, sympathy, fear and contempt) were included as filler among the anger and disgust items but not analyzed. Among the antecedents, four measures were included but not analyzed because of interpretive problems. There were two items intended to measure character attributions (“To what extent do you think James behaved the way he did because of something about his personality?”; “To what extent do you think James behaved the way he did because of something about who he is as a person?”) These were ambiguous as to whether they measured the more theoretically relevant construct of bad character (not all acts were judged as bad), so we excluded them from the measure, relying on the two variables that explicitly referred to “bad person” and “weak/flawed character.” There was an item assessing how much James had harmed himself, which was not relevant to the moral emotions under scrutiny (cf. Gutierrez & Giner-Sorolla, 2007). We also had a single exploratory item, “To what extent does James’ behavior violate people’s values?” This proved to show general high correlations with all the other measures and was not further explored.
  4. Sample Size: Study 1: Determined arbitrarily as a function of recruitment into the study; stopping was determined prior to looking at the results. Study 2: determined arbitrarily as a function of completion in a single class session; stopping was determined prior to looking at the results.
IssueAuthorsArticle Title
In pressRatliff, Swinkels, Klerx et al. Does one bad apple(juice) spoil the bunch? Implicit attitudes toward one product transfer to other products by the same brand.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: In both studies we aimed to get 300 participants (an arbitrary number decided a priori). Because data collection at Project Implicit is unpredictable, and because it takes 24 hours after stopping a study for data collection to actually end, we ended up a bit below (Study 1) and above (Study 2) this target N.
IssueAuthorsArticle Title
Dec 2011Ratliff & NosekNegativity and outgroup biases in attitude formation and transfer. 
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Study 2 was part of the first author’s dissertation. There were several other measures collected as part of the dissertation project (prejudice, internal and external motivation to avoid prejudice) that were not reported in the final version of the published manuscript.
  4. Sample Size: In both studies we aimed to get 450 participants (an arbitrary number decided a priori). Because data collection at Project Implicit is unpredictable, and because it takes 24 hours after stopping a study for data collection to actually end, we ended up a bit below (Study 1) and above (Study 2) this target N.
Apr 2011Peters & GawronskiAre we puppets on a string? Comparing the impact of contingency and validity on implicit and explicit evaluations.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: For Experiments 1 and 2, data collection stopped after reaching a predetermined sample size. For Experiment 3, N =120 subjects were run in the first round (sample size pre-determined by power analysis). 14 subjects had to be dropped due to a programming error and 15 were dropped due to chance responding (as noted in MS). Following initial analysis and marginal results, the programming bug was corrected and an additional 98 subjects were run (limited by subject availability), with combined data analysis (N = 218) occurring only after all 98 had been run.
IssueAuthorsArticle Title
Mar 2011Peters & GawronskiMutual influences between the implicit and explicit self-concepts: The role of memory activation and motivated reasoning. 
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Full Disclosure
  4. Sample Size: Experiment 1: First sample stopped at pre-determined size (N = 118), limited by subject availability. This was the sample submitted in the MS; at editor's request, an additional 20 data points were collected for a revision. Experiment 2: First sample stopped at pre-determined size (N = 81), limited by subject availability; following analysis, another sample of N = 47 was collected before analyses were repeated. This was the sample submitted in the MS; at editor's request, an additional 20 data points were collected for a revision.
Sep 2010Ratliff & NosekCreating distinct implicit and explicit attitudes with an illusory correlation paradigm.
  1. Exclusions: Full Disclosure
  2. Conditions: Full Disclosure
  3. Measures: Study 1 included an additional measure of explicit illusory correlation (frequency estimates) that was not reported in the final version of the published manuscript. We also included an additional measure of implicit attitudes (the Sorting Paired Features Task; SPF) that was being pilot tested at the time.
  4. Sample Size: For both studies we collected as much data as we could during a semester either in the lab (Study 1) or on Project Implicit (Study 2).

Note: To protect the anonymity of non-respondents, only 50% of the authors in each issue have been randomly chosen to participate.