Jump to content

Two-alternative forced choice

From Wikipedia, the free encyclopedia
(Redirected from Drift-diffusion model)

Two-alternative forced choice (2AFC) is a method for measuring the sensitivity of a person or animal to some particular sensory input, stimulus, through that observer's pattern of choices and response times to two versions of the sensory input. For example, to determine a person's sensitivity to dim light, the observer would be presented with a series of trials in which a dim light was randomly either in the top or bottom of the display. After each trial, the observer responds "top" or "bottom". The observer is not allowed to say "I do not know", or "I am not sure", or "I did not see anything". In that sense the observer's choice is forced between the two alternatives.

Both options can be presented concurrently (as in the above example) or sequentially in two intervals (also known as two-interval forced choice, 2IFC). For example, to determine sensitivity to a dim light in a two-interval forced choice procedure, an observer could be presented with series of trials comprising two sub-trials (intervals) in which the dim light is presented randomly in the first or the second interval. After each trial, the observer responds only "first" or "second".

The term 2AFC is sometimes used to describe a task in which an observer is presented with a single stimulus and must choose between one of two alternatives. For example in a lexical decision task a participant observes a string of characters and must respond whether the string is a "word" or "non-word". Another example is the random dot kinetogram task, in which a participant must decide whether a group of moving dots are predominately moving "left" or "right". The results of these tasks, sometimes called yes-no tasks, are much more likely to be affected by various response biases than 2AFC tasks. For example, with extremely dim lights, a person might respond, completely truthfully, "no" (i.e., "I did not see any light") on every trial, whereas the results of a 2AFC task will show the person can reliably determine the location (top or bottom) of the same, extremely dim light.

2AFC is a method of psychophysics developed by Gustav Theodor Fechner.[1]

Behavioural experiments

[edit]

There are various manipulations in the design of the task, engineered to test specific behavioral dynamics of choice. In one well known experiment of attention that examines the attentional shift, the Posner Cueing Task uses a 2AFC design to present two stimuli representing two given locations.[2] In this design there is an arrow that cues which stimulus (location) to attend to. The person then has to make a response between the two stimuli (locations) when prompted. In animals, the 2AFC task has been used to test reinforcement probability learning, for example such as choices in pigeons after reinforcement of trials.[3] A 2AFC task has also been designed to test decision making and the interaction of reward and probability learning in monkeys.[4]

Example of a random dot kinetogram as used in a 2AFC task.

Monkeys were trained to look at a center stimulus and were then presented with two salient stimuli side by side. A response can then be made in the form of a saccade to the left or to the right stimulus. A juice reward is then administered after each response. The amount of juice reward is then varied to modulate choice.

In a different application, the 2AFC is designed to test discrimination of motion perception. The random dot motion coherence task, introduces a random dot kinetogram, with a percentage of net coherent motion distributed across the random dots.[5][6] The percentage of dots moving together in a given direction determines the coherence of motion towards the direction. In most experiments, the participant must make a choice response between two directions of motion (e.g. up or down), usually indicated by a motor response such as a saccade or pressing a button.

Biases in decision making

[edit]

It is possible to introduce biases in decision making in the 2AFC task. For example, if one stimulus occurs with more frequency than the other, then the frequency of exposure to the stimuli may influence the participant's beliefs about the probability of the occurrence of the alternatives.[4][7] Introducing biases in the 2AFC task is used to modulate decision making and examine the underlying processes.

Models of decision making

[edit]

The 2AFC task has yielded consistent behavioral results on decision-making, which lead to the development of theoretical and computational models of the dynamics and results of decision-making.[8][9][10][11][12][13][14][15][16][17]

Normal distribution model

[edit]
The optimal strategy in a 2AFC task for univariate normal stimuli from categories and is to classify between the two joint bivariate normal distributions and .[18] The probability of the correct choice is 0.74 here.

Suppose the two stimuli and in the 2AFC task are random variables from two different categories and , and the task is to decide which was which. A common model is to assume that the stimuli came from normal distributions and . Under this normal model, the optimal decision strategy (of the ideal observer) is to decide which of two bivariate normal distributions is more likely to produce the tuple : the joint distributions of and , or of and .[18]

The probability of error with this ideal decision strategy is given by the generalized chi-square distribution: , where

This model can also extend to the cases when each of the two stimuli is itself a multivariate normal vector, and also to the situations when the two categories have different prior probabilities, or the decisions are biased due to different values attached to the possible outcomes.[18]

Drift-diffusion model

[edit]

There are typically three assumptions made by computational models using the 2AFC:

i) evidence favoring each alternative is integrated over time; ii) the process is subject to random fluctuations; and iii) the decision is made when sufficient evidence has accumulated favoring one alternative over the other.

— Bogacz et al., The Physics of Optimal Decision Making[7]

It is typically assumed that the difference in evidence favoring each alternative is the quantity tracked over time and that which ultimately informs the decision; however, evidence for different alternatives could be tracked separately.[7]

Example of six evidence accumulation sequences from an unbiased (100% noise) source. The dotted lines indicate the thresholds for decision making for each of the two alternatives.

The drift-diffusion model (DDM) is a well defined[19] model, that is proposed to implement an optimal decision policy for 2AFC.[20] It is the continuous analog of a random walk model.[7] The DDM assumes that in a 2AFC task, the subject is accumulating evidence for one or other of the alternatives at each time step, and integrating that evidence until a decision threshold is reached. As the sensory input which constitutes the evidence is noisy, the accumulation to the threshold is stochastic rather than deterministic – this gives rise to the directed random-walk-like behavior. The DDM has been shown to describe accuracy and reaction times in human data for 2AFC tasks.[13][19]

Formal model

[edit]
Example of ten evidence accumulation sequences for the DDM, where the true result is assigned to the upper threshold. Due to the addition of noise, two sequences have produced an inaccurate decision.

The accumulation of evidence in the DDM is governed according to the following formula:

[7]

At time zero, the evidence accumulated, x, is set equal to zero. At each time step, some evidence, A, is accumulated for one of the two possibilities in the 2AFC. A is positive if the correct response is represented by the upper threshold, and negative if the lower. In addition, a noise term, cdW, is added to represent noise in input. On average, the noise will integrate to zero.[7] The extended DDM[13] allows for selection of and the starting value of from separate distributions – this provides a better fit to experimental data for both accuracy and reaction times.[21][22]

Other models

[edit]

Ornstein–Uhlenbeck model

[edit]

The Ornstein–Uhlenbeck model[14] extends the DDM by adding another term, , to the accumulation that is dependent on the current accumulation of evidence – this has the net effect of increasing the rate of accumulation towards the initially preferred option.

[7]

Race model

[edit]

In the race model,[11][12][23] evidence for each alternative is accumulated separately, and a decision made either when one of the accumulators reaches a predetermined threshold, or when a decision is forced and then the decision associated with the accumulator with the highest evidence is chosen. This can be represented formally by:

[7]

The race model is not mathematically reducible to the DDM,[7] and hence cannot be used to implement an optimal decision procedure.

Mutual inhibition model

[edit]

The mutual inhibition model[16] also uses two accumulators to model the accumulation of evidence, as with the race model. In this model the two accumulators have an inhibitory effect on each other, so as evidence is accumulated in one, it dampens the accumulation of evidence in the other. In addition, leaky accumulators are used, so that over time evidence accumulated decays – this helps to prevent runaway accumulation towards one alternative based on a short run of evidence in one direction. Formally, this can be shown as:

[7]

Where is a shared decay rate of the accumulators, and is the rate of mutual inhibition.

Feedforward inhibition model

[edit]

The feedforward inhibition model[24] is similar to the mutual inhibition model, but instead of being inhibited by the current value of the other accumulator, each accumulator is inhibited by a fraction of the input to the other. It can be formally stated thus:

[7]

Where is the fraction of accumulator input that inhibits the alternate accumulator.

Pooled inhibition model

[edit]

Wang[25] suggested the pooled inhibition model, where a third, decaying accumulator is driven by accumulation in both of the accumulators used for decision making, and in addition to the decay used in the mutual inhibition model, each of the decision driving accumulators self-reinforce based on their current value. It can be formally stated thus:

[7]

The third accumulator has an independent decay coefficient, , and increases based on the current values of the other two accumulators, at a rate modulated by .

Neural correlates of decision making

[edit]

Brain areas

[edit]

In the parietal lobe, lateral intraparietal cortex (LIP) neuron firing rate in monkeys predicted the choice response of direction of motion suggesting this area is involved in decision making in the 2AFC.[4][24][26]

Neural data recorded from LIP neurons in rhesus monkeys supports the DDM, as firing rates for the direction selective neuronal populations sensitive to the two directions used in the 2AFC task increase firing rates at stimulus onset, and average activity in the neuronal populations is biased in the direction of the correct response.[24][27][28][29] In addition, it appears that a fixed threshold of neuronal spiking rate is used as the decision boundary for each 2AFC task.[30]

See also

[edit]

References

[edit]
  1. ^ Fechner, Gustav Theodor (1889). Elemente der Psychophysik (2 Volumes) (2nd ed.). Leipzig: Breitkopf & Härtel. Vol 2.
  2. ^ Posner, M I (February 1980). "Orienting of attention" (PDF). Quarterly Journal of Experimental Psychology. 32 (1): 3–25. doi:10.1080/00335558008248231. ISSN 0033-555X. PMID 7367577. S2CID 2842391. Retrieved 2012-06-12.
  3. ^ Shimp, Charles P. (July 1966). "Probabilistically reinforced choice behavior in pigeons". Journal of the Experimental Analysis of Behavior. 9 (4): 443–455. doi:10.1901/jeab.1966.9-443. ISSN 0022-5002. PMC 1338246. PMID 5961513.
  4. ^ a b c Platt, Michael L.; Paul W. Glimcher (1999-07-15). "Neural correlates of decision variables in parietal cortex". Nature. 400 (6741): 233–238. Bibcode:1999Natur.400..233P. doi:10.1038/22268. ISSN 0028-0836. PMID 10421364. S2CID 4389701.
  5. ^ Britten, Kenneth H.; Michael N. Shadlen, William T. Newsome, J. Anthony Movshon (1993). "Responses of neurons in macaque MT to stochastic motion signals". Visual Neuroscience. 10 (6): 1157–1169. doi:10.1017/S0952523800010269. PMID 8257671. S2CID 38024406.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  6. ^ Gold, Joshua I.; Michael N. Shadlen (2000-03-23). "Representation of a perceptual decision in developing oculomotor commands". Nature. 404 (6776): 390–394. Bibcode:2000Natur.404..390G. doi:10.1038/35006062. ISSN 0028-0836. PMID 10746726. S2CID 4410921.
  7. ^ a b c d e f g h i j k l Bogacz, Rafal; Eric Brown, Jeff Moehlis, Philip Holmes, Jonathan D. Cohen (October 2006). "The Physics of Optimal Decision Making: A Formal Analysis of Models of Performance in Two-Alternative Forced-Choice Tasks". Psychological Review. 113 (4): 700–765. CiteSeerX 10.1.1.212.9187. doi:10.1037/0033-295x.113.4.700. ISSN 0033-295X. PMID 17014301. Retrieved 2012-06-09.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  8. ^ Stone, M. (1960). "Models for choice-reaction time". Psychometrika. 25 (3): 251–260. doi:10.1007/bf02289729. S2CID 121644881.
  9. ^ Link, S. W.; R. A. Heath (1975). "A sequential theory of psychological discrimination". Psychometrika. 40 (1): 77–105. doi:10.1007/bf02291481. S2CID 49042143.
  10. ^ Link, S. W (1975). "The relative judgment theory of two choice response time". Journal of Mathematical Psychology. 12 (1): 114–135. doi:10.1016/0022-2496(75)90053-x.
  11. ^ a b Pike, A. R. (1966). "Stochastic Models of Choice Behaviour: Response Probabilities and Latencies of Finite Markov Chain Systems1". British Journal of Mathematical and Statistical Psychology. 19 (1): 15–32. doi:10.1111/j.2044-8317.1966.tb00351.x. PMID 5939142.
  12. ^ a b Vickers, D. (1970). "Evidence for an accumulator model of psychophysical discrimination". Ergonomics. 13 (1): 37–58. doi:10.1080/00140137008931117. PMID 5416868.
  13. ^ a b c Ratcliff, R. (1978). "A theory of memory retrieval". Psychological Review. 85 (2): 59–108. doi:10.1037/0033-295x.85.2.59. S2CID 1166147.
  14. ^ a b Busemeyer, J. R; J. T Townsend (1993). "Decision field theory: a dynamic-cognitive approach to decision making in an uncertain environment". Psychological Review. 100 (3): 432–459. doi:10.1037/0033-295x.100.3.432. PMID 8356185. S2CID 15604757.
  15. ^ Ratcliff, R.; T. Van Zandt, G. McKoon (1999). "Connectionist and diffusion models of reaction time". Psychological Review. 106 (2): 261–300. doi:10.1037/0033-295x.106.2.261. PMID 10378014.
  16. ^ a b Usher, M.; J. L McClelland (2001). "The time course of perceptual choice: the leaky, competing accumulator model". Psychological Review. 108 (3): 550–592. doi:10.1037/0033-295x.108.3.550. PMID 11488378.
  17. ^ Ratcliff, R.; P. L Smith (2004). "A comparison of sequential sampling models for two-choice reaction time". Psychological Review. 111 (2): 333–367. doi:10.1037/0033-295x.111.2.333. PMC 1440925. PMID 15065913.
  18. ^ a b c Das, Abhranil; Geisler, Wilson (2021). "A method to integrate and classify normal distributions". Journal of Vision. 21 (10): 1. arXiv:2012.14331. doi:10.1167/jov.21.10.1. PMC 8419883. PMID 34468706. S2CID 229934512.
  19. ^ a b Smith, P. L (2000). "Stochastic dynamic models of response time and accuracy: A foundational primer". Journal of Mathematical Psychology. 44 (3): 408–463. doi:10.1006/jmps.1999.1260. PMID 10973778.
  20. ^ Laming, Donald Richard John (1968). Information theory of choice-reaction times. Academic P.
  21. ^ Ratcliff, R.; J. N Rouder (1998). "Modeling response times for two-choice decisions". Psychological Science. 9 (5): 347–356. doi:10.1111/1467-9280.00067. S2CID 1045352.
  22. ^ Ratcliff, R.; J. N Rouder (2000). "A diffusion model account of masking in two-choice letter identification". Journal of Experimental Psychology: Human Perception and Performance. 26 (1): 127–140. doi:10.1037/0096-1523.26.1.127. PMID 10696609.
  23. ^ LaBerge, D. (1962). "A recruitment theory of simple behavior". Psychometrika. 27 (4): 375–396. doi:10.1007/bf02289645. S2CID 120502141.
  24. ^ a b c Shadlen, M. N.; W. T. Newsome (1996-01-23). "Motion Perception: Seeing and Deciding". Proceedings of the National Academy of Sciences. 93 (2): 628–633. Bibcode:1996PNAS...93..628S. doi:10.1073/pnas.93.2.628. ISSN 1091-6490. PMC 40102. PMID 8570606.
  25. ^ Wang, X. J (2002). "Probabilistic decision making by slow reverberation in cortical circuits". Neuron. 36 (5): 955–968. doi:10.1016/s0896-6273(02)01092-9. PMID 12467598.
  26. ^ Shadlen, Michael N.; William T. Newsome (2001-10-01). "Neural Basis of a Perceptual Decision in the Parietal Cortex (Area LIP) of the Rhesus Monkey". Journal of Neurophysiology. 86 (4): 1916–1936. doi:10.1152/jn.2001.86.4.1916. ISSN 1522-1598. PMID 11600651. S2CID 272332.
  27. ^ Hanes, D. P; J. D Schall (1996). "Neural control of voluntary movement initiation". Science. 274 (5286): 427–430. Bibcode:1996Sci...274..427H. CiteSeerX 10.1.1.408.5678. doi:10.1126/science.274.5286.427. PMID 8832893. S2CID 11202961.
  28. ^ Schall, J. D; K. G Thompson (1999). "Neural selection and control of visually guided eye movements". Annual Review of Neuroscience. 22 (1): 241–259. doi:10.1146/annurev.neuro.22.1.241. PMID 10202539. S2CID 1851931.
  29. ^ Gold, J. I; M. N Shadlen (2002). "Banburismus and the brain: decoding the relationship between sensory stimuli, decisions, and reward". Neuron. 36 (2): 299–308. doi:10.1016/s0896-6273(02)00971-6. PMID 12383783.
  30. ^ Roitman, J. D; M. N Shadlen (2002). "Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task". The Journal of Neuroscience. 22 (21): 9475–9489. doi:10.1523/JNEUROSCI.22-21-09475.2002. PMC 6758024. PMID 12417672.