When.com Web Search

Search results

  1. Results From The WOW.Com Content Network
  2. Reinforcement - Wikipedia

    en.wikipedia.org/wiki/Reinforcement

    In behavioral psychology, reinforcement refers to consequences that increase the likelihood of an organism's future behavior, typically in the presence of a particular antecedent stimulus. [1] For example, a rat can be trained to push a lever to receive food whenever a light is turned on. In this example, the light is the antecedent stimulus ...

  3. Reinforcement learning - Wikipedia

    en.wikipedia.org/wiki/Reinforcement_learning

    Machine learningand data mining. Reinforcement learning (RL) is an interdisciplinary area of machine learning and optimal control concerned with how an intelligent agent ought to take actions in a dynamic environment in order to maximize the cumulative reward. Reinforcement learning is one of three basic machine learning paradigms, alongside ...

  4. Operant conditioning - Wikipedia

    en.wikipedia.org/wiki/Operant_conditioning

    Operant conditioning. Operant conditioning, also called instrumental conditioning, is a learning process where voluntary behaviors are modified by association with the addition (or removal) of reward or aversive stimuli. The frequency or duration of the behavior may increase through reinforcement or decrease through punishment or extinction.

  5. B. F. Skinner - Wikipedia

    en.wikipedia.org/wiki/B._F._Skinner

    Institutions. University of Minnesota. Indiana University. Harvard University. Signature. Burrhus Frederic Skinner (March 20, 1904 – August 18, 1990) was an American psychologist, behaviorist, inventor, and social philosopher. [2][3][4][5] He was the Edgar Pierce Professor of Psychology at Harvard University from 1958 until his retirement in ...

  6. Q-learning - Wikipedia

    en.wikipedia.org/wiki/Q-learning

    Q-learning is a model-free reinforcement learning algorithm to learn the value of an action in a particular state. It does not require a model of the environment (hence "model-free"), and it can handle problems with stochastic transitions and rewards without requiring adaptations. [1]

  7. Rescorla–Wagner model - Wikipedia

    en.wikipedia.org/wiki/Rescorla–Wagner_model

    The Rescorla–Wagner model (" R-W ") is a model of classical conditioning, in which learning is conceptualized in terms of associations between conditioned (CS) and unconditioned (US) stimuli. A strong CS-US association means that the CS signals predict the US. One might say that before conditioning, the subject is surprised by the US, but ...

  8. Reinforced concrete - Wikipedia

    en.wikipedia.org/wiki/Reinforced_concrete

    Reinforcing schemes are generally designed to resist tensile stresses in particular regions of the concrete that might cause unacceptable cracking and/or structural failure. Modern reinforced concrete can contain varied reinforcing materials made of steel, polymers or alternate composite material in conjunction with rebar or not.

  9. Matching law - Wikipedia

    en.wikipedia.org/wiki/Matching_law

    Matching law. In operant conditioning, the matching law is a quantitative relationship that holds between the relative rates of response and the relative rates of reinforcement in concurrent schedules of reinforcement. For example, if two response alternatives A and B are offered to an organism, the ratio of response rates to A and B equals the ...