Editors' Picks
Great books about your topic, Learning, selected by Encarta editors
Related Items
Encarta Search
Search Encarta about Learning

Advertisement

Windows Live® Search Results

See all search results in
Windows Live® Search Results
Also on Encarta
Page 4 of 9

Learning

Encyclopedia Article
Find | Print | E-mail | Blog It
Multimedia
Classical ConditioningClassical Conditioning
Article Outline
C

Principles of Operant Conditioning

In a career spanning more than 60 years, Skinner identified a number of basic principles of operant conditioning that explain how people learn new behaviors or change existing behaviors. The main principles are reinforcement, punishment, shaping, extinction, discrimination, and generalization.

C 1

Reinforcement

In operant conditioning, reinforcement refers to any process that strengthens a particular behavior—that is, increases the chances that the behavior will occur again. There are two general categories of reinforcement, positive and negative. The experiments of Thorndike and Skinner illustrate positive reinforcement, a method of strengthening behavior by following it with a pleasant stimulus. Positive reinforcement is a powerful method for controlling the behavior of both animals and people. For people, positive reinforcers include basic items such as food, drink, sex, and physical comfort. Other positive reinforcers include material possessions, money, friendship, love, praise, attention, and success in one’s career.

Depending on the circumstances, positive reinforcement can strengthen either desirable or undesirable behaviors. Children may work hard at home or at school because of the praise they receive from parents and teachers for good performance. However, they may also disrupt a class, try dangerous stunts, or start smoking because these behaviors lead to attention and approval from their peers. One of the most common reinforcers of human behavior is money. Most adults spend many hours each week working at their jobs because of the paychecks they receive in return. For certain individuals, money can also reinforce undesirable behaviors, such as burglary, selling illegal drugs, and cheating on one’s taxes.

Negative reinforcement is a method of strengthening a behavior by following it with the removal or omission of an unpleasant stimulus. There are two types of negative reinforcement: escape and avoidance. In escape, performing a particular behavior leads to the removal of an unpleasant stimulus. For example, if a person with a headache tries a new pain reliever and the headache quickly disappears, this person will probably use the medication again the next time a headache occurs. In avoidance, people perform a behavior to avoid unpleasant consequences. For example, drivers may take side streets to avoid congested intersections, citizens may pay their taxes to avoid fines and penalties, and students may do their homework to avoid detention.



C 2

Reinforcement Schedules

A reinforcement schedule is a rule that specifies the timing and frequency of reinforcers. In his early experiments on operant conditioning, Skinner rewarded animals with food every time they made the desired response—a schedule known as continuous reinforcement. Skinner soon tried rewarding only some instances of the desired response and not others—a schedule known as partial reinforcement. To his surprise, he found that animals showed entirely different behavior patterns.

Skinner and other psychologists found that partial reinforcement schedules are often more effective at strengthening behavior than continuous reinforcement schedules, for two reasons. First, they usually produce more responding, at a faster rate. Second, a behavior learned through a partial reinforcement schedule has greater resistance to extinction—if the rewards for the behavior are discontinued, the behavior will persist for a longer period of time before stopping. One reason extinction is slower after partial reinforcement is that the learner has become accustomed to making responses without receiving a reinforcer each time. There are four main types of partial reinforcement schedules: fixed-ratio, variable-ratio, fixed-interval, and variable-interval. Each produces a distinctly different pattern of behavior.

On a fixed-ratio schedule, individuals receive a reinforcer each time they make a fixed number of responses. For example, a factory worker may earn a certain amount of money for every 100 items assembled. This type of schedule usually produces a stop-and-go pattern of responding: The individual works steadily until receiving one reinforcer, then takes a break, then works steadily until receiving another reinforcer, and so on.

On a variable-ratio schedule, individuals must also make a number of responses before receiving a reinforcer, but the number is variable and unpredictable. Slot machines, roulette wheels, and other forms of gambling are examples of variable-ratio schedules. Behaviors reinforced on these schedules tend to occur at a rapid, steady rate, with few pauses. Thus, many people will drop coins into a slot machine over and over again on the chance of winning the jackpot, which serves as the reinforcer.

On a fixed-interval schedule, individuals receive reinforcement for their response only after a fixed amount of time elapses. For example, in a laboratory experiment with a fixed-interval one-minute schedule, at least one minute must elapse between the deliveries of the reinforcer. Any responses that occur before one minute has passed have no effect. On these schedules, animals usually do not respond at the beginning of the interval, but they respond faster and faster as the time for reinforcement approaches. Fixed-interval schedules rarely occur outside the laboratory, but one close approximation is the clock-watching behavior of students during a class. Students watch the clock only occasionally at the start of a class period, but they watch more and more as the end of the period gets nearer.

Variable-interval schedules also require the passage of time before providing reinforcement, but the amount of time is variable and unpredictable. Behavior on these schedules tends to be steady, but slower than on ratio schedules. For example, a person trying to call someone whose phone line is busy may redial every few minutes until the call gets through.

C 3

Punishment

Whereas reinforcement strengthens behavior, punishment weakens it, reducing the chances that the behavior will occur again. As with reinforcement, there are two kinds of punishment, positive and negative. Positive punishment involves reducing a behavior by delivering an unpleasant stimulus if the behavior occurs. Parents use positive punishment when they spank, scold, or shout at children for bad behavior. Societies use positive punishment when they fine or imprison people who break the law. Negative punishment, also called omission, involves reducing a behavior by removing a pleasant stimulus if the behavior occurs. Parents’ tactics of grounding teenagers or taking away various privileges because of bad behavior are examples of negative punishment.

Considerable controversy exists about whether punishment is an effective way of reducing or eliminating unwanted behaviors. Careful laboratory experiments have shown that, when used properly, punishment can be a powerful and effective method for reducing behavior. Nevertheless, it has several disadvantages. When people are severely punished, they may become angry, aggressive, or have other negative emotional reactions. They may try to hide the evidence of their misbehavior or escape from the situation, as when a punished child runs away from home. In addition, punishment may eliminate desirable behaviors along with undesirable ones. For example, a child who is scolded for making an error in the classroom may not raise his or her hand again. For these and other reasons, many psychologists recommend that punishment be used to control behavior only when there is no realistic alternative.

C 4

Shaping

Shaping is a reinforcement technique that is used to teach animals or people behaviors that they have never performed before. In this method, the teacher begins by reinforcing a response the learner can perform easily, and then gradually requires more and more difficult responses. For example, to teach a rat to press a lever that is over its head, the trainer can first reward any upward head movement, then an upward movement of at least one inch, then two inches, and so on, until the rat reaches the lever. Psychologists have used shaping to teach children with severe mental retardation to speak by first rewarding any sounds they make, and then gradually requiring sounds that more and more closely resemble the words of the teacher. Animal trainers at circuses and theme parks use shaping to teach elephants to stand on one leg, tigers to balance on a ball, dogs to do backward flips, and killer whales and dolphins to jump through hoops.

Prev.
| | | | | | | |
Next
Find
Print
E-mail
Blog It


More from Encarta


© 2008 Microsoft