Reinforcement
Adapted from Wikipedia · Discoverer experience
In behavioral psychology, reinforcement means giving something good after a behavior so the person or animal is more likely to do that behavior again. For example, if a rat pushes a lever when a light turns on and gets food, it will keep pushing the lever when it sees the light. The light is what starts the behavior, pushing the lever is the behavior, and the food makes the rat want to do it again. In the same way, when a student answers a question in class and gets praise, they are more likely to answer questions next time.
Punishment is the opposite of reinforcement. It means something happens after a behavior that makes the person or animal less likely to do that behavior again. Punishment does not always mean hurting someone; even just hearing someone say they are unhappy can act as punishment.
There are two kinds of reinforcement. One kind is called positive reinforcement, where you give something nice to make a behavior happen more often. The other kind is called negative reinforcement, where you take away something bad to make a behavior happen more often. For example, taking medicine to make a headache go away is a way of making the behavior of taking medicine more likely because it removes something unpleasant.
Reinforcement is very important in teaching and changing behavior. It is used in many places, such as helping children learn, guiding athletes, supporting people in therapy, and even in business management.
Terminology
In behavioral science, words like positive and negative describe the type of action an adult takes, not how the child feels about it. A positive action means adding something—like giving a treat or praise. A negative action means taking something away—like stopping a loud noise.
The word reinforcement is used only for rewards that make a behavior more likely. Taking away something pleasant or adding something unpleasant is called punishment.
So, positive reinforcement means adding something nice, like giving a treat. Positive punishment adds something unpleasant. Negative reinforcement means stopping something unpleasant, like turning off a loud noise when a dog behaves well. Negative punishment means taking away something nice.
Overview
B. F. Skinner was an important researcher who helped us understand how rewards and consequences affect behavior. He believed that something is only a reward if it makes a person more likely to repeat the behavior that led up to it. For example, if a child asks for a cookie and gets one, and then asks for cookies more often, the cookie was a reward for asking.
In simple terms, a reward works when it makes a behavior happen more often in the future. This idea is important in teaching, helping people learn new skills, and understanding how habits form.
History
Laboratory research on reinforcement began with the work of Edward Thorndike, who is famous for his experiments with cats escaping from puzzle boxes. Later, B.F. Skinner published important work on this topic in his book The Behavior of Organisms in 1938. Skinner believed that giving rewards, known as positive reinforcement, is better than punishment for changing behavior. He said that rewards lead to lasting changes in behavior, while punishment only works for a short time and can cause other problems.
Many other researchers have built on this work and sometimes disagreed with Skinner's ideas. For example, some said that punishment can also change behavior effectively. Today, scientists still study how rewards and punishments affect learning because these ideas are important for understanding how people and animals learn new things.
Operant conditioning
Main article: Operant conditioning
The term operant conditioning was introduced by Skinner to describe situations where an organism can act on its environment. In this type of learning, the person or animal waits for the right moment to act, and then something good happens as a result.
Reinforcement is a key idea in operant conditioning. It means that when a behavior leads to a good result, the behavior is more likely to happen again. For example, if reading a book is fun, a person will want to read more books to have more fun. This is called positive reinforcement—it adds something pleasant to encourage the behavior.
Another type is negative reinforcement, which happens when a behavior stops something unpleasant. For example, if someone feels sad and reads a book to feel better, the book reading behavior is reinforced because it removed the sadness.
When a behavior is ignored and gets no reward, it often stops over time. This is called extinction. For example, if a child reads books because they are fun, and the parents ignore this, the child will still keep reading because the fun is the reward. But if the child does something to get attention and the parents ignore them, the child will stop that behavior and try something else to get attention.
Reinforcers make behaviors more likely, while punishers make behaviors less likely. Positive reinforcers are things people work to get, and negative reinforcers are things people work to end or avoid.
| Rewarding (pleasant) stimulus | Aversive (unpleasant) stimulus | |
|---|---|---|
| Positive (adding a stimulus) | Positive reinforcement Example: Reading a book because it is fun and interesting | Positive punishment Example: Telling someone that their actions are inconsiderate |
| Negative (taking a stimulus away) | Negative punishment Example: Loss of privileges (e.g., screen time or permission to attend a desired event) if a rule is broken | Negative reinforcement Example: Reading a book because it allows the reader to escape feelings of boredom or unhappiness |
Natural and artificial reinforcement
In 1967, Charles Ferster suggested that rewards can be natural or created by people. Natural rewards happen as a direct result of a behavior, like eating after finding food. Created rewards, such as in a token economy, need someone else to give them, like a teacher giving points for good behavior.
In 1970, Baer and Wolf introduced the idea of "behavioral traps." These are situations that start very easily but then encourage people to keep behaving well. Behavioral traps use rewards to guide people into habits that continue on their own, helping them learn new skills that last.
Intermittent reinforcement schedules
Behavior isn’t always rewarded every time it happens. How often rewards are given changes how quickly and consistently someone learns a behavior, and how long the behavior lasts even after rewards stop. The simplest way to reward behavior is to give a reward every time the behavior happens. But sometimes, rewards are given less often, which creates different patterns of behavior.
Scientists have studied these patterns using animals like pigeons. They found that the timing and frequency of rewards can change how often and how long animals keep behaving in certain ways. For example, if a pigeon gets food after every few pecks on a button, it might peck more quickly than if it only gets food after a long wait. These patterns help explain how people and animals learn from rewards.
Shaping
Main article: Shaping (psychology)
Shaping is a way to help someone learn a new behavior by rewarding them little by little. For example, when training a rat to press a lever, you might first reward it for just looking at the lever. Then, you reward it for moving closer to the lever. Finally, you reward it only when it presses the lever. Each small step gets rewarded until the rat learns the full behavior.
Shaping is used to help people learn new skills, especially those with special needs like autism. It can also help people who are picky eaters try new foods by rewarding small steps toward trying them. This method often gives rewards every time at first, but later, rewards might come less often.
Chaining
Main article: Chaining
Chaining means linking different actions together in a sequence. Each action's reward becomes the signal for the next action. There are different ways to teach chaining, like starting from the first action, starting from the last action, or teaching all actions at once. For example, many people have a morning routine where they shower, dry off, and get dressed, doing these actions in order because they are used to it.
Studies have shown that chaining can help manage challenging behaviors in people with autism and other similar conditions. One way to help is through training that encourages better communication by using rewards in a chained way.
Mathematical models
Researchers have studied how to create a mathematical model to understand reinforcement better. This model is called MPR, which stands for mathematical principles of reinforcement. Peter Killeen did important work in this area by studying pigeons.
Applications
Reinforcement and punishment are common in how people interact with each other. Here are a few examples of how these ideas are used.
Addiction and dependence
Positive and negative reinforcement are important in how addictions and dependence on substances develop. A substance can feel very rewarding, making a person want to use it again. Over time, seeing things linked to using the substance, like certain places or objects, can make cravings return. When someone depends on a substance, they might use it to feel better when they experience uncomfortable symptoms from not using it.
Animal training
Animal trainers and pet owners have always used ideas about learning without knowing the exact terms. Training animals shows clearly how these ideas work. Important parts include giving rewards right away when the animal does something right, making sure rewards follow the right behavior, using small signals to mark good behavior, gradually improving skills, and sometimes giving rewards less often to keep the animal interested.
Child behavior – parent management training
Helping parents use positive reinforcement for good behavior in children is a big part of training parents. Parents learn to reward good behavior with praise, smiles, or small treats. They also learn to focus on small steps toward a bigger goal and use charts to track progress. Using positive reinforcement in classrooms can help students succeed, and it works best when it fits each student's needs.
Economics
Both psychologists and economists look at how people behave when they buy things. They study how changes in price affect how much of something people buy. Some things, like food, people buy a lot more or less when the price changes, while other things, like gas, people keep buying about the same no matter the price. These ideas help explain how people make choices based on what they value.
Gambling – variable ratio scheduling
Gambling machines give rewards after an unpredictable number of tries, which makes people keep playing. Because these machines are set to take in more money than they pay out, people who play them often lose money over time. This type of reward schedule has been linked to gambling problems.
Praise
Praise is a way to encourage good behavior. When people praise someone right after they do something good, it makes them more likely to do it again. Studies show that praise works well for children and adults, helping them behave better and do well in school or at work. For praise to work best, it needs to be given right after the good behavior, clearly describe what was done right, and come from someone the person trusts.
Traumatic bonding
Traumatic bonding can happen in relationships where there are repeated patterns of kind and hurtful behavior. This mix of good and bad treatment can create strong emotional ties that are hard to break.
Video games
Many video games are designed to keep players interested by giving rewards at unpredictable times. While this can be fun, it can also lead some people to spend too much time playing. Some games include special rewards called "loot boxes," which give random items. This practice has been compared to gambling because players never know what they will get, and in some places, it is considered a form of gambling. These items can sometimes be used or traded for real money, which is being looked at by lawmakers.
Criticisms
People have questioned the usual way we talk about behavioral reinforcement. They say it can seem like we're saying that a behavior gets stronger because something makes it stronger, which doesn't really explain why.
Some think reinforcers work more like signals. They show which actions might lead to a reward. This idea helps explain some tricky behavior patterns, like how animals act when rewards are given only sometimes.
Related articles
This article is a child-friendly adaptation of the Wikipedia article on Reinforcement, available under CC BY-SA 4.0.
Images from Wikimedia Commons. Tap any image to view credits and license.
Safekipedia