In this series I will be sharing 6 interesting facts I didn’t know about when I started using positive reinforcement in training animals. This is part 6. This one is really an eye-opener! This is a phenomenon you only see in R+ training! (more…)
Posts tagged ‘fading out click’
In this series I will be sharing 6 interesting facts I didn’t know about when I started using positive reinforcement in training animals. This is part 1.
Some of these are common misunderstandings people have about clicker training while others are facts most equestrians don’t know at all.
The goal of this blog is to help more people understand how well positive reinforcement (R+) works in training our horses. I want every one to know that clicker training offers more great benefits besides training your goal behaviour. Positive side-effects you won’t get in negative reinforcement (R-) based training methods (traditional and natural horsemanship). I wish I had known these benefits earlier in life.
#1 The purpose of clicker training is to teach new behaviours or retrain undesired behaviours
People often get the wrong impression about equine clicker training. They think you need to keep clicking and feeding for ever. That’s not true at all!
I think it is because there are so many videos out there about teaching our horses new behaviours. If you see a lot of those videos you indeed can get the wrong impression and could be mistakenly thinking that we clicker trainers never stop clicking and are always giving treats.
Once the horse understands the new or more desirable behaviour, the marker (click) and food are faded out.
We still reinforce the behaviour once in a while with an appetitive (treat, praise, scratches or with other reinforcing behaviour), but we don’t keep clicking and feeding treats for the same behaviour over and over.
If we would do that, it would decrease the goal behaviour rather than it would keep it’s quality or increase it.
Part of the power of positive reinforcement is that there is a chance of getting a reward once the behaviour is trained. That chance can also involve to do other behaviour (one that they really like to do). That will make the horse always want to perform his best.
After the first few sessions of clicker training the horse starts to pay attention to the click and his behaviour at the the time of the click.
In clicker training he focus shifts pretty quickly from the food to the click and their own behaviour.
If people make videos about clicker training their horse, they are usually filming behaviour that is in the process of being taught, not behaviours that are already well trained and established. Therefor the horse is clicked and reinforced a lot in those videos.
The clicks and treats are faded out after the goal behaviour is trained.
Read the other articles in this series:
Share the passion!
If you want to share this blog on your social media, use one of the share buttons below. It’s very much appreciated!
I love to hear from you, so please add a comment or let me know if you have a question. I read them all!
Don’t know what to say? Simply hit the like button so I know you liked this article.
PS Do you know about the HippoLogic membership?
Safe the date: March 6, 2019
Ultimate Horse Training Formula, Your Key to Succes
- Want to get the results in training you really, really want?
- Want train your horse with confidence?
- Want to learn all there is to know about training your horse with positive reinforcement?
Join this online course and participate for free every time! Click here
Clicker Training Mastery (online course) starts March 6, 2019
Happy Horse training!
Sandra Poppema, B.Sc.
I help horse owners get results in training they really, really want. Getting results with ease and lots of fun for both horse and human is important to me. Win-win!
Sign up for HippoLogic’s newsletter (it’s free and it comes with a gift) or visit HippoLogic’s website and join my online course Ultimate Horse Training Formula in which you learn the Key Lessons, Your Key to Success in Clicker Training.
In a variable ratio schedule a desired behaviour (once it is established and put on cue) will be reinforced randomly. There is no way the horse can predict when he can expect a reward, so this will keep him motivated to perform well.
Benefits of a variable reinforcement schedule
With a variable ratio schedule it will take a very long time before a behaviour will become extinct. Extinction means that the behaviour will no longer be displayed in a certain situation. There is 0% chance of a reward so therefor the behaviour has become ‘useless’ in that situation.
A variable ratio schedule is the most powerful reward schedule. Your horse figures ‘This could be the time my behaviour gets rewarded, so let’s try this again’. No reward? ‘Maybe this time I will get a reward… Let’s give it a bit more effort… Yes! It worked’.
A variable reward schedule is also the reason why most horses keep displaying undesired behaviours. I explain this further in this post.
If a behaviour is never rewarded (intrinsically or extrinsically) it will go extinct. Just before a behaviour goes extinct there is usually an ‘extinction burst’.
Often when an in the past rewarded behaviour doesn’t result in a reward the animal shows a sudden and temporary increase in the behaviour followed by the eventual decline and extinction of the behaviour targeted for elimination. Novel behaviour, or emotional responses or aggressive behaviour, may also occur (Miltenberger, R. (2012). Behaviour modification, principles and procedures. (5th ed., pp. 87-99). Wadsworth Publishing Company.)
The same principle occurs in a consciously applied variable reward schedule. Just before the horse loses interest in displaying the behaviour he will show a little ‘extinction burst’ as a last attempt to influence the reinforcement (reward). This is the improved behaviour a trainer is looking for and wants to mark and reward.
Withhold the click
If the horse already has a strong positive reinforcement history with a certain behaviour or with positive reinforcement training in general, it can react differently to a withdrawn click than when he is in the beginning of the learning stage of an exercise.
A well used withdrawal of the click will induce an improvement of behaviour (extinction burst). It also can help the horse figure out quicker which behaviour is rewarded and which isn’t. In this way you can give more information about what you want.
Instead of the trainer acting like a ‘vending machine’: put money (behaviour) in and expect a reward (treat comes out), the trainer now behaves more like a ‘gambling machine’ with a fair chance to win.
The horse may become ‘superstitious’ and tries to figure out if there was a difference with the behaviour that was similar and didn’t get rewarded and the one that did. Just like superstitious people who are suddenly paying attention to the colour of their socks in order to influence their chances of winning, the animal will also pay more attention to the details of the behaviour in order to influences the chances of a click and reward.
Pitfalls of withholding a click too long
Withholding a click can also trigger impatience, frustration or confusion in the horse. So use this technique with caution. You don’t want to discourage your horse. A little bit of frustration is no big deal, as long as the horse stays in learning mode.
Sometimes a bit of frustration can actually benefit the learning process. It is the trainers responsibility to walk this line. If the horse gets frustrated or shuts down, turn back to a continuous reward schedule for a while and make your training steps smaller and lower your criteria.
When you start teaching a new behaviour it is really important to click every improvement and use a continuous reward schedule. The next step in training should be only rewarding the behaviour when you have cued it. Once the cue is established, switch to a variable reward schedule.
Fading out the rewards
So once your horse has learned a specific behaviour you can reward less and less and still get the behaviour. This is called fading out the click.
Continuous reward schedules are very easy to use (reward 100%) because you don’t have to think about it. What about a variable reward schedule, are you using this in your training?
For tailored positive reinforcement training advise, please visit my website and book a free intake consult!