3 Tips for Treats in Horse Training

When you clicker train your horse you need something to reinforce (strengthen) the behaviour. After all, it is the receiver (horse) that determines the reinforcer.

Food is an excellent reinforcer for most horses, although I have come across a few who didn’t seem interested in food at all for training purposes. In that case you have to become a bit more resourceful. I will write a post about that if there is demand for that topic. Please let me know in the comments.

Tip #1 Experiment!

Don’t be afraid to try out different treats and explore what your horse likes and what he doesn’t.

When I shared the post Clicker training 101: Tips for Treats on my Facebook group Happy Herd, Equine Clicker Training Network wonderful experiences about all kind of different food rewards were given.

Some trainers use large kibbles, that are just ‘maintenance pellets’ horse feed, some use a mix of alfalfa cubes mixed with corn or one of the many commercial treats available for horses.

I like to use Timothy hay cubes and grass pellets since Kyra loves them very much. My horse is insuline resistance so I don’t want to use treats loaded with sugar of molasses. Kyra prefers natural flavours. My own home baked (sugar free) horse cookies made out of brown rice and flax seeds are her favorite.

Other choices are: cut up carrots, apples, zucchini, cucumber, grain, carob pods, sunflower seeds, grapes and other fruits and vegetables.

Tip #2 Measure the value of the reinforcer

You can make a list of all the treats you tried and how much value they seemed to have for _carrot_reward_reinforcer_horsetreat_tips for treats_horsetraining_hippologicyour horse at that moment. Did he really like the treat or did he ‘just ate them’? Did he seem to like it or did he spit them out?

The value of certain foods can change over time or depending on the circumstance. Sometimes it is depending on the season. Gras pellets can have a higher value for your horse in Winter when his diet is mainly hay, than in Summer when he grazes in a juicy pasture all day long.

So try again if your horse didn’t seem to like a certain treat. He might have changed his mind. Sometimes a horse has to ‘learn’ to eat it. When Kyra came fresh out of the nature reserve she didn’t like carrots and apples at all because her mom hadn’t taught her they were ‘safe foods’. When she saw how other horses ate it again and again she started to try them too and eventually like them. So don’t give up too easy if it is a healthy treat that you would like to use a reinforcer.

Tip #3 Vary!

Variety is the spice of life. If you change your reinforcers in training, it contributes to a certain ‘chance’ of getting a certain treat. Since the horse doesn’t know what treat he will receive, you will get him on top of his game to earn that ‘special one’.

You can also mix two different kinds of food. If you use hay cubes and it’s values fades quickly for your horse, you can mix in some high value food like a bit of grain or corn to make it more interesting without making the reward too rich in calories.

 

What about you?

I (and I am sure my readers, too) would love to hear about your choice of food reward and the reason for your choice.

Please share your valuable experience in the comments and help your fellow positive reinforcement trainers.

 HippoLogic.jpgSandra Poppema, B.Sc.
My mission is to improve human-horse relationships. I reconnect horse women with their inner wisdom and teach them the principles of learning and motivation, so they become confident and skilled to train their horse in a safe and effective way that is a lot of FUN for both human and horse. Win-win.
Sign up for HippoLogic’s newsletter (it’s free and it comes with a reinforcer) or visit HippoLogic’s website and discover what else I have to offer.
Follow my blog  on Bloglovin

 

Most powerful reward schedule: Variable ratio

In a  variable ratio schedule a desired behaviour (once it is established and put on cue) will be reinforced randomly. There is no way the horse can predict when he can expect a reward, so this will keep him motivated to perform well.

Benefits of a variable reinforcement schedule

With a variable ratio schedule it will take a very long time before a behaviour will become extinct. Extinction means that the behaviour will no longer be displayed in a certain situation. There is 0% chance of a reward so therefor the behaviour has become ‘useless’ in that situation.

A variable ratio schedule is the most powerful reward schedule. Your horse figures ‘This could be the time my behaviour gets rewarded, so let’s try this again’. No reward? ‘Maybe this time I will get a reward… Let’s give it a bit more effort… Yes! It worked’.__rewards_hippologic

A variable reward schedule is also the reason why most horses keep displaying undesired behaviours. I explain this further in this post.

Extinction burst

If a behaviour is never rewarded (intrinsically or extrinsically) it will go extinct. Just before a behaviour goes extinct there is usually an ‘extinction burst’.

Often when an in the past rewarded behaviour doesn’t result in a reward the animal shows a sudden and temporary increase in the behaviour followed by the eventual decline and extinction of the behaviour targeted for elimination. Novel behaviour, or emotional responses or aggressive behaviour, may also occur (Miltenberger, R. (2012). Behaviour modification, principles and procedures. (5th ed., pp. 87-99). Wadsworth Publishing Company.)

Extinction_Graph

Extinction, extinction burst and spontaneous recovery graph from study.com

The same principle occurs in a consciously applied variable reward schedule. Just before the horse loses interest in displaying the behaviour he will show a little ‘extinction burst’ as a last attempt to influence the reinforcement (reward). This is the improved behaviour a trainer is looking for and wants to mark and reward.

Withhold the click

If the horse already has a strong positive reinforcement history with a certain behaviour or with positive reinforcement training in general, it can react differently to a withdrawn click than when he is in the beginning of the learning stage of an exercise.

A well used withdrawal of the click will induce an improvement of behaviour (extinction burst). It also can help the horse figure out quicker which behaviour is rewarded and which isn’t. In this way you can give more information about what you want.

Instead of the trainer acting like a ‘vending machine’: put money (behaviour) in and expect a reward (treat comes out), the trainer now behaves more like a ‘gambling machine’ with a fair chance to win.

_reward_schedules_hippologic

The horse may become ‘superstitious’ and tries to figure out if there was a difference with the behaviour that was similar and didn’t get rewarded and the one that did. Just like superstitious people who are suddenly paying attention to the colour of their socks in order to influence their chances of winning, the animal will also pay more attention to the details of the behaviour in order to influences the chances of a click and reward.

Pitfalls of withholding a click too long

Withholding a click can also trigger impatience, frustration or confusion in the horse. So  use this technique with caution. You don’t want to discourage your horse. A little bit of frustration is no big deal, as long as the horse stays in learning mode.

Sometimes a bit of frustration can actually benefit the learning process. It is the trainers responsibility to walk this line. If the horse gets frustrated or shuts down, turn back to a continuous reward schedule for a while and make your training steps smaller and lower your criteria.

When you start teaching a new behaviour it is really important to click every improvement and use a continuous reward schedule. The next step in training should be only rewarding the behaviour when you have cued it. Once the cue is established, switch to a variable reward schedule.

Fading out the rewards

So once your horse has learned a specific behaviour you can reward less and less and still get the behaviour. This is called fading out the click.

Continuous reward schedules are very easy to use (reward 100%) because you don’t have to think about it. What about a variable reward schedule, are you using this in your training?

Sandra Poppema
For tailored positive reinforcement training advise, please visit my website and book a free intake consult!

Follow my blog with Bloglovin

 

New uses for ‘Old’ Tools in Clicker Training , part II

When I started clicker training I had no clue that there was a whole science behind the use of rewards. I only knew about one primary reward: treats.

I didn’t know anything about ‘primary’ and ‘secondary’ rewards, how to use high and low value rewards to my advantage. Neither did I know about ‘chaining’ behaviours, I learned all these things in the following years.

High and low value rewards
When I started clicker training I didn’t realize that a horse can consider some rewards as ‘high value’ rewards, and other as ‘low value’ rewards. I never thought that I could use this knowledge to my advantage and to help the horse perform better or make training less stressful.

I didn’t realize some rewards can lose their value and how to use this knowledge in my training. Of course I noticed when my pony became less eager to work for his treats after some training. Then I just stopped the training session and gave him a break. I was not consious enough about this fact to use it in my training approach. Now I am aware, I use high value rewards or low value rewards depending on the circumstances.

Variable rewards_key_lesson_standing_on_mat_hippologic
I also vary my treats depending on the difficulty level of the behaviour I am training, or depending on the context. In situations where there is more distraction I might use very high value treats (for Kyra that is apple) rather than lower value rewards (non-food) in order to keep my horses attention focused on me.

If Kyra is in a more relaxed state, she values the ear scratches (see picture) more than grain. I experiment with the different value treats during one training session. You can use different treats with different values to build in a certain unpredictability for the horse, in order to stimulate his efforts.

With horses who get overly excited by certain treats I start with lower value treats, horses who have to overcome something really scary or aversive, get really high value treats or rewards. Sometimes I give bigger rewards, sometimes I give smaller amounts. I know how I can really ‘stretch’ the reward moment by throwing in lots of verbal praise and feeding multiple hands full of treats (jackpotting).

Secondary rewards
I now also know know that I can use secondary rewards, like standing on a mat or verbal praise. A secondary reward is something the horse has ‘learned to like’ because there is an appetite association attached to it.

Behaviour chains
If you chain a series of behaviours, certain behaviours can become a reward, too. You can also use ‘back chaining’. Then you start teaching the last behaviour in the chain first: behaviour #3. Next step is to teach behaviour #2 and then you chain them together: #2, #3. Behaviour #3 is clicked and rewarded in the chain.

Then you teach another behaviour( #1) and back chain it, so you have chained #1, #2, #3. The horse has already a solid reward history built with behaviour #3 and he will anticipate on what is coming after #1 and #2. By asking the chain, the reward for the horse will lie in performing behaviour #3, which you may or may not reward with a click and reward once the horse knows the chain.

Looking back
So when I started clicker training I thought it was a simple concept: click and treat for the desired behaviours. I understood that really complex behaviours would be better taught in small steps. That was basically what I thought was clicker training.

__hippologic_beautiful_thing_about_learningThe more I learned about reward-based horse training, the more I realize there is so much more to learn. After 15 plus years of experimenting and reading and learning I still enjoy the puzzles my horse gives me that I really want to solve with positive reinforcement only. I just love it!

What are the things you have learned over the years that really stunned you?

Read here part I of New Uses for ‘Old Tools’ in Clicker Training

Read here part III.

If you think this is a blog that someone can benefit from, please use one of the share buttons  below. Or post your comment, I read them all!

Or simply hit the like button so I know you appreciated this blog. Thank you!

HippoLogic.jpg
Sandra Poppema, B.Sc.
My mission is to improve human-horse relationships. I reconnect horse women with their inner wisdom and teach them the principles of learning and motivation, so they become confident and skilled to train their horse in a safe and effective way that is a lot of FUN for both human and horse. Win-win.
Sign up for HippoLogic’s newsletter (it’s free and it comes with a reinforcer) or visit HippoLogic’s website and discover my online 8 week course Key Lessons, Your Key to Success in Positive Reinforcement Horse Training.
Follow my blog  on Bloglovin