Card Values for Three Notrump
In other articles in this series, we are basing our search of evaluators on efforts to determine very generally how many tricks we expect to take. But what happens if we instead want to zero in on a particular contract and level?
We represent our side's combined honor holdings as a vector
(A,K,Q,J,T)
, where 0<=A<=4
, 0<=K<=4
, 0<=Q<=4
, 0<=J<=4
, 0<=T<=4.
We let P(A,K,Q,J,T)
be the probability we can make
nine or more tricks in notrump if we hold A
aces,
K
kings, Q
queens, J
jacks,
and T
tens.
Here are some rules that we know "intuitively:"
(1) P(A+1,K,Q,J,T)>P(A,K,Q,J,T) (A<4) (2) P(A,K+1,Q,J,T)>P(A,K,Q,J,T) (K<4) (3) P(A,K,Q+1,J,T)>P(A,K,Q,J,T) (Q<4) (4) P(A,K,Q,J+1,T)>P(A,K,Q,J,T) (J<4) (5) P(A,K,Q,J,T+1)>P(A,K,Q,J,T) (T<4) (6) P(A+1,K,Q,J,T)>P(A,K+1,Q,J,T) (A,K<4) (7) P(A,K+1,Q,J,T)>P(A,K,Q+1,J,T) (K,Q<4) (8) P(A,K,Q+1,J,T)>P(A,K,Q,J+1,T) (Q,J<4) (9) P(A,K,Q,J+1,T)>P(A,K,Q,J,T+1) (J,T<4)
(1)-(5) simply state that additional honors make it more likely we can make 3NT. (6)-(9) says that swapping a card with a card of higher rank makes it more likely we can make 3NT.
For the mathematicians in the house, these relations create a partial order on the vectors (A,K,Q,J,T).
We can come up with estimates for P(A,K,Q,J,T)
with
double-dummy simulations. Restricting to a limited range, we get
a table which looks like:
AKQJT % freq P ==================== ... 13431 0.052 17.0% 13432 0.056 29.0% 13433 0.025 37.5% 13434 0.004 40.0% 13440 0.003 32.0% 13441 0.009 47.5% 13442 0.008 51.5% 13443 0.003 51.5% 13444 0.000 59.5% ...
The Work Point Count assigns the values 4, 3, 2, and 1 to the ace, king, queen, and jack, and traditionally, we say we have enough for game if we have 25 points. But is this an ideal set of values?
Let's say our goal is to bid all games which are better than 50% likely. Is it possible to come up with values for the honors card, and some limit, to get a "perfect" evaluator - one which seperates the rows into those with >50% and <50%?
The answer to that is "no." Look at these rows:
(a) 24041 0.0136 60.0% (b) 24121 0.3911 31.8%If we have a perfect evaluator for honor cards, then we'd know from these two lines that the value of a queen is less than twice the value of a jack, because (b) adds a queen and subtracts two jacks from (a), and ends up not being a good game.
On the other hand:
(c) 31332 0.5163 37.6% (d) 31412 0.1744 50.6%In this case, adding a queen and subtracting two jacks improves the odds.
So, we can only get close with this simple approach. So, how do we determine how "close" an evaluator is to ideal? By measuring the error, which we can do row by row and sum up.
If the evaluator says it is right to bid game, but the table says it is wrong, or visa versa, we define the error contributed by the row as the absolute value of the difference between the game percentage and 50%, multiplied by the frequency of the row.
For example, if we are using the Work Point Count, and setting a threshhold of 25 for game, then row (d) above would contribute an error of 0.6%*0.001744. This means that rows which are rare, and rows which are close to 50% games, are not weighted nearly as high.
Obviously, we can apply the same technique for any percentage - if we want to bid 40% games, for example, we'd adjust the above error estimate.
The Sample
I've generated a considerable sample on the following conditions:- South is balanced.
- There is no 8-card major fit
- North is not "freakish" shape
- Neither hand has a doubleton queen or jack without a higher honor (no Qx, Jx), and north doesn't have any singleton kings.
The last condition is slightly controversial, but I wanted to test for the "essential" values of honors. I will, at some later date, investigate the effect of less ideal conditions.
The Results
So, without further ado, here are some data points for 50% 3NT contracts:Card Values Game 100* A K Q J T Goal Error 4.00 3.00 2.00 1.00 0.00 25.00 38.99 Work point count 4.50 3.00 1.50 0.75 0.25 24.75 59.85 BUM-RAP 4.00 3.00 2.00 1.00 0.50 25.50 16.12 Work point plus 1/2 per T 4.25 3.00 2.00 1.00 0.50 26.25 9.70 " + 1/4 per ace 4.00 2.80 1.80 1.00 0.40 24.20 6.31 WPC- 1/5 per K&Q + 2/5 per T
The goals for each row are chosen to minimize the error. So if we are using the Work Point Count, we want to have 25 points or more. If we add 1/2 for a ten, we want 25.5 to have a 50% chance for game, and we get a significantly smaller error. We get a further gain by adding 1/4 point more for each ace.
If we are aiming for 40% games - say, at IMPs rather than MPs - we get different errors and goals:
Card Values Game 100* A K Q J T Goal Error 4.00 3.00 2.00 1.00 0.00 24.00 65.12 Work point count 4.50 3.00 1.50 0.75 0.25 24.50 67.06 BUM-RAP 4.00 3.00 2.00 1.00 0.50 25.00 12.15 Work point plus 1/2 per T 4.25 3.00 2.00 1.00 0.50 25.50 9.87 " + 1/4 per ace 4.00 2.80 1.80 1.00 0.40 23.80 9.18 WPC- 1/5 per K&Q + 2/5 per TNote that the errors, when we are seeking 40% games, are different from the 50% games. In particular, all but one of the errors are increased, and BUM-RAP now does almost as well as Work Point Count.
As before, the fifth row (A=4, K=2.8, Q=1.8, J=1, T=0.4) is the best, but this time, only slightly better than the third. Also, in this case, WPC beats BUM-RAP by just a slight bit compared to the considerable difference in the 50% case.
So far, I've only tested card values by trial and error. But the more "fractional" the evaluator, the harder it is to use in bidding.
Less Ideal Situations
If we remove the last condition on the above sample we get:Seeking 50% Games
Card Values Game 100* A K Q J T Goal Error 4.00 3.00 2.00 1.00 0.00 25.00 32.13 Work point count 4.50 3.00 1.50 0.75 0.25 25.00 61.76 BUM-RAP 4.00 3.00 2.00 1.00 0.50 25.50 16.86 Work point plus 1/2 per T 4.25 3.00 2.00 1.00 0.50 26.25 7.50 " + 1/4 per ace 4.00 2.80 1.80 1.00 0.40 24.40 6.71 WPC- 1/5 per K&Q + 2/5 per T
Seeking 40% Games
Card Values Game 100* A K Q J T Goal Error 4.00 3.00 2.00 1.00 0.00 24.00 71.95 Work point count 4.50 3.00 1.50 0.75 0.25 24.25 61.21 BUM-RAP 4.00 3.00 2.00 1.00 0.50 25.00 20.40 Work point plus 1/2 per T 4.25 3.00 2.00 1.00 0.50 25.75 10.90 " + 1/4 per ace 4.00 2.80 1.80 1.00 0.40 23.80 13.17 WPC- 1/5 per K&Q + 2/5 per TNote that BUM-RAP does slightly better than HCP for the 40% games, but the other three still significantly beat BUM-RAP.
Conclusions
BUM-RAP is designed better for suit contracts than notrump, but another reason BUM-RAP might fail is that it is designed to work across all levels, and not geared towards the 3NT contract, specifically.When we target the specific notrump contract level, we find that Work Point Count is as at least as good an evaluator as BUM-RAP, and that we can get a very good evaluator by tweaking the Work Point Count slightly.
The nice thing about the last evaluator, (A=4,K=2.8,Q=1.8,J=1,T=0.4) is that the total count in the deck is still 40, as with WPC and BUM-RAP. I call this evaluator the "Fifths evaluator" because you take WPC and subtract one fifth for each king and queen, and add two fifths for each ten.
Caveats
Do not necessarily take these results to mean that you should use these new evaluators.For example, if your partner opens 1 NT with a hand worth 14.6 to 17.4 in the Fifths valuator, then when should you bid 3 NT? Obviously, if you use the above rules, you would always bid 3 NT with a value of 9.8 in the same evaluator, because your side is always worth at least 24.4, which is our cut-off point for 50% games. You also won't want to bid 3NT with less than 7.0 points, because there is no chance you will have as many as 24.2 points. But do you want to bid with 7.2 points? 7.4? At what point do you invite? At what strength does partner accept?
The problem with the fractional values is that the borders become fuzzy. The actual "error" when you bid to 3NT is some combination of the errors for various ranges. For example, if you decide to always bid game with 9.0 points or more, and never for less, you get two sets of errors, one where this "underestimates" the total strength, and another where it over-estimates. Again, we'd need to come up with some definition of "error," here.
comments powered by Disqus