Topic · Arithmetic

Averages and Mixtures

One number to summarize many — and the elegant little algebra that lets you blend two of anything to hit a target in between. Averages look simple until you average the wrong way; mixtures look hard until you see they're just weighted averages with a costume on.

What you'll leave with

  • The arithmetic mean and its workhorse identity $\sum x_i = n \bar{x}$ — the move that unlocks nearly every average problem.
  • Weighted averages: when not all items count the same (grades, GPAs, group merges).
  • The harmonic and geometric means, and the trap of averaging speeds with the wrong one.
  • Alligation — the cross-method for mixture problems — and why it works.
  • A toolkit for the classic genres: dilution, blending, alloys, and successive replacement.

1. The arithmetic mean

Arithmetic mean

For a list of numbers $x_1, x_2, \ldots, x_n$, the arithmetic mean is their sum divided by their count:

$$ \bar{x} \;=\; \frac{x_1 + x_2 + \cdots + x_n}{n} \;=\; \frac{1}{n}\sum_{i=1}^{n} x_i. $$

The mean is the "balance point" of the data — if you imagine the numbers as weights placed along a ruler at their values, the mean is where you'd put the fulcrum for the ruler to balance. That mental picture explains a lot of behaviour: pull one value far to the right and the balance point drifts that way; add a value at exactly $\bar{x}$ and nothing changes at all.

The identity that does the work

Rearrange the definition and you get the single most useful identity for solving average problems:

$$ \sum_{i=1}^{n} x_i \;=\; n \cdot \bar{x}. $$

Almost every "find the missing value" or "what score do I need on the final" problem reduces to this. Don't think in averages — think in sums, then divide at the end.

The workhorse move

If five test scores average 80, their sum is $5 \times 80 = 400$. If four of them are 72, 85, 91, and 68 (summing to 316), the fifth must be $400 - 316 = 84$. You never need to "average" again — you only need the total.

Two derived rules

Addition rule. Append a value $y$ to a list of $n$ items with mean $\bar{x}$:

$$ \bar{x}_{\text{new}} \;=\; \frac{n \bar{x} + y}{n + 1}. $$

If $y > \bar{x}$, the mean rises; if $y < \bar{x}$, it falls; if $y = \bar{x}$, it doesn't budge.

Replacement rule. Swap one value $y_{\text{out}}$ for another $y_{\text{in}}$ (keeping $n$ fixed):

$$ \bar{x}_{\text{new}} \;=\; \bar{x}_{\text{old}} + \frac{y_{\text{in}} - y_{\text{out}}}{n}. $$

The change in the mean is the change in the sum, spread evenly across all $n$ items. A swap that lifts the sum by 30 in a class of 10 lifts the average by 3.

2. Weighted averages

Plain averages assume every item counts the same. The real world rarely does. A final exam matters more than a homework set. A 4-credit course should pull harder on a GPA than a 1-credit one. A class of 30 should outweigh a class of 10 in any "combined" measure. The fix is to attach a weight $w_i$ to each value:

$$ \bar{x}_w \;=\; \frac{\sum w_i x_i}{\sum w_i}. $$

The denominator is the total weight, not the count. When every $w_i = 1$, this collapses back to the ordinary mean — the plain mean is just a weighted mean where every weight happens to equal 1.

Grade-style example

Suppose a course grade is 40% exam, 30% project, 30% homework. A student scores 90, 80, and 70 respectively. Their grade is

$$ \bar{x}_w \;=\; \frac{0.40(90) + 0.30(80) + 0.30(70)}{0.40 + 0.30 + 0.30} \;=\; \frac{36 + 24 + 21}{1} \;=\; 81. $$

Weights stated as percentages already sum to 1, so the denominator vanishes — but it's still there in principle. Forgetting that is one of the most common sources of subtle errors.

Combining two groups

Two groups of sizes $n_1$ and $n_2$ with means $\bar{x}_1$ and $\bar{x}_2$ have a combined mean

$$ \bar{x}_{\text{combined}} \;=\; \frac{n_1 \bar{x}_1 + n_2 \bar{x}_2}{n_1 + n_2}. $$

A class of 20 averaging 70 combined with a class of 30 averaging 80 gives

$$ \frac{20(70) + 30(80)}{50} \;=\; \frac{1400 + 2400}{50} \;=\; 76, $$

not the naive $(70 + 80)/2 = 75$. The larger group pulls the average closer to its own mean.

Don't average averages

If the groups have different sizes, the simple average of their means is wrong. You can only get away with $(\bar{x}_1 + \bar{x}_2)/2$ when $n_1 = n_2$. Always weight by size unless you've checked.

3. Harmonic and geometric means

"Average" is a category, not a single formula. Three classical members of the family show up often enough to deserve names. For positive numbers $x_1, \ldots, x_n$:

NameFormulaBuilt for
Arithmetic mean (AM)$\bar{x} = \dfrac{1}{n}\sum x_i$Additive quantities — scores, prices, counts
Geometric mean (GM)$\sqrt[n]{x_1 x_2 \cdots x_n}$Multiplicative quantities — growth rates, ratios
Harmonic mean (HM)$\dfrac{n}{\sum (1/x_i)}$Rates measured against a fixed denominator — speed over a fixed distance

The three are linked by the AM–GM–HM inequality: for any positive numbers, $\text{AM} \geq \text{GM} \geq \text{HM}$, with equality only when every $x_i$ is the same. The means collapse onto each other when the data is uniform, and spread apart as the data spreads.

The average-speed trap

You drive 60 km to a town at 40 km/h, then 60 km home at 60 km/h. What's your average speed for the round trip?

The intuition that screams "50 km/h" is wrong. Average speed is total distance over total time, never the simple average of speeds:

$$ \text{time out} = \frac{60}{40} = 1.5 \text{ h}, \quad \text{time back} = \frac{60}{60} = 1 \text{ h}. $$ $$ \text{avg speed} = \frac{\text{total distance}}{\text{total time}} = \frac{120}{2.5} = 48 \text{ km/h}. $$

Why 48 and not 50? Because the slower leg takes longer. You spend more of the trip at 40 km/h than at 60 km/h, so the average sags toward the slower speed. The formula for two equal-distance legs is the harmonic mean:

$$ v_{\text{avg}} \;=\; \frac{2 v_1 v_2}{v_1 + v_2} \;=\; \frac{2 \cdot 40 \cdot 60}{40 + 60} \;=\; \frac{4800}{100} \;=\; 48. $$
Pitfall

For equal distances at different speeds, average speed is the harmonic mean. For equal times at different speeds, it really is the arithmetic mean. Which kind of "equal" you're holding fixed decides which mean applies — and getting it backward is the canonical mistake.

When the geometric mean is the right choice

An investment grows 50% in year one and loses 50% in year two. What's the "average" annual return?

Arithmetic says $(50 + (-50))/2 = 0\%$ — but a $\$100$ stake becomes $\$150$, then $\$75$. You're down 25%, not flat. The correct average is the geometric mean of the growth factors:

$$ \sqrt{1.5 \cdot 0.5} \;=\; \sqrt{0.75} \;\approx\; 0.866. $$

So the average annual factor is about 0.866 — a roughly $13.4\%$ annual loss, which compounded over two years gives the actual 25% drop. Multiplicative processes call for multiplicative means.

4. Mixtures as weighted averages

A mixture problem combines two (or more) components with different characteristic values — prices, concentrations, alcohol percentages — and asks about the result. The mixture's characteristic is simply the weighted average of the components, weighted by quantity:

$$ m \;=\; \frac{w_1 a + w_2 b}{w_1 + w_2}, $$

where $a, b$ are the components' values and $w_1, w_2$ are how much of each you used. That's the whole secret. Everything that follows is bookkeeping.

A worked blend

You mix 2 L of orange juice (10% sugar) with 1 L of soda (15% sugar). The blend's sugar content is

$$ \frac{2 \cdot 0.10 + 1 \cdot 0.15}{2 + 1} \;=\; \frac{0.20 + 0.15}{3} \;=\; \frac{0.35}{3} \;\approx\; 11.67\%. $$

The answer sits between 10% and 15%, biased toward the juice's value because there's twice as much juice. That bias is the whole point: more of something pulls the average toward that something.

Two equations, every time

Most mixture problems boil down to two conservation statements: total quantity (volume, mass, count) and total characteristic content (grams of sugar, dollars of value, litres of pure alcohol). Write both, and the unknowns fall out.

5. Alligation: the cross method

When you only need the ratio of two components — not their absolute amounts — there's a shortcut that has been taught in commerce schools since the medieval Islamic world. It's called alligation, and it reduces the algebra to a picture.

To mix components valued $a$ (cheaper) and $b$ (dearer) into a blend valued $m$ (with $a < m < b$):

$$ \frac{\text{quantity of } a}{\text{quantity of } b} \;=\; \frac{b - m}{m - a}. $$

Visually, write $a$ and $b$ at the top, $m$ in the middle, and "cross" them — each component's share of the mix is its opposite's distance from the target:

cheaper a dearer b target m parts of a b − m parts of b m − a ratio (b − m) : (m − a)

Where the formula comes from

It's just the weighted-average equation, solved for the ratio. If quantities $w_1$ of $a$ and $w_2$ of $b$ produce a mixture worth $m$, then

$$ \frac{w_1 a + w_2 b}{w_1 + w_2} = m. $$

Multiply out: $w_1 a + w_2 b = m(w_1 + w_2)$. Group the $w$'s: $w_1(a - m) = w_2(m - b)$, or equivalently $w_1(m - a) = w_2(b - m)$. Therefore

$$ \frac{w_1}{w_2} \;=\; \frac{b - m}{m - a}. $$

The cross diagram is the formula made visual: each component sits on the opposite diagonal from its share of the mix.

The coffee blend

Arabica costs $\$8$/kg and Robusta costs $\$14$/kg. To blend a $\$10$/kg coffee:

$$ \frac{\text{Arabica}}{\text{Robusta}} \;=\; \frac{14 - 10}{10 - 8} \;=\; \frac{4}{2} \;=\; 2 : 1. $$

Two parts Arabica to one part Robusta. Check: 2 kg at $\$8$ plus 1 kg at $\$14$ is $\$30$ total over 3 kg, which is $\$10$/kg exactly. The target is closer to the cheaper component, so the cheaper component dominates the blend — a pattern worth memorising.

Mind the direction

The cross differences give the ratio of quantities, not "more of the cheap one" by reflex. The mnemonic: the closer a component's value is to the target, the more of it you need. The smaller distance, the bigger share — they're inversely related.

6. Successive dilution

A vessel holds $V$ litres of pure liquid. You remove $k$ litres of the mixture and replace them with water. Then you do it again. And again. After $n$ rounds, how much of the original liquid is left?

Each operation keeps the same fraction $(1 - k/V)$ of whatever original liquid was present, because you're removing a uniform mixture and replacing the missing volume with water (which contains none of the original). So the original-liquid amount after $n$ rounds is

$$ V \left(1 - \frac{k}{V}\right)^n. $$

A 100 L wine cask diluted by removing 10 L and refilling with water three times retains $100 \cdot 0.9^3 = 100 \cdot 0.729 = 72.9$ L of wine. Not 70 L — that would be the answer if dilution were additive, but it's multiplicative. This is exactly the geometric-decay shape that shows up in radioactive half-lives, drug clearance, and compound interest with negative rates.

Pitfall

Successive dilution is multiplicative, not additive. Removing 10% three times leaves $0.9^3 \approx 72.9\%$ — not $100\% - 30\% = 70\%$. The error grows with more rounds; after ten 10%-removals you have $34.9\%$ left, not the (nonsensical) "100% − 100% = 0%".

7. Common pitfalls

Averaging averages

Combining two groups by averaging their means works only when the groups are the same size. If a class of 20 averages 70 and a class of 30 averages 80, the combined average is 76 — not 75. Always weight by group size.

Averaging speeds the simple way

For equal distances at different speeds, the arithmetic mean overstates. Use total distance over total time — equivalently, the harmonic mean $2v_1 v_2 / (v_1 + v_2)$. Only equal times at different speeds let you use the simple average.

Alligation direction reversed

The cross differences give the ratio of quantities, where the bigger share goes to the component closer to the target. A common error is to read off the diagram and assign the bigger ratio to the component on the same side as the bigger difference — that's backwards.

Treating dilution as additive

Repeated dilution by a fixed fraction is multiplicative: $(1 - k/V)^n$, not $1 - nk/V$. The error gets dramatic fast — after enough rounds the additive version would predict negative liquid, which is absurd.

Forgetting what's conserved

When you add water to a salt solution, the salt doesn't change — only the total volume does. When you evaporate water, again the salt is constant. Find the conserved quantity first; the rest is algebra.

8. Worked examples

Try each before opening the solution. The goal is to see whether your reasoning matches the canonical move, not just to land on the right number.

Example 1 · A score is missing from the average

The mean of five numbers is 12, and four of them are 10, 11, 14, 15. Find the fifth.

Step 1. The total sum must be $5 \times 12 = 60$.

Step 2. The four known values sum to $10 + 11 + 14 + 15 = 50$.

Step 3. The fifth value is $60 - 50 = 10$.

Check. $(10+11+14+15+10)/5 = 60/5 = 12$. ✓

Example 2 · Combining two unequal groups

A school has 30 students averaging 75 and 20 students averaging 85. What's the combined average across all 50?

Step 1. Sum from the first group: $30 \times 75 = 2250$.

Step 2. Sum from the second group: $20 \times 85 = 1700$.

Step 3. Combined: $\dfrac{2250 + 1700}{50} = \dfrac{3950}{50} = 79$.

The naive $(75 + 85)/2 = 80$ overstates by 1 — because the lower-scoring group is larger and pulls the mean down.

Example 3 · The round-trip speed (harmonic mean)

Gita drives 30 km to work at 60 km/h and returns home at 40 km/h. What's her average speed for the full trip?

Step 1. Time out: $30/60 = 0.5$ h. Time back: $30/40 = 0.75$ h.

Step 2. Total time: $1.25$ h. Total distance: $60$ km.

Step 3. Average speed: $60 / 1.25 = 48$ km/h.

Check with the harmonic-mean formula:

$$ \frac{2 \cdot 60 \cdot 40}{60 + 40} = \frac{4800}{100} = 48 \text{ km/h}. $$

The "obvious" arithmetic answer of 50 km/h is 2 km/h too high, because the slower leg (40 km/h) takes more time than the faster leg.

Example 4 · A weighted GPA

Karan takes three courses: a 3-credit A (grade 4.0), a 4-credit B (3.0), and a 3-credit C (2.0). What's his GPA?

Step 1. Multiply each grade by its credit weight:

$$ 3(4.0) + 4(3.0) + 3(2.0) = 12 + 12 + 6 = 30. $$

Step 2. Divide by total credits: $3 + 4 + 3 = 10$.

Step 3. GPA $= 30 / 10 = 3.0$.

If you'd averaged the grades unweighted you'd have gotten $(4.0 + 3.0 + 2.0)/3 = 3.0$ — the same answer, but only by coincidence. Change the credits to 1, 5, 1 and the weighted GPA falls to $2.86$ while the unweighted stays at $3.0$.

Example 5 · Alligation: blending coffee to a target price

Sugar at $\$4$/kg is mixed with sugar at $\$8$/kg to sell at $\$5.50$/kg. In what ratio?

Step 1. Cross differences: $|8 - 5.50| = 2.50$ and $|5.50 - 4| = 1.50$.

Step 2. Ratio of cheaper to dearer:

$$ \frac{8 - 5.50}{5.50 - 4} = \frac{2.50}{1.50} = \frac{5}{3}. $$

So 5 parts of the cheap sugar to 3 parts of the expensive one. The target $\$5.50$ sits closer to the cheap side ($\$4$), so the cheap sugar dominates — exactly the pattern alligation predicts.

Check. 5 kg at $\$4$ plus 3 kg at $\$8$ is $20 + 24 = \$44$ over 8 kg, or $\$5.50$/kg. ✓

Example 6 · Diluting a solution to a target concentration

You have 5 L of 30% alcohol solution and want to dilute it to 20% by adding water. How much water?

Step 1. Identify what's conserved. Adding water doesn't change the alcohol — only the volume.

Alcohol present: $5 \times 0.30 = 1.5$ L.

Step 2. Let $V$ be the new total volume. Require $1.5/V = 0.20$, so $V = 7.5$ L.

Step 3. Water added: $7.5 - 5 = 2.5$ L.

The shortcut: $c_1 v_1 = c_2 v_2$ when only the solvent is added or removed.

Example 7 · Successive dilution

A 200 L milk vat has 20 L removed and replaced with water, four times. How much milk remains?

Step 1. Each round multiplies the remaining milk by $1 - 20/200 = 0.9$.

Step 2. After four rounds:

$$ 200 \cdot 0.9^4 = 200 \cdot 0.6561 = 131.22 \text{ L}. $$

A naive subtraction of $4 \times 20 = 80$ L would give 120 L — close, but wrong by about 11 L because each round removes a fraction of progressively less milk, not a fixed 20 L of pure milk.

Sources & further reading

The content above synthesizes established sources on arithmetic averages and mixture problems. When this page doesn't go deep enough, these do.

Test your understanding

A quiz that builds from easy to hard. Pick an answer to get instant feedback and a worked explanation. Your progress is saved in this browser — come back anytime to continue.

Question 1 of N
0 correct