1. The defining equation
Given a square matrix $A$, a non-zero vector $\vec{v}$ is an eigenvector of $A$ if there is a scalar $\lambda$ such that
$$ A\vec{v} = \lambda \vec{v}. $$The scalar $\lambda$ is the corresponding eigenvalue. The pair $(\lambda, \vec{v})$ is called an eigenpair.
Read the equation slowly. The left side, $A\vec{v}$, is "apply the transformation $A$ to the vector $\vec{v}$." The right side, $\lambda \vec{v}$, is "stretch $\vec{v}$ by the number $\lambda$." The eigenvalue equation says: the complicated thing on the left and the simple thing on the right are the same vector. The matrix's action on $\vec{v}$, with all its potential mixing and shearing, collapses to a plain scalar multiplication.
That collapse is the whole point. Generic vectors get pushed around in complicated ways by $A$ — bent off their original direction, mixed with other components. Eigenvectors are the survivors. The transformation can stretch them, squash them, even flip them around (negative $\lambda$), but it never tilts them off the line they're on.
The eigenvalue $\lambda$ tells you how much the stretching is. The eigenvector $\vec{v}$ tells you where — which direction in space is invariant.
"Eigen" is German for "own" or "characteristic" — these are the matrix's own directions, the ones intrinsic to it rather than to any choice of coordinate system. You'll sometimes see them called characteristic vectors and characteristic values.
2. Geometric intuition
Picture a 2-D plane with a few vectors scattered around it. Now apply some linear transformation $A$ — say, the matrix that horizontally stretches space by a factor of 2 and vertically squishes by a factor of $\tfrac{1}{2}$. Each input vector lands somewhere new.
For most vectors, the output points in a different direction than the input. The transformation rotated them. But two special directions are exceptions: anything purely horizontal stays horizontal (now twice as long), and anything purely vertical stays vertical (now half as long). Those two directions are the eigenvectors. The two stretch factors — $2$ and $\tfrac{1}{2}$ — are the eigenvalues.
The picture below shows three vectors under such a transformation. The eigenvector (orange, along the $x$-axis) is merely stretched — its image is on the same line. The other two vectors are rotated off their original directions.
Notice what happened. Vectors $\vec{u}$ and $\vec{w}$ both pointed somewhere diagonal on the left; after the transformation, they point somewhere different diagonal. They've been rotated. But the orange vector $\vec{v}$, which started along the $x$-axis, is still along the $x$-axis — just twice as long. That's the eigenvector signature: same line, different length.
If you can find the lines a matrix leaves invariant, you've found its eigenvectors. The stretching factor along each such line is the corresponding eigenvalue. Everything else about eigenanalysis is built on this picture.
3. Finding them
The defining equation $A\vec{v} = \lambda \vec{v}$ can be rearranged. Subtract $\lambda \vec{v}$ from both sides, and factor out $\vec{v}$ — being careful, because $A$ is a matrix and $\lambda$ is a scalar, so we need the identity $I$ to make the dimensions agree:
$$ A\vec{v} - \lambda \vec{v} = \vec{0} \quad\Longrightarrow\quad (A - \lambda I)\vec{v} = \vec{0}. $$This is now a homogeneous linear system in $\vec{v}$. The trivial answer $\vec{v} = \vec{0}$ always works — but we excluded that from the definition. So we want a non-trivial solution. Such a solution exists exactly when the matrix $A - \lambda I$ is singular: that is, when its determinant is zero.
The eigenvalues of $A$ are the roots of
$$ \det(A - \lambda I) = 0. $$This is a polynomial equation in $\lambda$ — the characteristic polynomial of $A$. For an $n \times n$ matrix, it has degree $n$.
The recipe is therefore two stages:
- Find the eigenvalues. Compute $\det(A - \lambda I)$ as a polynomial in $\lambda$. Set it equal to zero. Solve.
- Find the eigenvectors. For each eigenvalue $\lambda_i$, plug back in and solve the homogeneous system $(A - \lambda_i I)\vec{v} = \vec{0}$. The null space of that matrix is the eigenspace for $\lambda_i$ — every non-zero vector in it is an eigenvector.
A 2×2 walkthrough
Let
$$ A = \begin{pmatrix} 4 & 1 \\ 2 & 3 \end{pmatrix}. $$Step 1. Form $A - \lambda I$ and take its determinant:
$$ A - \lambda I = \begin{pmatrix} 4 - \lambda & 1 \\ 2 & 3 - \lambda \end{pmatrix}, \quad \det(A - \lambda I) = (4 - \lambda)(3 - \lambda) - 2. $$Expand:
$$ (4 - \lambda)(3 - \lambda) - 2 = \lambda^2 - 7\lambda + 12 - 2 = \lambda^2 - 7\lambda + 10. $$Step 2. Solve $\lambda^2 - 7\lambda + 10 = 0$. This factors as $(\lambda - 5)(\lambda - 2) = 0$, giving
$$ \lambda_1 = 5, \qquad \lambda_2 = 2. $$Step 3. For each eigenvalue, find the null space of $A - \lambda I$.
For $\lambda_1 = 5$:
$$ A - 5I = \begin{pmatrix} -1 & 1 \\ 2 & -2 \end{pmatrix}. $$The rows are proportional, so the system reduces to $-v_1 + v_2 = 0$, i.e. $v_2 = v_1$. Any non-zero scalar multiple of $(1, 1)$ works. Pick the cleanest:
$$ \vec{v}_1 = \begin{pmatrix} 1 \\ 1 \end{pmatrix}. $$For $\lambda_2 = 2$:
$$ A - 2I = \begin{pmatrix} 2 & 1 \\ 2 & 1 \end{pmatrix}. $$Again rank-one, with $2v_1 + v_2 = 0$, so $v_2 = -2v_1$. Pick:
$$ \vec{v}_2 = \begin{pmatrix} 1 \\ -2 \end{pmatrix}. $$That's everything: two eigenvalues, two eigenvectors. Verify with a quick check:
$$ A \vec{v}_1 = \begin{pmatrix} 4 & 1 \\ 2 & 3 \end{pmatrix}\begin{pmatrix} 1 \\ 1 \end{pmatrix} = \begin{pmatrix} 5 \\ 5 \end{pmatrix} = 5 \vec{v}_1 \quad \checkmark $$For a $2 \times 2$ matrix, the characteristic polynomial is always $\lambda^2 - (\mathrm{tr}\,A)\lambda + \det A$. The sum of eigenvalues equals the trace; the product equals the determinant. Both facts generalize to $n \times n$ matrices and are an excellent sanity check on hand calculations.
4. Why they matter
Suppose you have an $n \times n$ matrix $A$ with $n$ linearly independent eigenvectors $\vec{v}_1, \dots, \vec{v}_n$ and corresponding eigenvalues $\lambda_1, \dots, \lambda_n$. Stack the eigenvectors as columns of a matrix $P$, and put the eigenvalues on the diagonal of $D$:
$$ P = \begin{pmatrix} | & | & & | \\ \vec{v}_1 & \vec{v}_2 & \cdots & \vec{v}_n \\ | & | & & | \end{pmatrix}, \qquad D = \begin{pmatrix} \lambda_1 & & & \\ & \lambda_2 & & \\ & & \ddots & \\ & & & \lambda_n \end{pmatrix}. $$Then a beautiful identity holds:
$$ A = P D P^{-1}. $$This is diagonalization. It says: in the coordinate system whose axes are the eigenvectors, $A$ is just $D$ — a diagonal matrix that does nothing more than scale each axis by its eigenvalue. The horrible-looking matrix $A$ in the standard basis turns into a simple stretching machine once you change perspective.
The payoff is enormous because diagonal matrices are trivial to compute with. For example, what is $A^{10}$?
$$ A^{10} = (PDP^{-1})^{10} = P D^{10} P^{-1}, $$and $D^{10}$ is just the diagonal matrix with $\lambda_i^{10}$ on the diagonal — no matrix multiplications required. The same trick simplifies matrix exponentials, solutions to linear ODEs ($\frac{d\vec{x}}{dt} = A\vec{x}$ becomes $n$ uncoupled scalar equations, one per eigenvector), Markov chain long-run behavior, and almost any iterative process driven by a linear operator.
The takeaway: eigenvectors aren't just curiosities of a matrix. They're the right basis — the coordinate system in which $A$ shows its true, simple nature.
If $A = PDP^{-1}$, then anything you can do to a diagonal matrix, you can do to $A$: just sandwich the operation between $P$ and $P^{-1}$. Eigen-decomposition is a license to pretend $A$ is diagonal.
5. Applications, briefly
Eigenanalysis shows up wherever a linear operator drives a system, which turns out to be most of applied mathematics.
- Google PageRank. The original algorithm models the web as a giant stochastic matrix $M$ of link-following probabilities. The PageRank of every page on the internet is just the entries of the dominant eigenvector of $M$ (the one with eigenvalue $1$). Find that eigenvector, you've ranked the web.
- Principal Component Analysis (PCA). Given a cloud of data points, the directions of maximum variance are the eigenvectors of the data's covariance matrix; the eigenvalues give the variance along each. Keep the top few and you've compressed the data while preserving its dominant structure.
- Vibrational modes. Springs, drum heads, molecules, bridges — any system with linear restoring forces vibrates in a superposition of normal modes, each of which is an eigenvector of the system's stiffness matrix, with its eigenvalue setting the squared frequency.
- Quantum mechanics. Observable quantities are operators; their eigenvalues are the values you can measure; their eigenvectors are the states that produce a definite outcome. The entire formalism is eigenanalysis dressed in physics.
- Stability analysis. Linearize any dynamical system near a fixed point. The eigenvalues of the Jacobian tell you whether small disturbances decay (stable) or blow up (unstable) — and how fast.
This is why eigenvalues earn the description "the single most influential idea in applied linear algebra": each item above is a billion-dollar industry, a Nobel-winning theory, or both, and each is at heart the same calculation you did by hand in section 3.