Algebra

This is an online resource for instructors and students. While the material is designed to be taught to strong middle school students, these notes are written for instructors who are invited to guide and discuss topics with their students.

  1. Algebra
    1. Fundamentals
      1. The Integers
        1. Exercise 1: Arithmetic with Integers
          1. Solution
      2. Variables
        1. Exercise 2: More Lumber Is Required
          1. Solution
        2. Exercise 3: Properties of Whole Number Addition
          1. Solution
        3. Exercise 4: Counterexamples
          1. Solution
    2. Rational Numbers
      1. Fraction Multiplication
        1. Fact 1: Fraction Multiplication
      2. Equivalence Classes
        1. Fact 2: Equivalence of Fractions With Equal Denominator
        2. Fact 3: Common Factor of Numerator and Denominator
        3. Fact 4: Cross-multiplication
      3. Addition and Subtraction
        1. Exercise 5: Fraction Addition and Subtraction
          1. Solution
        2. Fact 5: Fraction Addition & Subtraction
      4. Simplification
        1. Exercise 6: Fraction Operations and Simplification
          1. Solution
        2. Exercise 7: A Telescoping Product
          1. Solution
    3. Linear Equations
      1. Ratios and Rates
        1. Exercise 8: Carcross Car Count
          1. Solution
        2. Exercise 9: Produce Price Sum
          1. Solution
        3. Exercise 10: An Unlikely Sprint?
          1. Solution
        4. Exercise 11: Raccoon Population Growth
          1. Solution
      2. A General Approach
    4. Real Numbers
      1. Exponents
        1. Exercise 12: Positive Integer Exponents
          1. Solution
        2. Fact 6: Sum of Exponents
        3. Fact 7: Product of Exponents
        4. Exercise 13: Negative and Zero Exponents
        5. Exercise 14: Scientific Notation
        6. Exercise 15: Fractions, Exponents & Radicals
      2. Definition of a Real Number
      3. Operations on Real Numbers
        1. Exercise 16: A Linear Equation using Real Numbers
          1. Solution
    5. Polynomials
      1. Linear Equations, Again
      2. Two Simple Quadratics
        1. Exercise 17: Roots of x21x^2 - 1
          1. Solution
        2. Exercise 18: Roots of (x3)(x+2)(x-3)(x+2)
          1. Solution
      3. Factoring Quadratics
        1. Exercise 19: Factoring a Quadratic via Trial and Error
          1. Solution
      4. Quadratic Formula
        1. Exercise 20: Vertex Form to Standard Form
          1. Solution
        2. Exercise 21: Solving a Quadratic in Vertex Form
      5. Polynomials
      6. Factoring Polynomials
        1. Exercise 22: Examples of Polynomial Long Division
          1. Solution
        2. Exercise 23: Applications of the Remainder Theorem
          1. Solution
        3. Exercise 24: Applications of the Rational Root Theorem
          1. Solution
        4. Exercise 25: Factorization with Real Numbers
          1. Solution
    6. Linear Algebra
      1. Vectors
      2. Homogeneous Linear Equations (Two Unknowns)
        1. Exercise 26: Solutions to a Homogeneous Linear Equation
          1. Solution
      3. Vector Spaces
        1. Exercise 27: Vector Space of Solutions
          1. Solution
        2. Exercise 28: Solutions to Homogeneous Linear Equations
          1. Solution
      4. Inhomogeneous Linear Equations
        1. Exercise 29: Solutions to an Inhomogeneous Linear Equation
          1. Solution
      5. Linear Systems

Fundamentals

The Integers

The first kinds of numbers discovered were the natural numbers: 0,1,2,3,4,5,6,7,8,9,10, 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, \dots

There are (infinitely) many natural numbers, and there are many things that we can do with them. For example, we can add any two natural numbers to obtain another. It does not matter how big the numbers become: their sum is always still a natural number.

We may also multiply two natural numbers to obtain another. Again, it does not matter how big or small the numbers are: multiplication is something we can do with any natural numbers.

Natural numbers are great for representing “things”. In particular, they're great for representing “some number of things”. But they fall short when we want to represent less than nothing. But you can’t have less than nothing, so why would we ever want that?

The desire for a more general kind of number comes from the desire to represent change. If yesterday there were 2020 students in attendance, and today there are 2424 students in attendance, then we can say 44 more people attended today than did yesterday. But what if tomorrow there will again be 2424 students in attendance? Then how many more students will have attended? Even worse, what if the day after tomorrow, there will be 2020 students in attendance again. Then what is the change in the number of students?

It is not that these problems cannot be solved with positive natural numbers. In fact, they certainly can. The number of students could increase by 44, and we have a number for that. Or the number of students could decrease by 44. We already have a number for that.

So to express a change in a quantity, we must convey two pieces of information: firstly, in what direction is the change; and secondly, by how much the quantity is changed. This seems quite complicated. It would be simpler to introduce a new kind of quantity that could represent these changes more conveniently and more compactly. This, of course, is the motivation for negative numbers.

Negative numbers do not “exist” in the real world. We cannot have a negative number of people in a class. But they provide an easy way to talk about change: if positive numbers represent increase, then negative numbers represent decrease.

The introduction of negative numbers allows us to expand our set of mathematical objects to the integers, ,3,2,1,0,1,2,3,\dots, -3, -2, -1, 0, 1, 2, 3, \dots. As with natural numbers, we still have these two useful properties of integers, which we will call closure properties (think of a room withh closed door; if applying these operations inside the room of the integers, you never need to worry about anything outside the room):

But in addition, we have a third closure property, which enables us to describe changes:

Exercise 1: Arithmetic with Integers

Evaluate each of the following expressions:

  1. 1+(1)=1 + (-1) =

  2. 1(2)=-1 - (-2) =

  3. 3×4=-3 \times 4 =

  4. (4)×(10)=(-4) \times (-10) =

Solution
  1. 1+(1)=01 + (-1) = \boxed{0}

  2. 1(2)=1-1 - (-2) = \boxed{1}

  3. 3×4=12-3 \times 4 = \boxed{-12}

  4. (4)×(10)=40(-4) \times (-10) = \boxed{40}

Variables

When we first learn arithmetic, we only have to worry about natural numbers. As we continue to develop new techniques, we find new kinds of numbers — like the integers mentioned above. Let’s take a step back and examine the philosophy behind what numbers really are.

We are familiar with everyday objects: things we can see, feel, or otherwise interact with. Mathematical objects are like those, but more abstract. In the mathematical world, you can interact with mathematical objects through statements of fact. A common example of such a statement is equality: 1+1=21 + 1 = 2 is a statement of fact that the mathematical object referred to by 1+11 + 1 is in fact the same as the mathematical object referred to by 22.

Numbers are the most common mathematical objects, but as we will see later, they are not the only kind. Typically, we study mathematical objects because of some motivation from a real-world problem. In the real world, we may talk of quantity, location, a process, or any number of other things. To use mathematics to solve these problems, we must translate these real-world concepts into mathematical objects by using a model. Inside the model, we have many mathematical objects. Some of these objects represent the information given to us by the problem. Other objects might represent information we need to figure out — they might not be known from the start!

There are three ways we can describe a mathematical object in a simple model:

For instance, a real world problem is as follows:

Exercise 2: More Lumber Is Required

Yahui and Zhen want to build a wooden treehouse. 100100 planks are required. Yahui has 3333 in her shed, and Zhen has 2525 in his shed. How many additional planks must they buy to complete the treehouse?

Solution

We can use whole numbers to model this problem. First, let’s give names (variables) to all mathematical objects (numbers) that the problem gives us or requests that we find. Let yy be the number of planks that Yahui has, zz be the number of planks that Zhen has, rr be the total number of planks required, and xx be the number of additional planks they must buy. The equation is: y+z+x=r y + z + x = r

This equation is the governing equation of our model. We are given the values of yy, zz, and rr, and we’re asked to find the value of xx. (This is not always possible!) As mentioned above, we don’t need to use names for the known values, so we can substitute the literals: 33+25+x=100 33 + 25 + x = 100

To find the unknown value we must add to 33+25=5833 + 25 = 58 to arrive at 100100, we can use subtraction. That is, x=10058=42 x = 100 - 58 = \boxed{42}

Another way to understand what we have done in the above step is that 58+x58 + x and 100100 refer to the same mathematical object, 100100. Therefore, we can subtract 5858 from both of them, and they will still be different names for the same object. That is, 58+x58=1005858 + x - 58 = 100 - 58. By using the properties of addition that we are familiar with, the left hand side is just another name for xx — and so we obtain the result seen above.

In this example, all of the variables we used represent specific mathematical objects. Three of them were immediately given to us in the question. The other, xx, still represented a specific mathematical object, but we had to figure it out.

It is not always the case that variables represent specific mathematical objects — sometimes, we can attach a quantifier to a variable, to say that a statement is true of all mathematical objects of a certain type at once.

Exercise 3: Properties of Whole Number Addition

Give a concrete example for each of the following properties of addition of whole numbers:

  1. For all integers aa, a+0=aa + 0 = a.

  2. For all integers aa and bb, a+b=b+aa + b = b + a.

  3. For all integers aa, bb and cc, (a+b)+c=a+(b+c)(a + b) + c = a + (b + c).

Solution

Solutions may vary.

  1. Take a=2a = 2. Then 2+0=22 + 0 = 2.

  2. Take a=3a = 3 and b=7b = 7. Then 3+7=10=7+33 + 7 = 10 = 7 + 3.

  3. Take a=1a = 1, b=2b = 2 and c=3c = 3. Then (1+2)+3=3+3=6=1+5=1+(2+3)(1 + 2) + 3 = 3 + 3 = 6 = 1 + 5 = 1 + (2 + 3).

Note that these concrete examples are applications of, not justifications for, the properties in question. It can be difficult to give a formal justification for true properties that involve quantifiers. However, if a statement is false, it is often much easier: we can simply give a single concrete example which does not satisfy the statement.

Exercise 4: Counterexamples

Give a counterexample for each of the following incorrect statements about whole numbers:

  1. For all integers aa, a+a>aa + a > a.

  2. For all integers aa and bb, ab=baa - b = b - a.

  3. For all integers aa, bb and cc, a+b=a+ca + b = a + c.

Solution

Solutions may vary.

  1. Take a=0a = 0. Then a+a=0+0=00=aa + a = 0 + 0 = 0 \not > 0 = a.

  2. Take a=1a = 1 and b=0b = 0. Then ab=10=11=01=baa - b = 1 - 0 = 1 \ne -1 = 0 - 1 = b - a.

  3. Take a=0a = 0, b=0b = 0, and c=1c = 1. Then a+b=0+0=01=0+1=a+ca + b = 0 + 0 = 0 \ne 1 = 0 + 1 = a + c.

Note that even though each of the statements is false, they all have certain cases where they do hold. In the future, we will see techniques to find out exactly which cases the statement is true in.

Rational Numbers

The motivation behind fractions is similar to that of expanding the whole numbers to the integers. Many things in the real world are divisible into portions. For instance, we can cut a cake into slices. Natural numbers are good at counting wholes, but bad at measuring parts of wholes.

To solve this problem, we introduce the concept of splitting a single whole into nn equal parts, where nn is a positive whole number. (Later, we will allow nn to be negative, but it cannot be zero — it is not possible to split something into zero equal parts.) We write this as 1n\frac{1}{n}, and call each of the equal parts an nnth part. (If n:=10n := 10, then they are called tenths.) We may sometimes require more than one such equal part. If we want mm parts where mm is any integer, we will write it as mn\frac{m}{n}.

All numbers that can be formed from fractions of integers are called rational numbers. We can talk about rational numbers as an extension to the integers, just like how we did with integers above, as an extension to the natural numbers. Each integer is still a rational number: for every integer nn, and n=n1n = \frac{n}{1}.

There are various simple properties of fraction addition and subtraction, and multiplication, by counting the number of of nnth parts:

  1. For all integers aa, bb, and cc, if c>0c > 0, then ac+bc=a+bc\frac{a}{c} + \frac{b}{c} = \frac{a+b}{c}.

  2. For all integers aa, bb, and cc, if c>0c > 0, then acbc=abc\frac{a}{c} - \frac{b}{c} = \frac{a-b}{c}.

  3. For all integers aa, bb, and cc, if c>0c > 0, then ac×b=abc\frac{a}{c} \times b = \frac{ab}{c}.

With the integers, we could not in general perform exact division on any two integers. Fractions cover the case of dividing almost any two integers (almost, since the denominator may not be 00). But do we still have closure under addition, subtraction, and multiplication? Moreover, do we almost have closure under division (if we disallow a zero divisor)? We only saw some special cases of these operations above. We need to introduce stronger techniques — a general way to do addition, subtraction, and multiplication of fractions, to show that this closure is indeed the case.

Fraction Multiplication

We saw above how to multiply a fraction by an integer. This has the same meaning as multiplying two integers; we can think of it as adding (maybe in the negative direction) repeated copies of the fraction. But it is harder to extend this idea toward multiplying two fractions: what do we mean when we say we want 12\frac{1}{2} copies of 13\frac{1}{3}?

Luckily, there is in fact a single reasonable meaning for this. Recall that mn\frac{m}{n} means to split a single whole into equal nnth parts, then take mm copies of the nnth parts. We can replace the single whole with something that is itself an arbitrary rational number: let q×mnq \times \frac{m}{n} mean to split qq into nnth parts, and then take mm of those. Then, 1×mn=mn1 \times \frac{m}{n} = \frac{m}{n}, as we would expect.

How do we split ac\frac{a}{c} into nnth parts? We can split each of the aa copies of the bbth parts into nn parts, then only take one of every nn. Each small part is a bbth part split further into nn parts. In a single whole, there would be c×nc\times n equal such parts, so these small parts are in fact 1c×n\frac{1}{c\times n} each.

Therefore, we can justify the following fact (or definition, or a sort) about fraction multiplication:

Fact 1: Fraction Multiplication

Let aa, bb, cc and dd be integers with c0dc \ne 0 \ne d. Then ac×bd=a×bc×d \frac{a}{c} \times \frac{b}{d} = \frac{a \times b}{c \times d}

Equivalence Classes

It happens to be the case with fractions that distinct ordered pairs might represent the same quantity. For instance, 36\frac{3}{6} and 714\frac{7}{14} are different pairs of numbers, but they represent the same fraction: one half. All the fractions that represent the same particular quantity form a so-called equivalence class. In a sense, this is not very different from 252 - 5 and 363 - 6 representing the same change in quantity 3-3.

How can we decide whether two fractions represent the same quantity? That is, suppose that ac\frac{a}{c} and bd\frac{b}{d} are rational numbers. Are they equal? In the case where the denominator is the same, this is easy to answer: just compare the numerators. Rational numbers with the same denominator are equal if and only if the numerators are equal.

Fact 2: Equivalence of Fractions With Equal Denominator

Let aa, bb, and cc be integers, and c0c \ne 0. Then ac=bc    a=b \frac{a}{c} = \frac{b}{c} \iff a = b

If the denominators are not the same, we need to rewrite the two fractions to have the same denominator. We can do this by first noticing the following fact, which we can obtain from our knowledge of fraction multiplication:

Fact 3: Common Factor of Numerator and Denominator

Let aa, bb and cc be integers, and c0bc \ne 0 \ne b. Then ac=ac×1=ac×bb=a×bc×b \frac{a}{c} = \frac{a}{c} \times 1 = \frac{a}{c} \times \frac{b}{b} = \frac{a \times b}{c \times b}

This fact allows us to rewrite ac\frac{a}{c} as a×dc×d\frac{a\times d}{c\times d}, and bd\frac{b}{d} as b×cd×c\frac{b\times c}{d\times c}. Now the denominators are the same (remember c×d=d×cc\times d = d\times c for any integers cc and dd). So we can simply compare a×da\times d with b×cb\times c!

Visually, we are multiplying the top-left with the bottom-right, and the top-right with the bottom-left. This makes a cross shape, so one way to remember this technique is that it is often called “cross-multiplication”.

Fact 4: Cross-multiplication

Let aa, bb, cc and dd be integers, and c0dc \ne 0 \ne d. Then ab=cd    a×d=b×c \frac{a}{b} = \frac{c}{d} \iff a\times d = b\times c

Addition and Subtraction

We have seen above the example of rewriting fractions with a common denominator in order to compare them. But another use of fractions with a common denominator is that they are easy to add and subtract. We can apply the same technique:

Exercise 5: Fraction Addition and Subtraction

Rewrite the fractions using a common denominator in order to calculate:

  1. 12+13=\displaystyle\frac{1}{2} + \frac{1}{3} =

  2. 1213=\displaystyle\frac{1}{2} - \frac{1}{3} =

  3. 1323=\displaystyle\frac{1}{3} - \frac{2}{-3} =

Solution
  1. 12+13=1×32×3+1×23×2=36+26=56\displaystyle\frac{1}{2} + \frac{1}{3} = \frac{1\times 3}{2\times 3} + \frac{1\times 2}{3\times 2} = \frac{3}{6} + \frac{2}{6} = \boxed{\frac{5}{6}}

  2. 1213=1×32×31×23×2=3626=16\displaystyle\frac{1}{2} - \frac{1}{3} = \frac{1\times 3}{2\times 3} - \frac{1\times 2}{3\times 2} = \frac{3}{6} - \frac{2}{6} = \boxed{\frac{1}{6}}

  3. 1323=132×(1)3×(1)=1323=13\displaystyle\frac{1}{3} - \frac{2}{-3} = \frac{1}{3} - \frac{2\times(-1)}{-3\times(-1)} = \frac{1}{3} - \frac{-2}{3} = \boxed{\frac{1}{3}}

In general, we can derive formulas for addition and subtraction of fractions, but you should not memorize them. It is more useful to understand the process of arriving at the formulas.

Fact 5: Fraction Addition & Subtraction

Let aa, bb, cc and dd be integers with c0dc \ne 0 \ne d. Then ac+bd=a×dc×d+b×cd×c=a×d+b×cc×d \frac{a}{c} + \frac{b}{d} = \frac{a \times d}{c \times d} + \frac{b \times c}{d \times c} = \frac{a\times d + b\times c}{c \times d} and acbd=a×dc×db×cd×c=a×db×cc×d \frac{a}{c} - \frac{b}{d} = \frac{a \times d}{c \times d} - \frac{b \times c}{d \times c} = \frac{a\times d - b\times c}{c \times d}

Notice the resemblance to cross-multiplication. This is not accidental! From the subtraction formula, we see that if two fractions are equal, their difference is zero, and vice versa.

Simplification

Often, given some fraction, we want to find the equivalent fraction with the smallest possible positive integer denominator. This is called the simplest form and is is useful for various reasons:

Reducing a fraction to simplest form is a matter of finding the largest common factor of both the numerator and denominator, and then dividing both the numerator and denominator by it.

Exercise 6: Fraction Operations and Simplification

Compute each of the following, then reduce it to simplest form.

  1. 38×27=\displaystyle \frac{3}{8} \times \frac{2}{7} =

  2. 59×25=\displaystyle \frac{5}{9} \times \frac{2}{5} =

Solution
  1. 38×27=656=328\displaystyle \frac{3}{8} \times \frac{2}{7} = \frac{6}{56} = \boxed{\frac{3}{28}}

  2. 59×25=1045=29\displaystyle \frac{5}{9} \times \frac{2}{5} = \frac{10}{45} = \boxed{\frac{2}{9}}

Exercise 7: A Telescoping Product

Compute and reduce to simplest form: 12×23×34××99100 \frac{1}{2} \times \frac{2}{3} \times \frac{3}{4} \times \dots \times \frac{99}{100}

Solution

Each fraction in this product, except for the last one, has a numerator which is the same as the denominator of the following fraction. These will cancel out if we multiply the fractions. For instance, 12×23=1×22×3\frac{1}{2} \times \frac{2}{3} = \frac{1\times 2}{2 \times 3}, and we can divide 22 from both the numerator and the denominator to get 13\frac{1}{3}.

In this manner, all the numbers except for the first 11 in the numerator and the last 100100 in the denominator will get cancelled out. So we are left with 1100\boxed{\frac{1}{100}}.

Linear Equations

Ratios and Rates

Frequently, we may know certain quantities not in absolute terms, but only in relative terms. What does this mean? Let’s say you see two weights on the ground, labeled A and B. You might notice that B is twice as hard to lift up as A. Without a scale, it is hard for you to measure the actual weight of the objects, but you might be able to estimate the ratio of their weights.

In another example, suppose you are counting cars as they pass by on a highway. You might notice that for every 55 personal cars you count, you see about one truck. It might be hard for you to estimate how many cars are passing by each minute, since it is hard to guess how long a minute is, but you could estimate the ratio of personal cars to trucks on this highway.

We will see a couple of word problems that involve known ratios, and try to determine the absolute quantities using additional information provided to us.

Exercise 8: Carcross Car Count

The community of Carcross, Yukon is quite small, with a population of only 301. Caroline counts the number of cars that passed her house over an hour and noticed that:

  • There were 1515 cars that passed in total.

  • All cars were either blue or silver.

  • Twice as many cars were blue than silver.

How many blue cars passed by? How many silver cars?

Solution

Let bb and ss be integers representing the number of cars that passed her house. Then our equations are: b+s=15b=2s \begin{aligned} b + s &= 15 \\ b &= 2s \\ \end{aligned}

We can substitute the second equation into the first equation, since 2s2s and bb refer to the same mathematical object. Thus: 2s+s=153s=1513×3s=13×15s=5 \begin{aligned} 2s + s &= 15 \\ 3s &= 15 \\ \textcolor{blue}{\frac{1}{3}} \times 3s &= \textcolor{blue}{\frac{1}{3}} \times 15 \\ s &= 5 \\ \end{aligned} so there were 5\boxed{5} silver cars. Then we can substitute this back into that second equation b=2sb = 2s. So b=2×5b = 2 \times 5, and so b=10b = 10. Therefore there were 10\boxed{10} blue cars.

Exercise 9: Produce Price Sum

A supermarket stocks four kinds of produce: apples, oranges, tomatos, and potatos. Apples cost twice as much as oranges, and oranges cost twice as much as tomatos. August bought 1kg1\,\mathrm{kg} of each kind of produce, and the total price was $20\$20.

Can we figure out what was the price of tomatos? If so, what was it?

Solution

Let aa, oo, tt, and pp be rational numbers representing the prices of apples, oranges, tomatos, and potatos, all in dollars per kilogram. Then our equations are a=2oo=2ta+o+t+p=20 \begin{aligned} a &= 2o \\ o &= 2t \\ a + o + t + p &= 20 \\ \end{aligned}

If we substitute the first and second equations into the third, we get 4t+2t+t+p=20 4t + 2t + t + p = 20 and thus 7t+p=207t + p = 20. But there is a problem: there are multiple solutions to this! There are even multiple integer solutions; for instance, maybe t=2t = 2 and p=6p = 6, or t=1t = 1 and p=13p = 13. So we do not have enough information to figure out the price of tomatos.

Exercise 10: An Unlikely Sprint?

Miran, Gosse, and Brayan participated in a 100m100\,\mathrm{m} sprint. Miran tells you that she won and was twice as fast as Brayan. Gosse agrees that Miran won, and says he was close behind with a time only 20%20\% higher than Miran’s. Brayan says that he came in last with a time 8s8\,\mathrm{s} longer than Gosse’s time.

You know, however, that sometimes Miran, Gosse, and Brayan aren’t the most reliable. Is it mathematically possible for all of these accounts to be accurate? If so, do we have enough information to determine what were each of their times? If so, calculate the times.

Solution

Let xx denote Miran’s time, yy denote Gosse’s time, and zz denote Brayan’s time. Based on what everyone said, the equations are: x=12zy=1.2xz=y+8s \begin{aligned} x &= \frac{1}{2} z \\ y &= 1.2 x \\ z &= y + 8\,\mathrm{s} \\ \end{aligned}

Substitute the expression for xx given by first equation into the second equation, to get y=1.2×12z=35z y = 1.2 \times \textcolor{blue}{\frac{1}{2} z} = \frac{3}{5} z

Now substitute this into the third equation, to get z=35z+8s35z+z=35z+35z+8s25z=8s52×25z=52×8sz=20s \begin{aligned} z &= \textcolor{blue}{\frac{3}{5} z} + 8\,\mathrm{s} \\ \textcolor{blue}{-\frac{3}{5} z} + z &= \textcolor{blue}{-\frac{3}{5} z} + \frac{3}{5} z + 8\,\mathrm{s} \\ \frac{2}{5} z &= 8\,\mathrm{s} \\ \textcolor{blue}{\frac{5}{2}} \times \frac{2}{5} z &= \textcolor{blue}{\frac{5}{2}} \times 8\,\mathrm{s} \\ z &= 20\,\mathrm{s} \\ \end{aligned}

We can now substitute this back into y=35zy = \frac{3}{5} z to get y=12sy = 12\,\mathrm{s}, and into x=12zx = \frac{1}{2} z to get x=10sx = 10\,\mathrm{s} (very fast indeed, maybe suspiciously so!).

We can check that these times match all three of the equations above, so it is mathematically possible and unique. This doesn’t mean that the statements were accurate, but they are not mathematically contradictory.

Exercise 11: Raccoon Population Growth

The number of raccoons in the city of Raccoonville is plotted on the following chart:

If the current trend continues, by what year will there be 180 raccoons in Raccoonville?

Solution

In this problem, we have to figure out the rate of increase of raccoons from the chart. The trend seems to be a straight line with an increase of 1010 raccoons every year. We can assume this trend will continue as the question asks us in that hypothetical. Let xx denote the number of years after 20192019. Then the number of raccoons will be 130+10x130 + 10x. We want to solve: 130+10x=180130+130+10x=130+18010x=5010x10=5010x=5 \begin{aligned} 130 + 10x &= 180 \\ \textcolor{blue}{-130} + 130 + 10x &= \textcolor{blue}{-130} + 180 \\ 10x &= 50 \\ \frac{10x}{\textcolor{blue}{10}} &= \frac{50}{\textcolor{blue}{10}} \\ x &= 5 \\ \end{aligned}

Since this means 55 years after 20192019, the year that there will be 180180 raccoons in Raccoonville is 2019+5=20242019 + 5 = \boxed{2024}.

A General Approach

The examples above all have the same general form, where we have a number of equations of the form ax=bax = b, where aa, bb, and xx are rational numbers, and we know aa and bb (but not xx). Equations of this form are called “linear equations”. Why are they linear? Intuitively, one reason is that if we draw a line graph of the value of axax as we increase the value of xx, we will find a straight line:

The solution to ax=bax=b, if one exists, is simply where this straight line reaches a vertical height of bb. We saw a general technique to do this if a0a\ne 0: we can multiply both sides by 1a\frac{1}{a} (or equivalently, divide both sides by aa). Thus the solution is x=bax = \frac{b}{a}.

But what if a=0a = 0? In this case, we cannot divide by aa, since division by 00 is meaningless. Instead, we have the flat blue line in the graph. Obviously, this line will never reach any vertical height except 00! Therefore, there is no solution if b0b \ne 0. If b=0b = 0, then we still have no information about xx: any rational number will do. In this case, there are multiple solutions.

Real Numbers

Exponents

Recall that a positive exponent represents repeated multiplication, much like how a positive multiplier represents repeated addition. We can express this rule recursively using the following identity: xn+1=xxn x^{n+1} = xx^n which says that if you increase the exponent by 11 it is the same as multiply one more copy of the base.

Exercise 12: Positive Integer Exponents

Evaluate each expression. Write your answer as an integer in literal form.

  1. 24=2^4=

  2. 32=3^2=

  3. 106=10^6=

Solution
  1. 24=2×2×2×2=162^4=2\times 2\times 2\times 2=\boxed{16}

  2. 32=3×3=93^2=3\times 3=\boxed{9}

  3. 106=10×10×10×10×10×10=100000010^6=10\times 10 \times 10\times 10\times 10\times 10=\boxed{1000000}

There are a variety of facts about positive integer exponents that we can justify using the properties of multiplication. Here are a few. You do not need to memorize these, but it is helpful to understand why they are true.

Fact 6: Sum of Exponents

If aa is a rational number, and nn and mm are positive integers, then am×an=am+na^m \times a^n = a^{m+n}. That is: am×an=a××am times×a××an times=a××am+n times=am+n a^m \times a^n = \underbrace{a\times\dots\times a}_{m\text{ times}} \times \underbrace{a\times\dots\times a}_{n\text{ times}} = \underbrace{a\times\dots\times a}_{m+n\text{ times}} = a^{m+n}

(Notice how when n=1n=1, this is just the recursive rule we discussed above.)

Fact 7: Product of Exponents

If aa is a rational number, and nn and mm are positive integers, then (am)n=amn{(a^m)}^n = a^{mn}. That is: (am)n=(a××am times)××(a××am times)n times=a××amn times=amn {(a^m)}^n = \underbrace{\left(\underbrace{a\times\dots\times a}_{m\text{ times}}\right) \times \dots \times \left(\underbrace{a\times\dots\times a}_{m\text{ times}}\right)}_{n\text{ times}} = \underbrace{a\times\dots\times a}_{mn\text{ times}} = a^{mn}

It is frequently useful to extend the system of exponents to non-positive numbers, which can be done by applying the recursive rule in the other direction. Thus we can derive that x0=1x^0 = 1 and that x1=1xx^{-1} = \frac{1}{x} for all non-zero values of xx. We can check that this extension retains the sum and product rules of exponents that we mentioned above, which is a useful feature.

Exercise 13: Negative and Zero Exponents

Evaluate each expression. Write your answer in simplest form as a fraction or as an integer literal.

  1. 11=1{-1}^{-1}=\boxed{-1}

  2. 42=1164^{-2}=\boxed{\frac{1}{16}}

  3. 9990=1{999}^0=\boxed{1}

  4. (174)0=1{\left(\frac{-17}{4}\right)}^0=\boxed{1}

A useful application of exponents is in shrinking large numbers to an more humanly understandable format. Indeed, we have a poor conception of how large certain numbers are. In science, it's common to see numbers way too large to count or way too small to visualize. Scientists have developed notation using exponents to make comparing such numbers easier. In scientific notation, a number xx is written as y×10ny\times 10^n, where yy is a number with exactly one non-zero decimal digit before the decimal point, and nn is a (positive, negative, or zero) exponent.

Exercise 14: Scientific Notation

Express in scientific notation.

  1. 1234=1.234×1031234 = \boxed{1.234\times 10^3}

  2. 0.000987=9.87×1040.000987 = \boxed{9.87\times 10^{-4}}

A natural question to ask after having defined negative exponents is: what about rational exponents? Could those be useful? In fact, for a positive rational number base, we may sometimes define rational exponents in a way that preserves both the sum and product laws of exponents mentioned above. The only way to do this is to ensure that (x1n)n=x{\left(x^{\frac{1}{n}}\right)}^n = x, that is, x1nx^{\frac{1}{n}} must be the nnth root of xx. We can also write that as xn\sqrt[n]{x}. With this definition and the product law, we can define xqx^q for any positive rational base xx and any rational exponent qq.

Exercise 15: Fractions, Exponents & Radicals

Evaluate each expression. Write your answer in simplest form as a fraction, or as an integer using the place value system.

  1. 412=24^{\frac{1}{2}}=\boxed{2}

  2. 932=279^{\frac{3}{2}}=\boxed{27}

  3. (23)3=827{\left(\frac{2}{3}\right)}^3=\boxed{\frac{8}{27}}

  4. 1625=45\sqrt{\frac{16}{25}}=\boxed{\frac{4}{5}}

  5. 256814=43\sqrt[4]{\frac{256}{81}}=\boxed{\frac{4}{3}}

With integer exponents of rational numbers, we are always guaranteed that the result exists and is a rational number (since we compute these exponents by multiplying and dividing rational numbers, which are closed under these operations). As we will see later, when rational exponents are concerned, the result may not exist as a rational number.

Definition of a Real Number

In previous sections, we were careful to only discuss square roots and other rational exponents when we knew that there was in fact a rational number that worked. In general, we cannot assume this is always the case.

We can give a proof that no rational number is equal to 3\sqrt{3}. One way to see this is a so-called proof by contradiction. In this kind of argument, we assume that 3\sqrt{3} is in fact rational. That is, if 3=pq\sqrt{3} = \frac{p}{q} for some integers pp and q0q\ne 0 in simplest form, then p2q2=3\frac{p^2}{q^2} = 3, so p2=3q2p^2 = 3q^2. We see that 33 is a factor of the right hand side, so it must also be a factor of the left hand side. But the left hand side is a square, so p=3mp=3m for some integer nn. Then 9k2=3q29k^2 = 3q^2 so 3m2=q23m^2 = q^2. Now 33 is a factor of the left hand side, so it should also be a factor of the right hand side. But the right hand side is a square, so q=3nq=3n for some integer nn. But then pq=3m3n\frac{p}{q} = \frac{3m}{3n} is clearly not in simplest form, so we have reached an absurd state — a contradiction. But of course, this is not possible, so something has gone wrong. Our argument is correct, so it must be our assumption that was wrong. The assumption we made was that 3\sqrt{3} is a rational number.

Many of you will probably be uneasy with how we are talking about 3\sqrt{3} as if it must exist, when we have already shown that no rational number squares to 33. We have already complicated things by introducing fractions to the easier world of the integers! If we make the claim that 3\sqrt{3} should exist as a number, then we run the risk of making things even more complicated and difficult. We don’t, in general, say that everything which doesn’t exist must be a new kind of number (we are happy to say that 10\frac{1}{0} simply does not exist).

There is in fact a good reason, however, to suggest that 3\sqrt{3} might be a useful number to have. The reason for this is that we can already get very very close to a potential square root of 33! An example of this is the rational number 390023168577698122517998136852482.9999999999999997\frac{3900231685776981}{2251799813685248} \approx 2.9999999999999997. In fact, we can get arbitrarily close. A way to visualize this is to see a graph that maps numbers to their squares.

We can start by plotting a point on some axes for integer values. The horizontal distance represents the number xx, which we vary to take on the integer values we want to show. The vertical distance represents the square of that number, x2x^2.

Of course, we can also take the square of rational numbers. We can think of this as increasing the precision of our graph by plotting more points, for example, every 0.10.1.

If we imagine that we continue this process, getting more and more precision, we would expect this curve to become continuous. We can see that it reaches a vertical value of 33 at some point, and we saw earlier that no such rational number point exists. But the curve suggests that we can define a new kind of number that is on the number line, and we can get close to using rational numbers, but can’t get exactly there. This concept is called a real number.

There are many ways to formally define a real number, but it is not necessary to understand such a definition to understand what a real number is. Intuitively, we expect real numbers to plug the holes in continuous lines that we can draw. We can get as close as we want to a real number using rational numbers. In fact we don’t even need to use all rational numbers. All finite decimals are rational numbers, and by adding more decimal points, we can get closer and closer to any rereal number we are interested in. 31.73\sqrt{3} \approx 1.73, but an even better approximation is 31.732\sqrt{3} \approx 1.732. We can keep getting more and more precise, but we can never reach the number itself because it is not rational (hence not a decimal).

Operations on Real Numbers

We are able to add, subtract, multiply, and divide (except by zero) real numbers, just as we can with rational numbers. (Remember that we can get as close as we want to a real number using rational numbers. So it makes sense that real numbers behave almost identically to rational numbers!)

However, unlike rational numbers, it is not always possible to write real numbers in a canonical simplest form. Instead, we can use algebraic techniques to make expressions look simpler from a human perspective.

Exercise 16: A Linear Equation using Real Numbers

Solve the following equation for real number xx: 2x=2 \sqrt{2} x = 2

Solution

We divide both sides by 2\sqrt{2}, yielding: 22x=22 \frac{\sqrt{2}}{\sqrt{2}} x = \frac{2}{\sqrt{2}}

Then, simplifying, x=22=212=21212=22212=2112=212=2 x = \frac{2}{\sqrt{2}} = 2\cdot \frac{1}{\sqrt{2}} = 2\cdot \frac{1}{2^{\frac{1}{2}}} = 2^2\cdot 2^{-\frac{1}{2}} = 2^{1 - \frac{1}{2}} = 2^{\frac{1}{2}} = \boxed{\sqrt{2}}

At this stage, it is helpful to understand some supplementary material on sets. This material is on a seperate page because it is not strictly related to what we are studying right now about algebra, but the notational conveniences of the material will be useful.

Polynomials

Linear Equations, Again

Recall earlier when we solved linear equations of the form ax=bax = b, where aa and bb are known rational (or real) numbers, and xx is an unknown rational (or real) number. The solution is to divide both sides of the equation by aa, which is valid because axax and bb are different names for the same rational (or real) number. This results in the solution x=bax = \frac{b}{a}.

Note that many equations that may look different are really linear equations after some rearranging. For example, ax+b=0ax + b = 0 can be rearranged into the linear equation ax=bax = -b. ax+b=cax + b = c can be rearranged into the linear equation ax=cbax = c - b. We will say that a linear equation that looks like ax+b=0ax + b = 0 is in “standard form”.

Let’s now look at a graphical method to solve a linear equation in standard form. What we will do is rewrite the right hand of the equation from 00 to another variable yy. We will then draw a graph, similar to what we did earlier when we introduced real numbers. Let us first consider the linear equation 2x6=02x - 6 = 0.

Note that the equation 2x6=y2x - 6 = y is more general than 2x6=02x - 6 = 0. If we set y:=0y := 0, then we get back the original equation 2x6=02x - 6 = 0. Therefore, to solve this equation we can look on the graphical plot for all the values on the line corresponding to y=0y = 0. We see that this is where the line intersects with the x-axis. This is called an x-intercept, or root, of the function y=2x6y = 2x - 6. From the plot, we see that the only root is x=3x = 3, which corresponds to the only solution to this equation.

Two Simple Quadratics

We will now use the knowledge about plots and roots to solve equations which are not linear. Let us start with a simple example.

Exercise 17: Roots of x21x^2 - 1

Plot y=x21y = x^2 - 1, determine its roots, and use this information to solve the equation x2=1x^2 = 1.

Solution

Here is our plot:

The roots are marked. They are x=1x = -1 and x=1x = 1, which correspond to the solutions to our equation x2=1x^2 = 1.

Exercise 18: Roots of (x3)(x+2)(x-3)(x+2)

Without using a plot, determine the roots of y=(x3)(x+2)y = (x-3)(x+2). These are also solutions to the standard form equation x2x6=0x^2 - x - 6 = 0; explain why.

Solution

The roots occur when y=0y = 0, so we want to solve 0=(x3)(x+2)0 = (x-3)(x+2). If the product of two numbers is 00, that means either the first number is 00 or the second number is 00 (or both). Therefore, the set of solutions to the equation 0=(x3)(x+2)0 = (x-3)(x+2) is the union of the set of solutions to x3=0x - 3 = 0 and the set of solutions to x+2=0x + 2 = 0. If x3=0x - 3 = 0, this is a simple linear equation, where x=3x = 3 is the only solution. If x+2=0x + 2 = 0, this is also a simple linear equation, where x=2x = -2 is the only solution.

Therefore, the roots are x{2,3}x \in \{-2, 3\}.

Note that by using the distributive property, we find that (x3)(x+2)=(x3)x+(x3)2=x23x+2x6=x2x6(x-3)(x+2) = (x-3)x + (x-3) \cdot 2 = x^2 - 3x + 2x - 6 = x^2 - x - 6. So in fact, x2x6=(x3)(x+2)x^2 - x - 6 = (x-3)(x+2) for all values of xx. Hence the solutions of the two equations must be the same!

Exercise 18 suggests a general approach to solving equations that involve xx and x2x^2 might be to decompose it into the product of two components which are both linear. This process is called factoring. An expression of the form ax2+bx+cax^2 + bx + c, where a0a\ne 0, bb, and cc are known, is called a quadratic polynomial, just as ax+bax + b where a0a\ne 0 and bb are known is called a linear polynomial.

Factoring Quadratics

Suppose we have (sxu)(txv)=0(sx - u)(tx - v) = 0, where ss, tt, uu, and vv are known real numbers with s0ts \ne 0 \ne t. Then we know the solutions are x{us,vt}x \in \left\{\frac{u}{s}, \frac{v}{t}\right\}. A quadratic polynomial written this way is easy to solve! Our goal is to take a polynomial in standard form, and convert it into this factored form.

It is easier to go backwards. From factored form, we can use distributivity to expand: (sxu)(txv)=stx2(sv+tu)x+uv(sx - u)(tx - v) = st x^2 - (sv + tu)x + uv. But what we want to figure out is how to turn standard form into factored form.

If we can write a polynomial in standard form to look like stx2(sv+tu)x+uvstx^2 - (sv + tu)x + uv, then we have found the solutions! The standard form is ax2+bx+cax^2 + bx + c, a0a \ne 0, so we need: a=sta = st, b=(sv+tu)b = - (sv + tu), c=uvc = uv. This is not easy to solve (in fact, it does not have a unique solution), so we need to make some simplifications first.

First of all, we need to to fix the fact that the solutions are not unique. We can factor, for example, 2x22=(2x2)(x+1)2x^2 - 2 = (2x-2)(x+1), but we can also factor it as 2x22=(x1)(2x+2)2x^2 - 2 = (x-1)(2x+2). The roots are of course the same, because the equations x+1=0x+1 = 0 and 2x+2=02x+2 = 0 have the same solutions. But they do not quite look the same. In order to force the factored form to be unique, we need to enforce that the linear polynomials are monic, that is, they have no leading coefficient (multiplier for the xx term). Instead, we will pull out those coefficients into a single multiplier for the entire quadratic polynomial.

That is, given (sxu)(txv)(sx - u)(tx - v), we would like to turn this into a(xu)(xv)a(x-u')(x-v'), where uu' and vv' are new coefficients. How do we calculate aa, uu', and vv'? Let’s focus on the two linear polynomials seperately. We know that sxu=s(xus)sx - u = s(x - \frac{u}{s}), by distributivity. Similarly, txv=t(xvt)tx - v = t(x - \frac{v}{t}). Therefore: (sxu)(txv)=s(xus)t(xvt)=st(xus)(xvt) (sx - u)(tx - v) = s(x - \frac{u}{s})t(x - \frac{v}{t}) = st(x - \frac{u}{s})(x - \frac{v}{t})

Therefore, we can set a:=sta := st and u:=usu' := \frac{u}{s} and v:=vsv' := \frac{v}{s}. Note that the expansion of a(xu)(xv)a(x-u')(x-v') into standard form is ax2a(u+v)x+auvax^2 - a(u' + v')x + au'v'. This is not actually any different from what we had before, but it looks somewhat closer to what we need! We also now see why we have reused aa for the leading coefficient in both forms — in fact, this leading coefficient will be the same going from monic factored form to standard form.

Again, we have not really made much progress — the goal is not to go from a factored form to standard form, but actually the opposite! But it turns out the new system of equations is easier to solve. We need: b=a(u+v)b = -a(u' + v'), c=auvc = au'v'. Another way to write this is ba=(u+v)\frac{b}{a} = -(u' + v'), and ca=uv\frac{c}{a} = u'v'. That is: after dividing by the leading coefficient, we want the constant coefficient to be the product of the solutions, and the xx coefficient to be the negative sum of the solutions. We can solve this problem with trial and error.

Exercise 19: Factoring a Quadratic via Trial and Error

Using trial and error, factor 25+5x6x225 + 5x - 6x^2.

Solution

In standard form, this is 6x2+5x+25-6x^2 + 5x + 25. We can divide by the leading coefficient to obtain 6(x256x256)6 (x^2 - \frac{5}{6}x - \frac{25}{6}). We want two numbers whose sum is 56\frac{5}{6}, and whose product is 256-\frac{25}{6}. One of them will need to be negative! Let’s guess that uu' and vv' will be rational numbers. Write them as mp\frac{m}{p} and nq\frac{n}{q}. We would like mn=25mn = -25 and pq=6pq = 6. There are essentially two options for pp and qq, such as 11 and 66, or 22 and 33 (all other options involve negatives or just switching around pp and qq). There are then six options for mm: 25,5,1,1,5,25-25, -5, -1, 1, 5, 25. The value of nn corresponding to each of those options would be 1,5,25,25,5,11, 5, 25, -25, -5, -1. Now we just need to guess and check!

  • If p=1p=1, q=6q=6, m=25m=-25, n=1n=1, then the sum is 251+16=149656\frac{-25}{1} + \frac{1}{6} = \frac{-149}{6} \ne \frac{5}{6}.

  • If p=1p=1, q=6q=6, m=5m=-5, n=5n=5, then the sum is 51+56=25656\frac{-5}{1} + \frac{5}{6} = \frac{-25}{6} \ne \frac{5}{6}.

  • If p=1p=1, q=6q=6, m=1m=-1, n=25n=25, then the sum is 11+256=19656\frac{-1}{1} + \frac{25}{6} = \frac{19}{6} \ne \frac{5}{6}.

  • If p=1p=1, q=6q=6, m=1m=1, n=25n=-25, then the sum is 11+256=19656\frac{1}{1} + \frac{-25}{6} = \frac{-19}{6} \ne \frac{5}{6}.

  • If p=1p=1, q=6q=6, m=5m=5, n=5n=-5, then the sum is 51+56=25656\frac{5}{1} + \frac{-5}{6} = \frac{25}{6} \ne \frac{5}{6}.

  • If p=1p=1, q=6q=6, m=25m=25, n=1n=-1, then the sum is 251+16=149656\frac{25}{1} + \frac{-1}{6} = \frac{149}{6} \ne \frac{5}{6}.

  • If p=2p=2, q=3q=3, m=25m=-25, n=1n=1, then the sum is 252+13=73656\frac{-25}{2} + \frac{1}{3} = \frac{-73}{6} \ne \frac{5}{6}.

  • If p=2p=2, q=3q=3, m=5m=-5, n=5n=5, then the sum is 52+53=5656\frac{-5}{2} + \frac{5}{3} = \frac{-5}{6} \ne \frac{5}{6}.

  • If p=2p=2, q=3q=3, m=1m=-1, n=25n=25, then the sum is 12+253=47656\frac{-1}{2} + \frac{25}{3} = \frac{47}{6} \ne \frac{5}{6}.

  • If p=2p=2, q=3q=3, m=1m=1, n=25n=-25, then the sum is 12+253=47656\frac{1}{2} + \frac{-25}{3} = \frac{-47}{6} \ne \frac{5}{6}.

  • If p=2p=2, q=3q=3, m=5m=5, n=5n=-5, then the sum is 52+53=56\frac{5}{2} + \frac{-5}{3} = \frac{5}{6} — we’re finally done!

So the factored form is 6(x52)(x+53)-6 (x - \frac{5}{2})(x + \frac{5}{3}).

You might have noticed that this exercise was really tedious. In fact, we will find that there is a more direct way to figure out the numbers we want. Nevertheless, this primitive method can be useful sometimes when the solutions are less difficult to find.

Quadratic Formula

We will introduce an additional form, besides standard form and factored form, for a quadratic polynomial. This new form will be called vertex form.

The motivation for vertex form is that the graph of any quadratic polynomial will look like a parabola, either opening upwards or downwards based on the leading coefficient. All parabolas have either a minimum or a maximum yy-value, which is attained at exactly one point. This point is called the vertex.

The vertex form of a quadratic polynomial is: a(xx0)2+y0a(x - x_0)^2 + y_0, where x0x_0 and y0y_0 are real numbers such that (x0,y0)(x_0, y_0) are the coordinates of the vertex.

Exercise 20: Vertex Form to Standard Form

Rewrite 4(x+1)294(x + 1)^2 - 9 in standard form.

Solution

We expand the square using distributivity: 4(x+1)29=4(x+1)(x+1)9=4(x2+2x+1)9=4x2+8x+49=4x2+8x5 \begin{aligned} 4(x+1)^2 - 9 &= 4(x+1)(x+1) - 9 \\ &= 4(x^2 + 2x + 1) - 9 \\ &= 4x^2 + 8x + 4 - 9 \\ &= 4x^2 + 8x - 5 \end{aligned} which is in standard form.

Exercise 21: Solving a Quadratic in Vertex Form

Solve 4(x+1)29=04(x + 1)^2 - 9 = 0.

First, add 99 to both sides: 4(x+1)2=9 4(x+1)^2 = 9

Then, divide both sides by 22: (x+1)2=94 (x+1)^2 = \frac{9}{4}

We know that there are two possibilities for x+1x+1, i.e. x+1{32,32} x + 1 \in \left\{\frac{-3}{2}, \frac{3}{2}\right\}

Therefore, the two possibilities for xx are x{52,12} x \in \left\{\frac{-5}{2}, \frac{1}{2}\right\}

Therefore, we can solve quadratic equations if we put them in vertex form. Based on whether the parabola opens upwards or downwards, and where the vertex is relative to the xx-axis, the equation will either have 00, 11, or 22 real solutions.

Now the question becomes: given a quadratic polynomial in standard form, can we put it in vertex form? Yes, we can! We just need to use distributivity in a clever way. We know that (xx0)2=x22x0x+x02(x - x_0)^2 = x^2 - 2x_0 x + {x_0}^2. Therefore, we want to get something which looks like this by manipulating the standard form expression. Let’s start with a concrete example:

4x2+8x5=4(x2+2x)5so we need x0=1=4(x2+2x+11)5=4(x2+2x+1)45=4(x+1)29 \begin{aligned} 4x^2 + 8x - 5 &= 4\left(x^2 + 2x\right) - 5 && \text{so we need }x_0 = 1 \\ &= 4\left(x^2 + 2x + \textcolor{blue}{1 - 1}\right) - 5 \\ &= 4\left(x^2 + 2x + 1\right) - 4 - 5 \\ &= 4{(x + 1)}^2 - 9 \end{aligned}

So in fact, it is possible to rewrite a standard form quadratic polynomial into vertex form, and this time we did not need any trial and error. In general:

ax2+bx+c=a(x2+bax)+c=a(x2+2b2ax+b24a2b24a2)+c=a(x2+2b2ax+b24a2)b24a+c=a(x+b2a)2b24a+c=a(xb2a)2b24ac4a \begin{aligned} ax^2 + bx + c &= a\left(x^2 + \frac{b}{a}x\right) + c \\ &= a\left(x^2 + 2\frac{b}{2a}x + \textcolor{blue}{\frac{b^2}{4a^2} - \frac{b^2}{4a^2}}\right) + c \\ &= a\left(x^2 + 2\frac{b}{2a}x + \frac{b^2}{4a^2}\right) - \frac{b^2}{4a} + c \\ &= a{\left(x + \frac{b}{2a}\right)}^2 - \frac{b^2}{4a} + c \\ &= a{\left(x - \frac{-b}{2a}\right)}^2 - \frac{b^2 - 4ac}{4a} \end{aligned}

We can furthermore find the roots of this quadratic polynomial in vertex form, using the method above:

ax2+bx+c=0a(xb2a)2b24ac4a=0a(xb2a)2=b24ac4a(xb2a)2=b24ac4a2xb2a=±b24ac2ax=b2a±b24ac2ax=b±b24ac2a \begin{aligned} ax^2 + bx + c &= 0 \\ a{\left(x - \frac{-b}{2a}\right)}^2 - \frac{b^2 - 4ac}{4a} &= 0 \\ a{\left(x - \frac{-b}{2a}\right)}^2 &= \frac{b^2 - 4ac}{4a} \\ {\left(x - \frac{-b}{2a}\right)}^2 &= \frac{b^2 - 4ac}{4a^2} \\ x - \frac{-b}{2a} &= \pm \frac{\sqrt{b^2 - 4ac}}{2a} \\ x &= \frac{-b}{2a} \pm \frac{\sqrt{b^2 - 4ac}}{2a} \\ x &= \frac{-b \pm \sqrt{b^2 - 4ac}}{2a} \\ \end{aligned}

This formula gives us a way to find the solutions of any quadratic polynomial in standard form. We will call it the quadratic formula.

Polynomials

Let us first define two terms that will be useful. A monomial is a whole number power of a variable xx multiplied by some coefficient. For example, the following are monomials:

The following are not monomials:

A polynomial in a single variable xx is an expression that involves the sum of some monomials (maybe just one). All monomials are also polynomials. For example, the following are polynomials:

The following are not polynomials:

A polynomial equation in a single variable xx is an equation that has polynomials on the left and right hand sides. For example, the following are polynomial equations:

In particular, all quadratic equations and linear equations (which we saw in the previous few weeks) are also polynomial equations.

The degree of a polynomial is the highest exponent (with a non-zero coefficient). For instance, the following polynomials correspond to degrees:

We will define the degree of 00 to be -∞, because there is no term with a a non-zero coefficient.

The leading coefficient is the coefficient of the highest exponent term. The constant term is the coefficient of the x0x^0 term, i.e. the monomial which does not depend on xx (hence, constant).

Factoring Polynomials

We have already seen how to solve polynomial equations of degrees 11 and 22 (they are linear equations and quadratic equations respectively). To solve polynomial equations with higher degrees, we would ideally want to write the equation in factored form (just like what we did with quadratic equations). There are a few tools we can use to do this.

The first tool we will need to learn is called long division. The idea behind this technique is that if we know one factor of a polynomial, we can get the other factor. This is similar to the concept of division for rational numbers that we are familiar with. In fact, the technique is exactly the same, except that we use the exponents of the variables instead of place values. The idea is to consider only the leading coefficient (coefficients corresponding to the highest degree monomial) each time. Let us do some examples of long division.

Exercise 22: Examples of Polynomial Long Division

Use polynomial long division to compute the following quotients.

  1. x21x+1\frac{x^2 - 1}{x + 1}

  2. x327x2+3x+9\frac{x^3 - 27}{x^2 + 3x + 9}

Solution
  1. We write x21=x(x+1)+x21(x2+x)=x(x+1)x1=x(x+1)1(x+1)x1(x1)=x(x+1)1(x+1)=(x1)(x+1) \begin{aligned} x^2 - 1 &= \textcolor{blue}{x(x + 1)} + x^2 - 1 - \textcolor{red}{(x^2 + x)} \\ &= x(x + 1) - x - 1 \\ &= x(x + 1) - \textcolor{blue}{1(x + 1)} - x - 1 - \textcolor{red}{(- x - 1)} \\ &= x(x + 1) - 1(x + 1) \\ &= (x - 1)(x + 1) \\ \end{aligned} and so x21x+1=x1\frac{x^2 - 1}{x + 1} = x - 1.

  2. We write x327=x(x2+3x+9)+x327(x3+3x2+9x)=x(x2+3x+9)3x29x27=x(x2+3x+9)3(x2+3x+9)3x29x27(3x29x27)=x(x2+3x+9)3(x2+3x+9)=(x3)(x2+3x+9) \begin{aligned} x^3 - 27 &= \textcolor{blue}{x(x^2 + 3x + 9)} + x^3 - 27 - \textcolor{red}{(x^3 + 3x^2 + 9x)} \\ &= x(x^2 + 3x + 9) - 3x^2 - 9x - 27 \\ &= x(x^2 + 3x + 9) - \textcolor{blue}{3(x^2 + 3x + 9)} - 3x^2 - 9x - 27 - \textcolor{red}{(- 3x^2 - 9x - 27)} \\ &= x(x^2 + 3x + 9) - 3(x^2 + 3x + 9) \\ &= (x - 3)(x^2 + 3x + 9) \\ \end{aligned} and so x327x2+3x+9=x3\frac{x^3 - 27}{x^2 + 3x + 9} = x - 3.

We now know what to do when we have a factor of the polynomial. How do we figure out The next tool that will be useful is called the remainder theorem. In fact, the remainder theorem is more general (it tells us more) than the version we will look at, but the version we will learn is enough for our purposes.

The remainder theorem states that if p(x)p(x) is a polynomial, and aa is some number, then p(a)=0p(a) = 0 if and only if (xa)(x - a) is a factor of p(x)p(x). This means that if we can guess a root of a polynomial, then we can find at least one factor of it. Let us look at a few examples.

Exercise 23: Applications of the Remainder Theorem

Find a single factor of each polynomial by using the remainder theorem.

  1. x7+1x^7 + 1

  2. x4+x3+x2+xx^4 + x^3 + x^2 + x

  3. 10x109x110x^{10} - 9x - 1

Solution
  1. (1)7+1=0{(-1)}^7 + 1 = 0, so by the remainder theorem, x+1x + 1 is a factor.

  2. 04+03+02+0=00^4 + 0^3 + 0^2 + 0 = 0, so by the remainder theorem, xx is a factor.

  3. 10110911=010\cdot 1^{10} - 9\cdot 1 - 1 = 0, so by the remainder theorem, x1x - 1 is a factor.

By combining the remainder theorem with the technique of long division, we can factor some polynomials for which we can easily guess a root. But we will want yet a third tool to make guessing roots easier, if they are rational. This third tool is called the rational root theorem. It allows us to use trial and error to guess all the possible rational roots; if none of them work, then we know there are no rational roots. (Recall that we did something similar to find the rational roots of a quadratic equation.)

The rational root theorem states that if we have a polynomial p(x)p(x) with integer coefficients, and the leading coefficient is aa while the constant term is cc, then any rational root q=mnq = \frac{m}{n} (in lowest form) must satisfy the following: mm is a factor of cc, and nn is a factor of aa. As a special case, if a=1a = 1 (so that the polynomial is monic) then the rational root theorem is called the integral root theorem, and we have that any rational root is an integer and is a factor of cc. (You might remember the integral root theorem from a homework question a few weeks ago.)

Exercise 24: Applications of the Rational Root Theorem

Use the rational root theorem to find all rational roots of the following polynomials.

  1. x7+1x^7 + 1

  2. 2x10x212x^{10} - x^2 - 1

  3. 6+17x9x217x3+3x46 + 17x - 9x^2 - 17x^3 + 3x^4

Solution
  1. By the integral root theorem, we need to check only 11 and 1-1. Only x=1x = -1 is a root.

  2. By the rational root theorem, we need to check 11, 1-1, 12\frac{1}{2}, and 12-\frac{1}{2}. We see that x=1x = 1 and x=1x = -1 are indeed roots, but the other two are not.

  3. By the rational root theorem, we need to check 13\frac{1}{3}, 23\frac{2}{3}, 11, 22, 33, 66, and 13-\frac{1}{3}, 23-\frac{2}{3}, 1-1, 2-2, 3-3, and 6-6. If p(x)=6+17x9x217x3+3x4p(x) = 6 + 17x - 9x^2 - 17x^3 + 3x^4, then:

    • p ⁣(13)=27227p\!\left(\frac{1}{3}\right) = \frac{272}{27}

    • p ⁣(13)=0p\!\left(\frac{-1}{3}\right) = 0

    • p ⁣(23)=809p\!\left(\frac{2}{3}\right) = \frac{80}{9}

    • p ⁣(23)=10027p\!\left(\frac{-2}{3}\right) = \frac{-100}{27}

    • p(1)=0p(1) = 0

    • p(1)=0p(-1) = 0

    • p(2)=84p(2) = -84

    • p(2)=120p(-2) = 120

    • p(3)=240p(3) = -240

    • p(3)=576p(-3) = 576

    • p(6)=0p(6) = 0

    • p(6)=7140p(-6) = 7140

    Therefore, only 13-\frac{1}{3}, 11, 1-1, and 66 are rational roots.

We can now combine our knowledge of the three tools we have learned to factor polynomials:

  1. If the polynomial is linear or quadratic, we already have tools to find the roots, and thus factor them (if possible) using the remainder theorem.

  2. If the polynomial has integer coefficients, we can apply the rational root theorem to find all rational roots. This gives us, by the remainder theorem, some linear factors. We then can use long division to find what is left, and then repeat.

  3. Whatever polynomial we have left has no rational roots. This means it is already fully factored into rational polynomials, but there might still be real roots which are not rational. If the polynomial is quadratic, then the quadratic formula can tell us what these roots are, and thus we can fully factor into real polynomials.

Exercise 25: Factorization with Real Numbers

Define R[x]\mathbf{R}[x] to be the set of polynomials with real coefficients. In R[x]\mathbf{R}[x], fully factor the following.

  1. 15x21 - 5x^2

  2. 1+x+x2-1 + x + x^2

  3. 33xx2+x33 - 3x - x^2 + x^3

Solution
  1. This is a quadratic, and it is already essentially in vertex form. We can solve it by the technique we learned earlier: 15x2=01=5x215=x2x=±15 \begin{aligned} 1 - 5x^2 &= 0 \\ 1 &= 5x^2 \\ \frac{1}{5} &= x^2 \\ x &= \pm\frac{1}{\sqrt{5}} \end{aligned}

    Therefore the factors are 15x2=(5x1)(5x+1)1 - 5x^2 = -(\sqrt{5}x - 1)(\sqrt{5}x + 1), where note that we needed to add the 1-1 to ensure the leading coefficient is still correct.

  2. This is a quadratic, and so we can use the quadratic formula to find the roots: x=b±b24ac2a=1±1+4112=1±52 \begin{aligned} x &= \frac{-b \pm \sqrt{b^2 - 4ac}}{2a} \\ &= \frac{-1 \pm \sqrt{1 + 4\cdot 1\cdot 1}}{2} \\ &= \frac{-1 \pm \sqrt{5}}{2} \end{aligned}

    Thus, the factors are 1+x+x2=(x1+52)(x152)-1 + x + x^2 = \left(x - \frac{-1 + \sqrt{5}}{2}\right)\left(x - \frac{-1 - \sqrt{5}}{2}\right). The leading coefficient is already correct and so we do not need to multiply by a constant.

  3. Since the coefficients are integers, we should consider the rational root theorem. The leading coefficient is 11 and the constant term is 33, so we only need to check 3-3, 1-1, 11, and 33. We evaluate:

    • 33(3)(3)2+(3)3=243 - 3\cdot (-3) - {(-3)}^2 + {(-3)}^3 = -24

    • 33(1)(1)2+(1)3=43 - 3\cdot (-1) - {(-1)}^2 + {(-1)}^3 = 4

    • 33112+13=03 - 3\cdot 1 - 1^2 + 1^3 = 0

    • 33332+33=123 - 3\cdot 3 - 3^2 + 3^3 = 12

    Thus, the only rational root is x=1x = 1. We can now use long division to remove the factor of (x1)(x - 1): 33xx2+x3=x2(1+x)+33xx2+x3(x2+x3)=x2(1+x)+33x=3(1+x)+x2(1+x)+33x(33x)=(3+x2)(1+x) \begin{aligned} 3 - 3x - x^2 + x^3 &= \textcolor{blue}{x^2(-1 + x)} + 3 - 3x - x^2 + x^3 - \textcolor{red}{(-x^2 + x^3)} \\ &= x^2(-1 + x) + 3 - 3x \\ &= \textcolor{blue}{-3(-1 + x)} + x^2(-1 + x) + 3 - 3x - \textcolor{red}{(3 - 3x)} \\ &= (-3 + x^2)(-1 + x) \end{aligned}

    We are not quite done, because x23x^2 - 3 is not irreducible. We know that this is a quadratic polynomial in vertex form, so we can solve it easily: 3+x2=0x2=3x=±3 \begin{aligned} -3 + x^2 &= 0 \\ x^2 &= 3 \\ x &= \pm \sqrt{3} \end{aligned}

    Thus, we know that x23=(x3)(x+3)x^2 - 3 = (x - \sqrt{3})(x + \sqrt{3}). Hence we have found a full factorization of the original polynomial: 33xx2+x3=(1+x)(3+x)(3+x) 3 - 3x - x^2 + x^3 = (-1 + x)(\sqrt{3} + x)(-\sqrt{3} + x)

Linear Algebra

Previously, we have been looking at polynomial equations with one unknown variable. Recall the simplest such equations are linear equations of the form ax=bax = b, where aa and bb are known real (or rational) numbers.

We will now extend our attention to polynomial equations when we have multiple unknown variables. For example, suppose that neither xx nor yy are known, and we have a linear equation ax+by=cax + by = c. What are the solutions to this linear equation?

One way to solve this problem is to choose any value for xx, and then we see that axax becomes a known value. For instance, suppose x=1x = 1. Thus we can now solve the linear equation by=caby = c - a where the right hand side is known. We can see that in this case, if b0b \ne 0, then y=caby = \frac{c - a}{b}. If b=0b = 0, then there are either no solutions or infinitely many solutions for yy, depending on whether ca=0c - a = 0 also.

However, the number of possible values to choose for xx is very high — in fact, usually there are infinitely many possible values for xx that work! This implies that we are not looking for only a single solution, but instead we want to describe the shape of all possible solutions. We have figured out one of the solutions, so if we can describe how the solution changes when we change the assumed value of xx, then this would be enough. The result will be some geometric shape that passes through our particular solution.

In order to visualize these shapes geometrically, we shall introduce the concept of a vector, which previously we have touched on but not discussed in detail.

Vectors

An ordered pair is two things written in an order. For example, (3,4)(3, 4) is an ordered pair of numbers. Ordered pairs frequently represent a single concept that is made of two components — though keep in mind that these components are not always written as in this example.

A simple ordered pair like (3,4)(3, 4) does not itself have much meaning, aside from being a collection of two numbers. However, we may assign an interpretation to particular ordered pairs to give them a meaning.

We saw earlier that fractions, like ab\frac{a}{b}, are made up of two components (which we called the numerator and denominator). Indeed, fractions are a kind of ordered pair (a,b)(a, b), with the first element of this ordered pair representing the number of fractional pieces, and the second element representing the size of a whole relative to a single fractional piece.

Another interpretation of ordered pairs is as vectors in a two-dimensional plane. The components of the vector [a,b][a, b] represent the displacement in two directions. For example, the first component might represent displacement to the right, and the second component displacement toward top of the page. (Such an interpretation is called a vector space, and the choices of directions are collectively called a basis.)

We have notation for the set of two-dimensional vectors where both components are real numbers: R2\mathbf{R}^2. The superscript 2^2 denotes that the vector space is two dimensional, i.e. has two components.

It will become more clear later why we have defined this concept, which currently seems to be just a fancy way of writing down two numbers at once. First, we shall define some operations on vectors, which will have geometric meaning:

Homogeneous Linear Equations (Two Unknowns)

Let us now return to considering the linear equations with two unknown variables. A homogeneous linear equation is an equation of the form ax+by=0ax + by = 0. Note that the only change from the more general linear equations that we have looked at is that we are forcing the cc in ax+by=cax + by = c to be 00. For simplicity, let us assume a0ba \ne 0 \ne b (this assumption is not really required, but if a=0a = 0 or b=0b = 0, then this case is degenerate because we know that one of the unknown variables is not important, so we can just solve a linear equation with the other unknown). What do the solutions to this kind of equation look like? One solution that comes to mind is [x,y]=[0,0][x, y] = [0, 0], but this solution does not tell us where to look for other solutions. Therefore, it would be useful to find solutions that are not [0,0][0, 0]. We shall investigate first a concrete example.

Exercise 26: Solutions to a Homogeneous Linear Equation

Describe all the rational solutions [x,y][x, y] to 3x+2y=03x + 2y = 0.

Solution

Recall the trick that we talked about earlier: if we pick some arbitrary value of xx and then solve for yy, then we will find a particular solution (but there may be others). We should not try x=0x = 0 because that will give us the solution [x,y]=[0,0][x, y] = [0, 0], which we already know about and which is not interesting.

Let’s choose x=2x = 2 to make calculations simpler (do you see why)? Then we have 32+2y=03\cdot 2 + 2y = 0 and therefore, by rearranging, 2y=62y = -6. Therefore y=3y = -3 is a solution, when x=2x = 2. Hence [x,y]=[2,3][x, y] = [2, -3] is a solution to the original equation.

Now we will use the fact that the equation is homogeneous to find all other solutions! How? Note that if 32+2(3)=03\cdot 2 + 2\cdot (-3) = 0, then 5(32+2(3))=50=05(3\cdot 2 + 2\cdot (-3)) = 5 \cdot 0 = 0. In fact, for any rational number qq, we always have q(32+2(3))=q0=0q(3\cdot 2 + 2\cdot (-3)) = q\cdot 0 = 0. Using distrubutivity and commutativity of multiplication, we find that 3(2q)+2(3q)=03(2q) + 2(-3q) = 0. But this looks exactly like the original equation! That is, if [x,y]=[2,3][x, y] = [2, -3] is a solution, then [x,y]=[2q,3q][x, y] = [2q, -3q] is a solution also. Does the right hand side look familar? That’s right: it is the scalar mumultiplication by qq of vector [2,3][2, -3].

This gives us a general fact about solutions to homogeneous linear equations: if [x,y][x, y] is a solution, then any rational (or real) scalar multiple of it is a solution to the homogeneous equation also. This is why we have decided to first consider the simpler homogeneous case: as soon as we find one non-zero solution, we have found infinitely many non-zero solutions. That is, all vectors [x,y]=q[2,3][x, y] = q[2, -3] are solutions.

In fact, we have found all of them. One way to see this is to notice that if we were somehow missing one solution [u,v][u, v], then first note u0u \ne 0 (since otherwise, only [0,0][0, 0] is a solution, but [u,v]=[0,0]=0[2,3][u, v] = [0, 0] = 0[2, -3] is already accounted for). Since u0u \ne 0, therefore 2u[u,v]\frac{2}{u} [u, v] is also a solution (by the same argument as above, since scalar multiples of homogeneous solutions are also homogeneous solutions). Hence, by simplifying, [2,2vu][2, \frac{2v}{u}] is a solution. We know that 32+2(3)=03\cdot 2 + 2\cdot (-3) = 0, and now the new solution tells us that 32+2(2vu)=03\cdot 2 + 2\cdot (\frac{2v}{u}) = 0. Taking the difference of these equations, we see that 2(3)=2(2vu)2\cdot (-3) = 2\cdot(\frac{2v}{u}). Hence, 3=2vu-3 = \frac{2v}{u}, i.e. v=32uv = -\frac{3}{2}u. But then the solution we were missing is actually [u,32u]=u2[2,3][u, -\frac{3}{2}u] = \frac{u}{2}[2, -3], which it turns out we weren’t missing after all!

This exercise generalizes to an important property: all solutions to a linear homogeneous equation in two variables (where the coefficients are not both zero) are scalar multiples of a particular non-zero solution. We can plot these as a straight line going through the origin. The particular non-zero solution we found is any arbitrary direction on this line.

Vector Spaces

Recall from before that we saw that the solutions to a homogeneous linear equation in two variables are all scalar multiples of a particular non-zero solution. We will now increase the number of variables and seek to find a similar result. First, a useful concept is that of a vector space.

A rational (or real) vector space is a set of vectors VV, possibly infinite, such that:

  1. A zero vector is in the set, i.e. 0V\mathbf{0} \in V

  2. For any two vectors uu, vv in the set, their sum u+vu+v is in the set

  3. For any vector vv in the set, all rational (or real, if we are talking about a real vector space) scalar multiples qvqv are in the set

In other words, within a vector space we can always add vectors, or multiply vectors by rational (or real) numbers, without leaving the vector space. For instance, all two-dimensional rational vectors V=Q2={[a,b]:aQ,bQ}V = \mathbf{Q}^2 = \{[a, b]: a\in \mathbf{Q}, b\in\mathbf{Q}\} form a vector space (we can add and subtract these without leaving VV). Another rational vector space is the singleton set V={[0,0]}V = \{[0, 0]\}. There is only one vector in here (the zero vector), but any combination of sums and scalar multiples with the zero vector will result in a zero vector, so this is still a vector space.

However, the set U={[0,0],[1,0]}U = \{[0, 0], [1, 0]\} is not a vector space. This is because, for example, the sum [1,0]+[1,0]=[2,0][1, 0] + [1, 0] = [2, 0] is not in UU.

Exercise 27: Vector Space of Solutions

Let VV be the set of all rational solutions [x,y][x, y] to xy=0x - y = 0. Show that VV is a vector space.

Solution

We need to check three things:

  1. Is [0,0]V[0, 0] \in V? This is true if [x,y]=[0,0][x, y] = [0, 0] is a solution. Indeed, xy=00=0x - y = 0 - 0 = 0, so it is a solution and thus [0,0]V[0, 0] \in V.

  2. Say that [a,b]V[a, b] \in V and [c,d]V[c, d] \in V. This means ab=0a - b = 0 and cd=0c - d = 0. If we add these equations, we get ab+cd=0a - b + c - d = 0, which we can rearrange to (a+c)(b+d)=0(a + c) - (b + d) = 0. Hence, [a+c,b+d]V[a + c, b + d] \in V also.

  3. Say that [a,b]V[a, b] \in V and qQq \in \mathbf{Q} is a rational number. This means ab=0a - b = 0. If we multiply this equation by qq, we get qaqb=0qa - qb = 0. Hence, [qa,qb]V[qa, qb] \in V also. (Remember, this is what we said about solutions to homogeneous linear equations last time too!)

Therefore, VV is in fact a vector space.

What Exercise 27 tells us is that when homogeneous linear equations in two variables have solutions, those solutions form a vector space. In fact, we do not have to limit ourselves to two variables. It is an important fact that the set of solutions to homogeneous linear equations in any number of unknown variables always forms a vector space: we can always add solutions, or multiply them by rational numbers, and get another solution.

We can often describe vector spaces by giving just a few vectors, and saying that all the other vectors are going to be linear combinations: i.e. sums and scalar multiples, and sums of scalar multiples. If uVu \in V and vVv \in V, we can see that all vectors that look like au+bvVau + bv \in V also (this is just by applying the 2nd and 3rd rules together). In these cases, we will write that V=span(u,v)V = \operatorname{span}(u, v), to mean that all vectors in VV are in fact linear combinations of uu and vv. Hence, we now have a brief notation to describe the solution to homogeneous linear equations.

Exercise 28: Solutions to Homogeneous Linear Equations

Describe all rational solutions to each homogeneous linear equation. Use spans to describe infinite vector spaces.

  1. xQx\in\mathbf{Q} such that 2x=02x = 0 (Hint: this vector space contains only one vector)

  2. [x,y]Q2[x, y]\in\mathbf{Q}^2 such that 2x+15y=02x + \frac{1}{5}y = 0

  3. [x,y,z]Q3[x, y, z]\in\mathbf{Q}^3 such that 2x+15y+13z=02x + \frac{1}{5}y + \frac{1}{3}z = 0 (Hint: this is a span of two vectors that do not point in the same or opposite direction)

Solution
  1. This is a linear equation that we have seen already, and the only solution is x=0x = 0. Therefore the set of solutions is {0}\{0\}, which is a vector space.

  2. We have seen how to solve these earlier: fix some non-zero value of xx, such as x=1x = 1. Then we need to solve 15y=2\frac{1}{5}y = -2, which has solution y=10y = -10. Therefore, [1,10][1, -10] is a particular non-zero solution, and all the solutions are scalar multiples of it. Hence, the set of solutions is span([1,10])\operatorname{span}([1, -10]). (Note that all the non-zero scalar multiples of this vector can also be used to describe this vector space, so our solution could look different, but would be mathematically the same.)

  3. Note that if we set z=0z = 0, we get the same equation as last time. So [1,10,0][1, -10, 0] is a particular non-zero solution. However, since we have more than two variables, the set of solutions won’t be all scalar multiples of this. We need to find another solution that does not point in the same direction (so, for example, not [2,20,0][2, -20, 0]). Let’s set (for example) z=6z = 6, y=0y = 0 to guarantee this. Then 2x=22x = -2. Hence [1,0,6][-1, 0, 6] is another nonzero particular solution. Assuming the hint is true, the answer is then span([1,10,0],[1,0,6])\operatorname{span}([1, -10, 0], [-1, 0, 6]). (Note that there are many ways to describe the same vector space, so if we picked different assumptions our solution would look different, but would be mathematically the same.)

Inhomogeneous Linear Equations

We have spent a lot of time talking about homogeneous linear equations, where the right hand side is 00. The solution to homogeneous linear equations is always a vector space. Now we will shift our attention to linear equations in general, including those where the right hand side is not 00. How do we solve such systems? Let us start with an example, x+y=2x + y = 2. There is an obvious solution [x,y]=[1,1][x, y] = [1, 1], but of course there are infinitely many other solutions. How can we describe them?

The most important observation is that this inhomogeneous equation is related to the homogeneous linear equation x+y=0x + y = 0. Let the vector space of solutions to this equation be VV. Any homogeneous solution [a,b]V[a, b] \in V can be added to [1,1][1, 1] to get another solution to x+y=2x + y = 2! That is, a+1+b+1=(a+b)+(1+1)=0+2=2a + 1 + b + 1 = (a + b) + (1 + 1) = 0 + 2 = 2. Here we see why homogeneous linear equations are important: we can always translate by a homogeneous solution to x+y=0x + y = 0 without changing the value of x+yx + y.

Therefore, we can describe all the solutions to x+y=2x + y = 2 by simply saying [x,y]=[1,1]+V[x, y] = [1, 1] + V. What does this mean? We mean that we start at [1,1][1, 1] (some particular solution), and then translate by any solution in VV. Since the solution to the homogeneous equation is [x,y]V=span([1,1])[x, y] \in V = \operatorname{span}([1, -1]), another way to write it is: x+y=2    [x,y]=[1,1]+q[1,1] for some qQ x + y = 2 \iff [x, y] = [1, 1] + q[1, -1] \text{ for some }q \in\mathbf{Q}

Exercise 29: Solutions to an Inhomogeneous Linear Equation

Find all rational solutions to 32x+23y+12z=2\frac{3}{2}x + \frac{2}{3}y + \frac{1}{2}z = 2. Hint: answer should include a span of 22 vectors not in the same or opposite direction

Solution

First we will solve the homogeneous linear equation 32x+23y+12z=0\frac{3}{2}x + \frac{2}{3}y + \frac{1}{2}z = 0. We expect two directions of freedom, because there are three variables. So we can make two different assumptions to get vectors which are guaranteed to not be in the same direction.

An example of some convenient assumptions (to simplify the math) are: x=0x = 0, y=3y = 3 and x=2x = 2, y=0y = 0. Under the first assumption, we have 12z=2\frac{1}{2}z = -2, so z=4z = -4. Under the second assumption, we have 12z=3\frac{1}{2}z = -3, so z=6z = -6. Hence the two different non-zero solutions we get are:

  • [x,y,z]=[0,3,4][x, y, z] = [0, 3, -4]

  • [x,y,z]=[2,0,6][x, y, z] = [2, 0, -6]

Hence, the vector space of solutions to this inhomogeneous linear equation is V=span([0,3,4],[2,0,6]) V = \operatorname{span}([0, 3, -4], [2, 0, -6])

The next step is to find a particular solution to the inhomogeneous equation 32x+23y+12z=2\frac{3}{2}x + \frac{2}{3}y + \frac{1}{2}z = 2. Once again, there are infinitely many of these, so we will make an assumption that simplifies the math. Let x=0x = 0 and y=0y = 0. Then z=4z = 4 is the only solution. So [0,0,4][0, 0, 4] is a particular solution.

Hence, the general solution is: 32x+23y+12z=2    [x,y,z]=[0,0,4]+p[0,3,4]+q[2,0,6] for some pQ,qQ \frac{3}{2}x + \frac{2}{3}y + \frac{1}{2}z = 2 \iff [x, y, z] = [0, 0, 4] + p[0, 3, -4] + q[2, 0, -6]\text{ for some }p\in\mathbf{Q}, q\in\mathbf{Q}

Linear Systems

We are now ready to combine our knowledge to solve systems of equations. A system of equations consists of multiple equations which may share unknowns, and all the equations are simultaneously true. For example: 2x+3y=73x+2y=8 \begin{aligned} 2x + 3y &= 7 \\ 3x + 2y &= 8 \\ \end{aligned} is a system of equations with two unknowns (xx and yy) and two equations.

When each equation is a linear equation, we call the system a linear system. In general, the solutions to linear systems are like the solutions to linear equations: there can be no solutions, or the solutions can be described as a particular solution plus a vector space. Therefore, we should expect to be able to solve such systems in a very similar way to how we solved individual linear equations.

First, we want to rewrite the system as a homogeneous linear system, which means to set the right hand sides to 00: 2x+3y=03x+2y=0 \begin{aligned} 2x + 3y &= 0 \\ 3x + 2y &= 0 \\ \end{aligned}

To solve this homogeneous linear system, we can take the intersection of the two vector spaces determined by each equation (the intersection of two vector spaces which share a 00 is also a vector space). In this case, the first equation is solved by the vector space span([3,2])\operatorname{span}([3, -2]), and the second equation is solved by the vector space span([2,3])\operatorname{span}([2, -3]). The intersection of these vector spaces is the zero-dimensional vector space {[0,0]}\{[0, 0]\}.

We now need to find a particular solution to the system. One way to do this is by substituting each equation into the next, and creating new equations if it is necessary. In this case, we can rearrange 2x+3y=72x + 3y = 7 into y=23x+73y = -\frac{2}{3}x + \frac{7}{3}. Substituting this into the second equation gives 3x43x+143=83x - \frac{4}{3}x + \frac{14}{3} = 8, and simplifying this gives us 53x=103\frac{5}{3}x = \frac{10}{3}, which is a linear equation easily solved for xxx=2x = 2.

Now we can substitute this back into the equation y=23x+73y = -\frac{2}{3}x + \frac{7}{3} to find that y=33=1y = \frac{3}{3} = 1. Thus, a particular solution is [1,2][1, 2]. Since the vector space that solves the homogeneous linear system consists of just the zero vector, that means this particular solution is the unique solution; i.e. x=2x = 2 and y=1y = 1 is the only rational solution to the system.