Suppose we have a function . Then the chain rule is:
We can rewrite this using an alternate notation:
Now if we were to change and , then we have the chain rule to be:
Why do we want this alternate notation? Because it relates to the chain rule for 2 variable functions.
1st Case of Chain Rule for 2 Variable Functions
Suppose we have , , and , then the chain rule (derivative of in respect to ) is:
2nd Case of Chain Rule for 2 Variable Functions
Suppose we have ,,, then there are 2 chain rules.
The derivative of in respect to is:
The derivative of in respect to is:
Using a Tree Diagram for Chain Rule
Tree diagrams are very useful when finding the chain rule for multivariable functions with more than 2 variables.
For example, suppose we have and , and we want to find .
We can write the tree diagram below like this:

Then we will multiply all the connected derivatives, and sum them up to have:






