7
$\begingroup$

I started reading Quantum Field Theory for the Gifted Amateur by Lancaster & Blundell, and I have a conceptual question regarding their motivation of the Lagrangian formalism. They start by considering the trajectory of a classical particle and do the following reasoning:

Given the average kinetic energy $$ \overline{T} = \frac{1}{\tau} \int_0^\tau \frac{1}{2} m [\dot{x}(t)]^2 dt, $$ and the average potential energy $$ \overline{V} = \frac{1}{\tau} \int_0^\tau V[x(t)]dt,$$ one can compute their functional derivatives $$ \frac{\delta \overline{V}}{\delta x(t)} = \frac{V'[x(t)]}{\tau},\qquad\frac{\delta \overline{T}}{\delta x(t)} = -\frac{m\ddot{x}}{\tau},\tag{1.21}$$ and with Newton's laws, $$m\ddot{x} = - dV/dx,$$ find $$ \frac{\delta \overline{V}}{\delta x(t)} = \frac{\delta \overline{T}}{\delta x(t)}.\tag{1.22}$$ The authors then proceed to say,

This means that if you slightly deviate from the classical trajectory, then both the average kinetic energy and the average potential energy will increase by the same amount.

I tried to construct a simple example to gain some intuition about this but so far I have not been successful.

  1. My first idea was to think about a mass sitting on a surface in a gravitational field. If I slightly move that mass, say to the left and then back, its average kinetic energy will increase but its average potential energy will not. So it seems to contradict the above statement.

  2. Also if I think about a point-mass falling to the ground that has an infinitesimal bump in its straight falling line, I find that only the average kinetic energy will increase but the average potential energy stays the same.

Can you help me find the error in my reasoning or provide a good example? Any help is much appreciated.

$\endgroup$
5
  • $\begingroup$ What is the reason for the negative sign in the functional derivative of the kinetic energy? $\endgroup$ Commented Oct 8, 2021 at 20:32
  • $\begingroup$ I can’t give an intuitive explanation, its just what comes out when you compute the functional derivative. Its derived in the book one page before this section. $\endgroup$ Commented Oct 8, 2021 at 20:38
  • $\begingroup$ Notification: I have added multiple sections to the answer that I wrote (and that you accepted). The new sections expand on the existing exposition. The diagrams in the answer were pre-existing diagrams; your question provided an opportunity to present this demonstration here on physicsSE. I would very much like to know whether you concur with the entire exposition, or whether you are cautious about some parts of it. $\endgroup$
    – Cleonis
    Commented Oct 10, 2021 at 19:25
  • $\begingroup$ @Cleonis Firstly, thank you very much for writing such an extensive answer. I need some time to work through all the parts, then I will reply to your request. $\endgroup$ Commented Oct 11, 2021 at 15:12
  • 2
    $\begingroup$ The functional derivative $\frac{\delta}{\delta x(t)}$ means that you change the trajectory of the particle, not its position. In particular you change the trajectory to one that is different from (but arbitrarily close to) the classical one. Your idea 1 only changes the position not the trajectory. Your idea 2 might work but you need to see how you make your variation maintain the boundary condition while also being smooth (the variations need to be at least $C^2(\Omega)$). I'd argue that the deviation in the bump will stay there longer and thus have that value of potential contribute more. $\endgroup$
    – Chaotic
    Commented Oct 15, 2021 at 17:35

2 Answers 2

14
$\begingroup$

About the general concept of 'motivation of the Lagrangian formalism'.

We have that textbook authors have converged on presenting Hamilton's stationary action as the motivation of the Lagrangian formalism, so in this answer, I will discuss Hamilton's stationary action.

That said: the approach to classical mechanics introduced by Joseph Louis Lagrange predates Hamilton's stationary action. If Hamilton's stationary action would have never been introduced the development of Lagrangian mechanics would not have been impacted; Lagrangian mechanics has the same capabilities either way. What Lagrange introduced was the systematic use of generalized coordinates, in combination with expressing the physics taking place in terms of energy. In that sense Lagrangian mechanics is independent of Hamilton's stationary action.


The following exposition consists of demonstrations for the purpose of gaining intuition.

To make this a general answer I will start with a short exposition of calculus of variations.

Differential calculus and calculus of variations both have a unit of operation. In differential calculus, the unit of operation is a pair of points, the line through those two points is tangent to the curve you are looking to solve for. In calculus of variations the unit of operation is a triplet of points.

Diagram 1 is an animated GIF that represents that unit of operation. (The animated GIF is composited from screenshots of an interactive diagram.)

Unit of operation of calculus of variations

Diagram 1 - Unit of operation

In diagram 1 the dashed line is a parabola. This parabola represents height as a function of time, of an object that has been thrown upwards. A uniform force is acting in the downward direction. To obtain the simplest possible parabola the value of the (uniform) acceleration is set at 2 $m/s^2$

The outer points ($t_1$ and $t_3$) are treated as fixed; variation is executed by varying the position coordinate of the middle point.

In the diagram two adjacent time intervals are evaluated; $t_{1,2}$ and $t_{2,3}$

The sweet spot is the point where the change in velocity is such that the difference in kinetic energy matches the difference in potential energy.

(In this diagram the case of a uniform force is presented, which means that in this diagram the potential increases linear with height, hence in this diagram $\Delta E_p$ is a constant. In the general case, the force changes as a function of the position coordinate. Then the potential is not linear, and the value of $\Delta E_p$ changes as a function of the position coordinate.)

The unit of operation is valid down to infinitisimal scale

Diagram 2 - valid at any scale

Diagram 2 illustrates that the logic of this unit of operation is valid at any scale, down to infinitesimally short time intervals.

Concatenation of unit of operation

Diagram 3 - Concatenation

Diagram 3 illustrates the concatenation of the unit of operation over the entire domain of the function you are looking to solve for.

The analogy with the unit of operation of differential calculus: you concatenate the unit of operation over the domain of the function you are looking to solve for, and then you take the limit of infinitesimally small intervals.

An example of a source where this idea of the concatenation of the unit of operation is discussed is the book 'Calculus of Variations' by Gelfand and Fomin. Gelfand and Fomin demonstrate that the Euler-Lagrange equation can be derived using the functional derivative concept.


In your question, you describe that Lancaster and Blundell present expressions for 'average kinetic energy' and 'average potential energy', but diagrams 1, 2, and 3 illustrate that the logic can be stated such that it applies at infinitesimal scale. From down at infinitesimal scale the logic propagates out to the evaluation of the entire trajectory.


The next stage is to state explicitly where we obtain the criterion that must be satisfied.

To that end, we derive the Work-Energy theorem.

The starting point is $F=ma$. Both sides of $F-ma$ are integrated with respect to position, from starting point $s_0$ to endpoint $s$

$$ \int_{s_0}^s F \ ds = \int_{s_0}^s ma \ ds \tag{1.1} $$

The right-hand side $\int_{s_0}^s ma \ ds$ can be processed further. The reason for that: acceleration (as a function of time) and position (as a function of time) are not independent; acceleration is the second derivative of position. The work-energy theorem capitalizes on that relation.

In the course of the derivation the following relations will be used:

$$ v = \frac{ds}{dt} \quad \Leftrightarrow \quad ds = v \ dt \tag{1.2} $$

$$ a = \frac{dv}{dt} \quad \Leftrightarrow \quad dv = a \ dt \tag{1.3} $$


For the time being, I will not write the factor $m$, it is a multiplicative factor that is just carried from step to step. In the final expression, I will include the factor $m$ again. The integral for acceleration from a starting point $s_0$ to a final point $s$

$$ \int_{s_0}^s a \ ds \tag{1.4} $$

Use (1.2) to change the differential from $ds$ to $dt$. Since the differential is changed the limits change accordingly.

$$ \int_{t_0}^t a \ v \ dt \tag{1.5} $$

Change the order:

$$ \int_{t_0}^t v \ a \ dt \tag{1.6} $$

Change of differential according to (1.3), with corresponding change of limits.

$$ \int_{v_0}^v v \ dv \tag{1.7} $$

So we have:

$$ \int_{s_0}^s a \ ds = \tfrac{1}{2}v^2 - \tfrac{1}{2}v_0^2 \tag{1.8} $$

Here (1.8) is worked out for the entities position, velocity, and acceleration. Note that this result generalizes. What counts is the pattern of of first and second derivatives.

Using the result (1.8) for (1.1) gives the Work-Energy theorem:

$$ \int_{s_0}^s F \ ds = \tfrac{1}{2}mv^2 - \tfrac{1}{2}mv_0^2 \tag{1.9} $$

We have that potential energy is defined as the negative of the integral of force over distance.

$$ E_p = -\int_{s_0}^s F \ ds \tag{1.10} $$

Hence:

$$ \Delta E_k = -\Delta E_p \tag{1.11} $$


This gives us our criterion explicitly. Over any interval, the amount of change of kinetic energy must match the amount of change of potential energy.

Below, between the horizontal lines, is a group of three statements. I present these three statements as a unit to emphasize the tight interconnection; while the statements are different mathematically, the physics content of these three is the same.


$$ \begin{array}{rcl} F & = & ma \\ \int_{s_0}^s F \ ds & = & \tfrac{1}{2}mv^2 - \tfrac{1}{2}mv_0^2 \\ -\Delta E_p & = & \Delta E_k \end{array} $$


The validity of (1.11) extends down to infinitesimal change:

$$ d(E_k) = d(-E_p) \tag{1.12} $$



Evaluation of area

Diagram 4 - Evaluation of area

Diagram 4 demonstrates an area evaluation property that we get for free. We get it for free in the sense that it doesn't require any additional assumption; it follows logically.

The two time intervals $t_{1,2}$ and $t_{2,3}$ are displayed in three of the four subpanels: upper-left, upper-right, and lower-left. The time intervals are set up to be of equal duration.

In the upper-right sub-panel: the height of each bar represents the value of the energy corresponding to that time interval. The number displayed in red is the summed area of the two red bars; the number displayed in green is the summed area of the two green bars.

Reminder: the energies that are displayed in this diagram are hypothetical energies. The variation sweep is hypothetical: at each position of the movable point, the diagram shows what the energies would be if the object would move along that particular trial trajectory.

Stating the two different kinds of change explicitly:

  • Rate of change of actual energy as an object is moving along the true trajectory.
  • Rate of change of hypothetical energy (as a function of variation sweep)

In motion along the true trajectory one form of energy is transformed into the other; the energies are counter-changing; $\Delta E_k = -\Delta E_p$

As illustrated in diagram 4: in sweeping out variation the (hypothetical) energies are co-changing.


Rate of change

In the case of Hamilton's stationary action, the variation sweep is variation of position coordinate. This means that the derivative of Hamilton's action with respect to variation boils down to taking the derivative with respect to position.

For emphasis: taking the derivative (of Hamilton's action) with respect to variation results in an expression for derivative with respect to position.

$$ \frac{d(E_k)}{ds} = \frac{d(E_p)}{ds} \tag{1.13} $$

The true energy satisfies (1.13) Therefore when the variation sweep is at the sweet spot the trial trajectory coincides with the true trajectory


Subtraction

In the hypothetical variation sweep the energies are co-changing, so to identify the point where they change at the same rate one energy is subtracted from the other. (By convention it is the potential energy that gets the minus sign.)

Given that we are subtracting the potential energy from the kinetic energy: in the lower-left subpanel the green area is displayed as the area below the coordinate's zero line. We can treat the green area as a signed area; area below the zero point of the coordinate system is counted as negative.

The lower-right sub-panel represents in blue the result of the subtraction.

The motion of the blue dot over the diagram represents how the value 'area($E_k-E_p$)' is responding to variation sweep.

The two time intervals $t_{1,2}$ and $t_{2,3}$ are set to be equal in duration. Because of that the time factor drops out of the evaluation. Hence the outcome of the evaluation is determined by the value of the energy only.

At the point where the variation hits the sweet spot the value of 'area($E_k-E_p$)' is stationary. That means: at the point where the variation hits the sweet spot the rate of change of summed red area matches the rate of change of summed green area.

Summarizing:
There is a point in the variation sweep where the rate of change of (hypothetical) kinetic energy matches the rate of change of (hypothetical) potential energy. If the time intervals are set to be of equal duration then it follows mathematically that the rate of change of summed red area matches the rate of change of summed green area.

Integral

The logic that is demonstrated in diagram 4 is valid at any scale, down to infinitesimal.

When concatenating units of operation along the length of the overall time the bars end up exactly adjacent; no overlap.

This explains why the integral of the quantity ($E_k - E_p$) has the property that it is stationary when the variation hits the sweet spot. The value of ($E_k - E_p$) already has that property at the infinitesimal scale, and from there it propagates to the value of the integral of ($E_k - E_p$).


The crucial point: the above reasoning does not require making any additional physical assumption. Hence: in any situation where the Work-Energy theorem holds good it follows that Hamilton's stationary action will hold good.




Narrowing down to what is necessary

The Euler-Lagrange equation is very compact. The Euler-Lagrange equation is an equation that has been narrowed down to what is necessary. You don't need more; the Euler-Lagrange equation is sufficient.

In other words: in retrospect, we can see that all elements that were removed during the process of deriving the Euler-Lagrange equation were unnecessary elements.

The standing convention is to state the functional as an integral, but at the end the integral is no longer there; the Euler-Lagrange equation is a differential equation.

The unit of operation of diagram 1 explains why the Euler-Lagrange equation is a differential equation: the unit of operation is valid at infinitesimal scale. The logic operates at the infinitesimal scale, and from there, it propagates out to the trajectory as a whole.

The integral that is present at the start is redundant and is accordingly removed in the process of deriving the Euler-Lagrange equation




The relation between variational calculus and differential calculus

To make the comparison I take a simple case that is a natural case for the application of variational calculus: the catenary.

As we know, mathematically the curve of a hanging chain is described by the hyperbolic cosine; the hyperbolic cosine gives the height as a function of the horizontal coordinate.

We have with the catenary that the force that is acting is perpendicular to the horizontal. And if the length of the catenary is changed then the change in the curve is change perpendicular to the horizontal.

Variational calculus pivots the direction of the differentiation. Variational calculus is a form of differential calculus where instead of differentiating with respect to $x$ (the horizontal coordinate), the differentiation is with respect to $y$, in such a way that you do end up with a function that gives the height of the catenary as a function of the horizontal coordinate.




The meaning of the 'stationary' criterion

I have noticed that authors of physics textbooks tend to have a cavalier attitude towards the question: "What is the meaning of the 'stationary' criterion?" The authors will acknowledge that the name 'stationary action' is better, but will then express that to actually use that name would be pedantic.

The purpose of the following discussion is to show that understanding the meaning of the 'stationary' criterion is key.

Finding the point of equal slope

Diagram 5 - The blue curve has a point where it is stationary

In Diagram 5 the red curve and the green curve are both ascending functions.
Red: $f(x) = \frac{1}{2} x^2$
Green: $ g(x) = ln(x) $

Problem: at what x-coordinate do the functions $f(x)$ and $g(x)$ have the same slope?
Phrased differently: at what x-coordinate is the rate of change of $f(x)$ equal to the rate of change of $g(x)$?

The direct way to answer that question is to start by taking the derivative of each function, and then solve for the point where the two derivatives have the same value:

$$ \frac{f(x)}{dx} = \frac{g(x)}{dx} \tag{3.1} $$

$$ \frac{\tfrac{1}{2}x^2}{dx} = \frac{ln(x)}{dx} \tag{3.2} $$

$$ x = \frac{1}{x} \tag{3.3} $$

There is also a more convoluted way to address the problem:
I will refer to this more convoluted way as constructing a 'mathematical action':
Create a third function by subtracting the second function from the first function.

$$ h(x) = f(x) - g(x) \tag{3.4} $$

$$ h(x) = \tfrac{1}{2} x^2 - ln(x) \tag{3.5} $$

In diagram 5 the blue curve represents $h(x)$, the 'mathematical action'.

In diagram 4 the motion of the blue dot in response to variation sweep traces a curve, and that curve has a stationary point. That stationary point is the point where the rate of change of kinetic energy matches the rate of change of potential energy.



The interactive diagrams (that were the source of the frames of the animated GIF diagrams) are available on my own website. A link to my website is available on my stackexchange profile page.

$\endgroup$
7
  • $\begingroup$ Thank you very much for this extensive answer! The point I am still unclear about is the sign flip between variation space and physical space, why exactly does that exist? $\endgroup$ Commented Oct 15, 2021 at 13:08
  • $\begingroup$ @AaronDaniel I added a section 'variation space and physical space'. (Presumably the same could have been said in more concise way. I'm not good at brevity.) My intention is to rework diagram 4 from a 2 subpanel diagram to a 4 subpanel diagram, and then the value of 'area ($E_k-E_p$)' can be demonstrated in a subpanel of its own. $\endgroup$
    – Cleonis
    Commented Oct 15, 2021 at 17:23
  • $\begingroup$ @AaronDaniel I have replaced diagram 4 with an extended version. The diagram now includes display that as variation is swept out the potential energy and kinetic energy change together. The rate of change of the energies as a function of variation sweep has the same magnitude as the rate of change of energies (with respect to the position coordinate) as a particle is moving along the true trajectory. $\endgroup$
    – Cleonis
    Commented Oct 17, 2021 at 19:39
  • 1
    $\begingroup$ @AaronDaniel I rewrote multiple sections, capitalizing on the extended version of diagram 4. (I also felt the rewrite allowed removal of several sections, so I did that.) If you feel the rewrite is an improvement then please let me know. $\endgroup$
    – Cleonis
    Commented Oct 22, 2021 at 23:16
  • $\begingroup$ Your answer deserves more votes, very good at explaining the topic from scratch. However, the problem I am having is: what does the potential express? Is it the force by shortest distance between the two points or the force multiply the distance of the path under observation. What I mean is: energy is conserved so what would defer between other non least action paths i,e, the potential energy is slightly changing to kinetic (change is conserved). What I speculate is that you mean potential energy in this case is the energy which will be done when the force acts on the lease $\endgroup$
    – mohamed
    Commented Feb 28, 2022 at 18:51
3
$\begingroup$
  1. The problem with first OP's example is that if a mass $m$ is sitting still in a gravitational field, there obviously has to be an additional supporting force, which needs to be included in the energy consideration. This explains OP's first example.

  2. Next consider that $m$ is free falling. If we add an infinitesimal horizontal one-sided bump on the actual position curve, a la Fig. 1 below, then the average potential energy is indeed not affected.

    What complicate matters is that the virtual velocity is required to be continuous, and the difference from the actual velocity should be infinitesimal. In other words, the one-sided bump should be sufficiently smooth.

    On one hand, if the actual velocity is purely vertical, then the kinetic energy will remain unchanged to first order in the infinitesimal parameter.

    On the other hand, if the actual velocity is not purely vertical, it means that the velocity sometimes will be faster and sometimes slower than the actual trajectory, which leaves the average kinetic energy the same.

    More generally, a detailed calculation will of course confirm the claim by Lancaster & Blundell in every possible situation.

        t_i
        |   
        |          
    t_1 |___ t_2 
            |   
            |   
    t_4  ___|t_3
        |
        |   
        |
        t_f
    

    $\uparrow$ Fig. 1. An infinitesimal horizontal 1-sided bump of the position curve of infinitesimal size $\delta x={\cal O}(\epsilon)$.

  3. If you like reading the author's motivating example for the Lagrangian formalism and the stationary action principle (SAP), you may also enjoy this somewhat related Phys.SE post about the virial theorem.

$\endgroup$
10
  • $\begingroup$ But shouldn't author's statement "This means that if you slightly deviate from the classical trajectory, then both the average kinetic energy and the average potential energy will increase by the same amount." wrong if we consider only a horizontal deviation of the path (normal to the pull of gravity)? (because only the average kinetic energy is changed) $\endgroup$ Commented May 14, 2023 at 14:32
  • $\begingroup$ I updated the answer. $\endgroup$
    – Qmechanic
    Commented May 15, 2023 at 9:11
  • $\begingroup$ Why do you state "that the velocity sometimes will be faster and sometimes slower than the actual trajectory"? A horizontal bump can only add or deduct velocity but not both separately at different times. $\endgroup$ Commented May 15, 2023 at 10:07
  • $\begingroup$ I updated the answer. $\endgroup$
    – Qmechanic
    Commented May 15, 2023 at 10:42
  • $\begingroup$ But still, additional K.E. required to introduce the bump will be added to the total, hence increasing the average, as I don't recall having a type of negative K.E. (no pun intended) $\endgroup$ Commented May 15, 2023 at 11:49

Not the answer you're looking for? Browse other questions tagged or ask your own question.