Sometimes called CPET (Cardio Pulmonary Exercise Testing), or VO2 max testing. Despite this
being a very commonplace test, it is actually very hard to do properly. This is because, in reality,
there are a multitude of factors that can conspire against a diligent scientist that can go wrong.
Unfortunately this type of testing has been made to look simple with a much more convenient
technology in recent times called Breath by Breath (BxB) CPET. See picture below. Note how
much less cumbersome this looks, is to work with for the scientist and use for the athlete.
It is only fair to hope that life can be made easier with changing technology, this however has
camouflaged the reality of the significant errors that result from this BxB methodology. Too great
to be ignored or in fact useful for athletic measurements.
This methodology was initially proposed in the late 1960’s by Beaver and Wasserman of Harbor-UCLA
and later with Brian Whipp became the gurus and advocates of BxB measurements. It was hoped
that the intra-breath information of BBB measurements would yield information about the dynamics
of muscle O2 uptake. (Ref. John Hoppe and Andrew Huszczuk Ph.D.)
Unfortunately, at the time, the technology didn’t exist to do this properly. Now that the technology
does exist and we can measure this information with more detailed sensors, we’ve learnt that the
methodology is flawed. In fact the many problems and magnitude of errors makes this system unfit
for the purpose of tracking athletic performance (despite the many manufacturer’s claims).
Lets examine why…
Firstly let us summarise the issues and expand on this understanding later.
This is a typical resting flow versus time graph. See below.
You may not think that this is a very noisy signal right. Just smooth it and work with the smoothed
curve? That is exactly what mixing chambers do. But BxB systems do not have this luxury. They
must sample this curve at least 200 times per second and hence the data they are using is every
single noisy point. (also this graph is using older more forgiving technology). Newer technology
(Hans Rudolph Pneumotach…a gold standard well above most) will have far more variations.
This graph is for rested breathing but in reality, for exercise, this graph is even noisier,
varies significantly in amplitude for each breath, is faster and harder to track and is very
inconsistent. This makes it very hard to determine when an actual breath starts and
finishes… certainly an issue if you need to look at segments within each breath as you do
in BxB…where you need to know precisely where each section of each breath is for time
alignment. Also the graphs for O2 and CO2 concentrations are also very noisy with the added
issue that they vary within the breath. Typically the O2 values are high at the start of the breath
and low at the end, with the CO2 being the opposite.
(ref. L.E. Armstrong and D.L. Costill. Variability of respiration and metabolism 1985)
The first sensor signal to be received is usually the flow (Ve or Vi). This is because generally it is used as
the reference point but also as it is normally closest to the subject. The O2 and CO2 must wait their
turn as the sample line takes a little longer to take the sample from the mouth to the sensors
and their electronics. (Again in Mixing chamber systems this is much more forgiving because they only
need to synchronise this once per breath and being a little out is no great sin. Also as the tubing bore
is about 25 times wider than a BxB sample line, transport time is easy to calculate using simple physics,
is not subject to issues of restricted flow due to sample line humidity and other issues we’ll discuss shortly).
And here we are now at a very significant point where we must now multiply the exact portion of breath
for each Flow, O2 and CO2. Now if you’ve been following this discussion at all, you’ll realise we have
a. very noisy flow, O2 and CO2 signals
b. these signals have been sampled about 200 times per breath (to reconstruct the breath properly)
c. the signals are misaligned
d. we must multiply the exact same sections of this breath.
If we say a breath is one second long for simplicity, then we want to multiply every 1/200th of
breath Flow, O2 and CO2 with each other. Lets just look at one of these points, say sample at time
0.500 to 0.505. This means we must realign sections of Flow(0.500s to 0.505s), O2(0.500s to 0.505s)
and CO2(0.500s to 0.505s). But in such a noisy signal how can we be sure we have aligned these at all?
We need to be sure where each breath starts and finishes. Its just not possible given how much the data is
changing in real time. So manufacturers just produce an algorithm to guess this and curve fit. This is why
none of them will give you the algorithm, not because there is some genius in their knowledge (this kind
of signal processing is very well understood) but because they employ fudge factors to make all this
work…and they still cant make it work if you explore published findings. So what does this mean to our
information? Well if we are multiplying the data as follows for example Flow(0.505s), O2(0.540),
CO2(0.545), which is very likely because we have misaligned data, then we are just creating new data
based on our sampling methods that never existed. If you feel that perhaps these are trivial changes to the
data, then read on. You will realise that the slightest error in say O2 will result in a very significant error
in VO2. (in fact about 50x the O2 error). Read on.
Below is a spreadsheet developed by us to track mathematically an error in any one or
multiple VO2 parameters and their effect on VO2 error. We normally use these to see how
much error any metabolic system has based on manufacturer’s specifications.
VO2 calculator Redback Biotek – Error sum of squares for absolute error (2)
Lets just see what happens if we insert an O2 error of about 5% relative and a Flow error of the same.
This I’m sure you’ll agree looking at the above graph is pretty generous, as it could be much more with
all the probable causes in error already discussed (with much more to come).
So with reference values set at say:
FiO2=21%, FeO2=17.5%, FiCO2=0.04, FeCO2=3.8, Ve=137L/min.
Now lets add the 5% relative error to FeO2 and Ve only.
This would change these values to:
FeO2= 18.3% , Ve=143.8L/min
Its not looking too drastic at all at this stage. However using our error algorithm above,
this amounts to a
VO2 error = 33.9%
If you cant follow or dont believe the algorithm, simply do the VO2 calculation yourself with these
new values and see what the new VO2 will be.
Also keep in mind we have not added any error to CO2 or any other sensors. For example
humidity can play havoc on this measurement if we dont have an excellent drying system. Any if you
test many athletes at once, this perfect drying system will certainly have its work cut out.
(Dr. Chris Gore et. al, Australian Institute of Sport. Error contributions to VO2.)
Now one could say, that we could smooth these values by averaging and this is very true. But now you
have created a very noisy signal that you need to smooth and average. This will never properly reconstruct
the data to something useful for athletic use, where we know that very small changes of say 2-5% in VO2
are very large improvements for an athlete even over the course of one to two years. Added to the fact that
not only is it noisy but seriously erroneous based on misaligned data.
To get closer to the culprit in VO2 errors we simply need to examine the effect of O2 errors
on the VO2 calculation. This holds true for mixing chamber as well as BxB systems.
If we take the error calculating spreadsheet above and look at FeO2=17.5% as previously discussed,
what happens if we have even just 0.1% absolute error in the O2 sensor. (pretty common accuracy)
So FeO2=17.6%, put this to the VO2 equation and you’ll find a VO2 error of about 4%. Too high an
increment for tracking athlete improvements. Add the typical flow sensor error of 2-3% and you have
quite a lot of error already.
0.1% O2 error = 4% VO2 error
2.5% Flow error = 2.5% VO2 error
So its not enough to have a mixing chamber system, you must have a very accurate O2 sensor also!
Its goes without saying that you can add the error of your calibration gas to this error. So for this reason,
any calibration gas that has an error of more than 0.02% absolute is not useful. (VO2 error=about 0.8%)
Some gases supplied to many labs have certification at 5% relative (This is roughly 1% absolute).
If 0.1% gives an error of about 4% VO2, how can we accept a gas with a 1.0% error (ten times worse)?
O2 Cal gas error 0.02% = 0.8% VO2 error
O2 Cal gas error 0.1% = 4.0% VO2 error
O2 Cal gas error 1.0% = 33.2% VO2 error
Some people may think that sensors measure the percentage of O2 or CO2. This is not true. In fact, all a sensor
can hope to measure is the amount of O2 or CO2. At least with sensors to date. So it responds to the amount of
O2 or CO2 presented to the sensor. This means that we must keep the flow through the sensor as constant
as possible. So that the changes in O2 or CO2 can only be the result in the change in concentration.
This is not a problem for Mixing chamber systems as the mixing chamber dampens any flow variations
from the breathing tube. Unfortunately, in the case of BxB systems, the sample line is constantly experiencing
large flow changes. This is due to the normal flow of expired followed by inspired air happening right at the
point of sampling. This means it is well-nigh impossible to maintain a constant flow into the O2 and CO2 sensors.
This makes for a very erroneous and unsettled O2 and CO2 values (keep in mind sensors take time…about
0.1 to 3.0 seconds, just to settle…how can you ever get a true reading of the situation). Unfortunately, you now
must also mutiply this oscillating signal of O2 to Flow as per above. As if the situation wasnt bad enough
already when we didnt assume these issues, we’re really making a bad situation very much worse.
To make matters worse, the sample line is not just subject to unstable flow, but it is also
subject to large swings in concentration, due to the pressure changes at the mouthpeice.
To explain, as the breath is expired, the sample line and sensors see the human expired gas
but when an inspiration occurs, the pressure reverses and the sample line and sensors are
now looking at the room air coming into the mouthpeice. This means that not only are the sensors
given no chance to settle but they are also having to make these large changes from room air
(21% O2) to expired gas (15% O2), in a very short space of time. (Here the mixing chamber wins
again as the expired gas goes from a narrow tube to a large volume of mixing chamber and any pressure
variations are damped significantly, giving the sensors a view of really only expired gas and not
Mixing chamber…see below.
In the special case where running is involved (but most movements will cause these issues),
we have an added overlayed error in the signal caused by movement mechanics. This means in
exercise we have an extra noisy flow signal (and possibly…not yet known…O2 and CO2). See below.
Again, this noise is absorbed in a Mixing chamber system. The BxB method is really now looking like
an exceptionally noisy and erroneous methodology compared to other methods.
Daley MA, (2013) Impact Loading and Locomotor-Respiratory Coordination Significantly Influence Breathing Dynamics in Running Humans.
As you can see there are many significant disadvantages of BxB systems over other methodologies,
including Mixing chamber. In essence, it is not reliable for athletic measurements. If however, you
can live with errors in the order of 15 to 30%, then you’re probably OK with BxB.
Finally, Mixing chamber systems are not perfect but they have far more chance of measuring the actual
values originally created by the human subject. The main issue with them (apart from being more awkward
to implement) is that the mixing chamber is not perfectly one breath in volume, which leads to some
minor errors. In fact if someone could invent a mixing chamber that adapted its volume to one breath
for every individual, we’d be much closer to a perfect system.