The Weibull function is applied extensively in the life sciences and engineering but underused in agriculture. The function was consequently adapted to include parameters and metrics that increase its utility for characterizing agricultural processes. The parameters included initial and final dependent variables (Y0 and YF, respectively), initial independent variable (x0), a scale constant (k), and a shape constant (c). The primary metrics included mode, integral average, domain, skewness, and kurtosis. Nested within the Weibull function are the Mitscherlich and Rayleigh functions where c is fixed at 1 and 2, respectively. At least one of the three models provided an excellent fit to six example agricultural datasets, as evidenced by large adjusted coefficient of determination (RA2 ≥ 0.9266), small normalized mean bias error (MBEN ≤ 1.49%), and small normalized standard error of regression (SERN ≤ 8.08%). The Mitscherlich function provided the most probable (PX) representation of corn (Zea mays L.) yield (PM = 87.2%); Rayleigh was most probable for soil organic carbon depth profile (PR = 96.4%); and Weibull was most probable for corn seedling emergence (PW = 100%), nitrous oxide emissions (PW = 100%), nitrogen mineralization (PW = 58.4%), and soil water desorption (PW = 100%). The Weibull fit to the desorption data was also equivalent to those of the well-established van Genuchten and Groenevelt–Grant desorption models. It was concluded that the adapted Weibull function has good potential for widespread and informative application to agricultural data and processes.
1. Introduction
Agricultural processes are amongst the most complex known, owing to their myriad of interacting chemical, physical, and biological factors. As a result, description and characterization of agricultural processes rely largely on fitted polynomial splines and empirical functions, rather than on theoretical models derived from first principles. Although this may be less than ideal, fitted splines and empirical functions can still be very useful. For example, splines and functions can often simulate the highly complex process of crop seedling emergence and can thereby provide objective estimates of important seed performance indicators such as time of first emergence, duration of emergence, maximum emergence rate, and average emergence rate. Fitted splines and empirical functions can also model strongly non-linear agricultural processes, such as nitrogen dynamics and soil water retention/release, which then serve as essential regulating functions in numerical crop growth and soil gas/water/solute balance models.
However, splines and empirical functions are frequently not equal when applied to agricultural data. Splines often perform unrealistically when applied to data that are irregularly spaced, change rapidly in value, exhibit outliers, and contain appreciable experimental and (or) random error, which are all “hallmark traits” of agricultural datasets. Empirical functions, in contrast, are not unduly perturbed by the above data traits; and when regression fitted, they are particularly adept at separating out the underlying data signal from random noise. As a result, empirical functions are often preferred over splines for describing and characterizing agricultural datasets.
One well-established empirical function that has potentially widespread agricultural applications is the so-called Weibull function. The function was first promoted in a publication by W. Weibull (Weibull 1951) and it has increased in popularity ever since, especially for characterizing extreme event processes. Some common life science and engineering applications include characterizing biological growth, ocean currents, particle size distributions, and failure of electrical insulation; determining breaking strength of natural fibers and building materials; assessing yield force, fatigue, and brittle failure of metals; estimating likelihood of infrequent events such as earthquakes, floods, gales, and tsunamis; and failure analysis of equipment such as lightbulbs, automotive parts, electronics, and industrial machinery (Weibull 1951; Monahan 2006; Abernethy 2010; Chu 2013; Brown and Mayer 1988; Mahanta and Borah 2014). Although the Weibull function is quintessentially empirical (Weibull 1951), it is nonetheless closely related to the power law distribution of flaw sizes, classical fracture mechanics, and weakest link theory (Zok 2017; Quinn and Quinn 2010).
The main popularizing features of the Weibull model are its relative simplicity, extensive versatility, and ability to perform well with small datasets and extreme data values (e.g., Abernethy 2010; Chu and Ke 2012; NCSS, Weibull 2020). More specifically, the standard Weibull function has only three fitting parameters, but it can still exhibit varying degrees of left skew, symmetry, right skew, leptokurtosis, mesokurtosis, and platykurtosis; and it can accommodate a variable inflection point, a variable y-axis intercept, and a variable x-axis threshold. In addition, Chu and Ke (2012) found that estimates of Weibull parameters can remain useably accurate even with datasets as small as 5–10 points. None of the other commonly used empirical functions (e.g., Gompertz, Logistic, Richards, Beta, Gumbel, Normal, Lognormal, etc.) can match the Weibull combination of simplicity, flexibility, and range of application.
Despite the above advantages, Weibull applications to agricultural processes appear limited to a relatively small number of studies using specialized (and restrictive) equation forms for characterizing crop yield and biomass production (e.g., Karadavut and Tozluca 2005; Karadavut et al. 2010), seed germination (e.g., Brown and Mayer 1988; Gardarin et al. 2011), and seedling emergence (e.g., Aboutalebian et al. 2017; Izquierdo et al. 2013; Navarro et al. 2013; Gan et al. 1996). The objectives of this study were consequently to (i) derive an adapted Weibull function that is applicable to a wide range of agricultural processes; (ii) describe Weibull function characteristics, parameters, and metrics that may be useful in agricultural applications; and (iii) present six illustrative Weibull characterizations of agricultural datasets.
We start by deriving an adapted Weibull function (Section 2.1.), then illustrate the primary Weibull characteristics (Section 2.2.), describe potentially useful function parameters and metrics (Section 2.3.), identify nested special case and related functions (Section 2.4.), describe a convenient but rigorous method for determining Weibull parameters (Section 2.5.), and finally, present six example applications to diverse agricultural datasets (Section 3.). Although the Weibull model may have theoretical justification for some processes, it is applied here as a purely empirical function for parameterizing and characterizing agricultural processes and not for making time/space/quantity predictions or extrapolations beyond fitted datasets.
2. Materials and Methods
2.1. Derivation of an adapted Weibull function
The Weibull relationship includes a “derivative function” and a “cumulative function”. The derivative function can be written as a “diminishing response” relationship:
where Y is the dependent variable, x is the independent variable, c is a shape parameter affecting function symmetry, α is a scale or proportionality parameter affecting function domain, x0 is an initial/threshold/minimum allowable x-value, and YF is the final (end-point) Y-value as x becomes large. By “diminishing response”, we mean that dY/dx → 0 as Y → YF, regardless of other parameter values.
Given eq. 1, an adapted Weibull cumulative function is obtained by separating variables and integrating between (x0,Y0) and (x,Y):
which leads toand ultimately simplifies towhere Y = Y0 at x = x0, and . Equation 4 is effectively a five-parameter Weibull cumulative function, as it contains five independent parameters (YF, Y0, x0, k, and c) that can be (in any combination) specified, independently measured, or determined by curve fitting to Y vs. x measurements (elaborated later). In addition, eq. 4 accommodates both YF > Y0 and YF < Y0 (Section 2.2.). The c parameter in eq. 4 is occasionally referred to as the “Weibull slope” (Abernethy 2010) because in traditional engineering analysis, Y vs. x data were plotted as ln[−ln(Y)] vs. ln(x), c.f.which yields a straight line with slope = c and Y-axis intercept = ln(k) = cln(kW) if the data follow a Weibull function. The kW parameter is sometimes referred to as the “Weibull scale constant”, or the “Weibull rate constant” if the x-axis represents time.
2.2. Essential characteristics of the five-parameter Weibull function
The essential characteristics of the five-parameter Weibull derivative and cumulative functions are illustrated in Fig. 1. The derivative function is negative monotone and unbounded (dY/dx → ∞ as x → x0) when c < 1; negative monotone and bounded (dY/dx finite at x = x0) when c = 1; bell shaped and right skewed when 1 < c < 3.6; bell shaped and near symmetrical when c = 3.6; and bell shaped and left skewed when c > 3.6 (Fig. 1a). The corresponding cumulative function has an inflection when c > 1; no inflection when c ≤ 1; and decreasing k causes increasing function domain but decreasing average slope (Fig. 1b). Note also that constant k, x0, YF, and Y0 causes all Y vs. x relationships to intersect at a common point regardless of c value; and that Y vs. x is increasing when YF > Y0, but decreasing when YF < Y0 (Fig. 1b).
An interesting feature of the Weibull function is its ability to indicate the likelihood (or chance) of an event or condition happening at any particular x value, given that the event or condition has not already happened (known in engineering as the Weibull “hazard” function). In processes where c ≠ 1, the likelihood of an event or condition occurring changes with x (these processes are said to have “memory”), whereas in processes with c = 1, the likelihood does not change with x (these processes are said to be “memoryless”) (Glen 2020a). Furthermore, when c < 1, likelihood of the event or condition happening decreases at a decreasing rate with increasing x; when c = 1, likelihood is constant (independent of x); when 1 < c < 2, the likelihood increases at a decreasing rate with increasing x; when c = 2, likelihood increases linearly with increasing x; and when c > 2, likelihood increases at an increasing rate with increasing x (see e.g., figs. 2–8 in Abernethy 2010). Seedling emergence is an example of an agricultural process which may or may not exhibit memory. If a Weibull fit to measured seedling emergence (Y) vs. time (x) yielded c > 1, the likelihood of emergence increased as time increased, presumably due to viable seed and favourable seedbed conditions (e.g., amenable soil temperature, moisture, density, fertility, etc.). If the fitted function yielded c < 1, likelihood of emergence decreased with increasing time, suggesting low viability seed and (or) poor seedbed conditions. If c = 1 was obtained, likelihood of seedling emergence was constant through time and not affected by seed quality or seedbed conditions. The memory property is illustrated further in Section 3.
2.3. Parameters and metrics obtainable from the adapted Weibull function
As mentioned above, the parameters of the adapted (five-parameter) Weibull derivative and cumulative functions (i.e., YF, Y0, x0, k, and c) may be obtained by any combination of specification, independent measurement, and estimation by curve fitting to Y vs. x data. Potentially useful derivative function metrics include mode , median , skewness (SF), kurtosis (KF), and integral average Cumulative function metrics of potential use include mean , inflection , quantiles , and domain . These metrics are briefly derived and discussed in Appendix A, summarized in Table 1, and illustrated in Fig. 2. This study focuses on the YF, Y0, x0, and c parameters and on the xD, xM, RM, AI, SF, and KF metrics.
Table 1.
Selected metrics of an adapted five-parameter Weibull function (see also Appendix A, Table 2, and Fig. 2).
2.4. Special cases of the Weibull function and related functions
The five-parameter Weibull function collapses to four-parameter Mitscherlich and Rayleigh functions when c = 1 and 2, respectively (e.g., Lai and Xie 2006). The four-parameter Mitscherlich functions (which are effectively exponential functions) are given by
where is the Mitscherlich scale constant, and the shape constant is fixed at c = 1. Distinguishing features of the Mitscherlich functions include no inflection, mode located at , and fixed Fisher skewness and kurtosis values of and , respectively (Weisstein 2004). Some traditional agricultural applications of Mitscherlich functions include description of nitrogen mineralization (e.g., Yang et al. 2020), characterization of nitrous oxide and carbon dioxide emissions from soil (e.g., Hoben et al. 2011; Gillis and Price 2011), depiction of crop and weed seed germination and (or) emergence (e.g., Brown and Mayer 1988; Gill et al. 1996), modeling crop growth and (or) yield responses to fertilizer rates (e.g., Srivastava et al. 2006), and simulating annual rainfall (e.g., Harmsen 2000) and water infiltration rate (e.g., Horton 1940).It should also be noted that the four-parameter Mitscherlich cumulative function (eq. 7) is a generalization of a single-pool, first-order reaction kinetics model developed by Beauchamp et al. (1986) for describing mineralization of soil nitrogen (N) in laboratory incubation studies:
where (mg N·kg−1 soil) is the amount of nitrogen mineralized at time t (d), (mg N·kg−1) is the total amount of nitrogen available to mineralize (pool of mineralizable soil nitrogen), β (d−1) is the mineralization rate constant, and (mg N·kg−1) is either the amount of mineralized N already present in the soil at t = 0, or the amount of “easily mineralizable” N that mineralized rapidly in the early stages of the incubation before measurements started. As eq. 8 assumes implicitly that t0 = 0 (i.e., x0 = 0 in eq. 7), then plots of vs. t intersect the y-axis at (see e.g., Figure 4 in Beauchamp et al. 1986). Note also that the mineralization “half time” or “half life”, t1/2 (i.e., the time required for half of to be mineralized), is a special case of the median x (, eq. A6) or the 0.5 quantile x (, eq. A23):since x0 = 0, c = 1 and β = k.
The four-parameter Rayleigh functions are
where is the Rayleigh scale constant, and the shape constant is fixed at c = 2. The Rayleigh functions have variable mode, median, mean, and inflection (as with Weibull) but fixed Fisher skewness and kurtosis of and , respectively (Weisstein 2020). To the authors’ knowledge, Rayleigh functions have not been applied to agricultural processes but are used extensively for characterizing communication signals, wind speed, wave height, sound and (or) light radiation, electronics longevity, and magnetic resonance images (e.g., Glen 2020b).In addition to the above special cases, Weibull functions provide close approximations to the Lognormal distribution when c = 2.5 (NCSS, Weibull 2020), the Normal distribution when c = 3.6023494 (Cousineau 2011; Lai and Xie 2006; NCSS, Weibull 2020; Fig. 1a), and the Gumbel distribution when c is large (Cousineau 2011). This obviously serves to further extend the flexibility and applications of Weibull functions.
2.5. Determining Weibull parameters
2.5.1. Parameter estimation methods
The main approaches for estimating Weibull parameters from Y vs. x data include method of moments, maximum likelihood estimation, and model-data curve fitting using Weibull probability plots (eq. 5) or iterative non-linear least-squares regression (Cousineau 2009; Abernethy 2010; Lai and Xie 2006; Quinn and Quinn 2010; Evans et al. 2019; NCSS, Weibull 2020). Of these, maximum likelihood and curve fitting via regression provide objective (and often the most accurate) parameter estimates (NCSS, Weibull 2020). However, regression may provide more accurate estimates of than maximum likelihood (NCSS, Weibull 2020), as well as more accurate estimates of all Weibull parameters when the dataset is small (Chu and Ke 2012). Notwithstanding that commercial software packages (e.g., Relyence, Weibull 2020; ReliaSoft, Weibull 7.0 2020; Reliability Workbench, Weibull Analysis 2020; NCSS, Weibull 2020) provide a variety of methods for estimating Weibull parameters, model-data curve fitting via regression may be the most widely accessible because it resides as a built-in application in most computer spreadsheets. Iterative non-linear least-squares regression was used here by applying the Solver® algorithm in the Excel® spreadsheet to minimize the sum of squared errors between the adapted Weibull cumulative function (eq. 4) and Y vs. x data. One or more of YF, Y0, x0, k, and c were treated as curve-fitting parameters, and the Weibull scale parameter, kW, was separated from the shape parameter, c, to improve parameter identifiability (i.e., k was used instead of kWc, Banks and Joyner 2017). Initial guess values for the curve-fitting parameters were obtained by manual “trial-and-error” matching of model prediction to Y vs. x data and by plotting the data according to eq. 5 (when possible) to estimate c from slope and k from intercept. Whenever possible, the number of parameters used for curve fitting was ≤ half the number of data points to increase the likelihood of obtaining unique parameter values. The options used in the Solver fitting algorithm included generalized reduced gradient non-linear numerical iteration, slope calculation using central derivatives, automatic scaling, and a 10−10 convergence criterion.
2.5.2. Assessing model-data fits
The accuracy and validity of model-data fits were assessed using various “goodness of fit” indicators, including adjusted coefficient of determination, normalized mean bias error, normalized standard error of regression (e.g., Archontoulis and Miguez 2015), and regression of measured Y (y-axis) against model-predicted Y (x-axis) (Piňeiro et al. 2008).
Adjusted coefficient of determination for non-linear least-squares regression (RA2) is given by
normalized mean bias error (MBEN) byand normalized standard error of regression (SERN) bywhere is the regression sum of squared errors, is the regression total sum of squares, and are, respectively, the model-predicted and measured Y values at each x-value, is the mean of the measured Y values, B is the number of data points (measurements), and L is the number of model fitting parameters (excluding intercept). The RA2 metric indicates degree to which the fitted model predicts (or explains) systematic variation in the data, and accounts for the number of model fitting parameters (L) (e.g., Archontoulis and Miguez 2015). The MBEN (also known as “normalized mean prediction error”, MPEN) indicates degree of systematic model bias, with positive values indicating net overestimate of the data by the fitted model, and negative values indicating net underestimate (e.g., Archontoulis and Miguez 2015; Schjønning et al. 2017). The SERN indicates the “predictive power” of a fitted model with L fitting parameters, where SERN = 0 indicates perfect prediction (i.e., no discrepancies between predicted and measured values). Given that SERN is functionally related to normalized root mean square error, RMSEN, i.e.,the RMSEN predictive power categories of Jamieson et al. (1991) can be used as conservative SERN categories, i.e., 0 ≤ SERN < 10% indicates excellent model prediction of the data; 10% ≤ SERN ≤ 20% signifies good prediction; 20% < SERN ≤ 30% indicates fair prediction; and SERN > 30% signifies poor prediction.
According to the analysis of Piňeiro et al. (2008), model representations of data are viable only when the slope and intercept of measured Y () regressed against model-predicted Y ( are not significantly different from unity and zero, respectively. When this occurs, the 95% confidence limits on slope fall on either side of unity, and the 95% confidence limits on intercept fall on either side of zero (at P < 0.05 significance level). If the regression slope is significantly different from unity (i.e., 95% confidence interval excludes unity), the model may be compromised due to “inconsistency” (with the data); and if the regression intercept is significantly different from zero (i.e., 95% confidence interval excludes zero), the model may be compromised due to “bias”. Generally speaking, the confidence limit “band” (i.e., upper confidence limit minus lower limit) becomes narrower and more symmetrical about the vs. regression line as fit between model and data improves.
2.5.3. Selecting the most suitable function or model
As the Mitscherlich and Rayleigh models are simpler special cases of the Weibull model (i.e., they are “nested” within the Weibull model because they have fixed c values), it is advisable to determine which of the three models provides the best balance between goodness of fit to the data and model simplicity, or “parsimony” (e.g., Johnson and Omland 2004; Vandekerckhove et al. 2015). This was achieved here using the two-model partial F test and the corrected Akaike information criterion (AICC).
The two-model partial F test (FP) may be given as (e.g., Archontoulis and Miguez 2015):
where SSEW and K are, respectively, the sum of squared errors and number of fitting parameters for the Weibull model, B is the number of Y vs. x measurements, FW-X is F test significance level, F.DIST.RT is the right-tailed F probability distribution (as represented in Excel®), and SSEX and J are, respectively, the sum of squared errors and number of fitting parameters for the Mitscherlich or Rayleigh models. If FW-M ≤ 0.05, the Weibull fit is significantly better (at P < 0.05) than the Mitscherlich fit; and if FW-R ≤ 0.05, the Weibull fit is significantly better (at P < 0.05) than the Rayleigh fit. Note that the FW-X test cannot compare models with an equal number of fitting parameters (K = J).For ordinary least-squares regression, the AICC is given by (Banks and Joyner 2017)
where SSEX is the regression sum of squared errors for the fitted model (i.e., Weibull, Mitscherlich, or Rayleigh), B is the number of data points, and L is the number of model parameters used as least-squares fitting variables (as c is fixed in Mitscherlich and Rayleigh, these models will usually have one less fitting variable than the Weibull model). The first term on the right of eq. 17 represents the model’s “goodness of fit”, and the last two terms represent a “parsimony penalty” (Banks and Joyner 2017; Vandekerckhove et al. 2015). The parsimony penalty increases with increasing number of model fitting variables (increasing model complexity), and it is designed to compensate for model over-fitting; i.e., spurious improvements in model-data fit caused by adding fitting variables that fit largely to the data’s random noise, rather than to the data’s underlying signal (Johnson and Omland 2004; Vandekerckhove et al. 2015). Generally speaking, the model providing the lowest AICC value is considered the “best estimator” of the data (Banks and Joyner 2017; Vandekerckhove et al. 2015). Note also that AICC applies for both nested and un-nested models, and that two or more models can be compared simultaneously (see e.g., Banks and Joyner 2017).
The lowest AICC alone does not indicate how much more probable the “best estimator” model is relative to the other models, especially if differences among AICC values are small. To alleviate this deficiency, likelihood of being the “best estimator” model can be assessed using normalized probabilities, PX, derived from differences among AICC weights (Banks and Joyner 2017; Vandekerckhove et al. 2015), i.e.,
where are normalized probabilities (%), and are the corresponding AICC weights for the Weibull, Mitscherlich, and Rayleigh models, respectively. The AICC weights are in turn given by (Banks and Joyner 2017)where , andwhere , , and are the values of the Weibull, Mitscherlich, and Rayleigh models, respectively, and is the smallest value of the three models. The relative magnitudes among PX values indicate if the AICC values are sufficiently different to select any one model as being better than the others for estimating or representing a particular Y vs. x dataset (Banks and Joyner 2017).Model suitability can also be assessed by comparing a plot of the model-predicted derivative function, , to a finite difference estimate of the actual derivative function, , where
and B is the number of measured data pairs. The comparison can be informal (e.g., visual) or quantified using goodness of fit criteria (e.g., eqs. 12–14, 18–20). Although such comparisons are absent (or rare) in the literature, they can be critically important as good correlation between the model-predicted and estimated derivative function implies model compatibility with underlying biological–physical–chemical processes. To be useful, this approach obviously requires relatively low random variability in the measurements.
3. Example Applications
3.1. Seedling emergence
Seedling emergence typically occurs several days after planting of crop seeds or plow-down of weed seeds. Cumulative seedling emergence versus time graphs usually have a sigmoid or concave shape (e.g., Haj Seyed Hadi and Gonzales-Andujar 2009; Forcella et al. 2000), which is determined largely by seed viability and (or) dormancy and seedbed conditions. Both the time delay and emergence versus time patterns of seedling emergence are amenable to characterization using the Weibull–Mitscherlich–Rayleigh models and are illustrated here for emergence of grain corn (Zea mays L.) seedlings. Figure 3 shows corn emergence data and fitted models, where Y0 and x0 were specified constants, YF and k were fitted for the Mitscherlich and Rayleigh models, and YF, k, and c were fitted for the Weibull model. Table 3 gives 95% confidence limits for slope and intercept of vs. ; Table 4 gives parameter values and fit metrics.
The Weibull and Rayleigh models produced better visual fits than Mitscherlich to both cumulative emergence (Fig. 3a) and estimated emergence rate (Fig. 3b). Note in particular that Weibull and Rayleigh tracked the estimated emergence rate data quite well, whereas Mitscherlich did not (Fig. 3b), suggesting that the emergence mechanism was not a Mitscherlich-type first-order exponential. The fit statistics for Weibull were excellent (RA2 = 1.0000, MBEN = −3.56 × 10−5%, SERN = 0.05%), whereas those for Rayleigh were somewhat poorer (RA2 = 0.9988, MBEN = 0.14%, SERN = 2.17%), and those for Mitscherlich were substantially poorer (RA2 = 0.9617, MBEN = 0.59%, SERN = 12.4%) (Table 4). In addition, the 95% confidence limits for slope and intercept of YiM vs. YiP were very narrow for Weibull, whereas those for Mitscherlich and Rayleigh were much wider (Table 3). Evidently, Weibull’s excellent model-data fit was sufficient to over-ride its parsimony penalty (due to having one more fitting parameter), as the Weibull fit was highly significant (FW-M < 0.001, FW-R < 0.001), and Weibull was by far the most probable of the three models (PW ≈ 100%, PR ≈ 0%, PM ≈ 0%). The fitted Weibull model indicated a final seedling emergence (YF) of 104%, a final emergence time (xF) of 8.3 d after planting, a maximum emergence rate (RM) of 77.3% d−1 at day 6 after planting (xM), and an integral average emergence rate (AI) of 31.1% d−1 over the 3.3 d (xD) duration of seedling emergence (Table 4). Note also that since c = 2.3 was obtained from the Weibull fit (Table 4), likelihood of corn seedling emergence increased with time at an increasing rate, suggesting viable non-dormant seed and favourable seedbed conditions. The Weibull-fitted emergence rate (derivative) function (Fig. 3b) was moderately right skewed (SF = 0.4536), but only slightly less tailed (KF = −0.0353) than a normal distribution (Table 2).
Table 2.
Proposed skewness (SF) and kurtosis (KF) categories based on the inclusive graphic classifications of Folk (1980).
3.2. Nitrous oxide emissions
Nitrous oxide (N2O) emissions from agricultural soil occur primarily as a result of microbial nitrification and denitrification of applied nitrogen sources, such as fertilizers and organic amendments (e.g., Signor and Cerri 2013). Although cumulative N2O emission versus time curves can assume a wide variety of shapes, by far the most common are concave and sigmoid forms, which are readily characterized using the Weibull–Mitscherlich–Rayleigh models. Figure 4 shows cumulative N2O emissions and fitted models for corn production on a clay loam soil that had received starter fertilizer (8–32–16) on Julian day 136 and side-dress nitrogen fertilizer (28% UAN) on Julian day 170 (see Drury et al. 2021 for details). Because there were more than twice as many data points (21) as fitting parameters (4–5), Y0, YF, x0, and k were fitted for the Mitscherlich and Rayleigh models, and Y0, YF, x0, k, and c were fitted for Weibull. Tables 3 and 5 give the parameter values and fit metrics.
Table 3.
Confidence limits (CL, 95%) on slope and y-axis intercept of measured dependent variable (YiM) regressed against model-predicted dependent variable (YiP) for the Weibull, Mitscherlich, Rayleigh, van Genuchten, and Groenevelt–Grant models.
The measured N2O emissions formed a sigmoid cumulative curve (Fig. 4a) and a bell-shaped derivative, or emission rate, curve (Fig. 4b). The Weibull and Rayleigh models produced visually good model-data fits, but Mitscherlich was clearly not competitive. The 95% confidence limits for slope and intercept of YiM vs. YiP (Table 3), and the RA2, MBEN, SERN, FW-M, and FW-R values (Table 5), further showed that the Weibull fit was excellent and significantly better than Rayleigh and Mitscherlich. As a result, Weibull was highly probable (PW = 100%) despite its parsimony penalty, while Rayleigh and Mitscherlich were highly improbable (PR ≈ 0%, PM ≈ 0%) (Table 5). The fitted Weibull model indicated maximum cumulative N2O emission (YF) of 16 769 g N2O-N·ha−1 at Julian day 231 (xF), a maximum emission rate (RM) of 424 g N2O-N·ha−1·d−1 at Julian day 178 (xM), and an average emission rate (AI) of 170 g N2O-N·ha−1·d−1 which occurred over a 98 d emission period (xD) (Table 5). Note as well that since c = 3.2669 was obtained (Table 5), likelihood of emission increased with time, and emissions were almost normally distributed (SF = 0.0872, KF = −0.2884, Table 2).
3.3. Yield change
Corn is highly sensitive to the soil environment, and annual yields decline when root zone fertility and biophysical health start to deteriorate (e.g., Bennett et al. 2012). Figure 5 shows annual grain yields and fitted models for unfertilized monoculture corn grown under no-tillage on a sandy loam soil after a one-time application (in 2012) of yard waste compost (experimental details given in Reynolds et al. 2020). Parameters Y0 and x0 were specified constants, whereas YF and k were fitted for Mitscherlich and Rayleigh, and YF, k, and c were fitted for Weibull. Parameter values and fit metrics appear in Tables 3 and 6.
The measured yields decline with time (Fig. 5a), which presumably reflects deteriorating soil fertility and health as the oxidizing compost loses its ability to supply nutrients and maintain a good biophysical environment. The data-estimated rate of yield decline (eq. 21, Fig. 5b) was too scattered to be useful, and it may reflect annual variation in weather. The fitted Weibull and Mitscherlich models predicted yield (Fig. 5a) and yield change (Fig. 5b) curves which were similar and convex in shape. The fitted Rayleigh model, in contrast, produced a sigmoid curve for yield (Fig. 5a) and a right-skewed bell curve for yield change (Fig. 5b), which was expected as this model’s shape parameter is fixed at c = 2 (eqs. 10 and 11; Figs. 1a, 1b). The RA2 and SERN fit metrics (Table 6) and confidence limits for YiM vs. YiP (Table 3) were similar between Weibull and Mitscherlich, and better than those for Rayleigh. However, the Weibull fit was not significantly better than the others (FW-M = 0.7576, FW-R = 0.1946); and this combined with the Weibull parsimony penalty, caused Mitscherlich to be most probable (PM = 87.2%), with Rayleigh second (PR = 10.1%), and Weibull third (PW = 2.7%). The fitted Mitscherlich model indicated that unfertilized grain yield declined over a period of about 9.8 yr (xD) and stabilized at about 7.5 t·ha−1 (YF). The model also indicated a maximum rate of yield decline of 6.9 t·ha−1·yr−1 (RM) during the first year after compost addition (xM = 2012), and an average rate of yield decline of 0.75 t·ha−1·yr−1 (AI) over the 9.8 yr yield stabilization period (xD). Since c = 1 for Mitcherlich (Table 6), yield decline was predicted to be memoryless (i.e., likelihood of yield decline was constant and independent of time). As expected, the Mitscherlich rate of yield change (derivative) function (Fig. 5b) was strongly right skewed (SF = 2) and substantially more tailed (KF = 6) than a normal distribution (Table 2).
3.4. Nitrogen mineralization
Potential supply and release dynamics of crop-available nitrogen are often determined using laboratory incubations, wherein nitrogen release vs. time is measured for soil amended with fertilizers or organic materials (e.g., Yang et al. 2020). Nitrogen release curves can have a variety of shapes, but they are most often sigmoid or monotone, and thereby amenable to characterization using the Weibull–Mitscherlich–Rayleigh family of models. Figure 6 shows example mineralized nitrogen (N) release data (from Yang et al. 2020) and model fits for soil amended with crimson clover shoots. Because there were more than twice as many data points (11) as fitting parameters (4–5), Y0, YF, x0, and k were fitted for Mitscherlich and Rayleigh, and Y0, YF, x0, k, and c were fitted for Weibull. Tables 3 and 7 give parameter values and fit metrics.
All three models generated visually reasonable fits to cumulative N released vs. time data (Fig. 6a); however, only Weibull and Mitscherlich produced plausible estimates of mineralization rate (dN/dt) vs. time (Fig. 6b). Hence, Rayleigh was deemed non-competitive and not considered further. The Weibull fit was most probable of the three models (PW = 58.37%, PM = 41.63%, PR = 0.0%) (Table 7), and its 95% confidence limits for slope and intercept of YiM vs. YiP were narrowest (Table 3). The Weibull fit also had excellent metrics (RA2 = 0.9988, MBEN = 5.92 × 10−8 %, SERN = 1.49%) which were better than Mitscherlich (RA2 = 0.9979, MBEN = 0.097%, SERN = 1.96%) (Table 7). The two models may actually be equivalent; however, as Weibull’s probability, fit metrics, and confidence limits were only slightly better than Mitscherlich, and the partial F-test for Weibull vs. Mitscherlich was not significant (FW-M = 0.0669). In addition, Weibull and Mitscherlich gave similar values for Y0, YF, k, c, RM, and AI (Table 7), and both indicated that mineralization was effectively memoryless (i.e., c ≈ 1, Table 7), strongly right skewed (SF = 1.8–2, Tables 2 and 7), and moderate to strongly leptokurtic (KF = 4.5–6, Tables 2 and 7). Note also that although both models track N mineralization rate (dN/dt vs. t) equally well (Fig. 6b), Weibull indicates that dN/dt = 0 at t = 0 (since c = 1.09 > 1), whereas Mitscherlich indicates that dN/dt = RM = 2.45% d−1 at t = 0 (since c = 1) (Table 7). Hence, the better model of the two may actually be the one that gives the most physically plausible representation of N mineralization rate at zero and near-zero time, given that most of the other parameters and metrics were similar.
3.5. Soil organic carbon depth profile
Soil organic carbon content (SOC) is perhaps the single most important soil attribute, as it affects crop productivity, most soil properties, and most soil quality and health indicators (e.g., Gregorich et al. 1997). Accurate and detailed characterization of SOC depth profiles can, therefore, be critically important for analytical and numerical models describing carbon sequestration, water transmission and storage, solute transport, crop growth, and environmental impact. Figure 7 gives an example SOC profile and corresponding Weibull–Mitscherlich–Rayleigh fits for a clay loam soil under a long-term corn–oat–alfalfa–alfalfa rotation. Parameters Y0 and x0 were specified constants, whereas YF and k were fitted for Mitscherlich and Rayleigh, and YF, k, and c were fitted for Weibull. The corresponding parameter values and fit metrics are in Tables 3 and 8.
The SOC data profile is clearly sigmoidal (Fig. 7a) with a bell-shaped derivative function (Fig. 7b) — hence, well fitted by the Weibull and Rayleigh models but poorly fitted by Mitscherlich. The model-data fit metrics were excellent for Weibull (RA2 = 0.9950, MBEN = −0.30%, SERN = 4.36%), whereas Rayleigh was a close second (RA2 = 0.9890, MBEN = −1.17%, SERN = 6.45%), and Mitscherlich was a distant third (RA2 = 0.7827, MBEN = −2.44%, SERN = 28.69%) (Table 8). However, the Weibull fit was not significantly better (FW-M = 0.0685, FW-R = 0.3171); and when parsimony is taken into account, Rayleigh was most probable by far (PR = 96.41%), with Weibull a distant second (PW = 3.53%), and Mitscherlich a very distant third (PM = 0.06%) (Table 8). Although this clearly indicates that Rayleigh has the best balance of fit and parsimony, there may be other factors to consider. For example, Weibull not only produced better fit metrics than Rayleigh but also narrower confidence limits for slope and intercept of YiM vs. YiP (Table 3), and a better visual fit to the −d(SOC)/dz vs. z data (Fig. 7b). Hence, choosing the most appropriate model may require subjective reasoning (e.g., visual model-data fits) as well as objective criteria (fit metrics). In any case, the Rayleigh (most probable) model indicated a minimum SOC (YF) of 0.21 wt. % at 83 cm depth (xF), maximum rate of SOC change of 0.08 wt. %·cm−1 at 23 cm depth (xM), and an average rate of SOC change (AI) of 0.03 wt. %·cm−1 over a 78.4 cm depth range (xD) (Table 8). Since c = 2 for Rayleigh, its derivative function for SOC (Fig. 7b) was moderately right skewed (SF = 0.6311) and slightly more tailed (KF = 0.2451) than a normal distribution (Table 2).
3.6. Soil water desorption curve
The soil water desorption curve, θ(h), describes the decrease in soil volumetric water content, θ [L3·L−3], with increasing soil water tension head, h [L]. It is a fundamental hydraulic property that underpins and regulates virtually all mechanistic descriptions of transmission and storage of water and gases in soils and other geologic porous media. Accurate representations of the θ(h) curve are consequently essential for valid and realistic characterizations of a wide range of agriculturally relevant processes such as infiltration, drainage, irrigation scheduling, tillage, trafficking, and depth profiles of root zone temperature, aeration, and moisture. Due to the extreme complexity of soil–water dynamics, θ(h) is typically represented by one of several empirical or semi-empirical equations; but to the authors’ knowledge, Weibull–Mitscherlich–Rayleigh expressions have never been applied. Figure 8 gives an example of Weibull, Mitscherlich, and Rayleigh model fits to a θ(h) curve from an agricultural sandy loam soil, with (x0,Y0) = (hS,θS) = (0.1,0.52), where θS is measured water content at saturation (0.52 m3·m−3), and hS is estimated tension head at saturation (0.1 cm). Fitting parameters included YF and k for Mitscherlich and Rayleigh, and YF, k, and c for Weibull. The resulting parameter values and fit metrics appear in Tables 3 and 9.
As is common for soil, the θ vs. h data produced a sigmoid, diminishing response curve (Fig. 8a, h on log10 scale), whereas −dθ/dh vs. h produced a concave curve (Fig. 8b, both θ and h on log10 scales). Although all three models could produce the same basic shapes as the data, it is clear that only Weibull was flexible enough (because of its variable c parameter) to achieve accurate and realistic model-data fits. Specifically, the Weibull line went through, or close to, every data point, whereas Mitscherlich and Rayleigh deviated systematically and substantially (Figs. 8a, 8b). Weibull consequently had much more favourable YiM vs. YiP confidence limits (Table 3) and fit metrics (Table 9) than Mitscherlich and Rayleigh; and as a result, PW was 100% and both FW-M and FW-R were <0.001 (Table 9). Hence, Weibull was highly probable and produced accurate fits to the θ(h) and −dθ/dh data, whereas Mitscherlich and Rayleigh were both improbable and inaccurate. The Weibull fit indicated a minimum water content (YF) of 0.12 m3·m−3 at a tension head of 17 558 cm (xF), a maximum desorption rate (RM) of 2.23·cm−1 at 0.1 cm tension head (xM), and an integral average desorption rate (AI) of 2.28 × 10−5·cm−1 over a 17 557.9 cm desorption range (xD) (Table 9). Since c = 0.4720 was obtained (Table 9), the desorption curve was extremely right skewed (SF = 7.5) and extremely leptokurtic (KF = 113.1) (Tables 2 and 8).
Given its apparent success, Weibull was compared with two of the most popular and versatile functions for describing the θ(h) relationship — the “van Genuchten” equations (van Genuchten 1980) and the “Groenevelt-Grant” equations (Groenevelt and Grant 2004). The empirical van Genuchten θ(h) and dθ/dh equations are given by (van Genuchten 1980)
where α [L−1] and n [—] are empirical curve-fitting parameters, m = 1 − (1/n) [—], θR [L3·L−3] is residual soil water content (treated as a fitting parameter), and θS [L3·L−3] is saturated soil water content (usually specified or measured). The empirical Groenevelt–Grant θ(h) and dθ/dh equations can be written as (Groenevelt and Grant 2004)where k0 [L], k1 [L3·L−3], and p [—] are empirical curve-fitting parameters, and (hA,θA) is a specified water content-tension head coordinate. Equations 22.1 and 23.1 were fitted to the θ(h) data in Fig. 8a using the protocol in Section 2.5.1., with θS = θA = 0.52 m3·m−3, and hA = 0.1 cm. It should also be noted that eqs. 22.2 and 23.2 are traditionally referred to as the “differential water capacity” or “specific moisture capacity” relationship.The van Genuchten and Groenevelt–Grant functions produced nearly identical visual fits to θ(h) and −dθ/dh (Figs. 9a, 9b), as well as similar fit metrics (Tables 3 and 10). The fits were not as good as Weibull; however, as their fit metrics and confidence limits for YiM vs. YiP were less favourable (Tables 3 and 10), and the models did not “track” the wet and dry ends of the −dθ/dh vs. h relationship as well as Weibull (Fig. 9b). As a result, Weibull was by far the most probable of the three models with PW = 99.72%, whereas van Genuchten and Groenevelt–Grant were PvG = 0.04% and PG-G = 0.24%, respectively (Table 10). Better tracking of −dθ/dh vs. h by Weibull (at least for this example) is particularly interesting, as it implies potentially better representations of water transmission and storage processes than the other two models, as well as more accurate estimates of the differential water capacity term (dθ/dh) in the physically based Richards equation for soil moisture flow (Richards 1931).
4. Discussion
Although the above agricultural applications are limited in scope, they nonetheless provide a good demonstration of the potential and utility of the adapted Weibull–Mitscherlich–Rayleigh family of functions. In every case, at least one of the three models provided a highly plausible model-data fit (Figs. 3–9) with excellent fit metrics (RA2 ≥ 0.9266, |MBEN| ≤ 1.49%, SERN ≤ 8.08%; Tables 4–10), and 95% confidence intervals that were narrow and near-symmetric about the YiM vs. YiP regression line (Table 3). In addition, the partial F test (FP) showed when Weibull produced a significantly better fit (P < 0.05) than the simpler Mitscherlich and Rayleigh functions; and the normalized suitability metric (PX) provided clear probability rankings among the fitted models (Tables 4–10). However, as has been noted by others (e.g., Archontoulis and Miguez 2015), there is currently no single metric (or suite of metrics) for definitive determination of the “best fitting”, “most appropriate” or “most probable” model. It also needs to be remembered that PX is a relative measure only, and therefore, cannot determine the best model in an absolute sense. In other words, some other model (e.g., Gompertz, Logistic, Gumbel, etc.) may provide better representations of the datasets than the Weibull–Mitscherlich–Rayleigh family. At present, estimating the best absolute representation of a dataset requires fitting a wide range of diverse (but plausible) models, and then applying the PX metric as above (e.g., Banks and Joyner 2017).
Table 4.
Selected parameter values and associated metrics for Weibull, Mitscherlich, and Rayleigh models fitted to corn emergence data (Figs. 3a, 3b).
Table 5.
Selected parameter values and associated metrics for Weibull, Mitscherlich, and Rayleigh models fitted to nitrous oxide emissions from a clay loam soil (Figs. 4a, 4b).
Table 6.
Parameter values and associated metrics for Weibull, Mitscherlich, and Rayleigh models fitted to decline in grain yield of unfertilized monoculture corn after a one-time application of compost (Figs. 5a, 5b).
Table 7.
Parameter values and associated metrics for Weibull, Mitscherlich, and Rayleigh models fitted to laboratory-incubated nitrogen mineralization data (Figs. 6a, 6b).
Table 8.
Parameter values and associated metrics for Weibull, Mitscherlich, and Rayleigh models fitted to soil organic carbon content (SOC) profile data (Figs. 7a, 7b).
Table 9.
Parameter values and associated metrics for Weibull, Mitscherlich, and Rayleigh models fitted to soil water desorption curve, θ(h), data (Figs. 8a, 8b).
Table 10.
Parameter values and associated metrics for the Weibull, van Genuchten, and Groenevelt–Grant models fitted to soil water desorption curve, θ(h), data (Figs. 9a, 9b).
Of the six example datasets, Mitscherlich was deemed most suitable for change in corn grain yield (Fig. 5, PM = 87.2%), Rayleigh was best for SOC distribution (Fig. 7, PR = 96.4%), and Weibull was best for corn seedling emergence (Fig. 3, PW = 100%), nitrous oxide emissions (Fig. 4, PW = 100%), nitrogen mineralization (Fig. 6, PW = 58.4%), and soil water desorption (Fig. 9, PW = 100% and Fig. 10, PW = 99.7%). The greater success rate for Weibull is perhaps not surprising, as it has one more fitting parameter (c) and thereby much greater flexibility. However, goodness of fit and parsimony metrics may not necessarily be the only criteria for deciding model suitability. Perhaps consideration should also be given to how well the fitted model tracks the data derivative function (dY/dx), or to the model that yields more physically realistic end points for the derivative function. For example, the derivative function for SOC distribution (Fig. 7b) was not tracked as well by the best-fit Rayleigh model (PR = 96.4%) as it was by the second-best Weibull model (PW = 3.5%). Also, zero initial nitrogen mineralization rate indicated by the best-fit Weibull model (PW = 58.4%, Fig. 6b) may not be as physically realistic as maximum initial mineralization rate indicated by second-best Mitscherlich (PM = 41.6%, Fig. 6b). It may even be appropriate in some cases to select a model primarily on its ability to track the data derivative function, or to provide physically plausible derivative function end points, with goodness of fit and parsimony metrics reduced to secondary consideration. This possibility warrants further investigation.
The results of this study suggest that the time/space/quantity behaviour of a wide range of agricultural processes is amenable to characterization and parameterization using the Weibull–Mitscherlich–Rayleigh family of functions. Some likely (or already partially demonstrated) candidates include seed germination and seedling emergence versus time or thermal time (e.g., Fig. 3); greenhouse gas production and emissions from soil versus time, soil moisture, or fertilizer source/rate/timing/placement (e.g., Fig. 4); crop yield and biomass production versus time, soil fertility, rainfall, temperature, or soil biophysical condition (e.g., Fig. 5); nitrogen mineralization dynamics versus time, temperature, or substrate composition (e.g., Fig. 6); SOC sequestration versus time, management practice, or depth (e.g., Fig. 7); transmission and storage of soil air and water (e.g., Fig. 8); and soil microbial population dynamics versus time or depth below surface. Further investigation of the usability of Weibull–Mitscherlich–Rayleigh functions for characterizing agricultural data and processes appears well justified.
5. Conclusions
The Weibull–Mitscherlich–Rayleigh family of functions was adapted to increase their utility for characterizing agricultural processes (Figs. 1 and 2; Tables 1 and 2). Using non-linear least-squares curve fitting, goodness of model-data fit metrics, and model selection metrics, it was demonstrated that the adapted functions were capable of producing accurate and unbiased fits to a wide range of agricultural data including corn seedling emergence, nitrous oxide emissions, change in corn grain yield, nitrogen mineralization, SOC distribution, and soil water desorption (Figs. 3–9). The fitted functions in turn provided several parameters (e.g., Y0, YF, x0, k, and c) and metrics (e.g., xF, xD, xM, RM, AI, SF, and KF) that were useful or potentially useful for characterizing agricultural data (Appendix A, Tables 1 and 2). It was therefore concluded that the adapted Weibull–Mitscherlich–Rayleigh family of functions has substantial potential for informative application to a wide range of agricultural data and processes.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as an actual or potential conflict of interest.
Acknowledgements
We gratefully acknowledge W. Calder, J. Gignac, J. Huffman, M. Reeb, and various summer students for data collection; Essex-Windsor Solid Waste Authority, Windsor, Ontario for provision of yard waste compost (Section 3.3.); and the AAFC-Harrow farm crew for ongoing operation and maintenance of the field sites.
References
Appendices
Appendix A
Potentially useful metrics of the adapted Weibull derivative and cumulative functions are derived below, illustrated in Fig. 2, and summarized in Table 1.
A1. .
Weibull derivative function: mode, median, skewness, kurtosis, and integral average
The derivative function mode gives the coordinates, , and value of the derivative function peak, (Fig. 2a). The mode is obtained by setting and :
then solving for :and then back substituting into eqs. 1 and 4 to producewhereInterestingly, YM (eq. A4) is independent of x and k. Note also that when c = 1, occurs at ; and when 0 < c < 1, as (see e.g., Fig. 1a).
The median bisects the area under the derivative function curve into two equal halves (Fig. 2a); and is obtained by setting , then equating a normalized form of eq. 4 to the 50th percentile, i.e.,
then solving for :and then back-substituting into eq. 1 to obtain:whereAs might be expected, the median is less than the mode when the derivative function is left skewed, equal to the mode when the derivative function is symmetrical, and greater than the mode when the derivative function is right skewed (Fig. 2a).
The Fisher population skewness (SF) and kurtosis (KF) of the derivative function are given by (e.g., Chu 2013):
andwhere is the Gamma distribution (available as a built-in function in most computer spreadsheets — given as GAMMA(x) in Excel®). The SF is a measure of function symmetry. Negative skewness indicates left skew (excess of small x-values relative to a normal distribution), positive skewness indicates right skew (excess of large x-values relative to a normal distribution), and zero skewness indicates a symmetrical (normal) distribution (National Institute of Standards and Technology 2013). The KF, in contrast, is a measure of the degree of function tailing relative to a normal distribution. Positive kurtosis (leptokurtic derivative function) indicates more/heavier tailing (more small and large “outlier” x-values) than a normal distribution, and negative kurtosis (platykurtic derivative function) indicates less/lighter tailing (fewer small and large outlier x-values) than a normal distribution (National Institute of Standards and Technology 2013). The kurtosis of a normal distribution is zero when KF is defined as in eq. A10. Table 2 gives proposed SF and KF categories based on the inclusive graphic classifications of Folk (1980). As an example, the derivative function in Fig. 2a is moderately right skewed (SF = 0.9620) and slightly leptokurtic (KF = 1.0440) — indicating that it has both a moderate excess of large x-values and slightly more outlier x-values than the derivative function of a normal distribution.
The integral average of the Weibull derivative function is obtained using the fundamental theorem of integral calculus, i.e.,
which readily simplifies tosince:The integral average of the derivative function describes a rectangle of length and height which has the same area as the area under the derivative function curve between and (Fig. 2a). The value can consequently be viewed as a mean rate of change of Y with respect to x (i.e., mean dY/dx) between and .
A2. .
Weibull cumulative function: mean, inflection, quantiles, and domain
The mean, is given by (Lai and Xie 2006; NCSS, Weibull 2020):
where is the Gamma function (Fig. 2b). The value is an average x of the cumulative function weighted according to the corresponding derivative function (e.g., Glen 2020c), i.e.,The lies to the left of the modal x ( for negative-skewed (left-tailed) derivative functions and to the right of for positive-skewed (right-tailed) derivative functions (Fig. 2a). Note also that although can be back-substituted into the derivative function (eq. 1), the resulting value applies only at and is, therefore, not the same as (Fig. 2a).
The inflection, , is derived in the same fashion as the derivative function mode, leading to
and hence is identical to . The inflection locates the point where the cumulative function changes from convex (upward bending) to concave (downward bending) (Fig. 2b), thus demarking the maximum slope of Y vs. x (i.e., maximum dY/dx), and thereby the peak (mode) of the corresponding derivative function (Fig. 2a).The quantiles, are specified points on the cumulative function that represent fractions of the function maximum, e.g., 0.2|YF-Y0|, 0.6|YF-Y0|, etc. Quantiles are obtained by setting , then equating a normalized form of the cumulative function to the Qth quantile, i.e.,
then solving for :and then back-substituting into eq. 4:Therefore, (for example) is given by
and hence, the 0.5 quantile (Fig. 2b) also locates the median (Fig. 2a). Note as well that time, t, Y(t), and rate (dY/dt) at the 0.5 quantile are sometimes used as diagnostic metrics in seed germination, seedling emergence, and mineralization studies (e.g., Gardarin et al. 2011; Gan et al. 1996; Yang et al. 2020). That is, 0.5 quantile time is given by (eq. A23); germination, emergence, or mineralization rate at is given byand cumulative germination, emergence, or mineralization at is given by (eq. A21).
The domain, xD, of our adapted Weibull function is defined here as
where is the initial (minimum) x-value, and is the final (end-point) x-value (Fig. 2b). The corresponding Weibull function range, YR, is therefore YR = YF − Y0, where YF = Y(xF) and Y0 = Y(x0). It is clear from eq. 4 and Fig. 2b that the adapted Weibull cumulative function is finite and defined at but asymptotic to the x-axis at (i.e., x → ∞ as ). Hence, is readily obtained, but must be estimated. We estimate here by equating it to the 0.9999 quantile in eq. A20, i.e., which is the x-value that corresponds toorOur estimated consequently approximates to ≥99.99 % of (Fig. 2b). Note also that the derivative function departs from zero at and returns to zero at (Fig. 2a).