1 Introduction

Transport sector plays a significant role in the overall development of a nation’s economy. Road transport is the prime motorized mode of transport linking the remote and hilly areas with rest of the country. The State Transport Undertakings (STUs), controlled by the respective state government, are the imperative mode of passenger mobility in public road transport sector.

Since STUs are public utility service with a social objective, it is essential to regularly monitor their performance, specifically with a view to identifying appropriate measures including proper investment and pricing policy and to improve their output efficiency. In public transport sector, efficiency measurement is the first step in the evaluation of individual performance of STUs. This study is an attempt in this direction to assess the relative technical efficiency of STUs in India.

Passenger road transportation is a “service business” and evaluating the efficiency of a service business is a complex matter. Transport efficiency is often more difficult to evaluate than manufacturing business efficiency, because it is difficult to determine the efficient amount of resources required to produce various service outputs. The manufacturing standard can be used to identify operating inefficiencies through classical cost accounting variance analyses. However, in service organization like road passenger transportation system, it is difficult to identify the specific resources required to provide a specific service output.

The purpose of this article is to evaluate the performance of STUs by providing them with a mathematical technique to analyse the efficiency with which service is rendered. The paper attempts to estimate technical efficiency of the STUs, sets benchmark for inefficient STUs, and suggests alternative actions that would make them relatively efficient. The paper is organized as follows: in Section 2 methodology is given. Empirical Results and discussions are given in Section 3, followed by conclusions in the last.

2 Methodology

This paper measures the technical efficiencies of the STUs. The technical efficiency refers to the extent to which a STU can produce maximum output from its chosen combination of factor inputs.

The mathematical relationship between inputs and outputs in transport sector is not known clearly, so STU efficiency is operationalized using Data Envelopment Analysis (DEA). It is a non-parametric linear programming model that estimates the magnitude of departure from efficiency frontiers for each STU. The DEA model is used to measure the OTE. The DEA is initially proposed by Charnes, Cooper and Rhodes [2]. DEA measures the relative technical efficiency of a group of decision-making units (DMUs) by simultaneously evaluating multiple inputs and outputs common to each unit; each DMU is thus assigned an efficiency score. The DEA model is a family of fractional linear programs; each linear program measures the relative efficiency of a particular DMU. Even though the modeling is nonlinear but under appropriate transformations the efficiency rating can be derived from an equivalent linear program (Charnes and Cooper [4]).

DEA is chosen over other methods because

  • ➢ It handles multiple inputs and multiple outputs;

  • ➢ It does not require a prior weights (as in index numbers);

  • ➢ It emphasizes individual observations rather than statistical estimates (as in regression analysis);

  • ➢ It is a dynamic analytical decision-making tool that not only provides a “snapshot” of the current efficiency of the DMU compared with the group, but also indicates possibilities for improving relative efficiency;

  • ➢ It uses benchmarking approach to measure STU efficiency relative to others in their group.

  • ➢ It can assist in identifying best-practice or efficient STUs and inefficient STUs within the group.

  • ➢ The DEA results can allow policy makers to develop policies that can assist the relatively inefficient STUs to improve their performance.

2.1 Algorithm

  • First Step: Selection of the Homogeneous DMUs

    We measure the OTE of 35 STUs using data from CIRT [5] for the year 2004–05. A list of these 35 selected STUs is given in the Appendix A.1.

  • Second Step: Selection of Input and Output Variables

    To evaluate the relative efficiency of the STUs, four inputs, viz., Fleet size (FS), Total Staff (TS), Fuel consumption (FC) and Accident per lakh kilometers (APLK) and three outputs, namely, Bus Utilisation (BU), Passenger kilometers (Pass-Kms) and Load Factor (LF) are considered.

2.2 Inputs

  1. 1.

    Fleet Size (number of buses in hundred) comprises the average number of buses on-road in a STU; it is representative of the capital input.

  2. 2.

    Total Staff (numbers in thousand) refers to the total number of employees worked in a STU; it is representative of the labour input.

  3. 3.

    Fuel Consumption refers to the fuel consumed (in ten thousand kilolitres) which is measured by dividing total earned kilometer by fuel average; it is representative of the material input.

  4. 4.

    Accident per lakh kilometers is important parameter of safety in bus operation.

2.3 Outputs

  1. 1.

    Bus Utilisation (in kilometers) is defined as kilometers done per bus on road per day. It is calculated from dividing total effective kilometers done on a day by total buses on road on that day.

  2. 2.

    Passenger-kilometers (in Billions) is a measure of service utilization which represents the cumulative sum of the distances ridden by each passenger. It is normally calculated by summation of the passenger load times the distance between individual bus stops.

  3. 3.

    Load Factor is the percent of the ratio of passengers actually carried versus the total passenger seating capacity.

The details of the observed data for the selected STUs of the input and output variables are shown in Table 1. There is a perceptible variation in the inputs and the outputs across STUs. All the inputs used are in some cases hundred times larger than that used by other STU. The variations in outputs produced are not so high except Pass-Kms.

Table 1 Observed data of the sample STUs in India (2004–05)
  • Third Step: Fourth Step: Selection of the model

    In this study, CCR input-oriented model has been employed, i.e., how much resources can be reduced without changing the outputs produced to make STUs efficient (Charnes et al. [4]). In order to decompose the overall technical efficiency (OTE) into pure technical efficiency (PTE) and scale efficiency (SE), BCC input-oriented model is also applied to the data. Descriptive statistics of the results are given in Tables 2 and 5.

    Table 2 Resulting efficiency scores of STUs by DEA model
  • Fourth Step: Calculating the overall technical efficiency (OTE) of STU

    To describe DEA efficiency evaluation, assume that the performance of the homogeneous set of n decision making units (DMU j ; j = 1…n) be measured by DEA. The performance of DMU j is characterized by a production process of m inputs (x ij ; i = 1…m) to yield s outputs (y rj ; r = 1… s). According to Charnes et al. [2], the ratio of the virtual output to the virtual input of any DMU k is to be maximized with the condition that the ratio of virtual output to virtual input of every DMU should be less than or equal to unity.

    Mathematically,

    $$ {\hbox{Max}}\,{{\hbox{E}}_k} = \frac{{\sum\limits_{{r = 1}}^s {{u_{{rk}}}{y_{{rk}}}} }}{{\sum\limits_{{i = 1}}^m {{v_{{ik}}}{x_{{ik}}}} }} $$

    subject to

    $$ \begin{array}{*{20}{c}} {\frac{{\sum\limits_{{r = 1}}^s {{u_{{rk}}}{y_{{rj}}}} }}{{\sum\limits_{{i = 1}}^m {{v_{{ik}}}{x_{{ij}}}} }} \leqslant 1\,\forall {\hbox{j = 1,2,}} \ldots {\hbox{,n}}} \hfill \\{\frac{{{u_{{rk}}}}}{{\sum\limits_{{i = 1}}^m {{v_{{ik}}}{x_{{ik}}}} }} \geqslant \,\varepsilon \,\forall \,{\hbox{r = 1,2,}} \ldots {\hbox{,s}}} \hfill \\{\frac{{{v_{{ik}}}}}{{\sum\limits_{{i = 1}}^m {{v_{{ik}}}{x_{{ik}}}} }} \geqslant \,\varepsilon \,\forall {\hbox{i = 1,2,}} \ldots {\hbox{,m}}} \hfill \\\end{array} $$
    (1)

    where y rk is the amount of the rth output produced by the kth DMU; x ik is the amount of the ith input used by the kth DMU; u rk is the weight given to the rthoutput of the kth DMU; v ik is the weight given to the ith input of the kth DMU; n is the no. of DMUs ; s is the no. of outputs; m is the no. of inputs and ε is a non-Archimedean (infinitesimal) constant.

    The model (1) is popularly known as the classical CCR ratio model named after Charnes, Cooper and Rhodes. The theory of fractional linear programming [3] makes it possible to replace (1) with an equivalent linear programming problem by imposing the condition \( \sum\limits_{{i = 1}}^m {{v_{{ik}}}{x_{{ik}}}} = 1 \) which provides:

    $$ {\hbox{Max}}\,{{\hbox{w}}_k} = \sum\limits_{{r = 1}}^s {{u_{{rk}}}{y_{{rk}}}} $$

    subject to

    $$ \begin{array}{*{20}{c}} {\sum\limits_{{i = 1}}^m {{v_{{ik}}}{x_{{ik}}}} = 1} \hfill & {} \hfill \\{\sum\limits_{{r = 1}}^s {{u_{{rk}}}{y_{{rj}}}} - \sum\limits_{{i = 1}}^m {{v_{{ik}}}{x_{{ij}}}} \, \leqslant \,0} \hfill & {\forall j = 1,2,...,n} \hfill \\{{u_{{rk}}}\, \geqslant \,\,\varepsilon } \hfill & {\forall r = 1,2,...,s} \hfill \\{{v_{{ik}}}\,\, \geqslant \,\,\varepsilon } \hfill & {\forall i = 1,2,...,m} \hfill \\\end{array} $$
    (2)

    Model (2) is interpreted that the objective is to maximize virtual output of DMU k subject to unit virtual input of DMU k while maintaining the condition that virtual output cannot exceed virtual input for every DMU. This is known as CCR multiplier model whose dual LPP is

    $$ {\hbox{Min}}\,{z_k} = {\theta_k} - \varepsilon \sum\limits_{{r = 1}}^s {S_{{rk}}^{ + } - \varepsilon \sum\limits_{{i = 1}}^m {S_{{ik}}^{ - }} } $$

    subject to

    $$ \begin{array}{*{20}{c}} {\sum\limits_{{j = 1}}^n {{\lambda_{{jk}}}{y_{{rj}}} - S_{{rk}}^{ + } = {y_{{rk}}}} } \hfill & {\forall r = 1 \ldots \ldots \ldots .s} \hfill \\{\sum\limits_{{j = 1}}^n {{\lambda_{{jk}}}{x_{{ij}}} + S_{{ik}}^{ - } = {\theta_k}{x_{{ik}}}} } \hfill & {\forall i = 1 \ldots \ldots \ldots .m} \hfill \\{{\lambda_{{jk}}} \geqslant 0} \hfill & {\forall j = 1 \ldots \ldots \ldots .n} \hfill \\{{\theta_k}is\,unrestricted\,in\,sign} \hfill & {} \hfill \\{S_{{rk}}^{ + },S_{{ik}}^{ - } \geqslant 0;r = 1 \ldots ..s,i = 1 \ldots \ldots m} \hfill & {} \hfill \\\end{array} $$
    (3)

    where \( S_{{rk}}^{ + } \) is slack in the rth output of the kth DMU; \( S_{{ik}}^{ - } \) is slack in the ith input of the kth DMU; \( {\lambda_{{jk\,}}}\,\prime s \) are non negative dual variables and θ k (scalar) is the (proportional) reduction applied to all inputs of DMU k to improve efficiency. This reduction is applied simultaneously to all inputs and results in a radial movement towards the envelopment surface. This is popularly known as CCR envelopment model.

    The interpretation of the results of the envelopment model (3) can be summarized as:

    • The kth STU is Pareto efficient if

      1. (a)

        \( \theta_k^{*} = 1 \)

      2. (b)

        All slacks are zero, i.e., \( S_{{rk}}^{{ + *}} \) and \( S_{{ik}}^{{ - *}} \) = 0 for every r and i.

    The non-zero slacks and (or) \( \theta_k^{*} \leqslant 1 \) identify the sources and amount of any inefficiency that may exist in the DMU k . If the optimal value \( \lambda_{{jk}}^{*} \) of λ jk is non zero then jth DMU represents the reference set (peers) of the kth DMU.

  • Fifth Step: Calculate OTE of every sample STU. The detailed information of the results is given in Table 2

  • Sixth Step: Calculating the pure technical efficiency (PTE) and scale efficiency (SE) of STU: Another version of DEA is BCC model given by Banker, Charnes and Cooper [1]. The primary difference between BCC model and CCR model is the convexity constraint, which represents the returns to scale. Returns to scale reflects the extent to which a proportional increase in all inputs increases outputs. In the BCC model λ jk ’s are now restricted to \( \sum\limits_{{j = 1}}^n {{\lambda_{{jk}}}} = 1 \) which is known as convexity constraint. Technical efficiency assessed by BCC model is pure technical efficiency because it has net of any scale effect. The impact of scale-size on efficiency of a DMU is measured by scale efficiency.

    $$ \begin{array}{*{20}{c}} {Scale\,Efficiency\,of\,the\,{k^{{th}}}\,DMU = \frac{{Overall\,Technical\,Efficiency\,of\,the\,{k^{{th}}}\,DMU}}{{Pure\,Technical\,Efficiency\,of\,the\,{k^{{th}}}\,DMU}}} \\{ = \frac{{CCR\,Efficiency\,Score\,of\,the\,{k^{{th}}}\,DMU}}{{BCC\,Efficiency\,Score\,of\,the\,{k^{{th}}}\,DMU}}} \\\end{array} $$
  • The overall technical efficiency (OTE) of a DMU can never exceed its pure technical efficiency (PTE). All the three efficiencies (overall technical, pure technical and scale) are bounded by zero and one.

  • Seventh Step: Calculate PTE and SE of every sample STU. The detailed information of DEA results is given in Table 5.

3 Empirical results and discussions

Table 2 presents the information on OTE, reference set, peer weights and reference count (peer count) of the sample STUs for the year 2004–05. The DEA analysis evaluates the set of STUs which construct the production frontier. The STUs having values of the OTE score equal to 1.00 are form the efficient frontier and those having the values less than 1.00 are less efficient relative to the STUs on the frontier. The lower the efficiency score, the higher scope for the potential reduction in inputs (while maintaining the existing level of outputs) relative to the best practice STUs.

The results indicate that out of 35 STUs, 14 STUs (40%) are relatively efficient (efficiency score =1) while remaining 21 STUS are relatively inefficient (efficiency score <1). These fourteen efficient STUs are APSRTC (S1), RSRTC (S5), KnSRTC (S6), NWKnSRTC (S7), NSKnSRTC (S8), STHAR (S9), KUM-1 (S13), MDU (S15), SLM (S16), VPM-1 (S17), OSRTC (S23), TRPTC (S25), MZST (S26), and KMTU (S34). These STUs are on the best-practice frontier and thus form the “reference set”, i.e., these STUs can set an example of good operating practice for the remaining 21 inefficient STUs to emulate. HRTC (S24) is the most technical inefficient STU. Among the inefficient STUs, 7 STUs have the efficiency scores above the average efficiency scores.

The average of OTE scores works out to be 0.833. This reveals that an average STU can reduce its resources by 16.74% to obtain the existing level of outputs.

We use the frequency of efficient STUs in the reference set (i.e., peer count) to discriminate among them. The higher peer count represents the extent of robustness of that STU compared with other efficient STUs. In other words, a STU with higher peer count is likely to be a STU which is efficient with respect to a large number of factors and is probably a good example of a “global leader” or a STU with a high robustness. Efficient STUs that appear seldom in the reference set are likely to possess a very uncommon input/output mix so when the peer count is low, one can safely conclude that the STU is somewhat of an odd unit and cannot be treated as a good example to be followed. On the basis of robustness of efficiency scores, the STUs on the frontier are classified as:

  1. 1. High robustness:

    MZST (S26, peer count =13) and VPM-1 (S17, peer count =12) are high robust STU and can be considered as global leaders in terms of OTE.

  2. 2. Middle robustness:

    APSRTC (S1), KnSRTC (S6), SLM (S16), OSRTC (S23), TRPTC (S25) and KMTU (S34) are classified in the middle robust group.

  3. 3. Low robustness:

    RSRTC (S5), NWKnSRTC (S7), NSKnSRTC (S8), STHAR (S9), KUM-1 (S13) and MDU (S15) are graded in the low robust group in terms of OTE.

3.1 Input/Output targets for inefficient STUs

When a STU is inefficient, DEA allows to set the targets for its inputs and outputs so that it can improve its performance. Thus, each of the inefficient STU can become overall efficient by adjusting its operation to the associated target point determined by the efficient STUs that define its reference frontier. According to model, the targets of the inefficient STUs are as follows:

$$ \begin{array}{*{20}{c}} {{\hbox{For}}\,{\hbox{outputs:}}} \hfill & {} \hfill \\{} \hfill & {\overline {{y_{{rk}}}} = {y_{{rk}}} + S_{{rk}}^{{ + *}} = \sum\limits_{{j = 1}}^n {\lambda_{{jk}}^{*}{y_{{rj}}}} } \hfill \\{{\hbox{For}}\,{\hbox{inputs:}}} \hfill & {} \hfill \\{} \hfill & {\overline {{x_{{ik}}}} = \theta_k^{*}{x_{{ik}}} - S_{{ik}}^{{ - *}} = \sum\limits_{{j = 1}}^n {\lambda_{{jk}}^{*}{x_{{ij}}}} } \hfill \\\end{array} $$
(4)

where \( \overline {{y_{{rk}}}} \) (r = 1, 2, 3) and \( \overline {{x_{{ik}}}} \) (i = 1, 2, 3, 4) are the target outputs and inputs respectively for the kth STU; y rk and x ik are the actual outputs and inputs respectively of the kth STU; \( \theta_k^{*} \) = optimal efficiency score of the kth STU; \( S_{{ik}}^{{ - *\,}} \) is the optimal input slack of the kth STU for i =1…4; and \( S_{{rk}}^{{ + *\,}} \) is the optimal output slack of the kth STU for r = 1…3. The optimal input and output slacks for every inefficient STU are shown in Table 3.

Table 3 Slacks in inputs/outputs

Table 4 presents the target values of all inputs and outputs for inefficient STUs along with percentage reduction in inputs and percentage expansion in outputs. It can be observed from the table that an average STU has a significant scope to reduce the inputs and expand the outputs, relative to the best practice STU. A perusal of the Table, it can be observed that on average, approximately 30% of FS, 37.75% of TS, 23.93 of FC, 41.24% of APLK can be reduced and 19.34% of BU, 26.14% of LF can be expanded if all the inefficient STUs operate at the level of efficient STUs. The results reveal that in order to become efficient, the worst inefficient STU, i.e., HRTC (S24, TE score = 49.94%), can reduce its FS by 51.18%, TS, FC, APLK by 50.06%, and expand LF by 11.09% relative to the best practice STU.

Table 4 Target values of input and output variables under CCR input model

3.2 Pure technical efficiency

CCR model is based on the assumption of constant returns to scale (CRS) which does not consider the scale-size of STU to be relevant in assessing technical efficiency. Therefore, in order to know whether inefficiency in any STU is due to inefficient production operation or due to unfavorable conditions displayed by the size of STU, BCC input model is also applied.

BCC efficiency (PTE) is always greater or equal to CCR efficiency (OTE). Hence, number of STUs on the frontier under BCC model is always greater or equal to the number of STUs on the frontier under CCR model.

Table 5 provides details about DEA results drawn from this model. It is evident from the Table that out of 35 STUs, 18 STUs are pure technical efficient (BCC score =1), i.e., none of these have scope to further reduce inputs (maintaining same output level) while remaining 17 STUs are relatively inefficient (score <1). PTE measures how efficiently inputs are converted into output(s) irrespective of the size of the STUs. The average of pure technical efficiency is worked out to be 0.875; this means that given the scale of operation, on average, STUs can reduce its inputs by 12.5% of its observed level without detriment to its output levels.

Table 5 OTE, PTE and SE

Pure technical efficiency is concerned with the efficiency in converting inputs to outputs for the given the scale-size of STUs, whereby we observe that S11, S19, S31 and S33 are CCR technical inefficient but pure technical efficient. This clearly evinces that these STUs are able to convert its inputs into outputs with 100% efficiency, but their overall efficiency (OTE) is low due to their scale-size disadvantageous (low scale efficiency).

3.2.1 Scale efficiency (SE)

A comparison of the results for CCR and BCC gives an assessment of whether the size of a STU has an influence on its OTE. Scale efficiency (SE) is the ratio of OTE to PTE scores. If the value of SE score is one, then the STU is apparently operating at optimal scale. If the value is less than one, then the STU appears either small or big relative to its optimum scale-size. Table 5 represents the SE score of the STUs at fourth column. Results show that out of 35 STUs, 15 STUs are scale efficient while remaining 20 STUs are scale inefficient. The average of scale efficiency is 0.953. It indicates that an average STU may be able to decrease its inputs by 4.7% beyond its best practice targets under variable returns to scale, if it were to operate at constant returns to scale.

4 Conclusions and policy implementations

This paper measures technical efficiency (OTE) of 35 STUs in India through DEA methodology. The study finds that 14 STUs have the maximum degree of efficiency. The overall mean TE of the STUs is 83.26%, indicating that on average 16.74% of the technical potential of the STUs is not in use. This implies that these STUs have the scope of producing the same output with the inputs 16.74% lesser than their existing level. The most efficient STUs are MZST and VPM-1 while HRTC is the most inefficient STU.

The targets setting results show that all the inputs have the significant scope to reduce. The model suggests that on average, non-frontier STUs may be able to reduce Fleet Size by 30%, Total Staff by 37.75%, Fuel Consumption by 2.93%, APKL by 41.24%, and to expand BU by 19.34%, LF by 26.14%, relative to the best practice STUs.

The results of BCC model show that out of 35 STUs, 18 STUs are pure technical efficient as they efficiently convert their inputs into the output. However, 4 STUs of them are technical inefficient due to scale-size effect. S31 has the least scale efficiency score (51.9%), implying that S31 has the maximum effect of scale-size on its efficiency score.