Skip to main content

Diagnostic value of the appendicitis inflammatory response (AIR) score. A systematic review and meta-analysis

Abstract

Background

Clinical scoring algorithms are cost efficient in patients with suspicion of acute appendicitis. This is a systematic review and meta-analysis of the diagnostic properties of the Appendicitis Inflammatory Response (AIR) score compared with the Alvarado score.

Methods

The PubMed, EMBASE, Web of Science and Google Scholar databases were searched for reports on the diagnostic properties of the AIR score from 2008 to July 18, 2024. A meta-analysis of the receiver operating characteristic (ROC) area and the sensitivity and specificity for all and advanced appendicitis patients was performed. Advanced appendicitis was defined as perforated or gangrenous appendicitis or appendicitis abscess or phlegmon or if described as complicated appendicitis. The risk of bias was estimated via the QUADAS-2 tool. The ROC areas of the AIR score and the Alvarado score were compared.

Results

A total of 26 reports with a total of 15.699 patients were included. The area under the ROC curve for the AIR score was 0.86 (95% CI 0.83–0.88) for all patients with appendicitis and 0.93 (CI 0.91–0.96) for those with advanced appendicitis, which was greater than the corresponding areas for the Alvarado score (0.79, CI 0.76; 0.81) and 0.88, CI 0.82; 0.95), respectively.

At > 4 points, the sensitivity was 0.91 (CI 0.88; 0.94) for all patients with appendicitis and 0.95 (CI 0.94; 0.97) for those with advanced appendicitis. At > 3 points, the sensitivity was 0.95 (0.90; 0.97) for all patients with appendicitis and 0.99 (0.97; 0.99) for those with advanced appendicitis.

At > 8 points, the specificity was 0.98 (0.97; 0.99) for all patients with appendicitis and 0.99 (0.97; 0.99) for those with advanced appendicitis. The included studies had a low risk for bias and low heterogeneity.

Conclusion

The AIR score has a better diagnostic capacity than the Alvarado score does. The AIR score is a safe and efficient basis for risk-stratified management of patients suspected of having appendicitis.

Background

Acute appendicitis is one of the most important differential diagnoses in patients presenting with acute abdominal pain at emergency departments. Patients with acute abdominal pain and suspicion of appendicitis can present with a wide spectrum of symptoms and signs and severity of the condition. The diagnosis is challenging, and the diagnostic process, especially the use of routine diagnostic imaging, is controversial [1, 2]

Initial management is usually based on the clinical presentation and basic laboratory tests. In the triage, patients with advanced appendicitis need to be recognised and given immediate attention, whereas patients with a presentation suggesting noncomplicated appendicitis, which may be self-limiting, are not an immediate medical emergency [3].

Initial management can be facilitated and made more efficient and safer by the use of clinical scoring systems based on symptoms, signs and simple inflammatory markers, which are part of the routine primary workup. The scoring system can define strata with high, indeterminate and low risk of appendicitis. These three zones can be the basis for an optimal and structured risk stratified pathway. This can involve immediate resuscitation and early diagnostic laparoscopy for patients in the high-risk zone, selective imaging or repeat scoring after observation for the indeterminate zone or early discharge with planned follow-up for the low-risk zone. Such risk-stratified management is safe and more cost efficient than unselective and routine use of imaging [4, 5].

Diagnostic tests are usually analysed from a binary point of view, often trying to define the point with an optimal balance between sensitivity and specificity. Indeterminate results are either not included in the analysis or counted as negative. This can lead to a loss of diagnostic information and bias. The use of three test zones, with one zone with high sensitivity and another with high specificity and an indeterminate zone in between, has been proposed as a solution. This model is closer to clinical reality [6, 7].

Many simple and user-friendly scoring systems have been presented, but few meta-analyses exist [8,9,10,11]. This report is a systematic review and meta-analysis of the diagnostic properties of the Appendicitis Inflammatory Response (AIR) score (Table 1), which is the second most cited appendicitis scoring system next to the Alvarado score [12]. The results are analysed and presented from the perspective of three risk zones.

Table 1 Appendicitis inflammatory response (AIR) score

Methods

We followed the recommendations of the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) 2020 statement [13].

Literature search

The PubMed, EMBASE and Web of Science databases were searched for relevant reports, using the search terms (("AIR score" OR "AIRscore" OR "inflammatory response score" OR "appendicitis inflammatory response") AND “appendicitis”). The search started in 2008, the year when the AIR score was published, and stopped on July 18th, 2024. More elaborate search profiles gave the same results as this simple profile. We also included all reports that had a reference to the original publication of the AIR score, according to Google Scholar. Additionally, a direct search was performed on the basis of the authors’ personal knowledge of the literature. After removal of duplicates, the two authors independently performed title and abstract screening as well as full-text screening. Disagreements were discussed and resolved between the authors.

Inclusion and exclusion criteria

Inclusion We included studies reporting the outcome of patients assessed for suspicion of appendicitis in strata according to the AIR-score riskzones, with high (> 8 points) and low risk (≤ 4, ≤ 3 or ≤ 2 points). Some of the included studies reported results from only one of these cut-off points.

Exclusion Reports that used very wide criteria for histopathological diagnosis were excluded. The risk scoring strategy is intended for unselected patients with suspicion of appendicitis, typically having a prevalence of appendicitis of approximately 30%. Given that a high prevalence of disease may have an impact on sensitivity and specificity [14], we excluded reports with a greater than 90% prevalence of appendicitis. Case reports, duplicates and reports that did not present results that could be extracted or used for the statistical analyses were excluded. The selection process is shown in the PRISMA flowchart in Fig. 1.

Fig. 1
figure 1

The PRISMA flowchart for the study

Data extraction

We extracted relevant information such as study identification (i.e., author and year of publication), study characteristics (i.e., study design, age of the participants, and study period), outcomes (appendicitis and not appendicitis), and use of a reference standard (PAD or follow-up). We noted the number of outcomes according to the low and high diagnostic thresholds of the AIR score (i.e., true/false and positive/negative cases). If not reported, the numbers were derived from the reported statistics. If available, we extracted the corresponding numbers separately for advanced appendicitis. Advanced appendicitis was defined as perforated or gangrenous appendicitis or appendicitis abscess or phlegmon or as complicated appendicitis. The histopathological diagnosis of a normal appendix was made by the authors of the original studies. Two investigators independently assessed the quality of each included study according to the QUADAS-2 tool [15]. Any disagreements were resolved by discussion between the investigators.

Statistical analysis

For each report, the results were extracted as the number of patients in four strata—true/false and positive/negative. This was performed for all patients with appendicitis and, if available, for those with advanced appendicitis. If available, data were extracted for cut-off points > 3, > 4 and > 8. One study reported results with a cut-off point > 2 only. The performance at ruling out appendicitis in the low-risk strata was tested from the sensitivity in the low- versus the combined intermediate- and high-risk strata. The performance in ruling out appendicitis in the high-risk strata was tested in terms of specificity in the high-risk strata versus the combined intermediate- and low-risk strata. For each study, we calculated the sensitivity and specificity for the low and high cut-off points separately and estimated the corresponding pooled results in a meta-analysis. The ROC area for the AIR score and, if available from the Alvarado score, was extracted from each report and compared statistically from the standardised mean difference. Separate analyses were performed for all patients with appendicitis and those with advanced appendicitis. Heterogeneity between studies was assessed by the I2 index. We used STATA software (Version 17.0, StataCorp, College Station, TX) for all the statistical analyses. The meta-analysis was performed via the “metadta” and “metan” commands.

Results

The literature search yielded 58 hits in PubMed, 94 hits in EMBASE and 130 hits in Web of Science. We also included 558 references to the original publication of the AIR score listed in Google Scholar. After exclusion of duplicates, reports unrelated to the AIR score and reports from which we could not extract any useful data, 139 potential references were retrieved for assessment of eligibility. After the full texts were assessed, another 109 reports were excluded, leaving 26 reports for the final analysis, with a total of 15.699 included patients. One study reported separate results for males and females, which explains 27 sets of data (Fig. 1, Table 2).

Table 2 Characteristics of the studies included in the meta-analysis

Characteristics of the included studies

The final selected reports come from many countries and settings (Table 2) [16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42]. The majority of these studies are prospective and include consecutive patients. Approximately half of the reports are based on patients assessed for suspicion of appendicitis, and the remaining include patients operated on for suspicion of appendicitis. The median prevalence of appendicitis was 56.6% (range 12.5–89.2%).

Most reports include patients of both sexes and all ages or adults. One study involving children reported results in which > 4 was used as the low cut-off point [28] and > 3 was used as the low cut-off point in a complementary report [29]. Three reports included only pregnant women [19, 33, 36].

For the operated patients, the final diagnosis was based on histopathology in all the reports (Table 3). For the unoperated patients, the diagnosis was based on radiologic examination in two studies and results after between 2 weeks and 6 months of follow-up in 13 studies (Table 3). The criteria used for the histopathological diagnosis of appendicitis were transmural inflammation in eight reports, neutrophil invasion to the muscularis propria in six reports and various other criteria in three reports (Table 3). Reports accepting the presence of neutrophils in the lumen or invasion limited to the mucosa or lymphoid hyperplasia as criteria were excluded. The remaining reports did not specify the histopathological criteria used.

Table 3 Diagnostic criteria used

The area under the ROC curve (AUC) for the AIR score was reported in 23 studies for all patients with appendicitis (one study reporting separate results for men and women) and in 7 studies for patients with advanced appendicitis. Some 22 studies reported ROC areas for both the AIR and the Alvarado score for all patients with appendicitis and three for those with advanced appendicitis.

The number of true/false and positive/negative patients related to the low (> 2, > 3 or > 4 points) and high (> 8 points) AIR score cut-off points were extracted from the reports, and the corresponding sensitivity and specificity were calculated. Separate analyses were performed for all patients and for those with advanced appendicitis. Three studies reported results for both > 3 and > 4 points as the low cut-off points [18, 25, 28, 29]. One study reported results for men and women separately, using two different cut-off points [20].

Quality assessment

The included studies were generally of high quality. In the QUADAS-2 assessment, 13 of the 26 studies had a high risk of bias and concern regarding applicability in the domain of patient selection due to the exclusion of nonoperated and/or nonpregnant patients (Table 4). Some 16 studies had a high risk of bias and concern regarding applicability in the domain flow and timing, mainly because different reference tests were applied (histopathology of operated patients vs follow-up of nonoperated patients). Fourteen studies were unclear with respect to both risk of bias and concern regarding applicability in the domain reference standard, mainly because they were unclear or did not specify histopathological criteria.

Table 4 QUADAS-2 adjudgments of included studies

Diagnostic accuracy

Overall diagnostic accuracy

A classifier's performance can be described by the area under the ROC curve. For the AIR score, the pooled ROC area for all cases of appendicitis was 0.86 (95% CI 0.83; 0.88) in 23 studies and 0.93 (CI 0.91; 0.96) for advanced appendicitis in 7 studies.

The AIR score and the Alvarado score were both reported in 19 studies. For these paired reports, the pooled ROC area for all cases of appendicitis was significantly larger for the AIR score (0.85, CI 0.82; 0.89) than for the Alvarado score (0.79, CI 0.76; 0.81, p < 0.001). For advanced appendicitis, the pooled ROC area was 0.96 (CI 0.94; 0.98) for the AIR score compared with 0.88 (CI 0.82; 0.95) for the Alvarado score in three reports (p < 0.001). An analysis of the weighted mean difference in the ROC area for the AIR and the Alvarado score revealed a significantly larger ROC area for the AIR score (Fig. 2).

Fig. 2
figure 2

Standardised mean difference for the paired ROC areas. The pooled ROC area for all patients with appendicitis was 0.85 (CI 0.82; 0.88) for the AIR score and 0.79 (CI 0.76; 0.81) for the Alvarado score, with a difference of 0.06 (CI 0.03; 0.09, p < 0.001). The corresponding result for advanced appendicitis was 0.96 (CI 0.94; 0.98) for the AIR score compared with 0.88 (CI 0.82; 0.95) for the Alvarado score in three reports, a difference of 0.08 (CI 0.00;0.15), p = 0.03. Gudjonsdottir [28] included only children. CI is Confidence Interval

Diagnostic accuracy at the low cut-off point

At the low cut-off point, the aim of the AIR score is to obtain high sensitivity, especially for advanced appendicitis. A score > 4 points had a pooled sensitivity of 0.91 (CI 0.88; 0.94) for all patients with appendicitis (Fig. 3) and a pooled sensitivity of 0.95 (0.94; 0.97) for patients with advanced appendicitis vs those without appendicitis (Fig. 4). At a cut-off point of > 3, the pooled sensitivity was 0.95 (0.90; 0.97) for all patients with appendicitis and 0.99 (0.97; 0.99) for those with advanced appendicitis. The corresponding pooled specificities were 0.63 (0.55; 0.70) and 0.71 (0.64; 0.77) at a cut-off > 4 and 0.47 (0.42; 0.53) and 0.46 (0.40; 0.51) at a cut-off > 3, respectively.

Fig. 3
figure 3

Sensitivity and specificity for all patients with appendicitis at the low cut-off point. The aim at this cut-off is high sensitivity. Gudjonsdottir [28] C included only children and Moltubak [36] P included only pregnant women. Bhanghu [20] presented results for females and males separately. CI is Confidence Interval

Fig. 4
figure 4

Sensitivity and specificity for advanced appendicitis at the low cut-off point. Gudjonsdottir [28] C included only children and Moltubak [36] P included only pregnant women. CI is Confidence Interval

Diagnostic accuracy at the high cut-off point

At the high cut-off point, the aim is to identify patients with appendicitis, especially advanced appendicitis, with high specificity. At a cut-off point > 8, the pooled specificity of the AIR score was 0.98 (0.97; 0.99) for all patients with appendicitis and 0.99 (0.97; 0.99) for those with advanced appendicitis (Fig. 5). The corresponding pooled sensitivities were 0.30 (0.21; 0.42) and 0.43 (0.29; 0.59), respectively.

Fig. 5
figure 5

Sensitivity and specificity for all patients and for those with advanced appendicitis at the high cut-off point (> 8 points). Gudjonsdottir [28] C included only children and Moltubak [36] P included only pregnant women. CI is Confidence Interval

Discussion

The traditional understanding of appendicitis as a single entity, where inflammation progresses and eventually leads to perforation if untreated, is currently questioned. Accumulating evidence suggests that simple and advanced appendicitis represent two different entities and that simple appendicitis may resolve without treatment [3, 43,44,45]. As a consequence, management aimed at early diagnosis and immediate surgical treatment to prevent perforation is now replaced by more diversified management. The main aim is early detection and treatment of patients with advanced appendicitis from the large majority of patients suspected of having appendicitis. If advanced appendicitis is unlikely, a second evaluation after a short period of observation is safe and cost efficient compared with routine diagnostic imaging [5].

The use of an algorithm based on a clinical scoring system can make this management more objective and efficient. The AIR score is the second most cited appendicitis risk score, next to the Alvarado score [12]. The AIR score is based on four inflammatory variables and two signs of peritoneal irritation (Table 1). It was designed with a special focus on identifying patients with advanced appendicitis [16].

The AIR score-based algorithm uses two cut-offs to define three groups of patients with high, medium and low risk of appendicitis (Fig. 6). The aim of the high-risk cut-off is to identify patients with appendicitis with high specificity. In this meta-analysis, approximately 25% of all patients with appendicitis were assigned to the high-risk group, with a specificity of 0.98 and a prevalence of appendicitis of 91%. These patients need urgent surgical evaluation and probable abdominal diagnostic exploration. At such a high prevalence, imaging cannot rule out appendicitis but will only yield a high proportion of false negative results [46]. However, imaging may be indicated for differential diagnosis if other inflammatory conditions need to be ruled out.

Fig. 6
figure 6

The AIR score algorithm

Conversely, the aim of the low cut-off is to define a group of patients with a very low probability of advanced appendicitis, where reexamination after observation can be motivated [5]. In this meta-analysis, the low cut-off had a sensitivity for advanced appendicitis of 0.95 at a score > 4 and 0.99 at a score > 3. With an AIR score ≤ 4, approximately 40% of all the patients were assigned to the low-risk group, with a prevalence of advanced appendicitis of 1%. At an AIR score ≤ 3, the proportion of patients with advanced appendicitis was only 0.3%. At such a low prevalence, imaging would yield many false positive results and an increased risk for negative appendectomies [46]. A planned reexamination after a short period of expectant management is a safe alternative in this group of patients [5].

Study limitations

Although histopathology is the gold standard for the diagnosis of appendicitis, the final diagnosis is strongly dependent on the criteria for the findings on histopathology [47]. Some studies reported that the diagnosis was based on histopathological examination but did not declare the criteria used. Three studies were excluded because they had too wide criteria, such as lymhoid hyperplasia or the presence of neutrophils in the mucosa or even in the appendix lumen.

The inclusion of patients varied from patients with abdominal pain and suspicion of appendicitis to patients who underwent surgery for suspicion of appendicitis. This is reflected in the strong variation in the prevalence of appendicitis. As the estimates of the diagnostic values are influenced by the prevalence [14], we excluded all reports with a prevalence of appendicitis over 90%. However, the high median prevalence is still higher than that in an emergency department where the AIR score should be used for risk stratification of unsorted patients with abdominal pain and suspicion of appendicitis, which typically has a prevalence of approximately 30%.

Conclusion

This meta-analysis revealed that the AIR score has a significantly better diagnostic performance than the Alvarado score does, as shown by the pooled ROC area. The diagnostic properties of the AIR score at the low and high cut-off points suggest that it is a safe and suitable basis for risk-stratified management of patients suspected of having appendicitis.

Availability of data and materials

No datasets were generated or analysed during the current study.

References

  1. Podda M, Andersson R, Boermeester M, Coccolini F, Sartelli M, Moore EE, Sugrue M, Abu-Zidan F, Tolonen M, Damaskos D, Kluger Y, Soreide K, Pisanu A, Augustin G, Latifi R, Kelly M, Leppaniemi A, Fraga GP, Ten Broek R, Tan E, Van Goor H, Chiara O, Maier RV, Pata F, De Simone B, Ordoñez CA, Ansaloni L, Catena F, Di Saverio S. Do young patients with high clinical suspicion of appendicitis truly need cross-sectional imaging? Proceedings from a highly controversial debate among the experts’ panel of 2020 WSES Jerusalem guidelines. J Trauma Acute Care Surg. 2021;90(5):e101–7. https://doiorg.publicaciones.saludcastillayleon.es/10.1097/TA.0000000000003097. (PMID: 33496553).

    Article  PubMed  Google Scholar 

  2. Song H, Lee S, Park JH, Kim HY, Min HD, Jeon JJ, Lee KH. LOCAT Group. Can patient triaging with clinical scoring systems reduce CT use in adolescents and young adults suspected of having appendicitis? Radiology. 2021;300(2):350–8. https://doiorg.publicaciones.saludcastillayleon.es/10.1148/radiol.2021203884.

    Article  PubMed  Google Scholar 

  3. Andersson RE. The natural history and traditional management of appendicitis revisited: spontaneous resolution and predominance of prehospital perforations imply that a correct diagnosis is more important than an early diagnosis. World J Surg. 2007;31(1):86–92. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00268-006-0056-y.

    Article  PubMed  Google Scholar 

  4. Jennings R, Guo H, Goldin A, Wright DR. Cost-effectiveness of imaging protocols for suspected appendicitis. Pediatrics. 2020;145(2):e20191352. https://doiorg.publicaciones.saludcastillayleon.es/10.1542/peds.2019-1352.

    Article  PubMed  Google Scholar 

  5. Andersson M, Kolodziej B, Andersson RE. STRAPPSCORE study group. Randomised clinical trial of appendicitis inflammatory response score-based management of patients with suspected appendicitis. Br J Surg. 2017;104(11):1451–61. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/bjs.10637.

    Article  CAS  PubMed  Google Scholar 

  6. Feinstein AR. The inadequacy of binary models for the clinical reality of three-zone diagnostic decisions. J Clin Epidemiol. 1990;43:109–13.

    Article  CAS  PubMed  Google Scholar 

  7. Coste J, Pouchot J. A grey zone for quantitative diagnostic and screening tests. Int J Epidemiol. 2003;32(2):304–13. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/ije/dyg054. (PMID: 12714554).

    Article  PubMed  Google Scholar 

  8. Frountzas M, Stergios K, Kopsini D, Schizas D, Kontzoglou K, Toutouzas K. Alvarado or RIPASA score for diagnosis of acute appendicitis? A meta-analysis of randomised trials. Int J Surg. 2018;56:307–14. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.ijsu.2018.07.003. (Epub 2018 Jul 12 PMID: 30017607).

    Article  PubMed  Google Scholar 

  9. Ebell MH, Shinholser J. What are the most clinically useful cut-offs for the alvarado and pediatric appendicitis scores? A systematic review. Ann Emerg Med. 2014;64(4):365–72.

    Article  PubMed  Google Scholar 

  10. Favara G, Maugeri A, Barchitta M, Ventura A, Basile G, Agodi A. Comparison of RIPASA and ALVARADO scores for risk assessment of acute appendicitis: a systematic review and meta-analysis. PLoS ONE. 2022;17(9):e0275427. https://doiorg.publicaciones.saludcastillayleon.es/10.1371/journal.pone.0275427.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Bai S, Hu S, Zhang Y, Guo S, Zhu R, Zeng J. The value of the Alvarado score for the diagnosis of acute appendicitis in children: a systematic review and meta-analysis. J Pediatr Surg. 2023;58(10):1886–92.

    Article  PubMed  Google Scholar 

  12. Abdella Bahta NN, Zeinert P, Rosenberg J, Fonnes S. The Alvarado score is the most impactful diagnostic tool for appendicitis: a bibliometric analysis. J Surg Res. 2023;291:557–66. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.jss.2023.06.037.

    Article  PubMed  Google Scholar 

  13. Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, Shamseer L, Tetzlaff JM, Akl EA, Brennan SE, Chou R, Glanville J, Grimshaw JM, Hróbjartsson A, Lalu MM, Li T, Loder EW, Mayo-Wilson E, McDonald S, McGuinness LA, Stewart LA, Thomas J, Tricco AC, Welch VA, Whiting P, Moher D. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ. 2021;29(372):n71. https://doiorg.publicaciones.saludcastillayleon.es/10.1136/bmj.n71.

    Article  Google Scholar 

  14. Murad MH, Lin L, Chu H, Hasan B, Alsibai RA, Abbas AS, Mustafa RA, Wang Z. The association of sensitivity and specificity with disease prevalence: analysis of 6909 studies of diagnostic test accuracy. CMAJ. 2023;195(27):E925–31. https://doiorg.publicaciones.saludcastillayleon.es/10.1503/cmaj.221802.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Whiting PF, Rutjes AW, Westwood ME, Mallett S, Deeks JJ, Reitsma JB, Leeflang MM, Sterne JA, Bossuyt PM. QUADAS-2 Group. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 2011;155(8):529–36. https://doiorg.publicaciones.saludcastillayleon.es/10.7326/0003-4819-155-8-201110180-00009.

    Article  PubMed  Google Scholar 

  16. Andersson M, Andersson RE. The appendicitis inflammatory response score: a tool for the diagnosis of acute appendicitis that outperforms the Alvarado score. World J Surg. 2008;32(8):1843–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00268-008-9649-y. (PMID: 18553045).

    Article  PubMed  Google Scholar 

  17. Andersson M, Rubér M, Ekerfelt C, Hallgren HB, Olaison G, Andersson RE. Can new inflammatory markers improve the diagnosis of acute appendicitis? World J Surg. 2014;38(11):2777–83. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00268-014-2708-7. (PMID: 25099684).

    Article  PubMed  Google Scholar 

  18. Andersson M, Kolodziej B, Andersson RE. Validation of the appendicitis inflammatory response (AIR) score. World J Surg. 2021;45(7):2081–91. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00268-021-06042-2.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Bardakçi O, Bahçecioğlu İB, Tatli F, Özgönül A, Güldür ME, Uzunköy A. Does one of the two most commonly used scoring systems have a decisive advantage over the other in diagnosing acute appendicitis in pregnant women? Medicine (Baltimore). 2023;102(17):e33596. https://doiorg.publicaciones.saludcastillayleon.es/10.1097/MD.0000000000033596.

    Article  PubMed  Google Scholar 

  20. Bhangu A. RIFT study group on behalf of the west midlands research collaborative. Evaluation of appendicitis risk prediction models in adults with suspected appendicitis. Br J Surg. 2020;107(1):73–86. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/bjs.11440.

    Article  CAS  PubMed  Google Scholar 

  21. Bolívar-Rodríguez MA, Osuna-Wong BA, Calderón-Alvarado AB, Matus-Rojas J, Dehesa-López E, Peraza-Garay FJ. Análisis comparativo de escalas diagnósticas de apendicitis aguda: alvarado, RIPASA y AIR. Cir Cir. 2019;86(2):169–74. https://doiorg.publicaciones.saludcastillayleon.es/10.24875/CIRU.M18000029.

    Article  Google Scholar 

  22. Chisthi MM, Surendran A, Narayanan JT. RIPASA and air scoring systems are superior to alvarado scoring in acute appendicitis: diagnostic accuracy study. Ann Med Surg (Lond). 2020;24(59):138–42. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.amsu.2020.09.029.PMID:33024555;PMCID:PMC7527659.

    Article  Google Scholar 

  23. de Castro SM, Ünlü C, Steller EP, van Wagensveld BA, Vrouenraets BC. Evaluation of the appendicitis inflammatory response score for patients with acute appendicitis. World J Surg. 2012;36(7):1540–5. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00268-012-1521-4.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Elshakhs S, Abdelsamie M, Fareed A, Abuomar MH. Reliability of the adult appendicitis score in diagnosing acute appendicitis. Menoufia Med J. 2019;32:544–8.

    Article  Google Scholar 

  25. Fagan PVB, Stanfield B, Korduke O, Henderson N, Kodeda K. Validation of the appendicitis inflammatory response score in a provincial population. Int Surg J. 2020;7(11):3540–5. https://doiorg.publicaciones.saludcastillayleon.es/10.18203/2349-2902.isj20204655.

    Article  Google Scholar 

  26. Gan DEY, Nik Mahmood NRK, Chuah JA, Hayati F. Performance and diagnostic accuracy of scoring systems in adult patients with suspected appendicitis. Langenbecks Arch Surg. 2023;408(1):267. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00423-023-02991-5. (PMID: 37410251).

    Article  PubMed  Google Scholar 

  27. Gopalam PR, Saila Suman Konidala MV. Comparison of acute inflammatory score and Alvarado score in diagnosis of acute appendicitis at a tertiary care hospital. Int Surg J. 2017;4(12):4034–8. https://doiorg.publicaciones.saludcastillayleon.es/10.18203/2349-2902.isj20175405.

    Article  Google Scholar 

  28. Gudjonsdottir J, Marklund E, Hagander L, Salö M. Clinical prediction scores for pediatric appendicitis. Eur J Pediatr Surg. 2021;31(3):252–60. https://doiorg.publicaciones.saludcastillayleon.es/10.1055/s-0040-1710534. (Epub 2020 May 26 PMID: 32455443).

    Article  PubMed  Google Scholar 

  29. Gudjonsdottir J, Andersson RE. Validation of clinical prediction scores for pediatric appendicitis: update with new cut-off values for the AIR score. Eur J Pediatr Surg. 2023;33(2):176–7. https://doiorg.publicaciones.saludcastillayleon.es/10.1055/a-1932-8468. (Epub 2022 Aug 29 PMID: 36037844).

    Article  PubMed  Google Scholar 

  30. Hassan M, Jeilani M, Saad AA, Iqbal S, Boshnaq M. Evaluation of Alvarado score and appendicitis inflammatory response score as diagnostic tools for acute appendicitis. Int Surg J. 2022;9(12):1937–42. https://doiorg.publicaciones.saludcastillayleon.es/10.18203/2349-2902.isj20223152.

    Article  Google Scholar 

  31. Jose T, Rajesh PS. Appendicitis inflammatory response score in comparison to Alvarado score in acute appendicitis. Surg J (N Y). 2021;7(3):e127–31. https://doiorg.publicaciones.saludcastillayleon.es/10.1055/s-0041-1731446.

    Article  PubMed  Google Scholar 

  32. Kabir SMU, Bucholc M, Walker CA, Sogaolu OO, Zeeshan S, Sugrue M. Quality outcomes in appendicitis care: identifying opportunities to improve care. Life (Basel). 2020;10(12):358. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/life10120358.

    Article  PubMed  Google Scholar 

  33. Kahana N, Boaz E, Neymark M, Gilshtein H, Dror YF, Benjaminov O, Reissman P, Tankel J. Magnetic resonance imaging for acute appendicitis in pregnancy: can clinical scores predict when imaging is needed? Eur J Trauma Emerg Surg. 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00068-024-02538-3.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Khudhar HE, Mohammed Jasim D. Accuracy of diagnostic scales of acute appendicitis: Alvarado, RIPASA and AIR in comparison to histopathology. JMSP. 2023;8(4):256–73.

    Google Scholar 

  35. Kollár D, McCartan DP, Bourke M, Cross KS, Dowdall J. Predicting acute appendicitis? A comparison of the Alvarado score, the appendicitis inflammatory response score and clinical assessment. World J Surg. 2015;39(1):104–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00268-014-2794-6. (PMID: 25245432).

    Article  PubMed  Google Scholar 

  36. Moltubak E, Landerholm K, Blomberg M, Andersson RE. Validation of the appendicitis inflammatory response (AIR) score during pregnancy: a nested case-control study. Acad Emerg Med. 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/acem.14915.

    Article  PubMed  Google Scholar 

  37. Patil S, Harwal R, Harwal S, Kamthane S. Appendicitis inflammatory response score: a novel scoring system for acute appendicitis. Int Surg J. 2017;4(3):1065–70. https://doiorg.publicaciones.saludcastillayleon.es/10.18203/2349-2902.isj20170863.

    Article  Google Scholar 

  38. Rohat A, Doganay F, Akoglu EU, Akoglu H, Uçar AB, Kurt E, Turan CA. Predictive value of scoring systems for the diagnosis of acute appendicitis in emergency department patients: is there an accurate one? Hong Kong J Emerg Med. 2020;27(5):262–9.

    Article  Google Scholar 

  39. Safaee M, Eshraghi Samani R, Talebzadeh H, Moeini Sam MS. Evaluation and comparison of the results of the Alvarado scoring system with acute inflammatory response score in the diagnosis of acute appendicitis based on the pathological evidence. Immunopathol Persa. 2024;10(2):e40606. https://doiorg.publicaciones.saludcastillayleon.es/10.34172/ipp.2024.40606.

    Article  Google Scholar 

  40. Sammalkorpi HE, Mentula P, Leppäniemi A. A new adult appendicitis score improves diagnostic accuracy of acute appendicitis–a prospective study. BMC Gastroenterol. 2014;26(14):114. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/1471-230X-14-114.PMID:24970111;PMCID:PMC4087125.

    Article  Google Scholar 

  41. Scott AJ, Mason SE, Arunakirinathan M, Reissis Y, Kinross JM, Smith JJ. Risk stratification by the appendicitis inflammatory response score to guide decision-making in patients with suspected appendicitis. Br J Surg. 2015;102(5):563–72. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/bjs.9773. (Epub 2015 Mar 2 PMID: 25727811).

    Article  CAS  PubMed  Google Scholar 

  42. Walia DS, et al. A comparative study of Alvarado, Ripasa and Airs scoring systems in the diagnosis of acute appendicitis. Europ J Molecul Clinic Med. 2022;9(3):369–79.

    Google Scholar 

  43. Andersson RE, Agiorgiti M, Bendtsen M. Spontaneous resolution of uncomplicated appendicitis may explain increase in proportion of complicated appendicitis during Covid-19 pandemic: a systematic review and meta-analysis. World J Surg. 2023;47(8):1901–16. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00268-023-07027-z.

    Article  PubMed  PubMed Central  Google Scholar 

  44. Park HC, Kim MJ, Lee BH. Randomised clinical trial of antibiotic therapy for uncomplicated appendicitis. Br J Surg. 2017;104(13):1785–90. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/bjs.10660.

    Article  CAS  PubMed  Google Scholar 

  45. Salminen P, Sippola S, Haijanen J, Nordström P, Rantanen T, Rautio T, Sallinen V, Löyttyniemi E, Hurme S, Tammilehto V, Laukkarinen J, Savolainen H, Meriläinen S, Leppäniemi A, Grönroos J. Antibiotics versus placebo in adults with CT-confirmed uncomplicated acute appendicitis (APPAC III): randomised double-blind superiority trial. Br J Surg. 2022;109(6):503–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/bjs/znac086.PMID:35576384;PMCID:PMC10364767.

    Article  PubMed  PubMed Central  Google Scholar 

  46. van Randen A, Bipat S, Zwinderman AH, Ubbink DT, Stoker J, Boermeester MA. Acute appendicitis: meta-analysis of diagnostic performance of CT and graded compression US related to prevalence of disease. Radiology. 2008;249(1):97–106. https://doiorg.publicaciones.saludcastillayleon.es/10.1148/radiol.2483071652. (Epub 2008 Aug 5 PMID: 18682583).

    Article  PubMed  Google Scholar 

  47. Carr NJ. The pathology of acute appendicitis. Ann Diagn Pathol. 2000;4(1):46–58. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/s1092-9134(00)90011-x.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

Not applicable

Funding

Open access funding provided by Linköping University.

Author information

Authors and Affiliations

Authors

Contributions

Both authors took part in the extraction of the data. RA made most of the analysis of the data. RA prepared all figures and tables. RA wrote the first version of the manuscript. JS made the first quality evaluation according to the QUADAS-2 tool. Both authors independently assessed the quality of each included study. Any disagreements were resolved by discussion between the investigators. Both authors read and approved the final manuscript.

Corresponding author

Correspondence to Roland E. Andersson.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Andersson, R.E., Stark, J. Diagnostic value of the appendicitis inflammatory response (AIR) score. A systematic review and meta-analysis. World J Emerg Surg 20, 12 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13017-025-00582-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13017-025-00582-x

Keywords