Neutrality Boundary Robustness for Meta-Analysis

doi:10.63096/medtigo3062417

Neutrality Boundary Robustness for Meta-Analysis

Thomas F. Heston
http://orcid.org/0000-0002-5655-2512

Thomas F. Heston
¹

Author Affiliations

medtigo J Med. |

Date - Received: Dec 31, 2025,

Accepted: Jan 04, 2026,

Published: Feb 26, 2026.

https://doi.org/10.63096/medtigo3062417

Abstract

Background: Meta-analyses conventionally report pooled effect sizes, confidence intervals, and p-values. These measures address statistical significance and precision, but not how far the results are from a “no benefit” (nb) point, as measured on a standardized robustness scale. The Neutrality Boundary Framework provides a robust metric, nb, that ranges from 0 to 1 for individual studies, but its use in meta-analysis has not been formally described. A nb of 0 indicates that results are at the neutrality boundary, where no relationship between treatment and outcome is present; a nb of 1 indicates that results show a strong relationship between treatment and outcome.
Methodology: A sample of published clinical trials was selected for evaluation. The distribution of nb across all trials was summarized, and the correlation between nb and p-values was examined.
Results: Across 161 trials, the median nb was 0.147 (range 0.000 to 0.902). Using standardized robustness bands derived from prior validation work based on simulation and trial data, 35.4% of trials showed weak robustness (nb less than 0.075), 24.8% moderate robustness (nb between 0.075 and 0.227), and 39.8% strong robustness (nb greater than or equal to 0.227). Trials with binary 2-by-2 outcomes tended to have lower nb values than trials with continuous outcomes. The robust metric nb was moderately correlated with p-values (r = 0.35, p less than 0.001). Although a moderate level of correlation was present, p-values explained only about 12% of the variation in nb (r-squared = 0.12), indicating that nb captures substantial additional information about distance from neutrality that p-values alone do not provide.
Conclusion: The distribution of trial-level nb values provides a simple way to describe robustness across a collection of studies and offers a cross-design view of how far the overall evidence base lies from therapeutic neutrality. Incorporating robust measures alongside p-values in routine evidence synthesis may provide a more complete picture of the strength and reliability of the evidence.

Keywords

Meta-analysis, Neutrality boundary framework, Statistical robustness, Evidence synthesis, Statistical fragility, Complete statistical evidence.

Introduction

Meta-analysis is the dominant tool for quantitative evidence synthesis in clinical research.[1] Standard practice focuses on three primary outputs: a pooled effect estimate (e.g., risk ratio, hazard ratio, mean difference), a confidence interval quantifying uncertainty, and a hypothesis test p-value against a null hypothesis of no effect.[2] Together, these answers: “Is there statistically detectable evidence against the null, and what is the estimated magnitude and precision of that effect?” However, p-values are inherently threshold-dependent, defined by an arbitrary cutoff (typically alpha=0.05).[3] This creates a fundamental problem: two meta-analyses with p-values of 0.049 and 0.001 are both labeled “statistically significant,” yet they represent vastly different levels of evidence quality. The first result sits precariously at the significance boundary and could easily flip with minor changes to the data, while the second shows robust separation from the null hypothesis. Current meta-analytic practice provides no standardized method for distinguishing between these scenarios or for quantifying how far pooled results lie from therapeutic neutrality on a scale comparable across different study designs and outcome types.

Several approaches have attempted to address the limitations of p-value-only reporting in meta-analysis. Confidence intervals provide information about precision and the magnitude of the effect. Prediction intervals estimate the range of effects in future studies.[4] Heterogeneity statistics (I-squared, tau-squared) quantify variability across studies. GRADE assessments evaluate overall evidence quality through multiple dimensions.[5] More recently, fragility indices have been extended to meta-analysis to measure how many outcome changes would be needed to flip statistical significance.[6] Each of these methods addresses essential aspects of evidence quality and has contributed to a more nuanced interpretation of meta-analytic results.

However, these existing approaches fall short in one critical dimension: none provides a threshold-free, standardized measure of how far pooled results are from therapeutic neutrality that works consistently across diverse study designs. Confidence intervals depend on arbitrary significance levels.[7] Prediction intervals estimate the range of true effects in future studies and can indicate whether those effects might be null or beneficial.[8,9] However, prediction intervals do not provide a standardized measure of how far the pooled effect lies from neutrality. The neutrality boundary framework (NBF) addresses this gap by quantifying distance from neutrality on a standardized 0-1 scale. Fragility indices are, by definition, threshold-dependent, measuring only changes needed to cross the alpha=0.05 boundary. GRADE assessments, while comprehensive, do not quantify the extent to which the results deviate from no effect on a standardized 0–1 scale. What remains absent is a metric that answers the question: “Regardless of whether p is above or below 0.05, how far are these results from a ‘no effect’ point, measured consistently across binary, continuous, survival, and other outcome types?”

NBF addresses this gap by providing a robustness index, nb, that ranges from 0 to 1 and measures geometric distance from therapeutic neutrality.[10] The NBF has been successfully developed and applied to individual clinical trials across multiple study designs, including independent binary outcome trials, matched binary outcome trials, diagnostic test evaluations, continuous outcomes, correlation analyses, and survival analyses.[11]

The NBF defines robustness using the general formula: nb = |T – T0| / (|T – T0| + S), where T is a test statistic or effect measure (for example, a log risk ratio or a standardized mean difference), T0 is the point of therapeutic neutrality (for example, log risk ratio = 0 or correlation r = 0), and S is a scale parameter that reflects within-study variability (for example, a standard error or a pooled standard deviation). The metric is threshold-free (independent of any chosen significance level), design-agnostic (works across different study types on a standardized 0-1 scale), and interpretable (values near 0 indicate proximity to neutrality; higher values indicate stronger separation from neutrality). However, its extension to meta-analytic evidence synthesis has not been formalized. Meta-analyses currently lack a standardized method for applying NBF principles when synthesizing evidence across studies with varying designs, sample sizes, and effect sizes.

The objective of this study was to extend the NBF to meta-analysis through two complementary approaches. The first approach, implemented empirically in this paper, involves summarizing trial-level nb distributions across heterogeneous study designs to characterize the overall robustness of an evidence base. The second approach, formalized theoretically, defines a pooled robustness measure (nb_meta) for future use when study-level effect estimates and their variances are available. Together, these approaches provide meta-analysts and clinicians with a standardized method to assess how far synthesized evidence lies from therapeutic neutrality, complementing traditional p-values and addressing the critical gap in current meta-analytic practice.

Methodology

Study design: This study used two complementary approaches to assess robustness in meta-analysis using the NBF. The first approach involved descriptive synthesis of trial-level nb values across a heterogeneous sample of clinical trials. The second approach formalized a pooled robustness measure (nb_meta) for future implementation when study-level effect estimates and variances are available.

Data source: The analysis used a convenience sample of 161 clinical trials for which NBF analyses had already been completed. For each trial, the following data were available: DOI or PubMed identifier, publication year, clinical specialty, study design category, p-value, and NBF robustness metric nb. Trials were classified into six design categories: binary 2-by-2 independent, binary 2-by-2 matched, continuous-difference, continuous-means, survival outcomes, and contingency tables. Trial-level nb values had been computed using design-specific NBF methods appropriate for each study type. As the majority of trials were two-arm, independent binary outcome trials, the NBF metric utilized was the Risk Quotient (RQ). For 2×2 contingency tables, the RQ = |ad – bc| / (N^2 / 4), which quantifies how far results are from independence (the point where treatment and outcome are unrelated, i.e., a relative risk of one).[12] All nb values range from 0 to 1, where values near 0 indicate proximity to therapeutic neutrality and values near 1 indicate strong separation from neutrality.

Meta-analysis robustness framework: When study-level effect estimates are available, a pooled robustness metric can be defined using NBF principles. The pooled robustness measure nb_meta is defined as:

nb_meta = |T_pooled| / (|T_pooled| + SE_pooled)

Here, T_pooled is the pooled effect estimate expressed on a neutrality-centered scale (neutrality at T = 0), and SE_pooled is the standard error of the pooled estimate under the specified meta-analytic model (fixed-effects or random-effects).

This definition yields values between 0 and 1. When the pooled effect is large relative to its pooled uncertainty, nb_meta-approaches 1, indicating strong separation from therapeutic neutrality across the evidence base. When the pooled effect is small relative to pooled uncertainty, nb_meta-approaches 0, indicating proximity to neutrality. Computing nb_meta requires individual-study effect estimates and standard errors (or equivalent variance information) so that T_pooled and SE_pooled can be obtained from a standard meta-analysis.

To demonstrate the practical application of nb_meta, a published diagnostic meta-analysis of cadmium-zinc-telluride (CZT) SPECT for coronary artery disease detection was selected as a worked example.[13] This meta-analysis included 11 studies with complete 2-by-2 diagnostic accuracy data, enabling calculation of individual study robustness using the Diagnostic Neutrality Boundary metric and subsequent pooling using the nb_meta formula described above.

Statistical analysis: The distribution of nb across all 161 trials was summarized using the median, range, and mean. Trials were categorized into robustness bands based on empirically derived thresholds from prior validation work: weak robustness (nb<0.075), moderate robustness (0.075≤ nb<0.227), and strong robustness (nb≥0.227). The number and proportion of trials in each band were calculated. The Pearson correlation coefficient between nb and -log10(p) was computed to examine the relationship between robustness and statistical significance. The -log10(p) transformation converts p-values to a scale where larger values indicate stronger evidence against the null hypothesis. Trials with p-values reported as zero were excluded from correlation analysis. SAMPL guidelines were followed.[14]

Ethics statement: This study is a secondary analysis of aggregate statistical results reported in previously published medical literature. No individual-level data, identifiable private information, or identifiable biospecimens were obtained, accessed, or analyzed. Under 45 CFR 46, this project does not constitute human subjects research, and institutional review board oversight was not required.

Results

Dataset overview: The dataset contained 161 clinical trials with complete p-value and nb information. By design class, binary 2-by-2 independent trials comprised the majority (n=133, 82.6%), followed by continuous-difference (n=15, 9.3%), continuous-means (n=7, 4.3%), survival outcomes (n=3, 1.9%), contingency tables (n=2, 1.2%), and binary 2-by-2 matched (n=1, 0.6%). Trials spanned multiple specialties, including autoimmunity, cardiology, endocrinology, gastroenterology, infectious diseases, internal medicine, nephrology, neurology, oncology, pediatrics, psychiatry, public health, pulmonary medicine, and surgery.

Overall robustness distribution: Across all 161 trials, the robustness metric nb had a median of 0.147 (range 0.000 to 0.902) and a mean of 0.247. By robustness band, 57 trials (35.4%) showed weak robustness, 40 trials (24.8%) showed moderate robustness, and 64 trials (39.8%) showed strong robustness. Overall, most trials exhibited weak or moderate robustness, indicating that their estimated effects remained relatively close to therapeutic neutrality. Table 1 summarizes these statistics.

Statistic	Value
Number of trials (N)	161
Median nb	0.147
Mean nb	0.247
Range of nb	0.000-0.902
Weak robustness (nb<0.075)	57 (35.4%)
Moderate robustness (0.075≤nb<0.227)	40 (24.8%)
Strong robustness (nb≥0.227)	64 (39.8%)

Table 1: Overall distribution of robustness metric nb across all trials

Relationship between robustness and p-values: The empirical relationship between robustness (nb) and p-values was examined using -log10(p) as a measure of p-value strength. The correlation coefficient between nb and -log10(p) was r=0.35 (Pearson, p<0.001, n=137). Twenty-four trials with p-values reported as zero or below machine precision were excluded from this correlation analysis. This pattern shows that robustness, as quantified by nb, is related to, but not reducible to, statistical significance. Only 12% of the variation in nb is explained by the p-value (r-squared = 12.2%). This confirms that the geometric distance from neutrality (nb) is quantitatively different from the raw incompatibility of the data with a null hypothesis (p-value). Continuous-outcome designs tended to show higher robustness (larger nb) than binary designs, consistent with higher information per subject in continuous endpoints.

Empirical validation of pooled robustness: To demonstrate the practical implementation of the proposed nb_meta pooling metric, the Neutrality Boundary Framework was applied to a published diagnostic meta-analysis evaluating cadmium-zinc-telluride (CZT) SPECT for coronary artery disease detection.[13] This meta-analysis included 11 studies with complete 2×2 diagnostic accuracy data, enabling calculation of individual study robustness (nb) via the Diagnostic Neutrality Boundary (DNB) metric and subsequent pooling.[15-25] Because all cells were nonzero, DOR, SE[ln(DOR)], and DNB were calculated directly from the 2×2 counts without continuity corrections.

Individual study robustness calculations: For each study, the diagnostic odds ratio (DOR) was computed from the reconstructed 2×2 table:

DOR = (TP × TN) / (FP × FN)

The DNB metric was then calculated as:

DNB = |ln(DOR)| / (|ln(DOR)| + SE[ln(DOR)])

where SE[ln(DOR)] = √(1/TP + 1/FN + 1/FP + 1/TN)

The DOR and DNB results are shown in Table 2.

Study	TP	FP	TN	FN	DOR	ln(DOR)	SE	DNB
Giubbini R et al., 2021	21	10	20	3	14.000	2.639	0.729	0.784
Nishiyama Y et al., 2014	46	4	18	8	25.875	3.253	0.673	0.829
Wang J et al., 2021	28	2	14	5	39.200	3.669	0.898	0.803
Ben Bouallegue F et al., 2015	9	3	13	1	39.000	3.664	1.233	0.748
Han S et al., 2018	16	8	45	7	12.857	2.554	0.594	0.811
Nkoulou R et al., 2016	1	2	14	4	1.750	0.560	1.350	0.293
Shiraishi S et al., 2015	12	9	32	2	21.333	3.060	0.852	0.782
Shiraishi S et al., 2020	62	16	31	16	7.508	2.016	0.416	0.829
Miyagawa M et al., 2017	10	1	13	4	32.500	3.481	1.195	0.745
Gimelli A et al., 2012a	77	4	4	13	5.923	1.779	0.768	0.698
Gimelli A et al., 2012b	88	3	6	8	22.000	3.091	0.798	0.795

Table 2: Diagnostic accuracy data and individual study robustness for 11 CZT SPECT studies.

Legend: The 2-by-2 diagnostic accuracy data (TP, FP, TN, FN) for each study were used to calculate the diagnostic odds ratio (DOR), its natural log and standard error, and the Diagnostic Neutrality Boundary (DNB), which is the NBF robustness metric for diagnostic studies. TP = true positives; FP = false positives; TN = true negatives; FN = false negatives; SE = standard error of ln(DOR); DNB = nb for diagnostic studies.

Pooled robustness calculation: Following the NBF formula for pooled robustness, nb_meta was calculated in two steps. First, the pooled effect estimate was obtained by weighing each study’s ln(DOR) by its precision, with more precise studies (those with smaller standard errors) receiving greater weight. The weight for each study was calculated as 1 divided by the square of its standard error. Second, the pooled effect estimate and its corresponding standard error were entered into the NBF formula to obtain nb_meta = (pooled effect) / (pooled effect + pooled standard error). The pooled effect calculation is shown in Table 3.

Study	ln(DOR)	SE	Weight (1/SE²)	ln(DOR) × Weight
Giubbini R et al., 2021	2.639	0.729	1.882	4.966
Nishiyama Y et al., 2014	3.253	0.673	2.208	7.182
Wang J et al., 2021	3.669	0.898	1.240	4.550
Ben Bouallegue F et al., 2015	3.664	1.233	0.658	2.410
Han S et al., 2018	2.554	0.594	2.834	7.238
Nkoulou R et al., 2016	0.560	1.350	0.549	0.307
Shiraishi S et al., 2015	3.060	0.852	1.378	4.215
Shiraishi S et al., 2020	2.016	0.416	5.778	11.649
Miyagawa M et al., 2017	3.481	1.195	0.700	2.438
Gimelli A et al., 2012a	1.779	0.768	1.695	3.016
Gimelli A et al., 2012b	3.091	0.798	1.570	4.854
Total	—	—	20.492	52.826

Table 3: Inverse-variance weighted pooling of ln(DOR)

Legend: Each study’s ln(DOR) was weighted by 1 divided by SE squared. The total weight sum (20.492) and weighted ln(DOR) sum (52.826) were used to calculate the pooled effect (52.826/20.492 = 2.578) and the pooled standard error (1/sqrt(20.492)=0.221). These are used to obtain nb_meta = 2.578 / (2.578 + 0.221) = 0.921. DOR = diagnostic odds ratio, SE = standard error of ln(DOR).

Interpretation: The pooled robustness of nb_meta = 0.92 indicates that the CZT SPECT diagnostic evidence is strongly separated from therapeutic neutrality (DOR = 1), consistent with the meta-analysis conclusion that CZT SPECT demonstrates reliable diagnostic accuracy for coronary artery disease. On the 0–1 NBF scale, values approaching 1 reflect maximal separation from neutrality; the observed value of 0.92 indicates that the pooled effect estimate is large relative to its uncertainty.

Notably, one study exhibited substantially lower robustness (nb = 0.29), reflecting its small sample size (n = 21) and a near-neutral diagnostic odds ratio (DOR = 1.75) [21]. The inverse-variance weighting appropriately down-weights this imprecise estimate in the pooled ln(DOR), demonstrating a key advantage of the nb_meta approach: studies contributing weak or unstable evidence receive proportionally less influence on the pooled estimate.

This empirical demonstration confirms that nb_meta can be computed directly from published meta-analysis forest plots without requiring access to individual patient data, supporting its feasibility for routine implementation in systematic reviews.

Discussion

The central finding of this study is that robustness, as quantified by the NBF metric nb, can be synthesized across heterogeneous clinical trials to provide a standardized assessment of how far an evidence base lies from therapeutic neutrality. Across 161 trials spanning multiple study designs and clinical specialties, the median nb was 0.147, with most trials (60.2%) exhibiting weak or moderate robustness. This indicates that most trial results remained relatively close to the point of no effect, even when statistically significant. The moderate correlation between nb and p-values (r=0.35) demonstrates that robustness and statistical significance measure related but distinct dimensions of evidence quality. Only 12% of the variation in nb is explained by p-values, confirming that robustness provides substantial additional information beyond what p-values alone convey.

This study makes three contributions to meta-analytic methodology. First, it demonstrates empirical robustness synthesis using trial-level nb distributions across heterogeneous study designs, showing that robustness can be meaningfully summarized without requiring uniform outcome types or effect size metrics. Second, it formalizes the theoretical framework for pooled meta-analytic robustness (nb_meta in the methods section) for future implementation once study-level effect estimates and variances become available. Third, it establishes robustness as a complementary dimension of meta-analytic evidence alongside pooled p-values, providing a threshold-free assessment of evidence quality that works consistently across binary, continuous, survival, and other outcome types.

A common objection is that confidence intervals already contain the necessary information about distance from the no-effect point. Confidence intervals indicate whether the interval includes the null and how wide the compatible range of effects is, so they are a good measure of precision. However, they do not provide a standardized 0-to-1 summary of the distance from neutrality relative to its uncertainty, and they are not directly comparable across different effect scales and study designs. In contrast, nb and nb_meta take the same basic ingredients (effect estimate and its variability) and convert them into a single robustness score between 0 and 1. This score reflects both the distance of the estimate from the no-effect value and its noise, on a common scale that is interpretable and directly comparable across diverse designs. In this sense, nb and nb_meta do not duplicate the confidence interval; they extract and standardize the distance-from-neutrality information that the interval only displays in a plot or on the original effect scale.

Classical meta-analyses emphasize pooled effects, confidence intervals, and p-values, sometimes supplemented by prediction intervals or GRADE assessments.[26] Fragility indices for meta-analysis quantify how many outcome changes would flip pooled significance. NBF robustness differs in three fundamental ways. It is threshold-free, remaining independent of the arbitrary alpha=0.05 cutoff that defines statistical significance. It is explicitly geometric, measuring distance from neutrality relative to variability rather than probability under a null hypothesis. It is cross-design, providing a unified 0-to-1 scale across binary, continuous, survival, and other outcome types where traditional effect sizes are not directly comparable. By integrating nb alongside p-values, meta-analysts can complement significance testing with a standardized robustness layer that answers a fundamentally different question: how far are these results from no effect?

In practice, meta-analysts could report the median nb and proportion of trials exceeding robustness thresholds alongside pooled effects. They could examine design-specific or domain-specific robustness patterns to identify whether certain study types consistently show stronger or weaker separation from neutrality. Systematically low nb values (e.g., nb < 0.227) would indicate proximity to neutrality even when p < 0.05, suggesting that statistically significant findings may lack practical separation from no effect. Consistently high nb values (e.g., nb > 0.227) would indicate robust separation from neutrality, strengthening confidence in the magnitude of the observed effects. This approach overlays a robustness map atop standard meta-analytic outputs without replacing them, providing complementary information for interpreting evidence and making clinical decisions.

Limitations: This analysis used a convenient sample of trials with completed NBF analyses, assembled for methodological illustration rather than systematic review of a specific clinical question. Results demonstrate feasibility and interpretive principles but should not be interpreted as clinical evidence for any intervention. The sample was heterogeneous by design, spanning multiple clinical specialties and study types, which enabled demonstration of cross-design robustness synthesis but limits generalizability to any single clinical domain. Directionality (benefit versus harm) was not emphasized in this analysis. The focus was on absolute distance from neutrality. However, in applied settings, nb can be paired with the sign of the underlying effect to distinguish beneficial from harmful deviations from null. The pooled robustness measure nb_meta was introduced as a conceptual extension of NBF to meta-analytic effect sizes, but empirical validation and software implementation are left for future work. The weak, moderate, and strong robustness bands are based on empirically derived provisional recommendations from 118 real trials and 1 million simulated trials. The NBF formula structure supports the use of standard thresholds across designs by placing all robustness metrics on a shared 0-to-1 scale. However, these bands remain provisional and warrant further validation in specific clinical domains. Finally, the observed correlation between nb and p-values is descriptive. Patterns may differ across datasets, particularly when distributions of study designs, sample sizes, or effect magnitudes differ.

Future work should focus on implementing nb_meta using extracted effect sizes from published meta-analyses, validating robustness thresholds in domain-specific contexts, and developing software tools to facilitate routine calculation and reporting of robustness metrics alongside traditional meta-analytic outputs. As robustness assessment becomes integrated into standard evidence synthesis workflows, the paucity of threshold-free, cross-design metrics in current practice will become increasingly apparent, and the value of complementing p-values with geometric distance measures will become more widely recognized.

Conclusion

Neutrality boundary robustness provides a simple, interpretable, threshold-free measure of distance from therapeutic neutrality. This study demonstrates that nb extends naturally to meta-analytic evidence through descriptive synthesis of trial-level nb distributions and the definition and worked example of pooled nb_meta based on meta-analytic effects and cross-study dispersion. Confidence intervals and p-values remain essential, but they do not, by themselves, provide a standardized, cross-trial 0-to-1 measure of how far results lie from the no-effect point. The NBF fills this gap by translating effect size and uncertainty into a common robustness scale.

Integrating robust metrics alongside p-values and confidence intervals yields a more complete picture of evidential strength. The Neutrality Boundary Framework, extended to meta-analysis, offers a threshold-free, design-agnostic approach to assessing evidence quality across individual trials and synthesized bodies of evidence, helping clinicians distinguish results that merely cross a significance threshold from those that are truly far from neutrality.

References

Haidich AB. Meta-analysis in medical research. Hippokratia. 2010;14(Suppl 1):29-37.
Meta-analysis in medical research
Liberati A, Altman DG, Tetzlaff J, et al. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate healthcare interventions: explanation and elaboration. BMJ. 2009;339:b2700. doi:10.1136/bmj.b2700
PubMed | Crossref | Google Scholar
Andrade C. The P Value and Statistical Significance: Misunderstandings, Explanations, Challenges, and Alternatives. Indian J Psychol Med. 2019;41(3):210-215. doi:10.4103/IJPSYM.IJPSYM_193_19
PubMed | Crossref | Google Scholar
Hespanhol L, Vallio CS, Costa LM, Saragiotto BT. Understanding and interpreting confidence and credible intervals around effect estimates. Braz J Phys Ther. 2019;23(4):290-301. doi:10.1016/j.bjpt.2018.12.006
PubMed | Crossref | Google Scholar
Balshem H, Helfand M, Schünemann HJ, et al. GRADE guidelines: 3. Rating the quality of evidence. J Clin Epidemiol. 2011;64(4):401-406. doi:10.1016/j.jclinepi.2010.07.015
PubMed | Crossref | Google Scholar
Xing A, Xing X, Murad MH, Lin L. Evaluating the properties of the fragility index of meta-analyses. BMC Med Res Methodol. 2025;25(1):212. doi:10.1186/s12874-025-02648-5
PubMed | Crossref | Google Scholar
Feinstein AR. P-values and confidence intervals: two sides of the same unsatisfactory coin. J Clin Epidemiol. 1998;51(4):355-360. doi:10.1016/s0895-4356(97)00295-3
PubMed | Crossref | Google Scholar
IntHout J, Ioannidis JP, Rovers MM, Goeman JJ. Plea for routinely presenting prediction intervals in meta-analysis. BMJ Open. 2016;6(7):e010247. doi:10.1136/bmjopen-2015-010247
PubMed | Crossref | Google Scholar
Seehra J, Stonehouse-Smith D, Pandis N. Prediction intervals reporting in orthodontic meta-analyses. Eur J Orthod. 2021;43(5):596-600. doi:10.1093/ejo/cjab037
PubMed | Crossref | Google Scholar
Heston TF. The Neutrality Boundary Framework: Quantifying Statistical Robustness Geometrically. arXiv. 2025. doi:10.48550/arXiv.2511.00982
Crossref | Google Scholar
Heston TF. Fragility of the Fragility Index: When Calculation Becomes Impossible. SSRN. 2025. doi:10.2139/ssrn.5589130
Crossref | Google Scholar
Heston TF: Redefining the Risk Quotient: A Generalized Framework for Fragility Analysis Across Study Designs. SSRN. 2025. doi:10.2139/ssrn.5531578
Crossref | Google Scholar
Arham M, Dad A, Bin Abdul Malik MH, et al. Diagnostic accuracy of discovery NM 530c CZT SPECT for myocardial perfusion imaging in coronary artery disease: a systematic review and meta-analysis. Int J Cardiovasc Imaging. 2025;41(12):2283-2297. doi:10.1007/s10554-025-03475-x
PubMed | Crossref | Google Scholar
Lang TA, Altman DG. Basic statistical reporting for articles published in biomedical journals: the “Statistical Analyses and Methods in the Published Literature” or the SAMPL Guidelines. Int J Nurs Stud. 2015;52(1):5-9. doi:10.1016/j.ijnurstu.2014.09.006
PubMed | Crossref | Google Scholar
Han S, Kim YH, Ahn JM, et al. Feasibility of dynamic stress 201Tl/rest 99mTc-tetrofosmin single photon emission computed tomography for quantification of myocardial perfusion reserve in patients with stable coronary artery disease. Eur J Nucl Med Mol Imaging. 2018;45(12):2173-2180. doi:10.1007/s00259-018-4057-5
PubMed | Crossref | Google Scholar
Giubbini R, Bertoli M, Durmo R, et al. Comparison between N13NH3-PET and 99mTc-Tetrofosmin-CZT SPECT in the evaluation of absolute myocardial blood flow and flow reserve. J Nucl Cardiol. 2021;28(5):1906-1918. doi:10.1007/s12350-019-01939-x
PubMed | Crossref | Google Scholar
Nishiyama Y, Miyagawa M, Kawaguchi N, et al. Combined supine and prone myocardial perfusion single-photon emission computed tomography with a cadmium zinc telluride camera for detection of coronary artery disease. Circ J. 2014;78(5):1169-1175. doi:10.1253/circj.cj-13-1316
PubMed | Crossref | Google Scholar
Wang J, Li S, Chen W, Chen Y, Pang Z, Li J. Diagnostic efficiency of quantification of myocardial blood flow and coronary flow reserve with CZT dynamic SPECT imaging for patients with suspected coronary artery disease: a comparative study with traditional semi-quantitative evaluation. Cardiovasc Diagn Ther. 2021;11(1):56-67. doi:10.21037/cdt-20-728
PubMed | Crossref | Google Scholar
Shiraishi S, Tsuda N, Sakamoto F, et al. Clinical usefulness of quantification of myocardial blood flow and flow reserve using CZT-SPECT for detecting coronary artery disease in patients with normal stress perfusion imaging. J Cardiol. 2020;75(4):400-409. doi:10.1016/j.jjcc.2019.09.006
PubMed | Crossref | Google Scholar
Ben Bouallègue F, Roubille F, Lattuca B, et al. SPECT Myocardial Perfusion Reserve in Patients with Multivessel Coronary Disease: Correlation with Angiographic Findings and Invasive Fractional Flow Reserve Measurements. J Nucl Med. 2015;56(11):1712-1717. doi:10.2967/jnumed.114.143164
PubMed | Crossref | Google Scholar
Nkoulou R, Fuchs TA, Pazhenkottil AP, et al. Absolute Myocardial Blood Flow and Flow Reserve Assessed by Gated SPECT with Cadmium-Zinc-Telluride Detectors Using 99mTc-Tetrofosmin: Head-to-Head Comparison with 13N-Ammonia PET. J Nucl Med. 2016;57(12):1887-1892. doi:10.2967/jnumed.115.165498
PubMed | Crossref | Google Scholar
Shiraishi S, Sakamoto F, Tsuda N, et al. Prediction of left main or 3-vessel disease using myocardial perfusion reserve on dynamic thallium-201 single-photon emission computed tomography with a semiconductor gamma camera. Circ J. 2015;79(3):623-631. doi:10.1253/circj.CJ-14-0932
PubMed | Crossref | Google Scholar
Miyagawa M, Nishiyama Y, Uetani T, et al. Estimation of myocardial flow reserve utilizing an ultrafast cardiac SPECT: Comparison with coronary angiography, fractional flow reserve, and the SYNTAX score. Int J Cardiol. 2017;244:347-353. doi:10.1016/j.ijcard.2017.06.012
PubMed | Crossref | Google Scholar
Gimelli A, Bottai M, Giorgetti A, Genovesi D, Filidei E, Marzullo P. Evaluation of ischaemia in obese patients: feasibility and accuracy of a low-dose protocol with a cadmium-zinc telluride camera. Eur J Nucl Med Mol Imaging. 2012;39(8):1254-1261. doi:10.1007/s00259-012-2161-5
PubMed | Crossref | Google Scholar
Gimelli A, Bottai M, Genovesi D, Giorgetti A, Di Martino F, Marzullo P. High diagnostic accuracy of low-dose gated-SPECT with solid-state ultrafast detectors: preliminary clinical results. Eur J Nucl Med Mol Imaging. 2012;39(1):83-90. doi:10.1007/s00259-011-1918-6
PubMed | Crossref | Google Scholar
Schünemann HJ, Higgins JP, Vist GE, et al. Chapter 14: Completing ‘Summary of findings’ tables and grading the certainty of the evidence. In Cochrane Handbook for Systematic Reviews of Interventions version 6.5. Cochrane; 2024.
Chapter 14: Completing ‘Summary of findings’ tables and grading the certainty of the evidence

Acknowledgments

Not reported

Funding

No external funding was received.

Author Information

Thomas F. Heston
Department of Family Medicine, University of Washington, USA
Department of Medical Education and Clinical Sciences, Washington State University, USA
Email: tomhestonmd@gmail.com

Author Contribution

The author independently designed the study; collected, analyzed, and interpreted the data; drafted and critically revised the manuscript; approved the final version for publication; and accepts full responsibility for all aspects of the work.

Ethical Approval

Not applicable

Conflict of Interest Statement

The author declares no conflicts of interest.

Guarantor

None

DOI

https://doi.org/10.63096/medtigo3062417

Cite this Article

Heston TF. Neutrality Boundary Robustness for Meta-Analysis. medtigo J Med. 2026;4(1):e3062417. doi:10.63096/medtigo3062417 Crossref

Neutrality Boundary Robustness for Meta-Analysis

Abstract

Keywords

Introduction

Methodology

Results

Discussion

Conclusion

References

Acknowledgments

Funding

Author Information

Author Contribution

Ethical Approval

Conflict of Interest Statement

Guarantor

DOI

Cite this Article

Table of Contents

Quick links

Category

Neutrality Boundary Robustness for Meta-Analysis

Abstract

Table of Contents

Category

Keywords

Introduction

Methodology

Results

Discussion

Conclusion

References

Acknowledgments

Funding

Author Information

Author Contribution

Ethical Approval

Conflict of Interest Statement

Guarantor

DOI

Cite this Article

Manuscript Submission Form

Manuscript Submission Form

Manuscript Submission Form

Manuscript Submission Form

Manuscript Submission Form

Manuscript Submission Form