To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
Incorporating patient preferences into health technology assessment represents a promising avenue for its enhancement. However, the complexity and diversity of methods for evaluating preferences, coupled with uncertainties regarding their impact on decision-making processes, present significant challenges to their effective integration. Consequently, the objective of this study was to evaluate the application of the MaxDiff analysis methodology for determining patient preferences, with a focus on pharmaceutical services as a case study.
Methods
The experimental design for scaling MaxDiff was developed using Sawtooth Lighthouse software. The design parameters included 12 pharmaceutical services. Descriptive statistics, hierarchical Bayesian analysis, latent class analysis, and logistic regression methods were employed to generate preference results.
Results
The most preferred services were identified as follows: the creation of a personalized list of safe medications to address patient needs, with a quantitative contribution of 0.5; and the provision of rapid tests (for influenza and Helicobacter) and stroke risk assessment (quantitative contribution of 0.31). In contrast, patients did not prioritize services related to contraception selection or weight control and weight loss program development. Descriptive statistics, hierarchical Bayesian analysis, and logistic regression methods sequentially identified the most preferred services. Latent class analysis revealed two distinct consumer segments: Segment 1 (39%) and Segment 2 (60%), which differed in their sociodemographic characteristics and preferred services.
Conclusions
The obtained results can be utilized as patient-based evidence to understand patients’ unmet needs in the assessment of technologies such as pharmaceutical services, taking into account the local context. Despite the simplicity of the MaxDiff analysis methodology, the resource intensity and the level of knowledge among specialists and patients regarding preference collection methods remain significant barriers to its implementation.
Over the years many researchers have sought to understand the decision-making process of CONITEC, the Brazilian health technology assessment (HTA) agency. The emergence of high-cost technologies, such as monoclonal antibodies (mAbs), has heightened the importance of delineating the decision-making criteria. This study aimed to elucidate the criteria employed by CONITEC for the incorporation of mAbs in Brazil.
Methods
All CONITEC reports published between 2019 and 2024 on mAbs were included. Descriptive statistics were utilized to summarize the data. The statistical significance between outcomes and covariates was assessed using the Kruskal-Wallis test for continuous variables and the chi-square test for categorical variables. Logistic regressions were produced to evaluate the impact of each covariable on the recommendations. Results with a p-value of less than 0.10 were considered statistically significant due to the small sample size. All analyses were conducted using R software.
Results
After cleaning the database, 53 reports were included, encompassing 37 mAbs evaluated for 45 indications. Sixteen submissions (30.1%) received a positive recommendation. Efficacy (p=0.009) and the preliminary decisions (p<0.001) were significantly associated with the final recommendation. Further analysis showed that an incremental cost-effectiveness ratio (ICER) below BRL150,000 (USD65,020) was associated with a higher chance of a positive recommendation (p=0.050). Logistic regression revealed that the logarithm of the ICER was significantly associated with recommendations (odds ratio 0.618; p=0.031), indicating that increasing the ICER by 2.7 times was associated with a 38.2 percent lower chance of listing.
Conclusions
Efficacy is important for decision-makers in Brazil. However, the results of economic analyses also influence the recommendation of mAbs. An ICER above BRL150,000 (USD65,020) per quality-adjusted life year significantly reduced the chances of a mAb being listed.
The healthcare and rehabilitation processes for individuals with low back pain impose significant costs on healthcare systems worldwide. Understanding the economic burden of this condition can help policymakers implement preventive strategies and promote a more rational allocation of health resources. The aim of this study was to investigate the costs of managing low back pain in Brazil between 2010 and 2019.
Methods
This study evaluated the costs of low back pain from the perspective of the Brazilian Unified Health System (SUS) in the outpatient setting. Data were collected from the Ambulatory Information System and analyzed by sex, age group, and health condition (low back pain). Ambulatory care data were presented as number of visits, total annual cost, and number of procedures performed. In addition, linear regression analysis using generalized linear models (gamma distribution with identity link) was performed to examine the association between total outpatient care costs and predictors such as age, gender, and race.
Results
Between 2010 and 2019, the SUS spent more than USD189 million to treat low back pain in adults. During this period, more than 16 million physical therapy sessions, 7,000 surgeries, and one million imaging studies were performed. These procedures accounted for approximately 85 percent of total spending on low back pain in Brazil. Despite the high number of physical therapy sessions, significant expenditures were still observed for surgical procedures. Regression analysis showed that men had higher costs per procedure than women.
Conclusions
The study revealed significant costs for low back pain in Brazil, with higher expenditures for men and those aged 34 to 63 years. Major expenditures were associated with surgery and physical therapy, whereas the prescription of imaging studies decreased, which is in line with recommendations. These findings highlight the need for targeted strategies to effectively manage resources for low back pain in the ambulatory care setting.
Diabetes is a condition that affects public health based on its increased incidence, prevalence, morbidity, and mortality. According to the latest Atlas of the International Diabetes Federation, it is estimated that there are 537 million adults with the condition; for Brazil, the estimate is 16 million. Investment in diabetes and its complications exceeds USD42 billion in Brazil.
Methods
This national study was conducted from 1 July to 22 August 2024. Interviews were conducted online with 1,843 Brazilians over 18 years of age with diabetes. The study’s objective was to identify barriers to diabetes treatment in Brazil after the incorporation of health technologies in the Unified Health System (SUS) aimed at diabetes. Data were processed according to the profile of reported diabetes diagnosis in the National Health Survey 2019, conducted by the Brazilian Institute of Geography and Statistics.
Results
The preliminary study indicated that 56 percent of those diagnosed with diabetes are over 60 years of age, 58 percent have elementary education, 54 percent have a monthly family income of up to two minimum wages, 75 percent have type 2 diabetes, 67 percent say they have hypertension, and 63 percent have high cholesterol. Obesity was reported by 39 percent, 82 percent use oral medication, 67 percent undergo tests through the SUS, and 84 percent obtain free medication through the SUS. Of those interviewed, 60 percent cite the availability of medication as the main factor in monitoring the condition, followed by attention from doctors (53%).
Conclusions
The aging population poses challenges for diabetes control in Brazil and will further strain the public health system. Current barriers, such as bottlenecks in care and lack of medicines and doctors, are likely to worsen. Vulnerable groups with lower levels of education and income are currently the most affected. With the increasing age of the population, the system will spend more resources and may have difficulty sustaining itself in a few years.
Skin-related neglected tropical diseases (skin NTDs) are very prevalent in endemic areas. Resources to manage them are very scarce. The World Health Organization’s Skin NTDs app is designed to help frontline health workers in identifying skin NTDs (n=13) and common skin conditions (n=24). A beta version including artificial intelligence (AI) was developed, and its accuracy and usability was assessed in real life conditions.
Methods
The Skin NTDs app usability and user experience was assessed in frontline healthcare workers (n=38) in Kenya. Participants answered the user Mobile App Rating Scale (uMARS) questionnaire. Focus group discussions (n=4) and semi-structured interviews (n=15) were used to get an in-depth understanding of the user experience. To assess accuracy of the AI algorithm, 40 participants from five counties in Kenya used the app for five months, uploading photographs of the skin lesion (n=605) to an external platform. AI algorithm accuracy was calculated based on the gold standard of consensus diagnosis reached by three independent dermatologists.
Results
The Skin NTDs app received high scores on the uMARS questionnaire (mean app quality 3.82/5 and perceived impact 4.1/5; n=38). Focus group discussions and interview responses aligned with the uMARS findings, reinforcing the positive assessment of the app. It helped to empower professionals, increased their knowledge about skin diseases, and improved their communications skills with patients. The app was time saving and reduced referral of patients to specialists. Some features to be improved were identified. Overall accuracy of the app was found to be 80 percent in diseases where AI has been trained with a higher number of photos of endemic skin conditions.
Conclusions
The Skin NTDs app showed commendable quality and holds potential to be scaled up and implemented at the national level in Kenya and globally. It performs well as a clinical decision support system to help frontline healthcare workers identify potential skin diseases that patients suffer from and to reduce the number of referrals to dermatologists in contexts where there is a lack of specialized professionals.
Chile has been conducting various efforts toward reforming the healthcare system, yet significant discrepancies persist in the equitable access to and utilization of healthcare resources. The aim of this scoping review was to map the existing literature on hearing health care and treatments for hearing loss (HL) in Chile to identify local determinants that may contribute to stratification of access to hearing health care.
Methods
The Joanna Briggs Institute guidance for scoping reviews was followed. The PCC criteria (Population, Concept, Context) was used to guide development of the search strategy. Searches were conducted in MEDLINE (PubMed), the Cochrane Library, and Science Direct databases and supplemented by a manual search. Searches were limited to publications from 2000 to June 2023, with no restrictions on language or publication type. Two independent reviewers screened all retrieved references, assessed the eligibility following the PCC criteria, and charted data of the eligible publications. Disagreements were solved through discussion with a third reviewer. A structured narrative synthesis of findings was conducted.
Results
The search yielded 506 unique records for screening, of which 104 full-text publications were assessed and 39 were included in the review. The evidence revealed that the treatment of HL is publicly financed for children younger than four years (any type of HL, degree moderate or higher), the elderly aged 65 years or older (bilateral HL requiring a hearing aid) and people at least four years of age diagnosed with total deafness (cochlear implant candidates). However, for individuals aged between four and 64 years who have HL but are not totally deaf, there is not a clear rehabilitation pathway or dedicated public reimbursement.
Conclusions
Age and degree of HL might be the most prominent determinants of access to hearing health care in Chile. Diagnosis and treatment of people aged four to 64 years with moderate to severe HL might incur out-of-pocket expenses or even have no access to care for HL. These findings highlight the need for public health policies to promote equitable access to hearing health care in Chile.
Rare diseases (RDs) present unique challenges in health technology assessment (HTA) due to small, heterogeneous populations and limited clinical data, which complicate economic modeling and delay access to innovative interventions. This survey, conducted by the HTAi Rare Diseases Interest Group, explored key challenges in economic modeling for RDs, the application of solutions, and the use of evidence assessment frameworks in HTA practice.
Methods
A mixed-method survey was developed with multiple-choice, Likert-scale, and open-text questions using the SurveyMonkey platform. Responses were collected from members of HTAi and the International Network of Agencies for Health Technology Assessment at conference sessions. Additional respondents were recruited via professional networks and direct emails. Options exceeding 50 percent of responses and rate weighted averages guided the interpretation and identification of areas of agreement.
Results
In total, 36 individuals across all continents, mostly from consultancy and HTA agencies, responded to the survey. Key challenges in economic modeling included insufficient quality of clinical trials (duration and sample size), scarcity of robust data, and lack of health-related quality of life metrics. The most common solutions included using proxy diseases, pooled data, and qualitative research. Broader impacts were integrated through societal perspectives, decision modifiers, and deliberative processes. GRADE was the most commonly cited explicit evidence assessment framework. In some cases, implicit approaches dominated HTA practice where challenges such as single-arm trials and reliance on real-world evidence prevailed, though patient-reported outcomes had gained some acceptance.
Conclusions
Scarcity and quality of clinical data remain significant barriers to economic modeling for RDs. Solutions such as qualitative methods and pooling data show promise. The broader impacts of RDs are increasingly being considered, but further research is essential to refine methods, embrace non-traditional data, and develop appropriate frameworks to assess evidence for RDs in HTA.
Chronic lymphocytic leukemia (CLL) is a malignant hematologic disorder that affects older adults. CLL is the most common leukemia in adults in the Western world. The genomic landscape of CLL is heterogeneous. The aim of this report was to identify the clinically relevant molecular alterations in CLL and define the role of targeted next-generation sequencing (NGS) in routine care.
Methods
The evaluation included an analysis of systematic reviews, meta-analyses, clinical guidelines, and relevant molecular alterations using OncoKB™ and TOPOGRAPH classifications. Approvals from the French National Authority for Health Transparency Committee or compassionate use decisions from the French National Agency for Medicines and Health Products Safety were also considered.
Results
Targeted NGS in CLL detected molecular alterations to determine prognosis and guide treatment decisions with a higher sensitivity than Sanger sequencing. The recommended analyses include:
• TP53 and IGHV mutations before first-line treatment;
• TP53 mutation before treatment modification in relapse cases;
• TP53, BTK, and PLCG2 mutations after treatment with BTK inhibitors;
• TP53 and BCL2 mutations after treatment with BCL2 inhibitors; or
• TP53, BTK, PLCG2, and BCL2 mutations after treatment with both BTK and BCL2 inhibitors.
Conclusions
Targeted NGS is essential for managing CLL. The gene panel will be updated dynamically based on new scientific evidence and regulatory approvals.
Internationally, there is a growing drive to integrate patient preference information (PPI) into health technology assessment (HTA) decision-making. However, it is unclear how PPI is used in real-world contexts and the extent to which it can reshape and influence HTA decisions. Understanding how PPI can be meaningfully used in HTA is fundamental for supporting its practical application and increasing trust in its use.
Methods
The aim was to enhance understanding of how HTA committees interpret and incorporate PPI into decision-making. Mock deliberation workshops were conducted by members of the HTAi Patient Preference Project Subcommittee with experts from four HTA agencies. For the workshops, a mock evidence package was developed that included PPI evidence alongside key clinical and economic evidence to simulate decision-making processes. The mock deliberations were analyzed using thematic analysis.
Results
The mock deliberation workshops (25 participants) highlighted both opportunities and challenges in integrating PPI into HTA processes. While PPI was acknowledged as valuable evidence, its influence on decision-making in the scenarios varied from being cited by a few participants as informing the decision to most participants indicating the evidence was not persuasive. Participants pointed out the need for more structured guidance on how to incorporate PPI alongside clinical and economic evidence. The findings highlighted the importance of standardizing the path to integration, as well as the content and format of PPI, to support consistent interpretation and application in HTA.
Conclusions
By examining how PPI is weighed alongside clinical and economic evidence, the project offered valuable insights for stakeholders aiming to generate more pertinent PPI studies for incorporation into the decision-making process. The key takeaways for HTA agencies included strategies for effectively balancing PPI with clinical and economic evidence, highlighting a shift toward incorporating more diverse and inclusive forms of evidence.
Oncology economic modeling often struggles with estimating clinical benefits like overall survival (OS) and progression-free survival (PFS), especially with short-duration trials involving extrapolating survival curves. This study analyzed methods used by the Brazilian National Committee for Health Technology Incorporation (CONITEC) in health technology assessment (HTA) to adapt economic models to evolving healthcare needs.
Methods
A descriptive analysis was conducted on the economic evaluations from CONITEC’s recommendation reports on oncology drugs published between January 2019 and December 2023. Three independent reviewers extracted data on outcomes, survival curve distributions, validation methods, and time horizons. The study focused on identifying patterns in modeling practices and explored whether these methods integrated with the expanding scope of HTA, including the use of non-traditional data sources and adaptive techniques.
Results
Thirty CONITEC reports were analyzed, evaluating 32 oncology treatments across 45 indications. The most common economic models were partitioned survival analysis (PartSA) (58%) and Markov models (33%). The Weibull distribution was the most frequent in OS extrapolations (50% for PartSA; 84% for Markov). PFS extrapolations showed a preference for exponential distribution in PartSA (35%) and Weibull in Markov models (54%). Methods for model validation were reported in 69 percent of Markov models and 100 percent of PartSA models. Extrapolation horizons ranged from 10 years (75% of models) to 30 years (31%).
Conclusions
The analysis highlighted the reliance on traditional models, such as PartSA and Markov, for modeling long-term cancer progression. However, these models may not fully capture treatment complexity. Emerging methods and the integration of diverse data sources, such as real-world data, are vital for improving HTA relevance and addressing the evolving demands of healthcare systems and next-generation evidence.
Hereditary breast and ovarian cancer, linked to BRCA gene mutations, poses significant health risks. Australian breast and ovarian cancer mortality rates emphasize the public health importance of effective screening. This study evaluated the cost effectiveness of population-wide BRCA screening and its equity implications, focusing on reducing health disparities across socioeconomic groups through early detection and prevention strategies.
Methods
A cost-effectiveness analysis was conducted comparing population-wide BRCA screening to family-history-based testing for Australian women aged 30 to 50 years. A decision-analytic model included a decision tree and two Markov models simulating outcomes over a lifetime horizon. Transition probabilities captured risk-reducing interventions. Outcomes included quality-adjusted life years (QALYs) and costs in 2024 AUD, discounted at five percent. Socioeconomic quintiles were defined using Socio-Economic Indexes for Areas, and distributional impacts were assessed with net health benefits and equally distributed equivalent health (EDEH) using the Atkinson index. Model inputs were derived from census data and empirical studies. Opportunity costs used UK data as a proxy.
Results
Screening at the age of 30 years was cost saving (incremental cost-effectiveness ratio [ICER] −AUD8,331 [−USD5,482] per QALY), while screening at ages 35 and 40 years was cost effective (ICERs: AUD28,231 [USD18,577] and AUD40,086 [USD26,384] per QALY, respectively). Screening at ages 45 and 50 years exceeded the threshold (ICERs: AUD78,666 [USD51,778] and AUD155,937 [USD102,640] per QALY). Distributional analysis showed screening at the age of 30 years increased average quality-adjusted life expectancy and EDEH, reducing health inequality between socioeconomic groups. In contrast, screening at older ages resulted in diminished benefits and exacerbated inequalities, highlighting the importance of early intervention in achieving both cost-effectiveness and equity goals.
Conclusions
Population-wide BRCA screening was cost effective for Australian women at younger ages, particularly at the age of 30 years, and reduced health inequalities. Policymakers should prioritize early screening to address hereditary breast and ovarian cancer, promote health equity, and alleviate disease burden. Further strategies to enhance access among disadvantaged groups could amplify these benefits, advancing a more equitable healthcare system.
Health technology assessment (HTA) often involves complex cost-effectiveness analyses (CEA) that can be challenging for non-experts. This study demonstrated the use of artificial-intelligence (AI)-powered prompts to simplify CEA processes. By integrating tools for generating efficiency frontiers and net-benefit analyses, this approach enables stakeholders, such as non-modeling HTA specialists or decision-makers, to understand therapeutic scenarios and make informed adjustments to analyses.
Methods
The prompts were created using ChatGPT-4o and tested for usability and reproducibility in ChatGPT-4o and the free version 3.5. The prompts were designed to automate key steps in economic analysis, including calculating net monetary and health benefits, performing cost-effectiveness analyses applying dominance and extended dominance concepts, and generating the efficient frontier plot. Ten HTA experts with no modeling experience evaluated the prompts using predefined scenarios with hypothetical datasets. The results of both versions of ChatGPT were compared to the expected results. The usability and accuracy of the prompts were assessed during the evaluation.
Results
ChatGPT-4o achieved 100 percent accuracy in calculating net health and monetary benefits (NHB and NMB) and correctly applied the concepts of dominance and extended dominance in 83 percent of cases. It calculated the incremental cost-effectiveness ratio (ICER) for non-dominated therapies in 50 percent of situations and successfully generated the efficiency frontier graphic in half of cases. In contrast, ChatGPT 3.5 achieved 50 percent accuracy for NHB and NMB calculations, only 17 percent for applying dominance and extended dominance concepts, and failed to calculate ICERs or generate the efficiency frontier plot as expected.
Conclusions
AI-powered prompts simplified cost-effectiveness analyses by enabling non-technical stakeholders to create and adjust efficiency frontiers and benefit analyses. ChatGPT-4o demonstrated improved reliability. Limitations in ChatGPT 3.5, particularly with ICER calculation and graphical outputs, indicated the need to adapt prompts to specific AI tools. Future developments may increase the robustness and usability of these types of prompts across platforms.
Generative artificial intelligence (AI) holds promise in aiding development of health economic models. Our objective was to explore the feasibility of using generative AI to replicate health economic models based on previously published models.
Methods
We replicated a Markov model of ulcerative colitis described in the literature using a two-step approach. First, we used Python for large language model interactions and utilized ValueGen.AI, a GPT-4-based platform with multi-agent pipelines (CrewAI, LangChain, and OpenAI libraries), to extract model structure and parameters from the source. These parameters were implemented in R’s heemod package to construct and run the Markov model. Next, we repeated the experiment using a more detailed technical report of the same model. We evaluated generative AI’s performance by comparing its conceptualization and parameterization with the original sources.
Results
Using the publication, the generative AI platform effectively extracted costs and quality-of-life inputs linked to health states. However, for health states and transition probabilities, initial attempts were less successful due to limited descriptions in the text, resulting in misinterpretations and conflicting health states. Performance improved considerably when we used the detailed technical report, which offered clearer and more structured information. Generative AI could not successfully extract all transition probability calculations for the defined model cycle length based on the formulas provided in the documents.
Conclusions
Generative AI holds significant potential for replicating previously published health economic models. However, challenges remain in capturing detailed model parameters, particularly when description of modeling approach lacks clarity and transparency. Improving standardized reporting practices within the health economics and outcomes research field is needed to enable generative AI to better support stakeholders during HTA processes.
Liquid biopsy (LB) is a minimally invasive technique that enables analysis of circulating tumor DNA (ctDNA) by next generation sequencing (NGS) and has several potential applications in oncology. The main objective was to describe the diagnostic accuracy of LB over tissue biopsy for screening, diagnosis, and tumor mutation profiling in breast, lung, and colorectal cancer, including at different cancer stages.
Methods
Initially, a literature overview to select systematic reviews with meta-analyses or health technology assessment reports was conducted. When no data were obtained, a systematic review was conducted to select individual studies. For both procedures, a systematic literature search, two-step study selection, quality appraisal and a narrative synthesis were conducted.
Results
The only study in the screening setting involved colorectal cancer (sensitivity 14%, specificity 90%). Regarding initial cancer diagnosis, LB in breast cancer had a sensitivity ranging from 31 to 74 percent and a specificity of 66 to 92 percent, with inconsistent results by stage. For lung cancer, LB had a sensitivity of 22 to 86 percent and a specificity of 42 to 91 percent, with improved utility in advanced stages. Colorectal cancer studies using multicomponent tests showed sensitivities of 83 to 93 percent and specificities of more than 95%. LB ctDNA analysis in molecular profiling of advanced cancer had high accuracy in breast (PIK3CA), lung (epidermal growth factor receptor mutations), and colorectal cancer (NRAS/KRAS), with values for sensitivity of up to 83 percent and for specificity of around 90 percent.
Conclusions
No results were available for screening in breast and lung cancer. Limited evidence for colorectal cancer showed low sensitivity but high specificity for LB over tissue biopsy. Initial cancer diagnosis results were heterogeneous, with inconsistencies in breast and colorectal stages but improvements in advanced lung cancer. Tumor profiling provided the strongest evidence, with high diagnostic utility relative to tissue biopsy across all three cancer types.
Hospital malnutrition is caused by metabolic imbalance and is associated with increased rates of morbidity and mortality and unfavorable prognosis for hospitalized patients. In 2001, severe malnutrition was detected in 12.5 percent of hospitalized patients in Brazil. The aim of this study was to evaluate evidence on the efficacy and safety of oral nutritional supplements for malnutrition in hospitalized patients.
Methods
Search strategies developed for the PubMed, Embase, LILACS and Cochrane Library databases were used to search and select the available evidence. The included evidence compared oral nutritional supplementation with standard hospital care or placebo. A total of 47 clinical trials were selected. For mortality, a meta-analysis was performed combining the effect measures of 40 clinical trials. R software was used with the meta package installed. The revised Cochrane risk-of-bias tool was used to assess the risk of bias.
Results
For the primary outcomes of mortality (relative risk [RR] 0.94, 95% CI: 0.77, 1.15), length of hospital stay, and number of hospital readmissions (RR 0.97, 95% CI: 0.74, 1.28), oral supplementation offered no benefit, compared with the control group. Regarding infectious and non-infectious complications, there was a significant reduction in the number of patients with complications in the supplemented group, compared with the control group (RR 0.71, 95% CI: 0.59, 0.86). Regarding the risk-of-bias assessment, most of the studies had a high risk of bias and limitations in almost all areas.
Conclusions
The designs of the selected studies were quite heterogeneous in terms of follow-up time, population, and sampling, among others, and several doubts were raised about the methodology used in the clinical trials. These aspects were reflected in the low quality and high risk of bias of the primary studies, which proved to be an important limitation.
Generative artificial intelligence (AI) is transforming the generation of real-world evidence in health care. This study compares chemotherapy recommendations for women with node-positive breast cancer, as provided by a Delphi panel of experts, with those generated by ChatGPT and Copilot. The objective was to assess concordance and discrepancies between AI-generated and expert-driven recommendations.
Methods
A Delphi panel of 10 independent breast cancer experts, blinded to each other’s responses, participated in an online survey to evaluate chemotherapy recommendations for early stage, node-positive breast cancer, stratified by menopausal status and Oncotype DX Breast Recurrence Score® (RS: <14, 14 to 25, >25). ChatGPT and Copilot, using automatic prompt engineering, addressed the same scenarios as agent-based oncologists. Expert recommendations were summarized as arithmetic means, while AI responses were analyzed for concordance. A one-sample t-test compared mean estimates between the Delphi panel and each AI tool.
Results
Chemotherapy recommendation rates for premenopausal women by the Delphi panel, ChatGPT, and Copilot were: RS less than 14 (82.2, 2.5, 10); RS 14 to 25 (96.7, 22.5, 30); RS greater than 25 (100, 80, 60). For postmenopausal women, these were: RS less than 14 (3.3, 1.5, 15); RS 14 to 25 (17.8, 17.5, 35); RS greater than 25 (94.4, 72.5, 65). Significant differences were observed, especially in lower RS ranges. Differences in means between the Delphi panel and AI for postmenopausal with a RS greater than 25 and premenopausal with a RS less than or equal to 25 were statistically significant (p<0.05).
Conclusions
The recommendations from AI demonstrated significant divergence from the “gold standard” panel, especially for premenopausal patients. The observed deviations from Brazilian experts’ recommendations likely stem from differences in local practices, compared with international published data, which may have influenced AI outputs. These findings emphasize the continued importance of human panels to account for regional variations in clinical decision-making.
HIV-1 compromises the immune system, leading to immunodeficiency. Standard treatment involves the use of antiretroviral therapies; however, the development of resistance is common, highlighting the need for alternative therapeutic options. This study evaluated the therapeutic value and cost effectiveness of fostemsavir in combination with optimized background therapy (OBT), compared with OBT alone for patients living with multidrug-resistant HIV-1.
Methods
A cost-effectiveness analysis was conducted to estimate the incremental cost-effectiveness ratio (ICER) of fostemsavir combined with OBT for people living with HIV-1 resistant to at least four classes of antiretroviral therapies, compared with OBT alone. The analysis was performed from the perspective of the Brazilian Unified Health System (SUS), considering a one-year treatment horizon. For OBT, it was assumed that the regimen could include the simultaneous use of up to six medications. The efficiency frontier methodology was applied to determine a proportional price based on the cost and therapeutic response benefit.
Results
Fostemsavir demonstrated an incremental effectiveness of 54 percent in therapeutic response rate (viral load reduction). Over a one-year horizon, the cost-effectiveness analysis revealed a cost of BRL163,402.21 (USD27,233.70) for fostemsavir combined with OBT, compared with BRL23,271.60 (USD3,878.60) for OBT alone. The incremental cost of using fostemsavir was BRL260,127.37 (USD43,354.56) per therapeutic response. Based on the efficiency frontier approach, the incremental cost of fostemsavir should not exceed BRL19,819.83 (USD3,303.30), calculated based on the amount currently spent by the SUS plus the 54 percent additional benefit.
Conclusions
The use of fostemsavir combined with OBT demonstrates a superior virologic response rate, compared with OBT. Efficiency frontier analysis suggested a maximum justified cost, providing a framework for pricing decisions. This study highlighted the importance of balancing innovation with cost effectiveness in healthcare decision-making.
Generic medicines form the backbone of affordable health care in low- and middle-income countries (LMICs), constituting a significant portion of pharmaceutical consumption. However, issues such as fragmented pricing policies and insufficient regulatory frameworks hinder equitable access and affordability. This study evaluated pricing models that address these challenges by integrating best practices, including tiered pricing mechanisms and enhanced transparency. The aim was to guide policymakers toward sustainable solutions that improve medicine affordability, equity, and market resilience.
Methods
A comparative analysis was conducted across 20 countries, chosen for their diverse healthcare profiles and potential to be best practice examples for LMIC contexts. Key elements analyzed included pricing mechanisms (e.g., tiered pricing, external reference pricing, internal reference pricing), regulatory frameworks, and incentives for local production. Data were gathered from government reports, healthcare provider feedback, and industry insights. A policy evaluation framework was applied to assess each model’s effectiveness in addressing affordability, accessibility, and equity. The analysis prioritized generalizability to LMICs while accounting for contextual adaptations such as regional economic challenges and varying levels of regulatory capacity.
Supported by TUBITAK 2224-A Program for participation in HTAi 2025; it had no role in the study conduct.
Results
The findings revealed that tiered pricing structures significantly enhanced affordability by incentivizing market competition and reducing prices as generics enter. Transparent regulatory practices, including centralized databases and consistent price reviews, mitigated regional price disparities and increased trust among stakeholders. Countries fostering domestic production reduced import dependencies, stabilizing supply chains and ensuring medicine availability. These strategies, when applied collectively, have great potential to improve equitable access to generics in resource-constrained settings. The results highlighted the importance of adapting global best practices to the economic and healthcare realities of LMICs, emphasizing sustainability and resilience in pharmaceutical markets.
Conclusions
Integrating tiered pricing frameworks, regulatory transparency, and local production incentives provided a pragmatic pathway to improve generic medicine access in LMICs. These approaches align with global best practices and address affordability and equity challenges. Policymakers should prioritize such strategies while leveraging stakeholder collaboration. Future research should examine dynamic pricing models and their long-term impacts on market stability and healthcare outcomes.
Impaired social functioning is commonly observed in youth at clinical high risk (CHR) for psychosis. Interpersonal synchrony, defined as the temporal alignment of movement between interacting partners, is a key component of successful social interactions. This study aimed to investigate interpersonal head synchrony in naturalistic virtual settings among CHR individuals using automated video analysis tools.
Methods
We analyzed short video recordings from virtual clinical interviews involving 116 participants including 50 CHR participants, 36 individuals with sub-threshold positive symptoms (SUB), and 30 healthy controls (HC). Vertical head movement time series were extracted using an open-access video-based head-tracking tool. Interpersonal head synchrony was computed using Windowed Cross-Correlation to assess group differences and associations with clinical symptoms and functioning.
Results
CHR participants showed significantly reduced strength of synchrony compared to HC (β = −0.05, 95% CI [−0.09, −0.02], p = .004), although 14% of variance in strength of synchrony was attributable to assessor identity. No significant group differences were found for delay of synchrony. Within the CHR group, delay of synchrony was positively associated with social anhedonia (r = 0.29). Strength of synchrony correlated with better social (r = 0.33) and role (r = 0.28) functioning.
Conclusion
Our findings suggest that impaired interpersonal head synchrony is already present in the psychosis-risk state and relates to negative symptoms and social and role functioning. These findings support the utility of nonverbal synchrony as a potential biomarker and demonstrate the feasibility of automated tools and virtual assessments to study social processes in at-risk populations.