Introduction
Stroke is a leading cause of death and disability worldwide, with its burden rising over the past three decades [Reference Feigin1, Reference Saini, Guada and Yavagal2]. Globally, ischaemic stroke accounts for approximately 80–85% of all strokes, while intracerebral haemorrhage (ICH) comprises most of the remainder [Reference Feigin1, Reference Saini, Guada and Yavagal2]. The burden of stroke varies geographically, being highest in middle- and low-income countries [Reference Feigin1, Reference Saini, Guada and Yavagal2]. While traditional risk factors such as hypertension, diabetes, and dyslipidaemia account for a significant proportion of stroke cases, they do not fully explain the observed variation in stroke incidence [Reference Feigin1, Reference Saini, Guada and Yavagal2]. Emerging evidence suggests that various infections, including both acute and chronic conditions like herpes zoster and hepatitis C virus (HCV), may also play a role in stroke pathogenesis [Reference Yahav3, Reference Clarke4]. However, the impact of hepatitis B virus (HBV) on stroke risk is yet to be determined [Reference Hamilton5].
HBV is a hepatotropic virus transmitted via blood and bodily fluids, affecting an estimated 3.3% of the global population [6]. Chronic infection develops in about 90% of infants infected perinatally but only 5%–10% of adults [6]. Like stroke, HBV is more prevalent in low-income countries, mainly due to low vaccination coverage and perinatal transmission [6]. Chronic HBV is a leading cause of liver cirrhosis and hepatocellular carcinoma, the most prevalent liver cancer [6].
A biological link between chronic HBV and stroke may involve an interplay between chronic systemic inflammation and liver damage. Persistent inflammation may promote atherosclerosis and endothelial dysfunction, increasing the risk of ischaemic events [Reference Wu7], while also weakening vessel walls and predisposing to haemorrhage [Reference Wang and Tsai8]. Liver dysfunction can further contribute to coagulopathy and thrombocytopenia, increasing haemorrhagic risk [Reference Chen9], but may also lower lipid levels, potentially reducing ischaemic risk [Reference Feigin1].
Evidence regarding the association between HBV and stroke is limited, inconsistent, and confined to endemic regions in Asia. A large cohort study from China reported no association between HBV and overall stroke or ischaemic stroke, but noted an increased risk for ICH [Reference Hamilton5]. Conversely, two other studies from Korea [Reference Sung10] and Taiwan [Reference Tseng11] found a decreased risk for ischaemic stroke among HBV-infected patients. Variations in the adjustment for potential confounders, methods of HBV exposure ascertainment, population characteristics, and the presence of liver dysfunction may explain these heterogeneous results. To our knowledge, no study has investigated the association between the hepatitis D virus (HDV), which co-infects individuals with HBV and is considered the most severe form of viral hepatitis [Reference Negro and Lok12].
Given these uncertainties, we aimed to assess the relationship between chronic HBV infection and the risk of stroke, as well as its main subtypes, leveraging the comprehensive and long-standing database of Clalit Health Services (CHS).
Materials and methods
Source of data
This retrospective cohort study is based on data from the CHS database, the largest health maintenance organization (HMO) in Israel. CHS provides inclusive health care for more than half of the Israeli population (~4.9 million). Health care coverage in Israel is mandatory according to the National Health Insurance Law (1995) and is provided by four groups akin to not-for-profit HMOs, which are charged with providing a broad package of benefits stipulated by the government. The four HMOs are both health-care insurers and providers, thus financing and supplying health services. Membership in a specific HMO is voluntary, and members can freely switch to another HMO. All members of the different HMOs have a similar health insurance plan and similar access to health services, including low medication co-payments. CHS maintains a database that receives data from multiple sources, including records of primary care physicians, community specialty clinics, hospitalizations, laboratories, and pharmacies. A registry of chronic disease diagnoses is compiled from these data sources. Diagnoses are captured in the registry by diagnosis-specific algorithms, employing International Classification of Diseases, Ninth Revision (ICD-9) code reading, laboratory test results, and disease-specific drug usage. A record is kept of the data sources and dates used to establish the diagnosis, with the earliest recorded date, from any source, considered to be the defining date of diagnosis. Designed for the purposes of administrative and clinical management, the database is available for clinical studies. Relevant high-quality studies have been conducted based on data from the CHS database [Reference Saliba13, Reference Saliba14].
Study population and design
The population of this retrospective cohort study included patients who tested positive for HBsAg (hepatitis B surface antigen), which constitutes the exposed group, and a propensity score-matched comparative group who tested negative for HBsAg, which constitutes the unexposed group. To select the study population, we first identified all adults 20 years or older from the CHS database who had undergone an HBsAg test with valid results between 1 January 2005 and 31 December 2023. Individuals with a prior positive HBsAg test were excluded. For the exposed group, the index date was defined as the date of the first documented HBsAg-positive test. For the unexposed group, the index date was defined as the date of the first documented HBsAg-negative test among individuals with only negative test results.
From the eligible HBsAg-negative subjects, we randomly selected a subset of 500 000 subjects for computational efficiency. We used logistic regression to calculate the predicted probability (propensity score) of being positive for HBsAg for both this subset and the exposed subjects using various factors, including demographic variables, baseline comorbidities, medication use, and medical services use. Patients in the positive HBsAg group were propensity score-matched to patients in the negative HBsAg group in a 1:4 ratio using a greedy matching algorithm with a maximum acceptable difference of 0.001 in the propensity score between matched groups (calliper width of 0.001).
The identified HBV patients (exposed group) and their matched comparative non-HBV controls (unexposed group) were followed from the index date until the first occurrence of study outcome (ICH or ischaemic stroke), death, or end of follow-up (31 December 2024), whichever came first.
Definition of outcome and exposure
The study outcomes included overall stroke, defined as a composite of ischaemic stroke and ICH, as well as each of these components examined separately. Ischaemic stroke was defined as a primary discharge diagnosis with ICD-9 codes 433.x1, 434.x1, and 436. While these codes reliably ascertain acute ischaemic stroke events as a whole [Reference McCormick15], they do not permit specific subtyping (e.g., lacunar, large-artery atherosclerosis, or cardioembolic), which requires additional neuroimaging and clinical information [Reference Mendelson and Prabhakaran16].
ICH was defined as a primary discharge diagnosis with ICD-9 code 431. Previous research by our group on the CHS database revealed a positive predictive value of 86.4% for the diagnosis of ICH using ICD-9 code 431 [Reference Saliba13].
Chronic HBV infection was defined as having at least one positive serological HBsAg test result, obtained from either hospital or community settings. Clinical diagnosis of chronicity requires HBsAg positivity for at least 6 months. However, HBsAg positivity alone is a recognized pragmatic approach to define chronic HBV infection in population-based research [Reference Hamilton5, Reference Sung10]. This approach reflects the low prevalence of acute HBV infection among adults in the general population, particularly in regions with low HBV incidence, such as Israel, where most HBsAg-positive adults are chronic carriers rather than acutely infected [6, Reference Rosner17].
Study variables
For each patient, we extracted baseline variables from the CHS computerized database. These included sociodemographic information, such as age, sex, population sector (Jewish or Arab), socioeconomic status (SES) based on the SES scores of the clinic neighbourhood as defined by the Israeli Central Bureau of Statistics, and district in the CHS. SES and district had missing values (7.9% and 0.3% respectively); hence, these variables were treated as categorical variables that include a category of missing values. In addition, we gathered data on cardiovascular risk factors and comorbidities, including: smoking status, hypertension, diabetes mellitus, obesity, hyperlipidaemia, family history of cardiovascular disease, alcohol abuse, drug abuse, cardiac valvular disease, intermittent peripheral vascular disease (PVD), carotid stenosis, aortic aneurysm, atrial fibrillation, ischaemic heart disease, congestive heart failure, prior stroke, chronic renal disease, chronic obstructive lung disease (COPD), and malignancy. We also included the following medications: antiplatelets, anticoagulants, and statins. Furthermore, we collected data on health behaviour markers and medical service utilization, including performance of faecal occult blood test and influenza vaccinations within 2 years before cohort entry, pneumococcal vaccinations within 5 years before cohort entry, hospital admission within the year before cohort entry, and the number of visits to a primary care physician within the year before cohort entry. We also retrieved baseline alanine aminotransferase (ALT) levels and the status of hepatitis D (HDV) virus coinfection, which was defined by the documentation of hepatitis D antibodies or RNA PCR serological tests.
Statistical methods
All statistical analyses were performed using IBM SPSS Statistics, version 28.0 (IBM Corp., Armonk, NY, USA) and SAS software, version 9.3 (SAS Institute Inc., Cary, NC, USA). Descriptive statistics were used to summarize study variables according to HBV exposure status. Continuous variables with normal distributions are presented as means ± standard deviation (SD), and categorical variables are presented as frequencies and percentages. To evaluate the balance of baseline characteristics between the HBV and non-HBV groups in both the full cohort and the propensity score-matched sample, we used the standardized mean difference (SMD). An SMD of 0.1 or less indicates a negligible difference in the measured variables between groups [Reference Austin18]. To assess the association between HBV and the study outcomes, we used a Cox proportional hazards regression model stratified by the matching indicator. Results are reported as hazard ratios (HRs) with 95% confidence intervals (CIs). Potential effect modification by sex and dichotomized age (at the median) was evaluated by including interaction terms in the Cox regression models.
We performed three sensitivity analyses; First, to estimate the association with the first event of stroke, we restricted the analysis to subjects without a prior stroke diagnosis. Second, we repeated the analysis after excluding participants who had ever been diagnosed with HCV. Additionally, we limited the analysis to confirmed chronic HBV cases, defined as at least two positive HBsAg tests obtained ≥6 months apart.
We also evaluated the association between HBV infection and study outcomes in two specific contexts: in individuals coinfected with HDV, and in those with HBV infection accompanied by elevated ALT levels (> 40 U/L) as a marker of disease activity.
Results
Overall, 1 782 110 subjects were eligible for inclusion in the study, comprising 20 660 HBV and 1 761 448 non-HBV subjects. From the non-HBV group, 500 000 subjects were randomly selected and used for matching. Using this subset, a propensity score-matched sample of 20 544 HBV and 82 176 non-HBV subjects was selected and included in the analyses. A flow chart for the selection of the study population appears in Figure 1.
Study flow diagram depicting the selection of the study population.
Abbreviations: HBV, hepatitis B virus; HBsAg, hepatitis B surface antigen; HCV.

Table 1 shows a comparison of baseline characteristics between the HBV and the non-HBV groups before and after matching. Before matching, subjects in the HBV group were older, more likely to be men, and more likely to be from a low SES background. They were also more likely to have diabetes, to take antiplatelets, and to perform faecal occult blood tests. After matching, all differences between the HBV and non-HBV groups were balanced with SMDs less than 0.1.
Baseline characteristics of the study population before and after propensity score-matching

Abbreviations: CHF, congestive heart failure, COPD, chronic obstructive pulmonary disease; CVDs, cardiovascular diseases; SMD, standardized mean difference.
a Socioeconomic status had missing values in 7.9% of subjects before matching and in 12.9% after matching.
b District code had missing values in 0.3% of subjects before matching and in 1.1% after matching.
c Midyear intervals; years are divided with no overlap.
During the 1 105 834 person-years of follow-up, 2 189 subjects experienced a stroke (Table 2). HBV patients had a slightly higher incidence rate of overall stroke compared to controls (2.13 vs. 1.94 per 1 000 person-years), yielding a non-statistically significant HR of 1.09 (95% CI 0.98–1.22). For ischaemic stroke, we observed 1 944 events with identical incidence rates in both groups (1.76 per 1 000 person-years) with a HR of 1.01 (95% CI 0.89–1.14). However, HBV was significantly associated with increased risk of ICH: with a total of 245 events, the incidence rate was nearly twice as high in the HBV group compared to non-HBV controls (0.37 vs. 0.19 per 1 000 person-years), resulting in a statistically significant HR of 1.82 (95% CI 1.35–2.45).
Descriptive statistics, incidence rates, and hazard ratios (HRs) for the association between hepatitis B virus (HBV) and study outcomes (N = 102 720)

Abbreviation: CI, confidence interval.
a Overall stroke was defined as the composite outcome of ischaemic and intracerebral haemorrhagic (ICH) strokes.
The results were consistent across the sensitivity analyses: among subjects without a prior history of stroke (Supplementary Table S1), after excluding subjects diagnosed with HCV during follow-up (Supplementary Table S2), and when limiting the analysis to confirmed chronic HBV cases defined by at least two positive HBsAg tests obtained ≥6 months apart (Supplementary Table S3).
Subgroup analysis stratified by median age and sex showed an increased risk of ischaemic stroke in subjects younger than 40 years (P for interaction = 0.006) and in females (P for interaction = 0.079) (Figure 2).
Forest plots for the association between hepatitis B virus (HBV) and overall stroke*, ischaemic stroke, and intracerebral haemorrhage (ICH), showing the hazard ratios (HRs) from the subgroup analyses by: (a) sex and by (b) median age.
* Overall stroke was defined as the composite outcome of ischaemic and ICH strokes.

The results remained unchanged when HBV infection was accompanied by an elevated ALT level (Supplementary Table S4). An exploratory analysis showed that HBV with HDV coinfection was associated with increased risk of ischaemic stroke with an HR of 1.83 (95% CI, 1.00–3.34), whereas HBV without HDV infection was not associated with increased risk of ischaemic stroke with an HR of 0.99 (95% CI, 0.87–1.12) (Table 3).
Descriptive statistics, incidence rates, and hazard ratios (HRs) for the association between HBV infection with and without hepatitis D virus (HDV) coinfection and study outcomes (N = 102 720)

Abbreviation: CI, confidence interval.
a Overall stroke was defined as the composite outcome of ischaemic and intracerebral haemorrhagic (ICH) strokes.
b The non-HBV category is distinct from each HBV category, corresponding to the original propensity score-matched groups.
Discussion
Our results show no significant association between HBV infection and overall stroke or ischaemic stroke. However, HBV infection was associated with a significantly increased risk of ICH. These findings remained robust across sensitivity analyses. Subgroup analysis further suggests that the association between HBV and ischaemic stroke risk is more pronounced and increased in subjects younger than 40 years and in females. In addition, HBV infection in the presence of HDV coinfection was associated with increased risk of ischaemic stroke.
Our results align with a large cohort study based on prospective biobank databases that found no association between HBV and overall stroke or ischaemic stroke, but identified an increased risk for ICH with an HR of 1.28 (95% CI, 1.06–1.55) [Reference Hamilton5]. Consistent with our findings, these results remained robust in a sensitivity analysis restricted to subjects without a history of prior cardiovascular disease. However, unlike our study, the study found no interaction effects for age or sex [Reference Hamilton5]. Similarly, a Korean study using a prospective biobank of male public servants reported an increased risk of ICH with an adjusted HR of 1.33 (95% CI, 1.15–1.53) [Reference Sung10]. In contrast to our findings, this study reported a decreased risk for ischaemic stroke with an adjusted HR of 0.79 (95% CI, 0.68–0.90) [Reference Sung10]. A decreased risk of ischaemic stroke was also reported in two retrospective cohort studies from Taiwan, which were based on insurance claims [Reference Tseng11, Reference Kuo19]. Kuo et al. [Reference Kuo19] found an adjusted HR of 0.61 (95% CI, 0.56–0.67), along with reduced mortality risk. Tseng et al. [Reference Tseng11], using the same database but focusing specifically on diabetic patients, reported an adjusted HR of 0.77 (95% CI, 0.66–0.89) [Reference Tseng11]. These contradictory results may stem from methodological limitations, including the use of diagnostic codes rather than laboratory confirmation for HBV exposure and the limited availability of potential confounders in claims data [Reference Tseng11, Reference Kuo19]. Alternatively, these differences might reflect unique characteristics of the Taiwanese study population [Reference Zhang20].
Biologically, chronic HBV infection can lead to an increased risk of ICH through three interrelated pathways. First, thrombocytopenia is common in chronic HBV infection, often occurring in early stages even before the onset of cirrhosis [Reference Wu7, Reference Chen9]. This reduction in platelet count is associated with hepatic dysfunction, as demonstrated in both clinical and Mendelian randomization studies [Reference Wu7, Reference Chen9]. HBV also directly affects bone marrow, further contributing to low platelet counts and increasing bleeding risk [Reference Hamilton5, Reference Wang and Tsai8, Reference Sung10]. Second, impaired synthesis of coagulation factors occurs as liver function deteriorates, resulting in a complex, unstable haemostatic balance that predisposes to haemorrhagic events [Reference Feigin1, Reference Wang and Tsai8, Reference Tripodi and Mannucci21]. Third, emerging evidence suggests that chronic HBV infection may contribute to vascular fragility and endothelial dysfunction through sustained pro-inflammatory states and oxidative stress, potentially weakening cerebral vessels [Reference Chen9]. Chronic HBV infection is also linked with lower serum lipid levels, including cholesterol, likely due to altered hepatic function and changes in lipid metabolism, which may further increase the risk for ICH [Reference Hamilton5, Reference Razi22]. However, the impact of these metabolic changes on atherogenesis or ischaemic stroke remains uncertain and may vary by sex and age [Reference Hamilton5, Reference Wu7, Reference Razi22, Reference Yun, Kim and Kim23].
In our study, about 3.2% of HBV patients were coinfected with HDV, consistent with previous reports from Israel [Reference Hazzan24]. Our analysis showed that HBV subjects coinfected with HDV were at increased risk for ischaemic stroke. Chronic coinfection with HDV is characterized by more aggressive liver disease, faster progression to cirrhosis, and a stronger, more sustained inflammatory response [Reference Negro and Lok12]. Sustained inflammation is independently associated with increased ischaemic stroke risk [Reference Wu7, Reference Negro and Lok12]. However, direct evidence linking HDV to stroke remains lacking.
Our study faces several limitations. First, as an observational study based on an administrative database, residual confounding remains a concern, since we could not control for information not coded in patients’ files, such as personality traits, lifestyle habits, and general health conditions that may be associated with both HBV activity and stroke development. However, to address concerns raised in previous studies regarding inadequate model adjustment, we employed propensity score-matching based on numerous potential confounders, including sociodemographic factors, cardiovascular comorbidities, cardiovascular risk factors, selected medications, chronic diseases, and healthcare utilization markers.
Second, regarding the potential for misclassification bias, we based the HBV exposure assessment on HBsAg laboratory test results. HBsAg testing is highly sensitive and specific and is considered the cornerstone of HBV diagnosis and screening [Reference Amini25]. In contrast, previous studies relied on point-of-care rapid diagnostic tests, which have lower validity [Reference Hamilton5] or ICD diagnostic codes [Reference Tseng11, Reference Kuo19]. Moreover, the diagnosis of ICH was found to be valid within our database [Reference Saliba13]. Nevertheless, misclassification bias remains a concern, and the expected non-differential misclassification would bias the results towards the null, although unlikely enough to explain the results for overall and ischaemic strokes. We also recognize that subjects with negative HBsAg could acquire HBV infection during follow-up. However, this risk is minimal given Israel’s low HBV incidence (~1–2/100 000 annually in adults) [Reference Rosner17].
Third, we did not relate our findings to antiviral treatments or the severity of chronic HBV infection, although we performed an analysis by ALT baseline levels as a proxy marker for inflammation activity. However, we acknowledge that ALT is a nonspecific marker that may be elevated for reasons other than HBV [Reference Ghany26].
Fourth, due to our reliance on ICD coding, we were unable to specifically address ischaemic stroke aetiologies. Future studies could explore whether HBV differentially affects particular stroke mechanisms.
Finally, although our results pertain to a specific healthcare system, the CHS covers more than 50% of the Israeli population, including diverse groups.
In conclusion, our findings suggest that HBV is not associated with an increased risk of overall or ischaemic stroke but is significantly associated with a higher risk of ICH. Yet, this study suggests a higher risk for ischaemic stroke in specific subgroups, including females, younger patients, and chronic coinfection with hepatitis D. Further studies are needed to investigate the relationship between those BV and stroke, including its subtypes.
Supplementary material
The supplementary material for this article can be found at http://doi.org/10.1017/S095026882610154X.
Data availability statement
Individual-level data cannot be publicly available due to legal restrictions. All data relevant to this analysis were presented in the paper.
Author contribution
A.Y., D.R., and W.S. wrote the manuscript; A.Y., D.R., N.T., E.A., and W.S. designed the research; A.Y. and N.S. analysed the data; A.Y., D.R., and A.A. conducted the research; and W.S. supervised.
Competing interests
The authors declare that they have no conflicts of interest, financial or otherwise, related to this study. No financial support was received for this research.
Ethical standard
The Institutional Review Board (IRB) of Lady Davis Medical Centre approved the study (Approval No. CMC-25-0101). The IRB granted a waiver of informed consent due to the retrospective nature of the study. The current study follows the Strengthening Reporting of Observational Studies in Epidemiology (STROBE) reporting guideline.




