Hostname: page-component-77f85d65b8-2tv5m Total loading time: 0 Render date: 2026-03-29T12:23:21.800Z Has data issue: false hasContentIssue false

Copy number variation profiling in the genome of crossbred dairy cattle from Pakistan

Published online by Cambridge University Press:  31 July 2025

Fakhar Un Nisa*
Affiliation:
Institute of Animal and Dairy Sciences, Faculty of Animal Husbandry, University of Agriculture, Faisalabad, Pakistan
Muhammad Asif
Affiliation:
Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering College (NIBGE-C), Faisalabad, Pakistan Pakistan Institute of Engineering & Applied Sciences (PIEAS), Islamabad, Pakistan
Qamar Un Nisa
Affiliation:
Department of Veterinary Pathology, University of Veterinary and Animal Sciences, Lahore, Pakistan
Rubab Zahra Naqvi
Affiliation:
Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering College (NIBGE-C), Faisalabad, Pakistan Pakistan Institute of Engineering & Applied Sciences (PIEAS), Islamabad, Pakistan
Muhammad Saif Ur Rehman
Affiliation:
Institute of Animal and Dairy Sciences, Faculty of Animal Husbandry, University of Agriculture, Faisalabad, Pakistan
Zahid Mukhtar*
Affiliation:
Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering College (NIBGE-C), Faisalabad, Pakistan Pakistan Institute of Engineering & Applied Sciences (PIEAS), Islamabad, Pakistan
*
Corresponding authors: Fakhar Un Nisa; Emails: dr.fakharunnisa@gmail.com, fakhar.nisa@uaf.edu.pk; Zahid Mukhtar; Email: zahidmukhtar@yahoo.com
Corresponding authors: Fakhar Un Nisa; Emails: dr.fakharunnisa@gmail.com, fakhar.nisa@uaf.edu.pk; Zahid Mukhtar; Email: zahidmukhtar@yahoo.com
Rights & Permissions [Opens in a new window]

Abstract

The investigation of structural variants that may govern complex traits has significant importance. This is particularly true for the crossbred dairy cattle of Pakistan, which are deemed ideal for achieving optimal milk production and enhanced environmental adaptability in tropical conditions. This research detected and described copy number variation regions (CNVR) within the crossbred cattle genome. A GGP_HDv3_C chip containing 139,376 SNPs was utilized to genotype a cohort of 81 animals. In this study, 1055 CNVs were obtained after quality control, distributed across animals and encompassing all autosomes. From these, 268 CNVRs were detected, which covered 31.03 megabases, representing approximately 1.24% of the bovine genome. Functional analysis of these regions yielded 97 genes primarily associated with the immune and defense systems. Additionally, other observed categories encompassed production, health and reproduction. These findings enhanced the CNV map of bovines, offering the variant identification linked to traits subject to selection in both crossbred and indicine breeds of cattle.

Information

Type
Animal Research Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Box plot showing CNV lengths (kb) across four categories (0, 1, 3, 4). The boxes represent the interquartile range (IQR), with medians marked as horizontal lines; whiskers extend to 1.5×IQR and dots represent outliers. CNV categories vary in data distribution, with category 0 showing fewer data points and higher outliers compared to categories 3 and 4.

Figure 1

Table 1. Summary of CNV based on size in kilobases (kb)

Figure 2

Figure 2. Line plot showing the distribution of CNVs across chromosomes, categorized by CNV types (0, 1, 3, 4). Each line represents the number of CNVs per chromosome, with distinct colours indicating different CNV values. The plot highlights variation in CNV counts across chromosomes and types.

Figure 3

Figure 3. Prevalence and type of CNVs across autosomes for four types (0, 1, 3, 4) represented as boxplots (lengths in kb) and lines with points (frequency per chromosome). Boxplots show medians, interquartile ranges and whiskers, while numbers on points indicate CNV counts.

Figure 4

Table 2. No. of CNVRs, proportional length of CNVRs on each autosome using HandyCNV and CNVRuler

Figure 5

Figure 4. Distribution of CNVs (a) and CNVRs (b) in different distance categories (0–100 kb, 101–200 kb, 201–300 kb, 301–400 kb, 401–500 kb and >500 kb) across autosomes. (a) The majority of CNVs (44%) fall within the 0–100 kb range. (b) Detailed breakdown of CNVR size ranges, highlighting that 65.67% of the CNVRs are within the 0–100 kb category, followed by 16.79% in the 101–200 kb range.

Figure 6

Figure 5. Genome-wide CNVR map illustrating the distribution of CNVRs across autosomes. Each horizontal bar represents a chromosome, with CNVRs categorized as gains (red), losses (green) and mixed events (black) based on their type. The percentages indicate the proportion of the chromosomes covered by CNVRs. The physical positions of CNVRs are displayed along the x-axis in megabase pairs (Mbp).

Figure 7

Table 3. Top 10 highly common genes and their association with economic traits

Supplementary material: File

Nisa et al. supplementary material

Nisa et al. supplementary material
Download Nisa et al. supplementary material(File)
File 2.7 MB