Teleost fishes have independently colonized polar regions multiple times, facing many physiological and biochemical challenges due to frigid temperatures. Although increased gene copy numbers can contribute to adaptive evolution in extreme environments, it remains unclear which categories of genes exhibit increased copy numbers associated with polar colonization. Using 104 species of ray-finned fishes, we systematically identified genes with a significant correlation between copy number and polar colonization after phylogenetic correction. Several genes encoding extracellular glycoproteins, including zona pellucida (ZP) proteins, which increase their copy number in Antarctic notothenioid fishes, exhibited elevated copy numbers across multiple polar fish lineages. Additionally, some genes reported to be highly expressed under cold stress, such as cold-inducible RNA-binding protein (CIRBP), had significantly increased copy numbers in polar fishes. Further analysis will provide a fundamental basis for understanding the role of gene duplication in polar adaptations.
INTRODUCTION
Convergent evolution of phenotypic traits across independent lineages in similar environments is commonly observed in nature, highlighting the role of natural selection and indicating that phenotypic evolutionary pathways may be predictable under certain ecological conditions (Schluter, 2000; Losos, 2011; Martin and Orgogozo, 2013). Extreme environments, in particular, provide valuable insights into the nature of adaptive evolution due to strong selective pressures (Hotaling et al., 2023). Polar regions, the Arctic and Antarctic, are among the most extreme environments, experiencing pronounced seasonal variation in daylight and temperature (Clarke and Harris, 2003). The Arctic is often defined by the 10°C July isotherm, whereas the Antarctic is defined by the Antarctic Polar Front (Clarke and Harris, 2003; Møller et al., 2005). In polar oceans, water temperatures drop below the freezing point of body fluids for many teleost fishes (Scholander et al., 1957; Christiansen et al., 1995; DeVries and Cheng, 2005). Frigid temperatures can disrupt various cellular processes, such as enzyme catalytic rates, protein structure, and membrane fluidity (Fields, 2001; Pörtner et al., 2007). Despite these harsh conditions, various teleost lineages have successfully colonized polar regions (Eastman, 2005; Møller et al., 2005; Mecklenburg et al., 2011). However, previous studies on molecular mechanisms have focused on a limited number of lineages or genes (DeVries and Cheng, 2005; Beers et al., 2015; Fields et al., 2015; Logan and Buckley, 2015; Daane and Detrich, 2022), leaving those underlying convergent adaptation to polar regions largely unknown.
One of the few examples of molecular mechanisms that enable polar colonization is de novo acquisition of antifreeze proteins (AFPs) (DeVries and Wohlschlag, 1969; DeVries, 1971; Duman and DeVries, 1974, 1976; Near et al., 2012; Bista et al., 2023). Polar fishes have evolved several structurally distinct types of AFPs, including AFP types I, II, and III, and antifreeze glycoproteins (AFGPs), in diverse lineages (Chen et al., 1997; Cheng, 1998; Fletcher et al., 2001; DeVries and Cheng, 2005; Deng et al., 2010; Baalsrud et al., 2018; Rives et al., 2024). These proteins lower the freezing point of blood and body fluids by adsorbing to ice surfaces and inhibiting ice growth, thereby preventing the organisms from freezing (DeVries, 1971; Raymond and DeVries, 1977; Yeh and Feeney, 1996). While de novo acquisition of AFPs is a notable example of genetic innovation in polar fishes, previous studies suggest that larvae may have additional mechanisms for freezing resistance (Cziko et al., 2006). Furthermore, because polar environments affect cellular processes beyond just freezing resistance, genes other than AFPs may also contribute to adaptation.
Gene duplication is one of the major processes that expands the genetic repertoire in teleost fishes (Brawand et al., 2014; Glasauer and Neuhauss, 2014). It can lead to phenotypic changes through various mechanisms such as increased protein dosage, alternative regulatory expression, and neofunctionalization with subsequent amino acid mutations (Magadum et al., 2013). Duplication of certain genes can enhance adaptation to different environments (Kondrashov, 2012). For example, increased copy numbers of fatty acid desaturase genes in freshwater fishes can compensate for lack of docosahexaenoic acid (DHA) (Ishikawa et al., 2019). In Antarctic notothenioid fishes, genes encoding zona pellucida (ZP) proteins show increased copy numbers (Chen et al., 2008, 2019; Kim et al., 2019; Daane and Detrich, 2022; Lu et al., 2022). ZP proteins constitute a fibrillar extracellular matrix surrounding eggs (Litscher and Wassarman, 2018, 2020). ZP proteins from Antarctic notothenioids exhibit novel ice melting-promoting (IMP) activities in eggs, in addition to lowering freezing points (Cao et al., 2016). However, we do not know how prevalent convergent gene duplication is in colonization of polar regions.
In this study, we sought to comprehensively identify genes with increased copy numbers among multiple polar fish lineages. Using 104 teleost genomes, we identified 20 genes showing significant copy number increases in polar fishes after phylogenetic correction. These included genes encoding extracellular glycoproteins and genes upregulated under cold stress. We also investigated whether copy number increases occurred only in polar regions or also along latitudinal clines, and we clarified the timing of gene duplication by estimating phylogenetic trees.
MATERIALS AND METHODS
Comprehensive detection of genes with higher copy numbers in polar fishes
To investigate the association between copy numbers and colonization of polar regions for each orthologous gene, we used OrthoDB v11, a database providing orthologous relationships and functional annotations of genes from representative genomes, based on their longest isoforms (Waterhouse et al., 2013; Kuznetsov et al., 2023). Species classified in the infraclass Teleostei were selected from those available in OrthoDB via NCBI taxonomy IDs using taxize 0.9.101 (Chamberlain and Szöcs, 2013). A phylogenetic tree for ray-finned fishes was obtained using the fishtree_phylogeny function in fishtree 0.3.4 (Chang et al., 2019), selecting only those species for which a phylogenetic tree was available. Latitudinal distribution ranges for selected species were retrieved from FishBase ver. 02/2024 (Froese and Pauly, 1994), and only species with these ranges available were selected. Of the 104 species selected, 23 were classified as polar fishes, based on the checklist of polar fishes by Møller et al. (2005), who defined them as those found in Arctic or Antarctic regions (Møller et al., 2005). The remaining 81 species were classified as non-polar fishes (see Supplementary Table S1 (zs240098_TableS1.xlsx)). For each of the selected polar and non-polar fishes, we calculated copy numbers of orthologous genes using the Actinopterygii level of orthology (Kuznetsov et al., 2023).
To identify orthologous genes with increased copy numbers in diverse polar fish lineages, we performed family-level downsampling. Specifically, we performed the downsampling to reduce the effect of extensively sequenced lineages (e.g., Salmonidae) when comparing copy numbers between polar and non-polar fishes. Amino acid sequences of the longest isoforms for all genes in selected species were obtained from OrthoDB. These sequences were used as inputs to calculate BUSCO (Benchmarking Universal Single-Copy Orthologs) completeness scores using BUSCO v5.6.1 with the actinopterygii_odb10 database (Manni et al., 2021). The species with the highest BUSCO completeness score in each family was selected from polar fishes when available, and from non-polar fishes otherwise (see Supplementary Table S1 (zs240098_TableS1.xlsx)). Using the remaining 12 polar and 53 non-polar fishes after downsampling, we selected orthologous genes that met the following criteria: (i) the average copy number in polar fish was more than twice that in non-polar fish, and (ii) more than half of polar fish had at least two copies.
To test the hypothesis that copy numbers of the 54 orthologous genes are associated with colonization of polar regions by pre-downsampling species, we performed Bayesian inference for a generalized linear mixed model (GLMM) with a Poisson distribution using MCMCglmm 2.36 (Hadfield, 2010) with pre-downsampling of 104 species. Copy numbers of orthologous genes were used as the response variable, with habitat type (polar or non-polar) as the predictor. To account for phylogenetic relationships, a covariance structure derived from the phylogenetic tree, pruned to include only pre-downsampling species using ape 5.8 (Paradis and Schliep, 2019), was incorporated as a random effect. The MCMC process was conducted for 10,005,000 iterations with inverse Wishart priors (parameters V = 1 and mu = 0.002). The initial 5000 samples were excluded as burn-in, with subsequent samples every 1000 iterations being used to calculate p-values (pMCMC_polar). These p-values were adjusted for multiple tests using the Benjamini-Hochberg procedure, with a false discovery rate (FDR) below 0.1 considered statistically significant.
Gene ontology (GO) enrichment analysis
To assess functional information of the detected 54 orthologous genes, we performed gene ontology (GO) enrichment analysis for biological function, cellular component, and molecular function, respectively. For orthologous genes present in at least one of the 65 downsampled species, GO IDs assigned per orthologous gene (i.e., per orthologous group, not per individual gene) were retrieved from OrthoDB. To identify GO IDs overrepresented among detected orthologous genes compared to all orthologous genes in downsampled species, we performed hypergeometric tests. More specifically, we calculated a p-value for each GO ID present in detected orthologous genes using phyper function in R as follows:
where n is the number of detected orthologous genes, ni is the number of detected orthologous genes with GO ID i, N is the number of all orthologous genes, and Ni is the number of all orthologous genes with GO ID i. These p-values were adjusted for multiple tests using the Benjamini-Hochberg procedure. We considered GO IDs with an FDR below 0.1 and at least two detected orthologous genes as being significantly enriched. We also performed GO enrichment analysis for the identified 20 orthologous genes that showed significant associations between copy number and colonization of polar regions.
Analysis of the association between gene copy number and latitudinal clines
To examine whether copy number increased not only in polar regions, but also at higher latitudes in general, we tested the association between copy number and maximum absolute latitudes of distribution ranges for the detected 20 orthologous genes that showed significant correlations with polar colonization. We performed GLMM analysis using maximum absolute latitudes of distribution ranges as the predictor instead of habitat type (polar or non-polar) and calculated p-values (pMCMC_latitude). These p-values were adjusted for multiple tests using the Benjamini-Hochberg procedure. FDRs below 0.1 were considered statistically significant.
Phylogenetic tree estimation
To clarify the timing of gene duplication, we estimated phylogenetic trees for the identified 20 orthologous genes. Protein sequences of the longest isoform for each gene were obtained from NCBI. These sequences were aligned using MAFFT v7.525 in L-INS-i mode (Katoh et al., 2002; Katoh and Standley, 2013). Aligned sequences were trimmed using trimAl v1.5.rev0 with the automated1 option to remove gappy sites (Capella-Gutiérrez et al., 2009). GeneRax v2.0.4 (Morel et al., 2020) was used to infer rooted phylogenetic trees. The trimmed alignment, the pruned species tree from fishtree, and the initial tree estimated by IQ-TREE v2.3.6 with the LG + G substitution model (Minh et al., 2020) were used as inputs. Estimation was performed using the LG + G model, accounting for duplication and loss. Phylogenetic trees were visualized using ggtreeExtra v1.12.0 (Xu et al., 2021).
Fig. 1.
Copy numbers of orthologous genes significantly increased in polar fishes. A phylogenetic tree for polar and non-polar fishes, including data on maximum latitudes of species ranges and copy numbers of the 20 orthologous genes identified. Polar fishes are labeled in light blue.

For the three identified ZP orthologous genes (428128at7898, 409056at7898, and 191156at7898), we included some ZP protein sequences from an Antarctic notothenioid fish, Dissostichus mawsoni, (DmZPAX1, DmZPC5, and DmZPC1) in the phylogenetic analysis. These ZP proteins from D. mawsoni, which have been experimentally verified to possess IMP activity (Cao et al., 2016), were included to clarify orthologous relationships between identified ZP genes and known antifreeze ZP proteins. Protein sequences for DmZPAX1 (AIO03056.1) and DmZPC1 (AJW66345.1) were downloaded from NCBI, while the sequence for DmZPC5 was obtained from Supplementary Figure 5 (zs240098_FigS1-S21.pdf) in Cao et al. (2016).
Data accessibility
Original data can be accessed from the cited sources. Code for the analysis is available at https://github.com/mkrg01/gene_dup_polar.
Table 1.
Orthologous genes significantly increased in polar fishes.

RESULTS
Genes that show increased copy numbers in polar fish lineages
We first screened for genes with increased copy numbers in polar fish lineages compared to non-polar fish lineages by family-level downsampling of 65 species. Our analysis detected 54 orthologous genes with increased copy numbers in multiple polar fish lineages, such as genes encoding the low choriolytic enzyme-like, pepsin A-like, E3 ubiquitin-protein ligase TRIM39-like, G2/M phase-specific E3 ubiquitin-protein ligase-like, and caspase-8-like (see Supplementary Figure S1 (zs240098_FigS1-S21.pdf); Supplementary Tables S2 (zs240098_TableS2.xlsx), S3 (zs240098_TableS3.xlsx)). GO enrichment analysis for these 54 orthologous genes showed that genes related to proteolysis, regulation of apoptotic process, protein ubiquitination, metal ion binding, zinc ion binding, and protein dimerization activity were significantly overrepresented (Benjamini-Hochberg FDR < 0.1 and N ≥ 2; see Supplementary Table S4 (zs240098_TableS4.xlsx)). Of the 54 orthologous genes, 20 showed a statistically significant association between copy number and colonization of polar regions even after phylogenetic correction (pMCMC, FDR < 0.1; Fig. 1, Table 1). The gene encoding the cold-inducible RNA-binding protein (CIRBP) showed the most significant association. Genes encoding ZP proteins, which show increased copy numbers in Antarctic notothenioid fishes (Cao et al., 2016), were among the detected genes. In addition to ZP genes, genes predicted to encode glycoproteins (leucine-rich repeat extensin-like protein 2, mucin-2, and uncharacterized threonine-rich GPI-anchored glycoprotein PJ4664.02-like) and those regulating mucin release (tetraspanin-8-like) were also included. Genes related to extracellular regions and protein dimerization activity were significantly enriched in the GO enrichment analysis (Benjamini-Hochberg FDR < 0.1 and N ≥ 2; see Supplementary Table S5 (zs240098_TableS5.xlsx)).
Fig. 2.
Scatter plots of maximum absolute latitudes and copy numbers of the 20 orthologous genes selected. Each dot represents polar or non-polar fish before downsampling. Polar fishes are plotted in light blue. adj_p_pol, adjusted pMCMC_polar; adj_p_lat, adjusted pMCMC_ latitude.

Fig. 3.
An estimated phylogenetic tree of OG 191156at7898 (zona pellucida sperm-binding protein 3-like (ZP3B)) after downsampling, including DmZPC5. Accession IDs from polar fishes are labeled in light blue, and DmZPC5 in red. In addition, polar fish species with each sequence and their clade names are highlighted in light blue on the right, whereas DmZPC5 is highlighted in red for clarity. The scale of branch lengths indicates the expected number of amino acid substitutions per amino acid site. This figure shows part of the phylogenetic tree. Please see Supplementary Figure S2 (zs240098_FigS1-S21.pdf) for the complete tree.

Genes with and without latitudinal clines
We examined whether the identified 20 orthologous genes with increased copy number in polar regions were also associated with latitudinal clines. Of the 20 orthologous genes, 14 also showed a statistically significant association between copy number and maximum absolute latitude of distribution ranges, whereas six genes did not (pMCMC, FDR < 0.1; Fig. 2, Table 1). Genes with increased copy numbers in polar regions, but not significantly associated with latitudes, such as genes encoding the trihelix transcription factor GT-3a-like (Fig. 2M) and transcription factor Adf-1-like (Fig. 2N), showed exceptionally high copy numbers in some polar fishes, but not in non-polar fishes inhabiting high latitudes (Fig. 2).
Phylogenetic trees estimated for the identified 20 genes
Our phylogenetic analysis clearly showed that gene duplication occurred in multiple lineages of polar fish for the identified 20 orthologous genes (Fig. 3, and see Supplementary Figures S2–S21 (zs240098_FigS1-S21.pdf)). Notable examples are three orthologous genes encoding ZP proteins (zona pellucida protein AX 1 (ZPAX1), zona pellucida sperm-binding protein 3-like (ZP3F), and zona pellucida sperm-binding protein 3-like (ZP3B)). In all three ZP genes, multiple gene duplications occurred not only in Antarctic notothenioids, but also in several Arctic fish lineages, such as Cyclopterus lumpus (Cyclopteridae), Pungitius pungitius (Gasterosteidae), Salmo salar (Salmonidae), and Clupea harengus (Clupeidae) (Fig. 3, and see Supplementary Figures S2–S4 (zs240098_FigS1-S21.pdf)). Conversely, no clear evidence of duplication for the three ZP orthologous genes was detected in some polar fish lineages, such as Perca fluviatilis (Percidae), Hippoglossus stenolepis (Pleuronectidae), Gadus morhua (Gadidae), and Esox lucius (Esocidae) (Fig. 3, and see Supplementary Figures S2–S4 (zs240098_FigS1-S21.pdf)). The three identified ZP genes were homologous to previously identified ZP proteins with antifreeze properties (ZPAX1 to DmZPAX1, ZP3F to DmZPC1, and ZP3B to DmZPC5) (Fig. 3, and see Supplementary Figures S2–S4 (zs240098_FigS1-S21.pdf)). In addition to ZP genes, gene duplication occurred in multiple Arctic and Antarctic fish lineages in several orthologous genes, such as those encoding NAD(P)H dehydrogenase (see Supplementary Figure S6 (zs240098_FigS1-S21.pdf)), trypsin-like (see Supplementary Figure S8 (zs240098_FigS1-S21.pdf)), leucine-rich repeat extensin-like protein 2 (see Supplementary Figure S9 (zs240098_FigS1-S21.pdf)), and probable E3 ubiquitin-protein ligase HERC6 (see Supplementary Figure S12 (zs240098_FigS1-S21.pdf)).
DISCUSSION
In this study, we identified 54 genes with increased copy number in multiple polar fish lineages. GO enrichment analysis revealed that genes involved in proteolysis, ubiquitination, and apoptosis were overrepresented among these genes. At low temperatures, food consumption rates and digestive enzyme activities decrease (Ahmad et al., 2014). Therefore, the higher copy number of genes related to protein digestive enzymes, such as genes encoding pepsin A-like and trypsin-like, may be adaptive to compensate for their deficiency in cold polar oceans. The increased copy number of genes encoding the low choriolytic enzyme-like, which can digest ZP proteins in the egg extracellular matrix during hatching (Yasumasu et al., 2010; Kawaguchi et al., 2013; Sano et al., 2014), may also be adaptive to digest abundant ZP proteins in polar fish eggs. In addition to low digestive enzyme activities, cold stress is thought to denature or misfold many proteins (Place et al., 2004; Logan and Buckley, 2015). Moreover, polar fishes are thought to experience high oxidative stress, likely due to increased mitochondrial abundance and lipid polyunsaturation (Abele and Puntarulo, 2004; Guderley, 2004; Todgham et al., 2017). Thus, higher copy numbers of genes related to ubiquitination, such as genes encoding the E3 ubiquitin-protein ligase TRIM39-like, G2/M phase-specific E3 ubiquitin-protein ligase-like, and probable E3 ubiquitin-protein ligase HERC6, may help to remove proteins denatured or misfolded by cold and oxidative stress, to maintain homeostasis. Similarly, increased copy numbers of apoptosis-related genes, such as the gene encoding caspase-8-like, may facilitate removal of cells unable to maintain homeostasis under stresses in polar regions. Overall, copy number increases in genes involved in proteolysis, ubiquitination, and apoptosis may promote colonization of polar regions in teleosts.
Among the 20 genes showing a significant association between copy number and polar colonization, genes related to the extracellular region were significantly enriched. In particular, we identified three orthologous genes encoding ZP proteins that showed significant associations between copy numbers and polar colonization. ZP proteins are glycosylated proteins constituting the fibrillar extracellular matrix surrounding eggs (Litscher and Wassarman, 2018, 2020). Increases in copy numbers of ZP genes in teleost fishes are thought to enhance physical protection of eggs in harsh environments, such as alkaline environments (cyprinid fish Leuciscus waleckii) (Xu et al., 2017) and near-freezing conditions (the Antarctic notothenioid fish, D. mawsoni) (Chen et al., 2008). A previous study experimentally confirmed that some ZP proteins from Antarctic notothenioid fishes, including DmZPAX1, DmZPC1, and DmZPC5 from D. mawsoni, depressed melting and freezing points in eggs (Cao et al., 2016). Our phylogenetic analysis clearly showed that the identified three orthologous ZP genes were homologous to previously identified ZP proteins with antifreeze properties (ZPAX1 to DmZPAX1, ZP3F to DmZPC1, and ZP3B to DmZPC5) (Fig. 3, and see Supplementary Figures S2–S4 (zs240098_FigS1-S21.pdf)). Gene duplications occurred repeatedly in these ZP genes not only in Antarctic notothenioids, but also in multiple Arctic fish lineages (Fig. 3, and see Supplementary Figures S2–S4 (zs240098_FigS1-S21.pdf)). These results indicate that eggs of multiple Arctic fish lineages have independently acquired antifreeze properties through mechanisms similar to Antarctic notothenioids. In some polar fish lineages, however, no duplication was observed in these three ZP genes (Fig. 3, and see Supplementary Figures S2–S4 (zs240098_FigS1-S21.pdf)). This may be partly because some species, such as P. fluviatilis and E. lucius, spawn in warmer seasons (Casselman and Lewis, 1997; Gillet and Dubois, 2007), where freeze tolerance is likely unnecessary. Furthermore, eggs of G. morhua are reported to exhibit freeze tolerance (Valerio et al., 1992), suggesting additional molecular adaptations beyond the copy number increase in these three ZP genes. We also identified significant associations in genes predicted to encode glycoproteins (leucine-rich repeat extensin-like protein 2, mucin-2, and uncharacterized threonine-rich GPI-anchored glycoprotein PJ4664.02-like), and a gene encoding a membrane protein regulating mucin release (tetraspanin-8-like) (Wojnacki et al., 2023). In teleosts, skin mucus helps to protect against physical and chemical attacks and alters its viscosity in response to cold stress (Guardiola et al., 2015; Sanahuja et al., 2018), suggesting that increased copy numbers of these genes may also contribute to physical and chemical adaptation to subzero environments.
We also identified significant associations between copy number and polar colonization in genes potentially related to cold stress adaptation, such as genes encoding the cold-inducible RNA-binding protein (CIRBP), NAD(P)H dehydrogenase, probable E3 ubiquitin-protein ligase HERC6, and trypsin-like. A gene encoding CIRBP increases its expression in response to cold stress (Gracey et al., 2004; Chou et al., 2008; Rebl et al., 2013; Liu et al., 2020; Ma et al., 2022) and stabilizes mRNA by binding to its 3′-untranslated regions (Zhong and Huang, 2017). A gene encoding NADH dehydrogenase that is upregulated under cold stress (Malek et al., 2004; Qian and Xue, 2016) likely compensates for decreased ATP production under reduced kinetic energy in a cold environment (O'Brien, 2011; Coppe et al., 2013). Genes involved in ubiquitination are upregulated in response to cold stress (Gracey et al., 2004; Long et al., 2013; Liu et al., 2020) and are highly expressed in Antarctic notothenioid fishes (Todgham et al., 2007; Shin et al., 2012). They probably remove proteins misfolded or damaged by cold stress besides oxidative stress, as described above (Logan and Buckley, 2015; Todgham et al., 2017). Activity of digestive enzymes, including trypsin, decreases at low temperatures (Ahmad et al., 2014). Elevated gene expression or reduced activity under cold stress suggests that increased copy numbers of these four genes in polar fishes may be adaptive to compensate for shortages of their products caused by extreme cold stress. In addition, these four orthologous genes were significantly associated with latitudinal clines and showed higher copy numbers at higher latitudes, particularly for the gene encoding CIRBP (Fig. 2). As sea surface temperatures tend to decrease at higher latitudes (Pinet, 2006), increased copy numbers of these genes in species inhabiting higher latitudes, including non-polar regions, may also indicate their importance in adaptation to relatively moderate cold stress outside polar regions.
Of the 20 genes significantly associated with polar colonization, six did not show a significant association between copy number and maximum absolute latitude. Some genes, such as those encoding trihelix transcription factor GT-3a-like and transcription factor Adf-1-like, showed a pronounced increase specifically in some polar fishes, rather than a general increase among species inhabiting high latitudes (Fig. 2). Although functions of these genes in adaptations to polar regions are unclear, the exceptional copy number increase of these transcription factors likely contributes to adaptation unique to polar ecosystems.
In conclusion, we systematically identified genes with increased copy numbers among polar fish lineages. In particular, our analysis suggests that copy number increases in genes related to extracellular function are probably involved in polar adaptation. Although further functional analysis of increased copy numbers of these genes is needed, our findings provide a basis for understanding the role of copy number increases in colonization of polar regions.
ACKNOWLEDGMENTS
We thank Steven D. Aird ( https://www.sda-technical-editor.org) for editing the manuscript. This work was funded by KAKENHI 23KJ0575 from the Japan Society for the Promotion of Science. Computations were partially performed on the NIG supercomputer at ROIS National Institute of Genetics.
© 2025 Zoological Society of Japan
AUTHOR CONTRIBUTIONS
TN: conceptualization, data curation, formal analysis, funding acquisition, project administration, visualization, writing—original draft; AI: supervision, writing—review and editing.
SUPPLEMENTARY MATERIALS
Supplementary materials for this article are available online. (URL: https://doi.org/10.2108/zs240098)
Supplementary Table S1. (zs240098_TableS1.xlsx) Species information used in this study.
Supplementary Table S2. (zs240098_TableS2.xlsx) 54 orthologous genes identified before GLMM selection.
Supplementary Table S3. (zs240098_TableS3.xlsx) Copy numbers of the detected 54 orthologous genes for teleost species before downsampling.
Supplementary Table S4. (zs240098_TableS4.xlsx) Summary of GO enrichment analysis for the detected 54 orthologous genes.
Supplementary Table S5. (zs240098_TableS5.xlsx) Summary of GO enrichment analysis for the 20 orthologous genes that showed significant associations between copy numbers and polar colonization.
Supplementary Figure S1. (zs240098_FigS1-S21.pdf) Copy numbers of 54 orthologous genes identified before GLMM selection. A phylogenetic tree for polar and non-polar fishes, including copy numbers of the 54 genes.
Supplementary Figure S2. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 191156at7898 (zona pellucida sperm-binding protein 3-like [ZP3B]) after downsampling, including DmZPC5.
Supplementary Figure S3. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 428128at7898 (zona pellucida protein AX 1 [ZPAX1]) after downsampling, including DmZPAX1.
Supplementary Figure S4. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 409056at7898 (zona pellucida sperm-binding protein 3-like [ZP3F]) after downsampling, including DmZPC1.
Supplementary Figure S5. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 489137at7898 (cold-inducible RNA-binding protein [CIRBP]) after downsampling.
Supplementary Figure S6. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 299357at7898 (NAD[P]H dehydrogenase) after downsampling.
Supplementary Figure S7. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 465589at7898 (claudin-like protein ZF-A89) after downsampling.
Supplementary Figure S8. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 172968at7898 (trypsin-like) after downsampling.
Supplementary Figure S9. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 29671at7898 (leucine-rich repeat extensin-like protein 2) after downsampling.
Supplementary Figure S10. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 498742at7898 (general transcription factor II-I repeat domain-containing protein 2-like) after downsampling.
Supplementary Figure S11. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 454335at7898 (zinc finger MYM-type protein 1) after downsampling.
Supplementary Figure S12. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 488885at7898 (probable E3 ubiquitin-protein ligase HERC6) after downsampling.
Supplementary Figure S13. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 428606at7898 (late histone H2A.2.2-like) after downsampling.
Supplementary Figure S14. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 469863at7898 (trihelix transcription factor GT-3a-like) after downsampling.
Supplementary Figure S15. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 485260at7898 (transcription factor Adf-1-like) after downsampling.
Supplementary Figure S16. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 482472at7898 (tetraspanin-8-like) after downsampling.
Supplementary Figure S17. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 438657at7898 (mucin-2) after downsampling.
Supplementary Figure S18. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 500231at7898 (uncharacterized threonine-rich GPI-anchored glycoprotein PJ4664.02-like) after downsampling.
Supplementary Figure S19. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 307673at7898 (LOC113088224) after downsampling.
Supplementary Figure S20. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 493746at7898 (uncharacterized protein LOC110015227 isoform X1) after downsampling.
Supplementary Figure S21. (zs240098_FigS1-S21.pdf) An estimated phylogenetic tree of OG 482447at7898 (protein ALP1-like) after downsampling.