Fatty acid profile (%) of the total fatty acids in R. toruloides strains | |||
|---|---|---|---|
CBS 14 | CBS 349 | CBS 6016T | |
C14:0 | 1.51 a ± 0.20 | 0.79 b ± 0.10 | 1.39 a ± 0.07 |
C16:0 | 22.7 a ± 0.01 | 14.3 b ± 0.58 | 19.0 c ± 0.72 |
C18:0 | 6.03 a ± 0.50 | 8.49 b ± 0.97 | 10.1 c ± 1.00 |
C24:0 | 0.77 a ± 0.22 | 0.47 b ± 0.10 | 0.27 c ± 0.05 |
C18:1 (n-9) | 47.8 a ± 0.14 | 55.5 b ± 1.08 | 49.0 a ± 1.17 |
C18:2 (n-6) | 9.94 a ± 1.05 | 10.3 b ± 1.29 | 10.1 b ± 3.16 |
C18:3 (n-3) | 1.07 a ± 0.30 | 1.43 b ± 0.35 | 0.89 a ± 0.51 |
Total SFA | 31.0 a ± 0.90 | 24.0 b ± 1.22 | 30.7 a ± 1.74 |
Total UFA | 58.8 a ± 1.49 | 67.2 b ± 1.83 | 60.0 a ± 2.50 |
Carotenoids | Quantity of Carotenoids in R. toruloides strains (µg/g d.w.) | ||
|---|---|---|---|
CBS 14 | CBS 349 | CBS 6016T | |
β-carotene | 0.83 ± 0.10 | 0.07 ± 0.003 | 0.39 ± 0.10 |
γ-carotene | 0.14 ± 0.01 | 0.01 ± 0.01 | 0.07 ± 0.002 |
Torularhodin | 41.89 ± 1.23 | 3.67 ± 0.09 | 32.68 ± 0.55 |
Torulene | 15.19 ± 1.70 | 1.10 ± 0.06 | 11.39 ± 1.11 |
Total | 58.0 ± 0.35 | 4.85 ± 0.03 | 44.52 ± 1.75 |
| Abbreviation: T – Type strain; d.w. – Dry weight | |||
| Further, our study revealed that CBS 14 and CBS 6016T reached notably higher concentrations of total carotenoids compared to CBS 349 (Fig. 2). The total carotenoid concentrations in the cultures of CBS 14, CBS 6016T and CBS 349 were 58.0 ± 0.35, 44.52 ± 1.75 and 4.85 ± 0.03 µg/g, respectively. | |||
Name (KO number) | Function/ E.C. number | CBS 14 Annotation-tags | CBS 349 Annotation-tags | CBS 6016 Annotation-tags |
|---|---|---|---|---|
Genes related to lipid metabolism | ||||
ACAC (K11262) ∼2230 aa | Acetyl-CoA carboxylase (E.C. 6.4.1.2) | CBS14_8968 (2233 aa) | CBS6016_15464 (2233 aa) | |
CBS349_2506 (1440 aa) CBS349_2508 (504 aa) CBS349_2509 (284 aa) (Fragmented) | CBS6016_280 (2267 aa) | |||
FAS1 (K00668) ∼1300 aa | Fatty-acyl-CoA synthase (E.C. 2.3.1.86) | CBS14_8939 (1329 aa) | CBS6016_15437 (1334 aa) | |
CBS349_2538 (N- and C-terminal truncated) (851 aa) | CBS6016_254 (1283 aa) | |||
FAS2 (K00667) ∼2900 aa | CBS14_6383 (2980 aa) | CBS6016_8784 (2980 aa) | ||
CBS349_2653 (2899 aa) | CBS6016_159 (2897 aa) | |||
FAA K01897 | long chain fatty acid CoA ligase (EC 6.2.1.3) FAA4 and FAA2 | CBS14_2650 (698 aa) | CBS6016_5854 (698 aa) | |
CBS14_5883 (705 aa) | CBS6016_2536 (705 aa) | |||
CBS14_1981 (649 aa) | CBS6016_3339 (706 aa) | |||
CBS_1997 (405 aa) | CBS6016_3355 (399 aa) | |||
CBS349_7135 (136 aa) | CBS6016_12215 (722 aa) | |||
CBS349_4677 (557 aa) | CBS6016_7567 (557 aa) | |||
CBS349_1802 (289 aa) | CBS6016_10896 (640 aa) | |||
CBS349_4691 (676 aa) | - | |||
Genes related to carotenoid metabolism | ||||
AL1 (CrtI) (K15745) | phytoene desaturase (EC1.3.99.30) | CBS14_2402 (558 aa) | CBS6016_13578 (521 aa) | |
CBS349_2451 (544 aa) | CBS6016_330 (507 aa) | |||
AL2 (CrtYB) (K17841) | Bifunctional: Phytoene synthase, (EC2.5.1.32) Lycopene cyclase (EC5.5.1.19) | CBS14_2399 T2940 (612 aa) | CBS6016_13575 T16127 (612 aa) | |
CBS349_2454 (118 aa) matches to the N-terminus of lycopene cyclase (crtY) domain | CBS6016_326 (600 aa) | |||
| Discussions | ||||
| In this study, the hybrid strain CBS 6016T exhibited biomass concentrations and lipid production levels comparable to one of its parental strains, CBS 14, rather than CBS 349. Analysis of the lipid profiles revealed that CBS 349 produced slightly higher amounts of unsaturated fatty acids (UFAs) compared to CBS 14 and CBS 6016T. This may be associated with the lower proportion of storage lipids compared to membrane lipids in this strain. A decrease of the proportion of unsaturated fatty acids with an increase of the total lipids has been observed before (Nagaraj, Burkina et al. 2022). The proportion of UFAs was higher than the SFAs in all three strains, in accordance with our previous results (Nagaraj, Burkina et al. 2022) (Nagaraj, Blomqvist et al. 2025). Zhang et al., also observed similar results in the fatty acid composition of some R. toruloides haploid and diploid strains (Zhang, Kamal et al. 2022). | ||||
| A similar pattern was observed for carotenoid production. With SC-CO2 extraction, we observed that instead of β-carotene, torularhodin was the major carotenoid in all three strains of R. toruloides which is in line with our previous results (Nagaraj, Blomqvist et al. 2025). Larocca et al., (Larocca, Martino et al. 2023) also observed similar results in their study where they confirmed torularhodin as the major carotenoid produced by Rhodotorula spp.. Moline et al., (Moliné, Flores et al. 2010) confirmed in their study that accumulation of torularhodin constitutes an important mechanism that improves the resistance of yeasts to UV-B. As with lipid production, the carotenoid content of CBS 6016T is similar to CBS 14 and differs significantly from CBS 349. | ||||
| To understand the genomic context and to identify genetic differences among the three strains, we performed a comparative genomic analysis with a focus on lipid and carotenoid metabolism. Based on protein-coding sequences inferred from transcript variants, we identified 69 lipid metabolism-associated KO identifiers in CBS 349, representing the lowest number among all strains, which is consistent with its reduced capacity for lipid accumulation (Fig. 1). However, the number of KO identifiers does not appear to correlate directly with lipid accumulation. CBS 14 was associated with 107 KO identifiers, whereas CBS 6016 was associated with 209, nearly double that of CBS 14, yet the lipid content in the latter is not significantly higher. Using KEGG Mapper to reconstruct lipid and carotenoid metabolic pathways for CBS 14, CBS 349, and CBS 6016, three major observations emerged. First, CBS 349 harbors a heavily truncated version of FAS1 and AL2. Second, the ACAC gene in CBS 349 is fragmented and third, double sets of all key functional genes are present in CBS 6016 (Table 3). Sequence alignments indicated that the gene duplications in CBS 6016 arise entirely from the lipid and carotenoid metabolism-related genes inherited from both parental strains, CBS 14 and CBS 349. An exception is a FAA homolog identified in CBS 349, which could not be found in CBS 6016 and may present a gene loss event (Table 3; supplementary tables S1 and S2). | ||||
| The truncated version of the fatty acid synthase FAS1 (851 aa) in CBS 349 was identified by BLASTP search using FAS1 of CBS 14, lacking both the N- and C-terminal regions found in the full-length versions (~ 1300 aa) in CBS 14 and CBS 6016. Similarly, the acetyl-CoA carboxylase (ACACA, ~ 2230 aa) is represented by multiple CDS entries in CBS 349 (including one of 1440 aa), suggesting either a truncated version or an alternative exon–intron structure compared to CBS 14 and CBS 6016. Both FAS1 possess the core catalytic domains required for fatty acid biosynthesis, including the fungal fatty acid synthase signature (IPR050830), acyl transferase domains (IPR001227, IPR016035), the aldolase-type TIM barrel (IPR013785), and the SAT domain (IPR032088). However, the N-terminal beta subunit domain (IPR041099) and the meander beta sheet domain (IPR040883) are absent in the truncated FAS1 of CBS 349. These missing domains may potentially affect the protein’s stability or efficiency. | ||||
| On the other hand, the FAS2 orthologs in all three strains are of comparable length (~ 2900 aa) and exhibit lengths characteristic of Type I fatty acid synthases: The fungal fatty acid synthase (FAS) system typically consists of two separate subunits, FAS1 and FAS2, encoded by distinct genes (Type 2). For instance, the Saccharomyces FAS consists of FAS1 (β-subunit, 2,051 amino acid residues) and FAS2 (α-subunit, 1887 residues) assemble into a dodecameric α₆β₆ complex with each subunit contributing specific catalytic domains for fatty acid biosynthesis (Schweizer and Hofmann 2004). In contrast, animals and some bacterial FAS encode all essential catalytic functions within a single polypeptide of ~ 2,500–3,300 amino acids (Type 1) (Smith 1994) (Smith, Witkowski et al. 2003) (Schweizer and Hofmann 2004) (Wang, Tang et al. 2011). The deduced length of FAS2 in CBS 14, 349, and 6016 is around 2900 amino acid residues, which may suggest a single-polypeptide FAS architecture. The results of the InterProScan indicates that CBS 14, 349, and 6016 have similar domain composition in their FAS2 sequences (Table 3) and key enzymatic domains required for de novo fatty acid biosynthesis, characteristic of Type I FAS synthases are present. Specifically, the sequence contains domains characteristic to ketoacyl synthase (KS) (IPR018201), malonyl/acetyltransferase (MAT) (IPR014043), ketoacyl reductase (KR) and enoyl reductase (ER) (inferred from the NAD(P)-binding domain (IPR036291), dehydratase (DH) (IPR029069), acyl carrier protein (ACP) (IPR009081), and phosphopantetheinyl transferase (PPT) required for ACP activation (IPR008278). The distribution of domains across FAS1 and FAS2 in R. toruloides differs markedly from the typical Type II fatty acid synthase (FAS) architecture found in S. cerevisiae. While S. cerevisiae shows a clear separation of functional domains between its α- and β FAS subunits (encoded by FAS2 and FAS1, respectively), many of these domains in R. toruloides are either shifted between subunits or duplicated (Table 3). The length of the analysed sequence (~ 2900 amino acid residues), the domain architecture, and the presence of functional domains may point to single-chain, multifunctional fatty acid synthase. Reflecting on the lower lipid accumulation that we observed in CBS 349 compared to both CBS 14 and the hybrid CBS 6016, a full-length FAS1 might improve fatty acid production, but does not seem to be essential. On the other hand, reduced lipid accumulation can also be due to the truncation of the acetyl-CoA carboxylase. This enzyme generates malonyl-CoA, which is a substrate of FAS to produce acyl-CoA. | ||||
| We identified seven long-chain fatty acid ligases (FAAs) in strain CBS 6016. FAAs are converting free fatty acids to Acyl-CoA and thus essential for the activation of fatty acids for fatty acid degradation or incorporation into phospholipids (Black and DiRusso 2007). S. cerevisiae has been shown to harbor four distinct FAA enzymes, each with specificity toward fatty acids of certain chain lengths and/or position of double bound (Johnson, Knoll et al. 1994) (Knoll, Johnson et al. 1995). Only two of them activate endogenous FAs (Johnson, Knoll et al. 1994). Among the FAAs of the investigated R. toruloides-strains, four FAA genes are homologs originally found in the CBS 14 strain, while three FAA gens are inherited from strain CBS 349 (Table 3). In CBS 349, we identified four FAA-like CDSs, but only two appear to encode a full-length protein. The others are either truncated or shorter in length, which may suggest a reduced capacity to degrade long chain fatty acids (FA) once stored in lipid bodies. S. cerevisiae strains deficient in FAA1 and FAA4 have been shown to have a reduced lipid synthesis, underscoring a function of FAAs in regulation of lipid synthesis (Black and DiRusso 2007). The deficiency of CBS 349 in FAAs may also explain its low ability of lipid synthesis. | ||||
| CBS 6016 had the highest number of genes involved in lipid metabolism, carrying genes from both, CBS 14 and CBS 349. However, the final lipid concentration was not significantly higher than in CBS 14. This may indicate that enhanced gene copy number does not necessarily result in higher lipid production. | ||||
| CBS 349 also showed a drastically reduced synthesis of carotenoids compared to the two other strains. The AL1 gene, encoding the enzyme responsible for converting phytoene to all-trans lycopene, was found in all three strains (supplementary figures S1-S12). In contrast, analysis of AL2 shows more differences: CBS 14 and CBS 6016 each harbor a full-length bifunctional phytoene synthase with 100% sequence identity. The CBS 6016 genome also contains a homolog sharing 96% identity, which may have originated from its parental strain, CBS 349. Phytoene synthase catalyzes not only the synthesis of phytoene from geranylgeranyl pyrophosphate (GGPP) but also mediates the cyclization of lycopene via γ-carotene to β-carotene or the conversion of 3,4-dehydrolycopene to torulene. The enzymatic pathway of torulene to torularhodin remains unresolved (Kot, Błażejak et al. 2016). Interestingly, CBS 349 appears to contain a severely truncated version of the AL2 gene (118 aa), matching only the N-terminal region of the lycopene cyclase domain. This may impair its functionality, likely contributing to the reduced carotenoid biosynthesis. Nevertheless, the detectable (albeit much lower) levels of β- and γ-carotene, torulene and torularhodin in CBS 349 suggest that R. toruloides CBS 349 either retains a partially functional carotenoid pathway through truncated AL2 or employs an alternative carotenoid biosynthesis pathway. | ||||
| Analysis of genes involved in lipid- and carotenoid metabolism showed that the hybrid CBS 6016 apparently got equally genetic information from both CBS 14 and CBS 349. However, in some cases CBS 6016 contains homologues to CBS 349, which encode for longer amino acids than in CBS 349. This is for instance the case for FAS1, the ACC-encoding gene, and two of the FAA-encoding genes. Because CBS 6016 is a hybrid of CBS 14 and CBS 349, our results indicate that CBS 349 underwent genetic changes after the hybridization event, i.e. during sub-cultivation in strain collections. Spontaneous mutations are common in all organisms including R. toruloides (Long, Behringer et al. 2016). However, in this case the extent of genetic changes seems to be more significant than in the other strains of the same species, which may need further consideration in future research. | ||||
| Our study shows that there is considerable genetic diversity within the species R. toruloides, which may explain previously observed strain variabilities on different substrates (Brandenburg et al. 2021, Chmielarz et al. 2021). | ||||
| Conclusions | ||||
| Our study revealed a substantial variability in accumulation of biomass, lipids, and carotenoids between strains of R. toruloides. The hybrid strain CBS 6016T displayed a lipid and carotenoid profile comparable to one of its parental strains, CBS 14, whereas CBS 349 exhibited slower growth and markedly lower accumulation of both lipids and carotenoids. Higher productivity in terms of biomass, lipids, and carotenoids appeared to be linked to genetic traits associated with CBS 14. However, this relationship was not linear, as the higher gene copy number in CBS 6016 did not translate into increased production. In contrast, genetic traits of CBS 349, representing mating type A2, appeared less favorable for high productivity. Still, as CBS 349 may have lost genetic information during long-term maintenance in the strain collection, this conclusion cannot be fully confirmed. Overall, our study highlights the substantial genetic variability within R. toruloides, providing a valuable foundation for future strain engineering and biotechnological applications. | ||||
GC | Gas chromatography |
|---|---|
PDA | Photodiode array |
PUFAs | Polyunsaturated fatty acids |
SFAs | Saturated fatty acids |
UFAs | Unsaturated fatty acids |
SC-CO2 | Supercritical carbon-dioxide |
TAGs | Triacylglycerols |
UHPLC CDS FAS | Ultra-high pressure liquid chromatography Coding sequences Fatty acid synthase |