1. Introduction
Pigs are significant economic mammals due to their efficient conversion of feed to meat, high reproductive rates, and the wide range of uses for various parts of the animal. They are a primary source of meat for many families and are also used for leather, fat, and other byproducts in various industries [
1]. Understanding the mechanisms involved in growth and reproduction is crucial for improving breeding/production strategies, enhancing meat quality, and optimizing growth rates in livestock [
2,
3]. In addition, genetics and genetic editing in pig breeding are critical areas of research that aim to enhance desirable traits, improve health, and increase productivity in swine [
4]. Except as economic animals, pigs are valuable models for human disease research because of their physiological and genetic similarities to humans [
5,
6]. Their organ systems and metabolic processes closely resemble those of humans, making them ideal for studying complex diseases such as diabetes, cardiovascular conditions, and obesity. Additionally, pigs can be genetically modified to create models that mimic specific human diseases, allowing for more targeted research. This similarity enhances the translational potential of findings from pig studies to human applications, facilitating the development of new therapies and interventions [
7].
Long non-coding RNAs (lncRNAs) are a class of RNA molecules, typically defined as transcripts longer than 200 nucleotides (nt) that are not translated into proteins [
8]. These lncRNAs can interact with DNA, mRNA, microRNAs, and proteins, thereby influencing key biological processes such as chromatin remodeling, transcription, and post-transcriptional regulation [
9]. As epigenetic regulators, lncRNAs play vital roles in modulating gene expression, cell differentiation/apoptosis, and growth/developmental processes [
10]. Moreover, lncRNAs are significantly implicated in the development of various diseases, particularly cancer, cardiovascular diseases, and neurological disorders. For instance, altered expression levels of specific lncRNAs have been associated with several neurodegenerative, neurodevelopmental, and mental health disorders [
11]. Understanding the mechanisms through which lncRNAs contribute to these diseases may pave the way for novel therapeutic targets and biomarkers, offering new avenues for treatment and diagnosis [
12,
13].
The mechanisms through which lncRNAs function are intricate and not yet fully elucidated. Based on their cellular localization, lncRNAs can be categorized into two main types: cytoplasmic lncRNAs and nuclear lncRNAs. Nuclear lncRNAs are often considered the dominant type due to their significant roles in regulating gene expression, chromatin structure, and transcriptional processes within the nucleus. They interact with chromosomes, chromatin domains, and specific genomic loci, influencing cellular functions and contributing to the complexity of gene regulation and nuclear function [
14]. In contrast, cytoplasmic lncRNAs exhibit a remarkable capacity to interact with various cellular structures (including ribosomes, the cytoskeleton, the endoplasmic reticulum, and mitochondria), allowing them to influence mRNA transport, stability, and translation [
8,
15]. Additionally, cytoplasmic lncRNAs play essential roles in modulating protein stability, post-translational modifications, and functional activities [
16]. The mechanistic diversity of cytoplasmic lncRNAs encompasses not only the regulation of cellular architecture but also the direct control of protein synthesis and function, highlighting their multifaceted contributions to cellular homeostasis.
lncRNAs exhibit differential expression across various tissues, with certain lncRNAs being abundant in specific tissues while either absent or present at lower levels in others [
17,
18]. For instance, some lncRNAs are predominantly expressed in the brain, influencing neuronal development and function, while others are enriched in muscle tissue, playing critical roles in muscle differentiation and regeneration [
19,
20]. Research has demonstrated that lncRNAs are pivotal in regulating essential economic traits such as muscle development and fat deposition, primarily through mechanisms involving epigenetic regulation and chromatin remodeling [
21,
22].
Transcriptomic analyses of the longissimus dorsi muscle have unveiled a regulatory network of lncRNAs involved in muscle fiber differentiation [
23]. Additionally, studies indicate that lncRNAs significantly influence fat deposition in the adipose tissue of Ningxiang pigs, particularly through the sphingolipid metabolic pathway [
24]. In the realm of reproductive traits, the expression of lncRNAs in porcine oocytes is closely associated with DNA methylation modifications [
25]. However, most existing research has focused on individual tissues, with limited exploration of lncRNAs within multi-tissue regulatory systems, such as the hypothalamic-pituitary-ovarian axis. The dynamic expression patterns of lncRNAs across multiple tissues and their conservation across species during oocyte development remain underexplored. This gap in systematic investigation constrains the potential applications of lncRNAs in pig breeding programs and other biomedical research utilizing pig as a disease model.
In contrast to lncRNAs, short non-coding RNAs, particularly microRNAs, display a higher degree of conservation across species due to their fundamental roles in gene regulation and cellular processes [
26,
27]. The lack of conservation of lncRNAs compared to miRNAs and mRNAs is often linked to their diverse regulatory roles in biological processes [
8,
28]. lncRNAs may not be conserved across species due to rapid evolution, functional redundancy, tissue-specific expression, changes in genomic context, and functional divergence. These factors allow lncRNAs to adapt to unique regulatory roles in different species, leading to significant sequence variability and a lack of conservation [
29]. Despite the significance of lncRNAs, there is a notable scarcity of studies investigating their cross-species conservation, which limits our understanding of their evolutionary significance and functional implications. Further research is crucial to elucidate the conservation patterns of lncRNAs and their impact on regulatory networks across species.
In this study, we aim to address the gaps in our understanding of tissue-specific and species-specific expression patterns of lncRNAs by leveraging publicly available transcriptomic datasets in conjunction with our own generated datasets. Furthermore, we seek to clarify the regulatory mechanisms, networks, and pathways associated with these lncRNAs.
2. Materials and Methods
2.1. Sample/Data Collection
A total of 607 RNA-seq datasets were obtained from the Sequence Read Archive (SRA;
https://www.ncbi.nlm.nih.gov/sra, accessed on 22 September 2022), encompassing 14 different tissues from pigs (
Supplemental Table S1). The tissue distribution included backfat (
n = 31), gallbladder (
n = 19), heart (
n = 28), ileum (
n = 28), jejunum (
n = 21), kidney (
n = 16), longissimus dorsi muscle (
n = 96), liver (
n = 64), lung (
n = 77), muscle (
n = 28), ovary (
n = 55), pituitary (
n = 26), muscle (
n = 72, including longissimus muscle and skeletal muscle), and spleen (
n = 46). Additionally, 53 porcine oocyte samples were collected from four healthy Large White × Landrace crossbred gilts (approximately 210 days old) that exhibited signs of estrus. To investigate the alterations in the expression patterns of long non-coding RNAs (lncRNAs) throughout follicular development, antral follicles (AF) were categorized into five distinct stages according to their diameters. These stages are designated as follows: AF1 (0.5–1.8 mm), AF2 (1.8–3 mm), AF3 (3–5 mm), AF4 (5–7 mm), and AF5 (>7 mm), and the transcriptomes of oocytes from these AFs were conducted on the single-cell RNA-seq [
30].
Human oocyte RNA-seq data were retrieved from the Gene Expression Omnibus (GEO) under accession number GSE107746, which included 20 samples. Mouse oocyte RNA-seq datasets were sourced from GEO accession numbers GSE141190, GSE70116, GSE116771, GSE74344, and GSE73803, comprising a total of 17 samples for analysis.
2.2. Quality Control, Read Alignment, and Transcriptome Assembly
The samples were transformed into paired-end FASTQ format using the fast-dump module of the SRA Toolkit (version 2.8.2). The quality of the raw sequencing data was assessed using FASTP software (version 0.23.2) with default parameters, which allowed for filtering and trimming of the reads to eliminate low-quality bases and adaptors [
31]. Following this, the clean data were mapped to the reference genome using HISAT2 software (version 2.1.0), and the resulting SAM files were converted to BAM format using Samtools (version v1.11).
RNA-seq data from each sample were processed with StringTie to generate transcript-level GTF files. These files were subsequently merged using StringTie’s merge function [
32], resulting in a unified GTF that includes all predicted transcripts. Transcript annotation was performed using gffcompare [
33], which identifies relationships between predicted and reference transcripts, such as antisense overlap, intronic location, and shared splice sites.
Transcripts were retained based on the following criteria: (i) antisense overlap with a reference transcript; (ii) being entirely intronic; (iii) sharing splice sites; (iv) exon overlap; (v) being located in intergenic regions; or (vi) sharing intron boundaries. The filtered GTF was then utilized with gffread to extract transcript sequences in FASTA format.
2.3. lncRNA Detection
Candidate sequences were evaluated for coding potential using three tools: CNCI [
34], CPC2, and FEELnc [
35]. Transcripts were classified as lncRNAs if they scored less than 0 in CNCI, less than or equal to 0 in CPC2, or fell below the machine-learning-derived threshold in FEELnc. Only sequences consistently identified as non-coding by these tools were retained for further analysis.
2.4. Descriptive Statistics of lncRNAs and Expression Quantification
We conducted descriptive analyses of lncRNAs expressed in each tissue, focusing on statistics related to their number, length distribution, and classification based on genomic location. According to the annotations provided by FEELnc, lncRNAs were categorized into four types based on their genomic context: exonic, intronic, upstream, and downstream. We then calculated the proportion of expressed lncRNAs in each category for every sample. Statistical differences in lncRNA positional distribution among tissues were determined using t-test.
Expression quantification of lncRNAs was performed using featureCounts [
36], resulting in a count matrix for downstream analysis. To minimize false positives, we retained only those lncRNAs with more than 10 reads in at least three samples of a single tissue type, ensuring robust and reproducible expression results.
2.5. t-SNE Clustering Analysis of lncRNAs
t-distributed stochastic neighbor embedding (t-SNE) was utilized for dimensionality reduction and clustering analysis to investigate the expression pattern relationships of lncRNAs across different samples [
37]. To further assess the correlation between tissues, Spearman′s rank correlation coefficient was calculated [
38].
2.6. Identification of Housekeeping and Tissue-Specific lncRNAs
Housekeeping lncRNAs were identified based on two criteria: (i) the average expression level of the lncRNA across all tissues exceeded the median expression level of all genes, and (ii) the coefficient of variation (CV) of its expression across tissues was less than 1. This approach allowed for the identification of lncRNAs that may play regulatory roles throughout the organism. These lncRNAs exhibit relatively high and stable expression across multiple tissues and are believed to be crucial in various biological processes.
Tissue-specific lncRNAs were identified based on the following conditions: (i) their expression level in a given tissue ranked within the top 25% of all lncRNAs; (ii) their expression in that tissue accounted for more than 50% of their total expression across all tissues; and (iii) their expression in that tissue was at least three times higher than in any other tissue. Such lncRNAs are likely involved in tissue-specific physiological or developmental processes.
To further explore the relationship between lncRNAs and mRNAs, a similar methodology was employed to identify housekeeping and tissue-specific mRNAs.
2.7. Identification of mRNAs and circRNAs in Oocytes
Sequence alignment was initially conducted using HISAT2 (version 2.1.0), followed by transcript reconstruction and expression estimation with StringTie (version 2.1.5). Gene- and transcript-level read counts were subsequently obtained using featureCounts (version 2.0.1). Utilizing previously established criteria for identifying housekeeping and tissue-specific mRNAs [
39], we successfully identified both categories within the oocyte transcriptome.
To further investigate the interactions between circular RNAs (circRNAs) and lncRNAs in oocytes, we systematically identified circRNAs from 55 oocyte samples. The same quality control and alignment procedures applied to lncRNAs were used to ensure consistency and comparability across datasets. CircRNAs were accurately detected using CIRI2 software (version 2.0.6) [
40], and their expression patterns and characteristics were analyzed to explore their potential interactions with lncRNAs in oocytes.
2.8. Construction of the ceRNA Network
To investigate the potential regulatory interactions among tissue-specific lncRNAs, miRNAs, and mRNAs, we constructed a competing endogenous RNA (ceRNA) network centered on tissue-specific lncRNAs. Here, we define the mRNAs predicted to interact with miRNAs in the lncRNA–miRNA–mRNA axis as the downstream target mRNAs of lncRNAs. lncRNA–miRNA and miRNA–mRNA interactions were predicted using miRanda (version 3.3a) with parameters score > 140 and energy < −20 kcal/mol. Only interactions supported by miRBase-annotated miRNA binding sites were retained. The circRNA–miRNA interactions were identified using the same procedure. Cytoscape (version 3.10.0) was used to integrate these interactions into a circRNA/lncRNA–miRNA–mRNA network.
Initially, sequence information for lncRNAs and mRNAs was extracted using BEDTools2 (version 2.23.0), while miRNA binding site sequences were sourced from miRBase (version v22.1) [
41]. miRanda was then employed to predict interactions between miRNAs and lncRNAs, as well as between miRNAs and mRNAs. Downstream target mRNAs were defined as protein-coding transcripts whose 3′ UTRs were predicted to bind miRNAs that also interacted with the identified lncRNAs or circRNAs, thereby representing potential ceRNA relationships in which lncRNAs/circRNAs act as molecular sponges for shared miRNAs. By evaluating binding scores and minimum free energy, we assessed the stability and potential strength of these interactions. Finally, to visualize the regulatory network, we selected the top five lncRNAs with the highest interaction scores and mapped their ceRNA relationships using Cytoscape software [
42].
2.9. Functional Enrichment Analysis of lncRNA-Associated Genes
In general, genes located within ±100 kb of the lncRNA locus and exhibiting a correlation coefficient greater than 0.6 with the lncRNA are considered as its potential cis-regulatory target genes. Using the annotation results from FEELnc, we identified the nearest potential parental protein-coding genes for each lncRNA to investigate their possible cis-regulatory roles. These genes were subsequently subjected to Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses to explore the functional regulatory network associated with lncRNAs.
GO enrichment analysis was conducted using the clusterProfiler R package (version 3.14) and visualized with ggplot2 (version 3.3.5) [
17], which revealed significant enrichment of lncRNA-associated genes across cellular components, biological processes, and molecular functions. For KEGG pathway enrichment analysis, we employed the ShinyGO (version 0.7.6) tool to identify overrepresented signaling and metabolic pathways [
43]. In both analyses, statistical significance was assessed using the hypergeometric test, with a
p-value threshold of <0.05 deemed significant.
2.10. Co-Expression Network Analysis
We employed Weighted Gene Co-expression Network Analysis (WGCNA) to investigate the co-expression patterns of lncRNAs across various tissues [
44]. The analysis was conducted using the WGCNA package in R (version 1.70). WGCNA-based clustering was performed using the average linkage method (hclust function), and the soft-thresholding power (β) was determined with the “pickSoftThreshold” function in WGCNA, which defines the strength of connections between lncRNAs.
We focused on modules with a correlation coefficient greater than 0.65, which were considered to exhibit significant tissue specificity in this study. After selecting an appropriate soft-thresholding power, we constructed the co-expression network and identified key tissue-specific modules using WGCNA. Additionally, we identified critical tissue-specific modules from these co-expression networks and utilized module membership (MM) and gene significance (GS) to determine the core lncRNAs, referred to as hub lncRNAs. Hub lncRNAs were defined based on pre-established thresholds, typically |GS| > 0.2 and |MM| > 0.8, indicating strong correlations between lncRNAs and the module characteristic genes.
2.11. Inter-Species Conservation Analysis of lncRNAs
To facilitate the conversion of genomic positions, we first downloaded the necessary files for genome conversion from the UCSC Genome Browser: hg38ToSusScr11.over.chain.gz and mm39ToSusScr11.over.chain.gz. These files contain the mapping relationships from the human (hg38) and mouse (mm39) genomes to the pig genome (susScr11).
Utilizing the Liftover tool from the UCSC Genome Browser [
45], we mapped the lncRNAs identified in the human (hg38) and mouse (mm39) genomes to their corresponding positions in the pig genome (susScr11), applying a parameter of “minMatch = 0.5” to ensure a minimum match rate.
Subsequently, we compared the resulting pig-genome-aligned lncRNAs with the lncRNA positions obtained from 15 different pig tissues. By examining the overlap of genomic positions, we identified lncRNAs that are conserved across species, which we refer to as inter-species conserved lncRNAs.
Short Time-series Expression Miner (STEM v1.3.13) was used with the following parameters: maximum model profiles = 50, minimum correlation coefficient = 0.7. Significant clusters were determined using Bonferroni-adjusted p < 0.05.
2.12. Statistical Analysis
Spearman′s correlation coefficient was utilized to compare the expression levels of lncRNAs across different tissues and time points for significant differences. In the context of descriptive statistics across multiple tissues, t-tests were employed to detect differences between tissue types. For GO analysis using the clusterProfiler R package (version 3.14) and KEGG analysis using ShinyGO (version 0.76), a significance threshold of p < 0.05 was applied. Additionally, Pearson′s correlation coefficient was used for correlation analyses among lncRNAs, mRNAs, and circRNAs.
Differential expression analysis of lncRNAs was performed in R using DESeq2 (v1.30.1). Raw counts from featureCounts were normalized by the median-of-ratios method. For pairwise comparisons among the five oocyte stages (AF1–AF5), a negative binomial Wald test was applied, and
p values were adjusted using the Benjamini–Hochberg procedure. lncRNAs with an adjusted
p value (FDR) < 0.05 and an absolute log2 fold change ≥ 1 were considered differentially expressed. Low-abundance lncRNAs were filtered prior to testing (read count > 10 in ≥ 3 samples of at least one stage/tissue, consistent with
Section 2.4).
2.13. Molecular Validation
To ensure the accuracy of the identified lncRNAs, we randomly selected four housekeeping lncRNAs, known for their stable expression across all cell types, for experimental validation. Validation was performed using spleen tissue, chosen due to its stable RNA quality and availability. We employed polymerase chain reaction (PCR) to amplify and confirm the presence of these four lncRNAs.
Total RNA was isolated from freshly dissected spleen tissues using TRIzol reagent following the manufacturer’s instructions. Genomic DNA was removed by on-column DNase I treatment (10 U, 15 min) to avoid DNA contamination. RNA concentration and purity were assessed by spectrophotometry (A260/280 between 1.9 and 2.1; A260/230 > 2.0), and integrity was evaluated by agarose gel electrophoresis. Only samples meeting these criteria were used for downstream analyses. The validation process involved several steps: First, the sequences of the housekeeping lncRNAs were extracted based on their known genomic locations, and specific primers were designed for each lncRNA (
Supplemental Table S2). After the primers were designed, the PCR mixture was prepared, following established PCR protocols while making necessary adjustments to suit the experimental requirements (
Supplemental Table S3). Once the PCR mixture was thoroughly mixed, it was transferred to a PCR thermal cycler, where temperature cycling was applied according to the specified program settings (
Supplemental Table S4). Finally, the PCR products were analyzed using gel electrophoresis, which allowed for the separation and visualization of the target fragments.
4. Discussion
In this study, we conducted a comprehensive analysis of lncRNAs across various porcine tissues and oocytes at different developmental stages. Our findings underscore the critical roles of lncRNAs in growth, development, and trait formation in pigs. We observed significant variations in lncRNA expression patterns among different tissues, with oocytes exhibiting the most distinct and pronounced lncRNA profiles. These results provide valuable theoretical insights into the regulatory mechanisms of lncRNAs, thereby deepening our understanding of porcine functional biology and development.
4.1. Expression Patterns of LncRNAs Across 15 Tissues
lncRNAs exhibit tissue-specific expression patterns, playing crucial roles in regulating gene expression, cellular differentiation, and development. Understanding lncRNAs profiles can provide insights into physiological functions, disease mechanisms, and potential therapeutic targets, enhancing precision medicine approaches for tissue-specific conditions [
46,
47].
lncRNAs are distributed throughout chromosomes, often situated near or within gene loci. While some lncRNAs exhibit tissue-specific expression, others play roles in regulating genes across different chromosomes [
48]. Their precise locations can significantly influence their functions, including chromatin remodeling, transcriptional regulation, and interactions with other RNA molecules. In our study, we observed that lncRNAs were most abundant on Chromosome 1 and least abundant on the Y chromosome. Additionally, lncRNA density was highest on Chromosome 9 and lowest on the Y chromosome (
Figure 1A,B).
Porcine Chromosome 1, one of the largest chromosomes in the pig genome, contains a substantial number of genes associated with various traits, including growth, reproduction, and immune response. Therefore, investigating gene functions and lncRNAs from this chromosome is vital for understanding their roles in economically important traits such as meat quality and disease resistance. Chromosome 9, although smaller than Chromosome 1, still harbors essential genetic information related to metabolic processes and growth regulation. Consequently, studying gene functions and lncRNAs from this chromosome will also provide valuable insights into traits such as fat deposition and carcass quality.
In contrast, the Y chromosome has a lower abundance of lncRNAs compared to the X chromosome and other chromosomes, primarily due to its smaller size and reduced gene content [
49]. The Y chromosome is highly specialized and has undergone significant degeneration throughout evolutionary history, resulting in the loss of many genes and genetic material. This reduction limits the potential for lncRNA transcription on the Y chromosome.
The strand orientation of lncRNAs is essential for understanding their functions, as it can influence their interactions with other RNA molecules or proteins. In the study, we revealed that approximately 25% of lncRNAs were located on the sense strand, while 75% were found on the antisense strand (
Figure 1E). Antisense lncRNAs can overlap with their target mRNAs, forming RNA:RNA complexes that can either stabilize or degrade the mRNA, depending on the context. For instance, an antisense lncRNA can have both overlapping and non-overlapping domains with its target mRNA, allowing for specific interactions and regulatory functions [
50]. Sense lncRNAs can act as decoys or scaffolds for proteins, influencing gene expression by recruiting transcriptional regulators or modifying chromatin structure.
lncRNAs often possess multiple functional domains that facilitate interactions with proteins, other RNA molecules, and DNA, thereby playing crucial roles in various biological processes. In our study, we observed that the length distribution of lncRNAs is consistent across different tissues, with a peak length ranging from 800 to 900 nt. The longest lncRNA identified in this study exceeded 10,000 nt. Longer lncRNAs have the potential to adopt complex secondary and tertiary structures, which are vital for their stability and functionality [
51]. The length of lncRNAs may contribute to the formation of these intricate structures, facilitating specific interactions with their molecular targets. The observed 800–900 nt range likely represents an optimal balance between providing sufficient length for effective interactions and maintaining the efficiency of transcription and processing [
51].
In our study, we found that the majority of lncRNAs are located within exonic regions, with intronic regions being the second most common location. This distribution may be attributed to the fact that exonic lncRNAs often play more direct regulatory roles in relation to the protein-coding genes with which they are associated. In contrast, intronic lncRNAs may primarily influence the splicing and processing of pre-mRNA, thereby contributing to the regulation of gene expression at different levels [
52].
Among all tissues analyzed, the ovary exhibited the lowest number of detected lncRNAs (2693), significantly fewer than other tissues. The reduced number of lncRNAs detected in oocytes may reflect their single-cell identity compared to heterogeneous tissues composed of multiple cell types. This feature highlights the transcriptional simplicity of oocytes at specific stages rather than a true reduction in regulatory complexity. Also, this may be related to the unique transcriptional state of oocytes, which are known to exist in either the surrounded nucleoli (SN) or non-surrounded nucleoli (NSN) state (3, 25). In contrast, the pituitary gland contained the highest number of lncRNAs (20,360). This discrepancy may be attributed to the fact that actively dividing or metabolically active cells require precise regulation of gene expression to support processes such as growth, differentiation, and responses to environmental signals. This heightened activity is associated with increased transcription of both protein-coding genes and lncRNAs [
51].
4.2. Evolutionary Conservation of Oocyte lncRNAs Across Species
The evolutionary conservation of oocyte lncRNAs varies significantly across different species, reflecting the extent to which specific lncRNA sequences and their functions have been preserved throughout evolutionary history. This conservation is crucial, as sequences that remain intact over time often indicate essential roles in critical cellular processes [
53].
While it is commonly understood that many lncRNAs experience weaker evolutionary constraints and exhibit lower levels of sequence conservation compared to protein-coding genes, a notable subset of lncRNAs demonstrates significant conservation across diverse species [
54]. In many species, including mammals, birds, and amphibians, certain oocyte lncRNAs exhibit a high degree of sequence similarity, indicating their importance in oocyte development and maturation. For instance, lncRNAs that regulate key genes involved in oocyte meiosis and folliculogenesis are often conserved, highlighting their critical roles in reproductive success [
55].
In our investigation of oocyte lncRNAs, we identified 1078 conserved lncRNAs, of which 810 displayed pig-specific or pig-preferred expression patterns, while 268 showed a preference for expression in humans. The conservation of lncRNAs suggests they may play vital roles in fundamental biological processes, including development, cell differentiation, and responses to environmental changes. The loss of such conserved lncRNAs could potentially have detrimental effects on organismal fitness, highlighting their significance in evolutionary biology [
53].
To further explore the potential functions of lncRNAs, we categorized housekeeping lncRNAs based on their genomic positioning and strand orientation. GO and KEGG enrichment analyses revealed that lncRNAs located in intronic regions and on the antisense strand are particularly enriched in pathways related to gene expression regulation, epigenetic modification, and embryonic development. These findings are consistent with previous research demonstrating that intronic lncRNAs play a vital role in regulating gene expression, alternative splicing, and RNA processing [
54]. Intronic lncRNAs influence transcriptional interference, maintain nuclear architecture, function as ceRNAs, and modulate epigenetic modifications, thereby contributing significantly to developmental processes and cellular functions. Similarly, lncRNAs located on the antisense strand regulate gene expression by modulating transcription, influencing alternative splicing, and recruiting chromatin modifiers. They can also act as ceRNAs, sequestering microRNAs and facilitating the nuclear retention of mRNAs, which ultimately impacts various cellular functions [
56].
4.3. Oocyte Development Stages
Oocytes exhibit the lowest levels of lncRNA expression among the analyzed tissues, primarily due to their specific developmental stage, streamlined regulatory mechanisms, and focus on essential functions necessary for fertilization and early embryonic development. This distinctive transcriptional landscape prioritizes critical processes, leading to reduced lncRNA activity compared to other tissues [
57,
58]. In our study, we observed no significant differences in lncRNA levels across the developmental stages, likely for the reasons mentioned above.
Using the Spearman correlation analysis, we found limited similarity in lncRNA expression profiles across the five developmental stages of oocyte maturation. These findings suggest that dynamic changes in lncRNA expression are essential for the proper regulation of oocyte development. The variations in lncRNA expression may be linked to specific developmental processes, signaling pathways, or cellular functions unique to each stage, underscoring the importance of lncRNAs in facilitating transitions between developmental phases and ensuring the successful maturation and functionality of oocytes [
56].
To investigate the expression dynamics of lncRNAs across the five developmental stages of porcine oocytes, we conducted a differential expression analysis among 5 stages and performed GO enrichment analysis of their predicted cis-regulatory target genes. The comparison between the AF2 (1.8–3 mm) and AF3 (3–5 mm) stages revealed significant enrichment of target genes associated with key biological processes, including female sex differentiation, the development of primary female sexual characteristics, and female gonad development. During the AF2 stage, follicles exhibit increased responsiveness to hormonal signals, particularly follicle-stimulating hormone (FSH). In pigs, follicles transition from the secondary follicle to the antral follicle when their diameter is less than 3 mm [
3]. At this stage, the oocyte is classified as a growing oocyte and does not resume meiosis in vitro. This period is crucial for selecting which follicles will continue to develop. In contrast, during the AF3 stage, the oocyte within the follicle is actively growing and maturing. Oocytes achieve full meiotic competence when their diameter reaches ≥3 mm, signifying that they have attained cytoplasmic maturity [
3].
4.4. Limitations of Our Study
Our study has several limitations. First, while we analyzed a broad range of porcine tissues, the primary focus was on 14 specific tissues and oocytes. Including additional tissue types, especially those involved in reproductive and developmental processes, could provide deeper insights into the functions of lncRNAs. Second, although our analysis incorporated a substantial number of RNA-seq datasets, the variability in sample sizes among different tissues may affect the robustness of our findings. Third, the conservation analysis of lncRNAs across species, specifically humans and mice, may be constrained by differences in evolutionary pressures and the functional roles of lncRNAs in various species. As a result, our findings may not fully capture the functional significance of conserved lncRNAs in pigs. Finally, the study primarily focused on differential expression patterns without thoroughly investigating the mechanistic roles of specific lncRNAs in regulating target genes or biological processes. Conducting functional validation of selected lncRNAs in spleen tissue would enhance the conclusions drawn from this research.