Single cell sequencing of the mouse anterior palate reveals mesenchymal heterogeneity
Abstract
Background
Cleft palate is one of the most prevalent birth defects. Mice are useful for studying palate development because of their morphological and genetic similarities to humans. In mice, palate development occurs between embryonic days (E)11.5 to 15.5. Single cell transcriptional profiles of palate cell populations have been a valuable resource for the craniofacial research community, but we lack a single cell transcriptional profile for anterior palate at E13.5, at the transition from proliferation to shelf elevation.
Results
A detailed single cell RNA sequencing analysis reveals heterogeneity in expression profiles of the cell populations of the E13.5 anterior palate. Hybridization chain reaction RNA fluorescent in situ hybridization (HCR RNA FISH) reveals epithelial populations segregate into layers. Mesenchymal populations spatially segregate into four domains. One of these mesenchymal populations expresses ligands and receptors distinct from the rest of the mesenchyme, suggesting that these cells have a unique function. RNA velocity analysis shows two terminal cell states that contribute to either the proximal or distal palatal regions emerge from a single progenitor pool.
Conclusion
This single cell resolution expression data and detailed analysis from E13.5 anterior palate provides a powerful resource for mechanistic insight into secondary palate morphogenesis for the craniofacial research community.
1 INTRODUCTION
Cleft palate is one of the most prevalent birth defects affecting 1 in every 1700 babies resulting from a failure of fusion of the palatal shelves during embryonic development.1 Perturbation of over 300 genes leads to cleft palate.2-4 Exposure to various pharmacological medications, nicotine, or alcohol increases incidence of cleft palate suggesting that palate development is especially susceptible to genetic and environmental insult.5-11 Children with cleft palate have difficulty with feeding, speech, and hearing.12 Surgical intervention is costly and places a large financial burden on affected families.13 Palatal development is a highly conserved process among mammals.14 Mice have traditionally been used as a model for studying palate closure. In mice, this process occurs between embryonic day (E)11.5 to E15.5.15-17 The secondary palate originates as paired outgrowths from either oral side of the developing maxillary prominences on E11.5.16, 17 The palate undergoes a period of vertical outgrowth from E11.5 to E13.5, increasing greatly in size due to high rates of cellular proliferation.18 At E14.0, the palatal shelves elevate above the tongue, grow horizontally at E14.5, adhere at E15.0, and fuse by E15.5.16 The precise molecular mechanisms controlling these transitions have not been completely characterized. Specifically, the transition from a period of high proliferation at E13.5 to a dynamic elevation above the tongue at E14.0 is particularly understudied.
Microarrays and bulk RNA sequencing can provide important information about gene expression allowing for novel pathway identification and hypothesis development. Previous work within the palate has utilized both methods to gain more information about the developmental landscape of palatogenesis. Affymetrix GeneChip microarrays have provided transcriptional profiles of whole palates at E13.5, E14.5, and E15.5.19 Laser capture microdissection combined with RNA-seq has allowed for more spatial compartmentalization. Researchers have used this technique to analyze differences in medial vs lateral and oral vs nasal compartments in E14.5 palatal shelves.20 However, these techniques still do not delineate gene expression differences among cells within a population or compartment. With the advent of single cell RNA sequencing (scRNA seq), we can obtain a more accurate transcriptional profile of different populations of cells that would reveal any heterogeneity. As a result, we now have single cell developmental atlases of many key timepoints in craniofacial development including upper lip and primary palate fusion,21 midfacial morphogenesis,22 soft palate myogenesis,23 tooth development,24 mandibular development,25, 26 and cranial suture formation.27-29 Although we now have bulk RNA sequencing of fused palatal shelves, scRNA-seq data from palatal precursor populations, posterior palate, and Shox2+ E13.5 palatal mesenchyme,20-23, 30 we do not have a complete atlas at a single cell resolution for the E13.5 anterior palate which would greatly increase our understanding of the complex gene regulatory processes at work.
Mesenchymal and epithelial heterogeneity has not been explored at this key timepoint prior to palatal elevation and horizontal outgrowth. Previous methods have not distinguished whether anterior palate marker genes such as Msx1 and Shox2 for example, which are used to delineate this population from the middle and posterior mesenchyme,31 are co-expressed among the same cells or occupy distinct cell populations. Gene summaries for both genes use the same language: “Msx1/Shox2 is expressed in the mesenchyme of the anterior region of the developing palate, in both nasal and oral sides”.32, 33 Similarly, most genes described with anterior palate expression are not fully parsed apart. scRNA seq sequencing allows us to look at the expression of these genes at a higher resolution than previous technologies have allowed. scRNA seq data further allows for various computational pipelines to be employed giving insights into ligand-receptor interactions, predicted transcription factor activity, and pseudotime analyses.
Here, we show unexpected complexity and heterogeneity among the mesenchymal cells of the E13.5 anterior palate. Using scRNA seq, we cluster the anterior palatal mesenchyme into four populations based on gene expression profiles. Hybridization chain reaction (HCR) confirms these populations and reveals spatial segregation into four domains: oral, lingual, proximal, and central. We find that anterior palate genes Msx1 and Shox2 mark distinct and transcriptionally separable populations within the anterior palate. Msx1 spans the proximal and central mesenchyme while Shox2 is restricted to the distal oral and lingual mesenchyme. Using cell chat, we identify a mesenchymal subpopulation (Mes_3/proximal) with distinct signaling profiles from the remaining mesenchyme. RNA velocity analysis shows the emergence of two terminal cell states from a single progenitor state within the developing palate contributing to either the proximal or distal palatal regions.
2 RESULTS
2.1 Workflow and cluster identification of single cell RNA sequencing of the E13.5 anterior palate
We performed scRNA seq on E13.5 WT anterior palates (Figure 1A) to investigate cell populations comprising the anterior palate at this key timepoint. Anterior palates were collected while leaving posterior palate tissue intact (Figures 1A and S1A-B). After filtering and quality assessment of data, we kept 22 238 cells from three embryos for downstream analysis. Sample preparations were labeled as wildtype (WT)_1, WT_2, and WT_3 and yielded 7959, 6994, and 7268 cells, respectively. Single cell population data is visualized in a Uniform Manifold Approximation and Projection (UMAP) wherein each individual dot represents a cell and shorter distances between dots constitute more similar transcriptomes between cells. Our sequencing produced six distinct clusters of cells (Figure 1B). Based on gene expression (Figure S2A-F), cell clusters were broadly defined as mesenchyme (Col1a2, Col3a1, Col1a1 expressing),34-38 proliferating (Top2a, Hist1h3c, Hist1h1b expressing),39, 40 epithelial (Epcam, Igfbp5, Pdgfa expressing),21, 41-43 endothelial (Pecam1, Plvap, Cdh5 expressing),44-46 immune (Fcer1g, Tyrobp, Cd52 expressing),47-49 and Schwann cells (Fabp7, Mpz, Sox10 expressing)50-53 (Figure 1C). The most prominent cell types were mesenchyme and proliferating mesenchyme cells followed by epithelial cells (Figures 1B and S2A-C). The endothelial, immune, and Schwann cell populations represent only a small fraction of the total cells (Figures 1B and S2A-C). While expected cell types were easily identified by gene expression, we hypothesized that higher resolution thresholding may reveal differences within cell populations that have not been previously identified.
2.2 Single cell RNA sequencing reveals heterogeneity within the E13.5 anterior palate
Cell heterogeneity among the cells within the embryonic palate remains poorly understood. To determine if cells within the E13.5 anterior palate are a diverse population, we subclustured three main populations: mesenchyme, proliferating mesenchyme, and epithelial cells at a resolution of 0.2 shared nearest neighbor (snn) within the Seurat pipeline. This produced three proliferating clusters, two epithelial clusters, and four mesenchymal clusters (Figure 2A,B). Cell cluster proportionality was consistent between sample preparations with all expected cell types appearing within each sample in expected ratios (Figure S3A,B). Top 10 markers of each of these subclusters are distinct showing clear segregation of cluster identities (Figures 2C and S4A-I). Top 15 markers of each cluster are listed in Table 1. We chose one-two distinct markers per cluster to determine spatial localization of cell populations within anterior palate tissue (Figure 2B). Subclustering of proliferating cells demonstrated various stages of the cell cycle. Prolif_1 is marked by conserved histone H3 variant Cenpa, which determines kinetochore position (Figures 2B,C, 3A and S4A). Prolif_2 is characterized by high levels of histones such as Hist1h3c and Hist1h1b indicating condensed chromatin (Figures 2B,C, 4B and S3B). Prolif_3 is marked by Ccne2 which regulates G1-S progression (Figures 2B,C, 3C and S4C). Epithelial cells cluster into two similar, but distinct groups. The top genes of Ep_1 are also highly expressed within Ep_2 with the exception of a few genes such as follistatin (Fst), insulin like growth factor binding protein 2 (Igfbp2), and transformation related protein 63 (Trp63) (Figures 2B,C, 3D,E and S4D). Ep_2 remains more transcriptionally distinct from Ep_1 through strong expression of several keratin (Krt) genes (Krt8, Krt15, Krt17, Krt18, Krt19) (Figures 2B,C, 3F,G and S4E).
Mes_l | Mes_2 | Mes_3 | Mes_4 | Prolif_1 | Prolif_2 |
---|---|---|---|---|---|
Col3a1, Shox2, Gpc6, Csrp2, Osr2, Ndnf, Pcdh17, Ptch1, Col8a1, Nrp1, Cped1, Peg10, Meis2, Twist1, Zfhx4 | Smpd3, Runx2, Zeb2, Col12a1, Sema6a, Msx1, Mef2c, Maf, Cxcl5, S100a6, Rorb, Npnt, Emp3, Cd200, Pou3f3 | Ogn, Capn6, Ptn, Mecom, Fn1, Rmst, Sfrp2, Pdzrn4, Slc1a3, Id2, Tpm1, Cdkn1c, Gm26771, Rspo3, Tsc22d1 | Epha3, Tcf4, Egfl6, Rgs2, Alx1, Pax3, Adamts5, Nppc, Wnt5a, Ddit4l, Prg4, Mfap4, Meis1, Crabp2, Limch1 | Cenpf, Cenpa, Ube2c, Tubb4b, Cenpe, Birc5, Tpx2, Cks2, Hmgb2, Cdc20, Ccnb2, Mki67, Nusap1, Ccnb1, Cdca8 | Hist1h3c, Hist1h1b, Hist1h2ap, Hist1h2ae, Hist1h2ab, Hist1h3e, Hist1h4d, Hist1h1e, Top2a, Hist1h1a, Hist2h2ac, Hist1h1d, Hist1h4h, Hist1h1c, 2810417H113Rik |
Prolif_3 | Ep_1 | Ep_2 | End | Imm | Schw |
---|---|---|---|---|---|
Rrm2, Clspn, Fam111a, Ccne2, Cdc6, Slfn9, Ung, Dtl, Mcm10, Rad51ap1, Dhfr, Uhrf1, Lig1, Gmnn, Hells | Igfbp5, Krt14, Epcam, Sox2, Dsp, Pdgfa, Pitx1, Fst, Dmrt2, Igfbp2, Siva1, Dapl1, Sema3d, Trp63, Fgf9 | Krt15, Krt17, Hspb1, Krt8, Dsp, Sfn, Epcam, Cldn7, Krt18, Krt19, Lgi2, Ezr, Krt14, Krt5, Ehf | Cdh5, Egfl7, Pecaml, Cldn5, Kdr, Flt1, Col18a1, Emcn, Sparcl1, Plxnd1, Cd34, Ecscr, S100a16, Plvap, Crip2 | Fcer1g, Tyrobp, Mrc1, C1qb, Pf4, Cd52, Lcp1, Coro1a, Laptm5, Cd74, C1qa, Lyz2, Ctss, Cx3cr1, C1qc | Phactr1, Ednrb, Fabp7, Egfl8, Sox10, Tm4sfl1, Moxd1, Cryab, Mpz, Erbb3, Cdh6, Foxd3, Gfra3, Npy, Slitrk2 |
To determine whether epithelial cell populations are restricted to different domains or if they are interspersed, we performed hybridization chain reaction RNA fluorescence in situ hybridization (HCR RNA FISH) with probes for Igfbp2 and Cldn7, top markers for Ep_1 and Ep_2, respectively. Cells expressing Igfbp2 and Cldn7 are stacked upon on another (Figure 3H-K). Igfbp2 cells line the inside of the epithelial layer contacting the mesenchyme while Cldn7 cells line the oral cavity laying on top of the Igfbp2 positive cells (Figure 3K).
Mesenchymal subclustering produced four new clusters: Mes_1-Mes_4. Mes_1 is characterized by high expression of Osr and Shox2 (Figures 2B,C, 3L and S4F). Mes_2 has high expression of Runx2 and Msx1 (Figures 2B,C, 3M and S4G). Mes_3 has high expression of Ogn and Mecom (Pdrm3) (Figures 2B,C, 3N and S4H). Mes_4 is characterized by high expression of Epha3 and Egfl6 (Figures 2B,C, 3O and S4I). To determine spatial domains of each mesenchymal cell population, we performed HCR with probes for Ogn, Runx2, Osr2, and Epha3. Osr2 marks the distal tip and nasal compartment of the palate (Figure 3P). Runx2 marks the population of mediolateral cells without significant overlap with neighboring compartments (Figure 3Q). The Ogn population is dorsal and proximal to the Runx2 population marking the most proximal palatal mesenchyme (Figure 3R). Epha3 overlaps in domain with Osr2 from the distal tip of the palate to the oral palatal mesenchyme (Figure 3S). Importantly, expression of both epithelial and mesenchyme marker genes remains consistent between cells from all three embryos (Figure S5A,B).
Proliferating subcluster markers were not independently labeled by HCR. However, it can be noted from observing expression of mesenchyme markers Osr2, Runx2, Ogn, and Epha3 on the FeaturePlots that the proliferating subclusters are comprised of all four mesenchymal subclusters (Figure 3L-O). This suggests that proliferation occurs within all mesenchymal domains at this stage.
2.3 Shox2 and Msx1 are independent populations
To determine how known markers of the anterior palate resolve at a single cell resolution, we examined expression of Msx1 and Shox2. Although both are considered markers of the anterior mesenchyme,32, 33, 54-56 our data shows Msx1 and Shox2 expression are restricted to opposite populations within the palatal mesenchyme (Figure 4A). Most mesenchymal clusters express both Shox2 and Msx1 to an extent; however, there is a clear delineation in their relative expression within those populations (Figure 4A,B). Mes_1 and Mes_4 express Shox2 more strongly while Mes_2 expresses Msx1 most strongly (Figure 4A,B). We performed HCR RNA FISH for cluster marker genes to determine if cell populations were spatially distinct from one another or if these different cell types were interspersed together. HCR revealed that Shox2 occupies the most distal nasal compartment of the palate while Msx1 is diffuse throughout the tissue with highest expression in the center of the palate shelf (Figure 4C).
Combining expression data with HCR RNA FISH, it is apparent that there are distinct domains within the palate mesenchyme. Ogn (Mes_3), Runx2 (Mes_2), and Epha3 (Mes_4) can be seen marking the oral, central, and proximal domains of the palate without significant overlap (Figure 5A). Shox2 and Osr2 span Mes_1 and Mes_4 marking the nasal mesenchyme, distal tip of the palate, and parts of the oral mesenchyme (Figures 3P-S and 5B). Mes_4 can be distinguished from Mes_1 through expression of Epha3 which is expressed exclusively along the oral side of the palatal mesenchyme (Figures 5A,B and S3S). Mes_1 does not express Epha3 and marks the nasal mesenchyme but not the distal tip of the palate (Figures 2B,C and 3L,P). Mes_4 marks the remaining oral and distal mesenchyme (Figures 2B, 3O,S and 5A,E). Shox2 is not highly expressed in Mes_2 or Mes_3 (Figure 2B). Therefore, Shox2-low populations likely mark more central and proximal areas of the palatal mesenchyme. Indeed, HCR RNA FISH for Runx2 or Msx1 shows high expression in a population of cells in the center of the palate, Mes_2 (Figure 5A,C). Msx1, like Shox2, marks multiple clusters (Figures 2B and 4A-C). Ogn delineates Mes_3 from Mes_2 marking the most dorsal and proximal population of cells within the palate (Figures 3R and 5A,B). Based on our data, we propose boundaries for these domains in a schematized palate (Figure 5D,E).
2.4 Mes_3 has unique signaling patterns distinct from other mesenchyme
We hypothesized that there may be functional differences between mesenchymal cell clusters in the anterior palate because cell populations are spatially distinct. To determine differences in signaling capacities between clusters, we utilized CellChat cellular communication analysis to discover dominant cellular communication patterns. The CellChat database uses literature curated resources to interpret scRNA seq data and infer statistically and biologically significant cellular communications.57 Visualizing the river plots of outgoing communication patterns of secreting cells, interpreted as ligand profiles of each cluster, Mes_1, Mes_2, and Mes_4 group together into Pattern 1, composed of Midkine (MK), Collagen, noncanonical Wnt (ncWnt), Ephrin-B (EPHB), Insulin-like growth factor (IGF), Periostin, Tenascin, Angiopoietin-like (ANGPTL), Activin, Semaphorin 5 (SEMA5), and Neurexin (NRXN). Mes_3 uniquely groups into Pattern 3, composed of pleiotrophin (PTN), Fibronectin 1 (FN1), Neural Cell adhesion molecule (NCAM), Myelin protein zeta (MPZ), Chemokine (C-X-C motif) ligand (CXCL), Thrombospondin (THBS), Vascular Cell adhesion protein (VCAM), Growth differentiation factor (GDF), and Neuronal growth regulator (NEGR). Both Ep clusters group together into Pattern 2, composed of Laminin, Platelet-derived growth factor (PDGF), Wnt, Cadherin (CDH), Cell adhesion molecule (CADM), Fibroblast growth factors (FGF), Agrin (AGRN), Desmosome, Notch, Junctional adhesion molecule (JAM), Semaphorin 3 (SEMA3), Nephronectin (NPNT), Hedgehog (HH), Heparin sulphate proteoglycan (HSPG), Leukemia inhibitory factor receptor (LIFR), Granulin (GRN), and Occludin (OCLN) (Figure 6A). Exploring incoming communication patterns of target cells, interpreted as receptors present within each cluster, we once again see that Mes_1, Mes_2, and Mes_4 group together (Pattern 1) while Mes_3 demonstrates a unique receptor profile by grouping alone into Pattern 4 (Figure 6B). Interestingly, we also see a split in receptors present between Ep_1 and Ep_2 (Figure 6B).
We further delineate the compositions of these patterns through dot plots for outgoing or incoming communication patterns (Figure 6C,D). Cells in Mes_3 specifically lack ligands associated with Collagen, ncWNT, EphA, EphB, IGF, Periostin, Tenascin, Macrophage migration inhibitory factor (MIF), SEMA6, and Activin that all other mesenchyme clusters express (Figure 6C). Conversely, Mes_3 cells express Thrombospondin (THBS), NRXN, GDF, and NEGR ligands that the rest of the mesenchyme does not (Figure 6C). All mesenchyme clusters except for Mes_3 express receptors associated with the FN1, ncWNT, EPHA, Laminin, PDGF, IGF, Tenascin, FGF, NPNT, THBS, HH, and HSPG pathways (Figure 6D). Mes_3, but not the rest of the mesenchyme, expresses receptors of SEMA3, Sema6, NRXN, and NEGR (Figure 6D). Together, these data show that one population of anterior mesenchymal palate cells sends and receives a different set of signals than the other mesenchymal cell populations.
2.5 CellRank trajectory analysis reveals two lineages within the E13.5 palate
The striking segregation of Shox2+ and Msx1+ cells within the palate led us to question whether distinct lineages can be traced within the E13.5 transcriptome. We utilized a pseudotime analysis using CellRank and RNA velocity to determine progenitor and terminal populations within the E13.5 anterior palate. Because some clusters spanned large distances over the UMAP, we first subclustered data at a higher resolution to minimize pseudotime analysis warping (Figure 7A). RNA Velocity shows mesenchyme populations originate from the proliferating cells (Prolif_2a) and move toward two distinct terminal states, Mes_1a and Mes_2a (Figure 7B). The initial and terminal states are highlighted (Figure 7C). A directed Partition-based graph abstraction (PAGA) denotes CellRank probabilities at each node with arrows indicating pseudotemporal ordering (Figure 7C). We visualize gene expression changes from initial to terminal states (i.e., from Prolif_2a to either Mes_1a or Mes_2b) using heatmaps (Figure 7D,E). These data show that different mesenchymal clusters are distinct populations of cells rather than differentiation stages of the same population.
3 DISCUSSION
Here we utilize advances in single cell sequencing and bioinformatic techniques to explore the intricacies of the E13.5 anterior palate transcriptome. Previously, excellent analysis of scRNA-seq has characterized the soft palate at E13.5, which makes up the posterior 1/3 of the structure.23 We chose to focus on the anterior palate to conserve resources for increasing sequencing depth and to gain meaningful information about potentially rare transcripts within these populations. We acknowledge that the middle palate is not captured within our dataset or other published datasets and remains an area of investigation for future studies. E13.5 is a crucial point in palatal development representing a period of rapid vertical growth of the palatal shelves just prior to palatal elevation and fusion and remains understudied. Using scRNA-seq to develop an atlas of the E13.5 anterior palate, we identify expected cell types in our dataset: mesenchymal, proliferating, epithelial, endothelial, immune, and Schwann cells. These populations express genes associated with the respective cell types. This confirms accuracy of our dissections and our sequencing at capturing our desired RNAs of interest.
We subcluster mesenchymal, proliferating, and epithelial populations to reveal unexpected diversity within these populations. Even at low resolution threshold clustering, we find that these populations subcluster into distinct identities with unique marker gene profiles. These marker genes map back to the respective clusters on the UMAPs. The proliferating clusters subclustered into three populations. Prolif_3, marked by Ccne2 are cells entering the cell cycle. Prolif_2, marked by transcription of histone components, are cells with condensed chromatin. Prolif_1, marked by CenpA, represents cells preparing to divide. These proliferating populations are not specific to any one cell type, but rather represent a mixture of cells from multiple clusters that are grouped together based cell cycle stage. We did not define domains for proliferating populations because their transcriptional profiles suggest that they contribute to multiple populations and therefore could not be distinguished by HCR RNA FISH. This aligns with previous studies that show relatively consistent proliferation across the palatal shelves at E13.5.58, 59 We chose to not regress out cell cycle in our analyses because mesenchyme is highly proliferative at this stage in development and regressing out cell cycle genes would remove valuable information from our dataset. Furthermore, retaining cell cycle information provides a resource to determine RNA velocity and CellRank in our analysis to determine progenitor pools.
HCR RNA FISH confirms scRNA-seq results and defines the spatial location of these clusters in relationship to one another. Marker genes from our scRNA-seq show the correct cell types within the palate confirming our clustering and assignments. Cells from the two epithelial subclusters captured appear to segregate into rows. Ep_1 cells marked by Igfbp2, comprise the bulk of the tissue and line the inside of the epithelia in contact with the mesenchyme. Ep_2 cells, marked by Cldn7, lie superficial to Ep_1 in contact with the oral cavity. Although CellChat analysis demonstrates epithelial cells express similar ligands and cluster into shared signaling pathways, we observe a separation into two patterns when looking at incoming (receptor) communication signals. This suggests that these cell populations respond to different signals.
Interestingly, cells from the mesenchymal subclusters spatially segregate from one another. We show distinct expression patterns of two genes that have historically served as equivalent markers of the anterior palate, Shox2 and Msx1, demonstrating one application of how our dataset can be used as a community resource.31-33 We find that these genes fall within separate clusters in our dataset and spatially segregate within the palate when viewed using HCR RNA FISH. Msx1 is expressed throughout the palatal mesenchyme, but HCR RNA FISH data clearly shows an Msx1-high expressing population, Mes_2, which lies in the middle of the palatal shelf. Conversely, Shox2 expression is restricted to the medial edge and distal tip of the palate and is not broadly expressed in other areas of the palate. Utilizing only one of these two markers may yield incomplete results when performing experiments in the anterior palate as neither completely marks the full extent of the tissue. We use HCR RNA FISH and scRNA-seq data to develop a spatial cluster map of the palate for this dataset so others can quickly identify several genes within a particular spatial domain within the anterior palate to bypass this problem. Using single cell resolution techniques, we can find improved markers of these populations to avoid confusion among researchers in the field.
Previous work has identified differences in signaling patterns between oral and lingual or medial and lateral compartments using bulk RNA-seq at E14.5.20 Our mesenchyme clusters segregate into oral, lingual, proximal, and central domains, our dataset can be used to identify good candidates for exploring these spatial differences at higher resolution at this earlier timepoint. Using the CellChat ligand receptor interaction explorer, we map signaling patterns of ligands and receptors expressed within each cluster. We identify that Mes_3, the proximally located cluster within the palate, has a unique ligand and receptor profile when compared to the remaining mesenchymal clusters. We used an CellRank and RNA velocity analysis to determine the origins and terminations of cell populations. We identified that cells originated from the proliferating clusters within our dataset and differentiated into either of two fates. Our results show mesenchymal clusters are not merely different developmental stages of the same cell population and are in fact distinct cell populations. The first being the cells that occupy the distal tip of the palate shelf and the other being the unique signaling population we identified using CellChat, Mes_3. This data shows two paths for proliferating cells as they differentiate: either contribute to the rapidly proliferating tip of the palatal shelf to aid in palatal elevation or supply cells to the base of the palate. Is there a functional importance of Mes_3 cells, which express a unique signaling profile? Because these cells line the proximal border of the palatal shelf, they are not likely to be contributing to palatal outgrowth and certainly will not play a direct role during palatal fusion. CellChat data shows Mes_3 cells lack expression of ligands and receptors from key developmental signaling pathways including Bone morphogenetic protein (BMP), Platelet-derived growth factor (PDGF), Fibroblast growth factor (FGF), Insulin-like growth factor (IGF), Ephrin-A (EphA), Ephrin-B (EphB), noncanonical Wnt (ncWnt), and Hedgehog (Hh) signaling which are present in at least one of the other mesenchyme clusters. Clearly Mes_3 is not a primary signaling center like the other mesenchymal clusters have been documented to be. One possibility is that this population of cells provides guidance cues for projecting neurons. Indeed, CellChat data shows that Mes_3 cellular communication patterns are partially defined by the presence of Neuronal growth regulator (NEGR) as well as Midkine (MK) and Pleiotrophin (PTN) which are members of the Neurite growth-promoting factor (NEGF) family. Interestingly, Mes_3 signaling pathways are dominated by several glycoprotein pathways such as Thrombospondin (THBS), Fibronectin (FN1), Neural cell adhesion molecule (NCAM), and Vascular cell adhesion molecule (VCAM). Although the exact mechanisms by which the secondary palate elevates have been contentious, it is generally agreed upon that there is horizontal outgrowth and remodeling from the medial wall of palatal shelves to give rise to the elevated palatal shelves by E14.5. These rapid structural changes can apply large biophysical forces on the tissue. Perhaps the glycoprotein rich Mes_3 serves as an “anchor” domain for the palate as it undergoes this sheer stress from remodeling. Future studies will illuminate why Mes_3 functionally segregates from the rest of the palatal mesenchyme and how this population contributes to palate development.
Curation of this dataset and others allows for advancement in the understanding of the etiology of cleft palate on both clinical and research fronts. De novo mutations found in patients with cleft palate can be queried within our dataset to determine embryonic expression in the mouse. Correlation of a cleft palate-associated genetic variant and expression at the time of plate closure would provide rationale for experimentation in the mouse to determine causation. Genes of interest can then be referenced to the provided spatial map to determine where to expect expression of the gene of interest. Similarly, researchers can use this dataset for identification of genes expressed in the palate and to determine if their gene of interest is expressed in the same cells as other genes of interest. Furthermore, researchers can determine whether genes of interest are co-expressed with other molecular pathway genes and marker genes to form new hypotheses. Arguably one of the most promising uses of this dataset will come from its integration with existing and future datasets. Temporal integration and lineage tracing with earlier and later timepoints will give key insights into the origins and fates of the cells that we have defined. Additionally, comparing the formation of the anterior palate with that of the posterior palate would similarly be quite valuable. This dataset adds to a growing body of work with the goal of unveiling the molecular underpinnings of palatal development.
4 EXPERIMENTAL PROCEDURES
4.1 Mice
C57BL/6J Mice originally obtained from The Jackson Laboratory and maintained as a colony in accordance with IACUC protocol #0139 at the University of Colorado Anschutz Medical Campus. Observation of the vaginal plug was considered embryonic day 0.5. On embryonic day 13.5, pregnant dams were euthanized by isofluorane followed by cervical dislocation and embryos were harvested. Heads were kept in ice-cold PBS for dissection. All three sequenced embryos were female.
4.2 Single cell RNA sequencing
In ice-cold PBS, heads were removed from embryos. The top of the head was cut flat, and the head was laid ventral. The secondary palate was exposed by removing the mandible and tongue. The anterior 1/3 of the palate was dissected away from the remainder of the head using forceps. Paired anterior palatal shelves were washed twice in PBS and transferred to 750 μL 0.25% Trypsin-EDTA for 10 min at 37C with frequent trituration. Trypsinization was stopped with addition of 750uL DMEM + 10% FBS (Invitrogen/Life Technologies). Cells were pipetted up and down 20 times to assist in cell dispersion, pelleted at 300 g for 3 min, and resuspended in PBS with 0.1% BSA before being filtered through a Flowmi 40 μM cell strainer. Single cell suspensions were confirmed by microscopy and cells were counted on a hemocytometer with trypan blue staining. Single cell suspensions were submitted to the University of Colorado Anschutz Medical Campus Genomics Shared Resource for droplet derivation and library preparation with the Chromium single cell 3’ NextGen V3.1 chemistry (10X Genomics). Sequencing was performed on an Illumina NovaSeq6000 with V1.5 chemistry. 7976, 6994, and 7268 cells were collected from three embryos (22 238 cells in total across all embryos) and sequenced at a depth of 100 000 reads per cell. Cells from different embryos were run separately and not pooled together during sequencing.
4.3 Data analysis
RNA-sequencing analysis was performed using the Seurat pipeline.60 Data was normalized using SingleCellTransform (SCT) and clustered at a resolution of 0.2 snn and integrated using the Seurat v4 integration method. Populations were identified based on marker gene expression. Heatmaps were generated based on top 10 marker genes and plotted across every cell within the dataset. Global intercellular communication patterns between the cells in a subset of the integrated SeuratObject containing mesenchymal and epithelial clusters was inferred using CellChat57, 61 following the tutorial. Number of outgoing and incoming communication patterns was selected based on an agreement in the drop of Cophenetic and Silhouette values. The integrated SeuratObject was subset to select WT mesenchymal and proliferating cells and converted to an h5ad object. This subset was analyzed with scVelo62 to determine RNA Velocity63 and CellRank64 to calculate the initial and terminal states based on the RNA Velocity, a directed partition-based graph abstraction (PAGA) based on the latent time inferred pseudotime, and the lineage driver genes.
4.4 In situ hybridization
E13.5 embryos were collected, and heads were fixed in 4% paraformaldehyde (PFA). Heads were dehydrated in 30% sucrose and embedded in OCT (Sakura, Tissue-Tek, Cat. 4583). Samples were sectioned at 12 μM thickness on a cryostat. Hybridization chain reaction (Molecular Instruments) was used for multiplexed RNA in situ hybridization according to manufacturer's protocol. Briefly, sectioned tissues were allowed to thaw at room temperature, fixed in 4% PFA, dehydrated, treated with 2 μg/mL Proteinase K for 2 min, re-fixed, and hybridized with 1.6 pmol of probe at 37°C overnight. The following day slides were washed in wash buffer and SSCT per manufacturers protocol and amplified with six picomole snap-cooled hairpins complementary to the probes overnight at room temperature. On day three, slides were washed in SSCT and mounted for imaging. Slides were imaged on a Zeiss LSM 980 inverted confocal laser scanning microscope.
AUTHOR CONTRIBUTIONS
Yunus H. Ozekin: Conceptualization (equal); data curation (equal); formal analysis (equal); investigation (equal); visualization (equal); writing – original draft (equal); writing – review and editing (equal). Rebecca O'Rourke: Data curation (supporting); formal analysis (supporting); writing – review and editing (supporting). Emily Anne Bates: Conceptualization (equal); funding acquisition (equal); investigation (equal); project administration (equal); writing – review and editing (equal).
ACKNOWLEDGMENTS
We would like to thank the University of Colorado Genomics Core for performing sequencing of samples and being a valuable resource.
FUNDING INFORMATION
This work was supported by the National Institute of Dental and Craniofacial Research NIH-NIDCR-R01DE025311 to Emily Anne Bates and NIH-T32GM141742-02S1 to Yunus H. Ozekin.
CONFLICT OF INTEREST STATEMENT
The authors declare no conflicts of interest.
Open Research
DATA AVAILABILITY STATEMENT
Raw and processed data has been made available via a NCBI GEO Submission (accession code GSE222205). Code is accessible via GitHub at https://github.com/yunusozekin/WT_E13.5_AntPalate_scRNAseq_Ozekin.git.