Manage a pathway registry#
Background#
Pathways in single-cell analysis represent the interconnected networks of molecular signaling cascades that govern critical cellular processes. They are of utmost importance as they offer a comprehensive understanding of the intricate regulatory mechanisms underlying cellular behavior, providing insights into disease pathogenesis, therapeutic responses, and the identification of potential targets for precision medicine and intervention strategies.
Managing pathways across different datasets is crucial in a biotech company to gain a comprehensive understanding of complex biological processes and facilitate efficient research and development.
In this notebook we are registering the 2023 GO Biological process pathway ontology with Lamin. Afterwards, we are linking the pathways to genes and conducting a pathway enrichment analysis on an interferon-beta treated dataset. Finally, we will demonstrate how to fetch datasets with pathway queries using Lamin.
Setup#
Warning
Please ensure that you have created or loaded a LaminDB instance before running the remaining part of this notebook!
Show code cell content
# A lamindb instance containing bionty schema (skip if you already loaded your instance)
!lamin init --storage ./enrichr --schema bionty
π‘ creating schemas: core==0.45.5 bionty==0.29.6
β
saved: User(id='DzTjkKse', handle='testuser1', email='testuser1@lamin.ai', name='Test User1', updated_at=2023-08-17 17:34:33)
β
saved: Storage(id='EJlRAlPa', root='/home/runner/work/lamin-usecases/lamin-usecases/docs/enrichr', type='local', updated_at=2023-08-17 17:34:33, created_by_id='DzTjkKse')
β
loaded instance: testuser1/enrichr
π‘ did not register local instance on hub (if you want, call `lamin register`)
import lamindb as ln
import lnschema_bionty as lb
lb.settings.species = "human" # globally set species
import gseapy as gp
import scanpy as sc
import matplotlib.pyplot as plt
from lamin_usecases import datasets as ds
lb.settings.species = "human" # globally set species
β
loaded instance: testuser1/enrichr (lamindb 0.50.7)
β
set species: Species(id='uHJU', name='human', taxon_id=9606, scientific_name='homo_sapiens', updated_at=2023-08-17 17:34:35, bionty_source_id='aSyL', created_by_id='DzTjkKse')
β
set species: Species(id='uHJU', name='human', taxon_id=9606, scientific_name='homo_sapiens', updated_at=2023-08-17 17:34:35, bionty_source_id='aSyL', created_by_id='DzTjkKse')
Fetch GO_Biological_Process_2023
pathways annotated with human genes using Enrichr#
First we fetch the βGO_Biological_Process_2023β pathways for humans using GSEApy which wraps GSEA and Enrichr.
go_bp = gp.get_library(name="GO_Biological_Process_2023", organism="Human")
print(f"Number of pathways {len(go_bp)}")
Number of pathways 5406
go_bp["ATF6-mediated Unfolded Protein Response (GO:0036500)"]
['MBTPS1', 'MBTPS2', 'XBP1', 'ATF6B', 'DDIT3', 'CREBZF']
Parse out the ontology_id from keys, convert into the format of {ontology_id: (name, genes)}
def parse_ontology_id_from_keys(key):
"""Parse out the ontology id.
"ATF6-mediated Unfolded Protein Response (GO:0036500)" -> ("GO:0036500", "ATF6-mediated Unfolded Protein Response")
"""
id = key.split(" ")[-1].replace("(", "").replace(")", "")
name = key.replace(f" ({id})", "")
return (id, name)
go_bp_parsed = {}
for key, genes in go_bp.items():
id, name = parse_ontology_id_from_keys(key)
go_bp_parsed[id] = (name, genes)
go_bp_parsed["GO:0036500"]
('ATF6-mediated Unfolded Protein Response',
['MBTPS1', 'MBTPS2', 'XBP1', 'ATF6B', 'DDIT3', 'CREBZF'])
Register pathway ontology in LaminDB#
pathway_bionty = lb.Pathway.bionty() # equals to bionty.Pathway()
pathway_bionty
Pathway
Species: all
Source: go, 2023-05-10
#terms: 47514
π Pathway.df(): ontology reference table
π Pathway.lookup(): autocompletion of terms
π― Pathway.search(): free text search of terms
β
Pathway.validate(): strictly validate values
π§ Pathway.inspect(): full inspection of values
π½ Pathway.standardize(): convert to standardized names
πͺ Pathway.diff(): difference between two versions
π Pathway.ontology: Pronto.Ontology object
Next, we register all the pathways and genes in LaminDB to finally link pathways to genes.
Register pathway terms#
To register the pathways we make use of .from_values
to directly parse the annotated GO pathway ontology IDs into LaminDB.
pathway_records = lb.Pathway.from_values(go_bp_parsed.keys(), lb.Pathway.ontology_id)
β
validated 5406 Pathway records from Bionty on ontology_id: GO:0044208, GO:0051084, GO:0006103, GO:0061158, GO:0070935, GO:0050427, GO:0042791, GO:0009452, GO:0036261, GO:0006370, GO:0015866, GO:0006167, GO:0046033, GO:0036500, GO:0006754, GO:0046034, GO:0042773, GO:0015867, GO:0086016, GO:0086067, ...
lb.Pathway.from_bionty(ontology_id="GO:0015868")
β
validated 1 Pathway record from Bionty on ontology_id: GO:0015868
Pathway(id='SMqshx3Y', name='purine ribonucleotide transport', ontology_id='GO:0015868', description='The Directed Movement Of A Purine Ribonucleotide, Any Compound Consisting Of A Purine Ribonucleoside (A Purine Organic Base Attached To A Ribose Sugar) Esterified With (Ortho)Phosphate, Into, Out Of Or Within A Cell.', bionty_source_id='pj5J', created_by_id='DzTjkKse')
ln.save(pathway_records, parents=False) # not recursing through parents
Register gene symbols#
Similarly, we use .from_values
for all Pathway associated genes to register them with LaminDB.
all_genes = {g for genes in go_bp.values() for g in genes}
gene_records = lb.Gene.from_values(all_genes, lb.Gene.symbol)
Show code cell output
π‘ using global setting species = human
β
validated 16808 Gene records from Bionty on symbol: ODAD2, PPP1R3B, PPP1R3B, NSUN3, TRMT6, KRT81, RFPL2, UACA, PSMD7, NFAM1, CTSS, SIM1, NAPRT, NAPRT, TXNL4A, P2RX5, KRT12, KRT12, PSMB6, NOVA2, ...
β ambiguous validation in Bionty for 1080 records: PPP1R3B, NAPRT, KRT12, TPBG, NDUFA11, YWHAE, TLCD2, PCDHB9, ADAMTS12, ARMC10, GRK1, SMCR8, PCDHB4, SYNRG, HLA-DMB, SCAMP3, ARHGEF10, CNOT3, SESN2, LYZL6, ...
β did not validate 77 Gene records for symbols: AFD1, AZF1, C10ORF71, C10ORF90, C11ORF65, C11ORF80, C12ORF29, C12ORF4, C12ORF50, C12ORF57, C15ORF62, C17ORF75, C17ORF97, C17ORF99, C18ORF25, C18ORF32, C19ORF12, C1ORF109, C1ORF112, C1ORF131, ...
gene_records[:3]
[Gene(id='F45SkWrTo04i', symbol='ODAD2', ensembl_gene_id='ENSG00000169126', ncbi_gene_ids='55130', biotype='protein_coding', description='outer dynein arm docking complex subunit 2 [Source:HGNC Symbol;Acc:HGNC:25583]', synonyms='CILD23|GUDU|FLJ10376|DKFZP434P1735|FLJ10817|ARMC4', species_id='uHJU', bionty_source_id='Rb6X', created_by_id='DzTjkKse'),
Gene(id='zDt8BSTf0A20', symbol='PPP1R3B', ensembl_gene_id='ENSG00000173281', ncbi_gene_ids='79660', biotype='protein_coding', description='protein phosphatase 1 regulatory subunit 3B [Source:HGNC Symbol;Acc:HGNC:14942]', synonyms='GL|FLJ14005|PPP1R4', species_id='uHJU', bionty_source_id='Rb6X', created_by_id='DzTjkKse'),
Gene(id='8o19rBNQZZr6', symbol='PPP1R3B', ensembl_gene_id='ENSG00000285343', ncbi_gene_ids='79660', biotype='protein_coding', description='protein phosphatase 1 regulatory subunit 3B [Source:HGNC Symbol;Acc:HGNC:14942]', synonyms='GL|FLJ14005|PPP1R4', species_id='uHJU', bionty_source_id='Rb6X', created_by_id='DzTjkKse')]
ln.save(gene_records);
Link pathway to genes#
Now that we are tracking all pathways and genes records, we can link both of them to make the pathways even more queryable.
gene_records_ids = {record.symbol: record for record in gene_records}
for pathway_record in pathway_records:
pathway_genes = go_bp_parsed.get(pathway_record.ontology_id)[1]
pathway_genes_records = [gene_records_ids.get(gene) for gene in pathway_genes]
pathway_record.genes.set(pathway_genes_records)
Now genes are linked to pathways:
pathway_record.genes.list("symbol")
['XIAP', 'CAST', 'CARD8', 'CARD18', 'CST7']
A interferon-beta treated dataset#
We will now conduct a pathway enrichment analysis on a small peripheral blood mononuclear cell dataset that is split into control and stimulated groups. The stimulated group was treated with interferon beta.
The dataset was initially obtained using From "SeuratData::ifnb"
.
Letβs load the dataset and look at the cell type annotations.
adata = ds.anndata_seurat_ifnb()
adata
AnnData object with n_obs Γ n_vars = 13999 Γ 14053
obs: 'orig.ident', 'nCount_RNA', 'nFeature_RNA', 'stim', 'seurat_annotations'
var: 'features'
uns: 'log1p'
adata.obs["seurat_annotations"].value_counts()
seurat_annotations
CD14 Mono 4362
CD4 Naive T 2504
CD4 Memory T 1762
CD16 Mono 1044
B 978
CD8 T 814
NK 633
T activated 619
DC 472
B Activated 388
Mk 236
pDC 132
Eryth 55
Name: count, dtype: int64
For simplicity, we subset to βB Activatedβ cells:
adata_ba = adata[adata.obs.seurat_annotations == "B Activated"].copy()
adata_ba
AnnData object with n_obs Γ n_vars = 388 Γ 14053
obs: 'orig.ident', 'nCount_RNA', 'nFeature_RNA', 'stim', 'seurat_annotations'
var: 'features'
uns: 'log1p'
Pathway enrichment analysis using Enrichr#
This analysis is based on: https://gseapy.readthedocs.io/en/master/singlecell_usecase.html
First, we compute differentially expressed genes using a Wilcoxon test between stimulated and control cells.
# compute differentially expressed genes
sc.tl.rank_genes_groups(
adata_ba,
groupby="stim",
use_raw=False,
method="wilcoxon",
groups=["STIM"],
reference="CTRL",
)
rank_genes_groups_df = sc.get.rank_genes_groups_df(adata_ba, "STIM")
rank_genes_groups_df.head()
names | scores | logfoldchanges | pvals | pvals_adj | |
---|---|---|---|---|---|
0 | ISG15 | 16.881584 | 5.923428 | 6.147295e-64 | 6.536230e-60 |
1 | ISG20 | 16.857113 | 4.167713 | 9.302256e-64 | 6.536230e-60 |
2 | IFIT3 | 14.587233 | 31.232290 | 3.386569e-48 | 1.586382e-44 |
3 | IFI6 | 14.128634 | 6.471180 | 2.530019e-45 | 8.888589e-42 |
4 | MX1 | 13.442097 | 6.241539 | 3.425901e-41 | 9.628837e-38 |
Next, we filter out up/down-regulated differentially expressed gene sets:
degs_up = rank_genes_groups_df[
(rank_genes_groups_df["logfoldchanges"] > 0)
& (rank_genes_groups_df["pvals_adj"] < 0.05)
]
degs_dw = rank_genes_groups_df[
(rank_genes_groups_df["logfoldchanges"] < 0)
& (rank_genes_groups_df["pvals_adj"] < 0.05)
]
degs_up.shape, degs_dw.shape
((89, 5), (47, 5))
Run pathway enrichment analysis on DEGs and plot top 10 pathways:
enr_up = gp.enrichr(degs_up.names, gene_sets="GO_Biological_Process_2023").res2d
gp.dotplot(enr_up, figsize=(2, 3), title="Up", cmap=plt.cm.autumn_r);
enr_dw = gp.enrichr(degs_dw.names, gene_sets="GO_Biological_Process_2023").res2d
gp.dotplot(enr_dw, figsize=(2, 3), title="Down", cmap=plt.cm.winter_r, size=10);
Track datasets containing annotated pathways in LaminDB#
Letβs enable tracking of the current notebook as the transform of this file:
ln.track()
π‘ notebook imports: gseapy==1.0.5 lamin_usecases==0.0.1 lamindb==0.50.7 lnschema_bionty==0.29.6 matplotlib==3.7.2 scanpy==1.9.3
β
saved: Transform(id='6oxEIEduvo6wz8', name='Manage a pathway registry', short_name='enrichr', stem_id='6oxEIEduvo6w', version='0', type=notebook, updated_at=2023-08-17 17:35:40, created_by_id='DzTjkKse')
β
saved: Run(id='aTaG4cz17LGPqTetRLw0', run_at=2023-08-17 17:35:40, transform_id='6oxEIEduvo6wz8', created_by_id='DzTjkKse')
We further create a File object to track the dataset.
file = ln.File.from_anndata(
adata_ba, description="seurat_ifnb_activated_Bcells", var_ref=lb.Gene.symbol
)
π‘ file will be copied to default storage upon `save()` with key `None` ('.lamindb/MEXvOFrrAMjeQ2TMGg7c.h5ad')
π‘ parsing feature names of X stored in slot 'var'
π‘ using global setting species = human
β
validated 10582 Gene records on symbol: EMC3, ATP6V1F, CDK16, BABAM1, TRPT1, FTSJ3, INTS10, EFHC2, EFHB, PET100, PLA2G4A, FIGLA, MRPL22, NSMCE4A, ELK3, AMFR, ARHGAP27, FBLN5, PPIL6, RPL36AL, ...
β
validated 2857 Gene records from Bionty on symbol: A1BG, A1BG-AS1, A2M, A2M-AS1, A4GALT, AAAS, AAAS, AACS, AAGAB, AAMDC, AAMP, AAR2, AARSD1, AASDH, AASS, AATF, AATF, ABCA1, ABCF2, ABCF3, ...
β ambiguous validation in Bionty for 182 records: AAAS, AATF, ABR, ABT1, ACAP1, ACTR1B, ADCK5, ADI1, ANKRD36, ARHGEF35, ARL17B, ASMTL, BDP1, BEX5, C16orf46, C17orf97, C1RL-AS1, C22orf15, C6orf136, C6orf47, ...
β did not validate 2258 Gene records for symbols: 7SK.2, AAED1, AARS, AC000003.2, AC002117.1, AC002310.12, AC002331.1, AC002451.3, AC002456.2, AC002467.7, AC003075.4, AC004067.5, AC004069.2, AC004158.2, AC004257.3, AC004381.6, AC004447.2, AC004463.6, AC004840.9, AC004854.4, ...
β ignoring non-validated features: 7SK.2,AAED1,AARS,AC000003.2,AC002117.1,AC002310.12,AC002331.1,AC002451.3,AC002456.2,AC002467.7,AC003075.4,AC004067.5,AC004069.2,AC004158.2,AC004257.3,AC004381.6,AC004447.2,AC004463.6,AC004840.9,AC004854.4,AC004893.11,AC004951.6,AC004988.1,AC005003.1,AC005037.3,AC005076.5,AC005082.12,AC005253.2,AC005264.2,AC005306.3,AC005498.3,AC005523.2,AC005618.6,AC005932.1,AC006116.20,AC006129.2,AC006129.4,AC006369.2,AC006942.4,AC006994.2,AC007036.6,AC007038.7,AC007040.6,AC007228.11,AC007228.9,AC007246.3,AC007292.3,AC007292.4,AC007381.2,AC007383.3,AC007386.2,AC007551.3,AC007620.3,AC007773.2,AC007879.2,AC008964.1,AC009005.2,AC009065.1,AC009120.10,AC009120.6,AC009133.12,AC009133.14,AC009299.3,AC009403.2,AC009404.2,AC009473.1,AC009495.2,AC009501.4,AC009506.1,AC009948.5,AC009950.2,AC009961.3,AC010226.4,AC010336.1,AC010642.1,AC010761.8,AC010883.5,AC010894.3,AC011484.1,AC011747.4,AC011899.9,AC011997.1,AC012358.8,AC012363.4,AC012368.1,AC012442.6,AC013264.2,AC013449.1,AC015849.19,AC015849.2,AC015987.2,AC016586.1,AC016629.8,AC016700.5,AC016757.3,AC016831.7,AC017002.1,AC017076.5,AC018755.1,AC018816.3,AC020571.3,AC021188.4,AC021860.1,AC022182.3,AC022400.2,AC022532.1,AC023490.1,AC023590.1,AC025171.1,AC025335.1,AC026202.3,AC026310.1,AC026806.2,AC027307.3,AC027763.2,AC040977.1,AC062017.1,AC062029.1,AC068039.4,AC068282.3,AC068831.10,AC069278.4,AC069363.1,AC073283.4,AC074117.10,AC074138.3,AC074183.4,AC074289.1,AC078883.3,AC079210.1,AC079305.10,AC079767.4,AC079922.3,AC083843.1,AC084018.1,AC084219.4,AC087294.2,AC090186.1,AC090587.5,AC091633.3,AC091729.9,AC092295.7,AC092301.3,AC092484.1,AC092580.4,AC093157.1,AC093323.3,AC093388.3,AC093391.2,AC093495.4,AC093609.1,AC093627.10,AC093627.9,AC093673.5,AC093818.1,AC096772.6,AC097495.2,AC097500.2,AC097713.4,AC097724.3,AC099552.4,AC099850.1,AC100830.3,AC104024.1,AC104532.4,AC104653.1,AC104654.2,AC104655.3,AC104820.2,AC105053.4,AC105760.2,AC108488.4,AC109333.10,AC109826.1,AC113189.5,AC115618.1,AC116366.6,AC120194.1,AC132192.1,AC132872.2,AC133528.2,AC135178.1,AC137932.1,AC137932.6,AC137934.1,AC139100.2,AC142528.1,AC144652.1,AC144831.1,AC147651.3,AC147651.4,AC159540.1,AC195454.1,ACN9,ACPL2,ACPP,ACRC,AD000090.2,ADC,ADCK3,ADCK4,ADPRHL2,ADRBK1,ADRBK2,ADSS,ADSSL1,AE000662.92,AES,AF001548.5,AF127936.5,AF127936.7,AF129408.17,AF131215.5,AF131217.1,AF165138.7,AF178030.2,AF196972.9,AF213884.2,AGPAT6,AGPAT9,AHSA2,AIM1,AIM1L,AJ006998.2,AKAP2,AL031663.2,AL133245.2,AL139099.1,AL161915.1,AL359091.2,AL590452.1,AL590822.2,AL591479.1,AL592284.1,AL592494.5,AL627171.2,AL627309.1,AL645728.1,AL928768.3,ALG1L,ALS2CR11,AMICA1,ANKRD32,AP000350.4,AP000476.1,AP000487.5,AP000640.2,AP000692.10,AP000695.4,AP000769.1,AP000769.7,AP000783.1,AP001046.5,AP001053.11,AP001055.6,AP001062.7,AP001189.4,AP001257.1,AP001258.4,AP001372.2,AP001412.1,AP001434.2,AP001437.1,AP001462.6,AP001469.9,AP001626.1,AP001816.1,AP003419.16,AP003733.1,AP003774.1,AP006621.5,APITD1,APOA1BP,APOPT1,ARL5B-AS1,ARNTL,ARNTL2,ASNA1,ASUN,ATE1-AS1,ATHL1,ATP5A1,ATP5B,ATP5C1,ATP5D,ATP5E,ATP5EP2,ATP5F1,ATP5G1,ATP5G2,ATP5G3,ATP5H,ATP5I,ATP5J,ATP5J2,ATP5L,ATP5L2,ATP5O,ATP5S,ATP5SL,ATPIF1,AZI1,B3GALTL,B3GNT1,BAI1,BAIAP2-AS1,BHLHB9,BOLA3-AS1,BRE,BTBD11,BZRAP1,BZRAP1-AS1,C10ORF68,C10orf11,C10orf111,C10orf118,C10orf12,C10orf128,C10orf2,C10orf25,C10orf32,C10orf35,C10orf54,C10orf76,C10orf91,C11orf1,C11orf30,C11orf31,C11orf35,C11orf48,C11orf49,C11orf57,C11orf63,C11orf73,C11orf74,C11orf82,C11orf83,C11orf84,C11orf95,C12orf10,C12orf23,C12orf39,C12orf44,C12orf45,C12orf49,C12orf5,C12orf61,C12orf65,C12orf66,C12orf68,C12orf73,C12orf74,C12orf79,C14orf1,C14orf142,C14orf159,C14orf166,C14orf182,C14orf2,C14orf23,C14orf37,C14orf64,C14orf79,C14orf80,C15orf38,C15orf41,C15orf53,C15orf57,C15orf65,C16orf13,C16orf45,C16orf52,C16orf58,C16orf59,C16orf62,C16orf70,C16orf72,C16orf80,C16orf91,C16orf93,C16orf98,C17orf103,C17orf104,C17orf51,C17orf53,C17orf59,C17orf62,C17orf70,C17orf72,C17orf85,C17orf89,C17orf96,C18orf8,C19orf10,C19orf24,C19orf35,C19orf40,C19orf43,C19orf48,C19orf52,C19orf54,C19orf55,C19orf59,C19orf60,C19orf66,C19orf68,C19orf70,C19orf77,C19orf82,C19orf83,C1QTNF9B-AS1,C1orf123,C1orf172,C1orf173,C1orf186,C1orf194,C1orf200,C1orf213,C1orf228,C1orf27,C1orf51,C1orf61,C1orf63,C1orf85,C1orf86,C20orf112,C20orf194,C20orf195,C20orf196,C20orf201,C20orf24,C20orf27,C21orf119,C21orf2,C21orf33,C21orf49,C21orf59,C22orf29,C22orf34,C22orf46,C2ORF15,C2orf40,C2orf43,C2orf44,C2orf47,C2orf62,C3orf17,C3orf58,C3orf67,C4orf21,C4orf26,C4orf27,C4orf29,C4orf32,C5orf28,C5orf30,C5orf42,C5orf45,C5orf51,C5orf55,C5orf56,C6orf1,C6orf100,C6orf106,C6orf203,C6orf211,C6orf223,C6orf229,C6orf25,C6orf48,C6orf57,C6orf99,C7orf13,C7orf26,C7orf43,C7orf49,C7orf55,C7orf55-LUC7L2,C7orf60,C7orf61,C7orf73,C8orf31,C8orf37,C8orf59,C9orf114,C9orf116,C9orf135,C9orf139,C9orf142,C9orf147,C9orf156,C9orf16,C9orf3,C9orf37,C9orf38,C9orf41,C9orf47,C9orf66,C9orf69,C9orf89,C9orf9,C9orf91,CAPN10-AS1,CARD17,CARKD,CARS,CASC4,CASC5,CASC7,CBWD1,CBWD2,CBWD3,CBWD5,CBWD6,CCBL1,CCBL2,CCDC101,CCDC104,CCDC109B,CCDC11,CCDC130,CCDC132,CCDC151,CCDC163P,CCDC173,CCDC176,CCDC23,CCDC41,CCDC53,CCDC58,CCDC64,CCDC84,CCDC94,CD3EAP,CD97,CEBPA-AS1,CEBPZ-AS1,CECR1,CECR5,CELF6.1,CENPBD1,CHKB-AS1,CIRH1A,CITF22-92A6.1,CLECL1,CLLU1OS,CLN3.1,COL4A3BP,CPSF3L,CRAMP1L,CRIPAK,CRYBG3.1,CRYM-AS1,CSRP2BP,CTA-14H9.5,CTA-204B4.2,CTA-211A9.5,CTA-217C2.1,CTA-250D10.23,CTA-292E10.6,CTA-293F17.1,CTA-29F11.1,CTA-363E6.6,CTA-384D8.34,CTA-384D8.35,CTA-384D8.36,CTA-445C9.14,CTA-445C9.15,CTAGE5,CTB-113D17.1,CTB-113I20.2,CTB-114C7.4,CTB-129O4.1,CTB-133G6.1,CTB-152G17.6,CTB-31O20.2,CTB-41I6.1,CTB-43P18.1,CTB-55O6.12,CTB-58E17.1,CTB-58E17.3,CTB-61M7.2,CTBP1-AS2,CTC-228N24.3,CTC-241F20.3,CTC-241N9.1,CTC-246B18.10,CTC-250I14.6,CTC-260E6.6,CTC-273B12.10,CTC-297N7.8,CTC-301O7.4,CTC-329D1.2,CTC-332L22.1,CTC-338M12.5,CTC-366B18.2,CTC-378H22.1,CTC-378H22.2,CTC-428G20.3,CTC-428G20.6,CTC-428H11.2,CTC-444N24.11,CTC-444N24.6,CTC-444N24.8,CTC-459F4.3,CTC-459I6.1,CTC-479C5.12,CTC-487M23.8,CTC-503J8.4,CTC-503J8.6,CTC-507E2.1,CTC-508F8.1,CTC-523E23.11,CTC-524C5.2,CTC-527H23.3,CTC-534A2.2,CTC-550B14.7,CTD-2006H14.2,CTD-2006K23.1,CTD-2007H13.3,CTD-2012K14.6,CTD-2012K14.7,CTD-2015H6.3,CTD-2017C7.1,CTD-2017D11.1,CTD-2020K17.1,CTD-2020K17.3,CTD-2020K17.4,CTD-2031P19.3,CTD-2035E11.3,CTD-2035E11.4,CTD-2035E11.5,CTD-2037K23.2,CTD-2044J15.2,CTD-2047H16.4,CTD-2054N24.2,CTD-2083E4.6,CTD-2085J24.4,CTD-2086O20.3,CTD-2162K18.4,CTD-2162K18.5,CTD-2165H16.4,CTD-2186M15.3,CTD-2192J16.22,CTD-2196E14.4,CTD-2196E14.9,CTD-2201E18.5,CTD-2210P24.4,CTD-2228K2.5,CTD-2231E14.8,CTD-2245F17.3,CTD-2256P15.2,CTD-2260A17.1,CTD-2260A17.2,CTD-2267D19.2,CTD-2270L9.4,CTD-2283N19.1,CTD-2287O16.5,CTD-2288F12.1,CTD-2302E22.4,CTD-2306M10.1,CTD-2319I12.2,CTD-2325M2.1,CTD-2336O2.1,CTD-2341M24.1,CTD-2349P21.9,CTD-2353F22.1,CTD-2357A8.3,CTD-2366F13.1,CTD-2368P22.1,CTD-2369P2.2,CTD-2371O3.2,CTD-2377D24.6,CTD-2514K5.2,CTD-2515A14.1,CTD-2516F10.2,CTD-2517O10.6,CTD-2521M24.5,CTD-2521M24.9,CTD-2528L19.6,CTD-2532N20.1,CTD-2538C1.2,CTD-2541M15.4,CTD-2544N14.3,CTD-2545M3.8,CTD-2547L16.1,CTD-2547L24.3,CTD-2547L24.4,CTD-2562J15.6,CTD-2568A17.5,CTD-2574D22.2,CTD-2576D5.4,CTD-2587H19.1,CTD-2587M2.1,CTD-2619J13.14,CTD-2630F21.1,CTD-3064H18.1,CTD-3065J16.9,CTD-3074O7.12,CTD-3074O7.5,CTD-3093M3.1,CTD-3105H18.14,CTD-3131K8.2,CTD-3138B18.5,CTD-3184A7.4,CTD-3185P2.1,CTD-3195I5.1,CTD-3195I5.5,CTD-3203P2.1,CTD-3222D19.7,CTGF,CXXC11,CXorf21,CXorf23,CXorf24,CXorf40A,CXorf40B,CXorf56,CXorf57,CYHR1,CYP17A1-AS1,DAK,DARS,DDX26B,DDX58,DFNA5,DFNB31,DGCR14,DHFRL1,DIEXF,DIRC2,DKFZP761J1410,DKFZP779L1853,DNAJC3-AS1,DOCK9-AS2,DOPEY1,DOPEY2,DSCR3,DYNLL1-AS1,DYX1C1,EFCAB4A,EFCAB4B,EFTUD1,EIF2S3L,EIF3J-AS1,ELMSAN1,EMR1,EMR2,EMR3,ENTHD2,EP400NL,EPRS,EPT1,ERBB2IP,ERO1L,ERO1LB,FAIM3,FAM101B,FAM102A,FAM102B,FAM103A1,FAM105A,FAM105B,FAM109A,FAM109B,FAM115A,FAM115C,FAM122A,FAM122B,FAM122C,FAM126A,FAM126B,FAM127A,FAM127B,FAM129A,FAM129B,FAM129C,FAM132A,FAM134A,FAM134B,FAM134C,FAM153B,FAM154B,FAM159A,FAM160A2,FAM160B1,FAM160B2,FAM173A,FAM173B,FAM175A,FAM175B,FAM178A,FAM179A,FAM179B,FAM188A,FAM188B,FAM189B,FAM192A,FAM195A,FAM195B,FAM198B,FAM19A2,FAM206A,FAM207A,FAM208A,FAM208B,FAM211A,FAM212A,FAM212B,FAM212B-AS1,FAM213A,FAM213B,FAM214A,FAM214B,FAM21A,FAM21B,FAM21C,FAM26F,FAM27A,FAM35A,FAM45A,FAM46A,FAM46C,FAM47E-STBD1.1,FAM49A,FAM49B,FAM57A,FAM58A,FAM60A,FAM63A,FAM63B,FAM65A,FAM65B,FAM65C,FAM69A,FAM71A,FAM71D,FAM73A,FAM73B,FAM84B,FAM86A,FAM86C1,FAM92A1,FAM96A,FAM96B,FBXO18,FCGR1B,FDX1L,FGFR1OP,FLJ00104,FLJ27365,FLVCR1-AS1,FOPNL,FRG1B,FTSJ2,FUK,FYB,GAREML,GARS,GATS,GATSL3,GBA,GBAS,GCN1L1,GLTPD1,GLTSCR1,GLTSCR1L,GLTSCR2,GMDS-AS1,GNB2L1,GNG5P2,GPR114,GPR56,GPR75-ASB3,GPR97,GRAMD3,GRASP,GS1-114I9.1,GS1-124K5.11,GS1-124K5.3,GS1-124K5.4,GS1-251I9.4,GSG2,GUCY1A3,GUCY1B3,GVQW1,GYLTL1B,H1F0,H1FX,H2AFJ,H2AFV,H2AFX,H2AFY,H2AFY2,H2AFZ,H3F3A,H3F3B,H3F3C,HARS,HDGFRP2,HDGFRP3,HDHD1,HEATR2,HEXDC,HIAT1,HIATL1,HIATL2,HIST1H1B,HIST1H1C,HIST1H1D,HIST1H1E,HIST1H2AC,HIST1H2AE,HIST1H2AG,HIST1H2AH,HIST1H2AI,HIST1H2AK,HIST1H2AL,HIST1H2AM,HIST1H2BC,HIST1H2BD,HIST1H2BF,HIST1H2BG,HIST1H2BH,HIST1H2BJ,HIST1H2BK,HIST1H2BN,HIST1H3A,HIST1H3D,HIST1H3E,HIST1H3H,HIST1H4C,HIST1H4D,HIST1H4E,HIST1H4H,HIST1H4I,HIST1H4J,HIST1H4K,HIST2H2AB,HIST2H2AC,HIST2H2BE,HIST2H2BF,HIST3H2BB,HIST4H4,HKR1,HMHA1,HN1,HN1L,HNRNPU-AS1,HRASLS2,HRSP12,IARS,ICK,ICT1,IER3IP1.1,IGJ,IKBKAP,IL10RB-AS1,IL8,ILF3-AS1,IMPAD1,INADL,IRG1,ITFG3,ITPR1-AS1,JHDM1D-AS1,KAL1,KARS,KB-1125A3.11,KB-1460A1.5,KB-1507C5.2,KB-1507C5.4,KB-1615E4.2,KB-1732A1.1,KB-173C10.2,KB-226F1.1,KB-431C1.4,KCNE1L,KDELC1,KDELC2,KIAA0020,KIAA0100,KIAA0101,KIAA0125,KIAA0141,KIAA0195,KIAA0196,KIAA0226,KIAA0226L,KIAA0247,KIAA0355,KIAA0368,KIAA0391.1,KIAA0430,KIAA0556,KIAA0754,KIAA0895,KIAA0895L,KIAA0907,KIAA0922,KIAA0947,KIAA1009,KIAA1033,KIAA1107,KIAA1109,KIAA1147,KIAA1211L,KIAA1279,KIAA1324,KIAA1324L,KIAA1377,KIAA1407,KIAA1429,KIAA1430,KIAA1432,KIAA1467,KIAA1468,KIAA1524,KIAA1551,KIAA1598,KIAA1683,KIAA1715,KIAA1731,KIAA1841,KIAA1919,KIAA2018,LA16c-358B7.3,LA16c-431H6.6,LA16c-OS12.2,LACE1,LARGE,LARS,LDOC1L,LEPRE1,LEPREL4,LHFP,LIMS3.1,LINC00094,LINC00116,LINC00152,LINC00167,LINC00176,LINC00341,LINC00493,LINC00568,LINC00610,LINC00617,LINC00657,LINC00672,LINC00694,LINC00843,LINC00883,LINC00884,LINC00893,LINC00894,LINC00909,LINC00936,LINC00959,LINC00969,LINC00984,LINC00998,LINC01024,LINC01059,LINC01136,LINC01137,LINS,LL0XNC01-116E7.2,LL0XNC01-7P3.1,LL21NC02-1C16.2,LL22NC03-2H8.5,LL22NC03-86G7.1,LOH12CR1,LOR,LPHN1,LRMP,LRRC16A,LRRC16B,LRRC29,LRRC3DN,LRRC6,LSMD1,LYRM5,MAATS1,MANEA-AS1,MARC2,MARCH1,MARCH2,MARCH3,MARCH5,MARCH6,MARCH7,MARCH8,MARCH9,MARS,MB21D1,MED14-AS1,MEF2BNB,MESDC1,MESDC2,METTL10,METTL12,METTL20,METTL21B,MFI2,MFSD4,MFSD7,MGEA5,MINA,MINOS1,MIR4435-1HG,MKL1,MKL2,MLK4,MLLT4,MLTK,MMP24-AS1,MPP5,MPP6,MRE11A,MRP63,MRPS17.1,MST4,MTERF,MTERFD1,MTERFD2,MTERFD3,MTL5,MUM1,MUT,MYEOV2,MYLPF,N6AMT2,NAA60.1,NAMPTL,NAPRT1,NARFL,NARG2,NARS,NAT6,NBPF16,NCBP2-AS2,NDNL2,NDUFA7.1,NDUFB8.1,NGFRAP1,NHP2L1,NOTCH2NL,NOV,NPPA-AS1,NRD1,NUPL1,NUPL2,OBFC1,ORAOV1,OSER1-AS1,PAPD4,PAPD5,PAPD7,PAXIP1-AS1,PCNX,PCNXL2,PCNXL3,PCNXL4,PDDC1,PET112,PHB,PIDD,PLA2G16,PPAP2A,PPAP2B,PPAP2C,PPAPDC1B,PPAPDC2,PPP2R4,PPP5D1,PQLC1,PQLC2,PQLC3,PRKCDBP,PRKRIR,PRMT10,PROSC,PRR24,PRUNE,PSMA2.1,PSMD5-AS1,PTPLA,PTPLAD1,PTPLAD2,PTPLB,PTRF,PVRL1,PVRL2,PVRL3,PVRL4,PWAR6,PYCRL,QARS,QTRTD1,RAB30-AS1,RAB7L1,RABL5,RARRES3,RARS,RFWD2,RGAG4,RLTPR,RNASEH1-AS1,RNF139-AS1,RNF219,RNMTL1,RP1-100J12.1,RP1-102H19.8,RP1-111C20.4,RP1-151F17.2,RP1-168L15.5,RP1-178F15.4,RP1-179N16.6,RP1-187B23.1,RP1-191J18.66,RP1-197B17.3,RP1-202O8.3,RP1-223E5.4,RP1-224A6.3,RP1-234P15.4,RP1-239B22.5,RP1-241P17.4,RP1-244F24.1,RP1-249H1.4,RP1-257A7.4,RP1-257A7.5,RP1-261D10.2,RP1-28H20.3,RP1-28O10.1,RP1-29C18.10,RP1-30M3.5,RP1-313I6.12,RP1-315G1.3,RP1-39G22.7,RP1-3J17.3,RP1-40E16.12,RP1-43E13.2,RP1-45N11.1,RP1-50J22.4,RP1-67K17.4,RP1-8B1.4,RP1-90J20.12,RP1-90J20.8,RP1-90J4.1,RP1-92O14.3,RP1-93H18.7,RP11-1000B6.3,RP11-1007O24.3,RP11-1008C21.1,RP11-1017G21.5,RP11-1020A11.2,RP11-102N12.3,RP11-103G8.2,RP11-104L21.3,RP11-105N14.1,RP11-1060J15.4,RP11-1070N10.3,RP11-1080G15.1,RP11-108M9.4,RP11-108M9.6,RP11-108P20.1,RP11-1094M14.11,RP11-109G23.3,RP11-10C24.1,RP11-10C24.3,RP11-10J5.1,RP11-10L12.4,RP11-10L7.1,RP11-110G21.1,RP11-110I1.12,RP11-110I1.13,RP11-110I1.14,RP11-1114A5.4,RP11-111M22.2,RP11-111M22.3,RP11-112J3.16,RP11-1143G9.4,RP11-1149M10.2,RP11-1151B14.4,RP11-115C21.2,RP11-117D22.2,RP11-1191J2.5,RP11-119B16.2,RP11-119D9.1,RP11-119F19.2,RP11-11N7.4,RP11-11N9.4,RP11-121A8.1,RP11-122G18.5,RP11-124N14.3,RP11-126K1.2,RP11-126K1.6,RP11-1275H24.1,RP11-1275H24.2,RP11-1277A3.1,RP11-1277A3.3,RP11-127B20.3,RP11-128M1.1,RP11-130F10.1,RP11-131L12.2,RP11-131L23.1,RP11-131N11.4,RP11-1348G14.5,RP11-134G8.7,RP11-134L10.1,RP11-134P9.3,RP11-137H2.4,RP11-138A9.1,RP11-138A9.2,RP11-138I1.2,RP11-138I1.4,RP11-138P22.1,RP11-1391J7.1,RP11-1398P2.1,RP11-1399P15.1,RP11-139H15.1,RP11-13P5.1,RP11-1407O15.2,RP11-140K17.3,RP11-141B14.1,RP11-143J12.2,RP11-143M1.4,RP11-145M9.4,RP11-146F11.1,RP11-147L13.2,RP11-14N7.2,RP11-152N13.5,RP11-154H23.3,RP11-156E8.1,RP11-157D23.2,RP11-157E21.1,RP11-158K1.3,RP11-159D12.2,RP11-159G9.5,RP11-159H10.3,RP11-15A1.3,RP11-15A1.7,RP11-160H22.5,RP11-160O5.1,RP11-161M6.2,RP11-161M6.3,RP11-162G10.5,RP11-164H13.1,RP11-165A20.3,RP11-165J3.6,RP11-166A12.1,RP11-166B2.1,RP11-166O4.6,RP11-166P13.3,RP11-166P13.4,RP11-169D4.2,RP11-169E6.1,RP11-169K16.9,RP11-16E12.1,RP11-16N11.2,RP11-171I2.5,RP11-173M1.8,RP11-175O19.4,RP11-176H8.1,RP11-178G16.4,RP11-178L8.7,RP11-179B2.2,RP11-17M16.1,RP11-17M16.2,RP11-181C3.1,RP11-182L21.6,RP11-185E8.1,RP11-188D8.1,RP11-18H21.1,RP11-18H7.1,RP11-190C22.8,RP11-190C22.9,RP11-191L17.1,RP11-192H23.4,RP11-192H23.6,RP11-195F19.5,RP11-195F19.9,RP11-196G18.24,RP11-202G18.1,RP11-203J24.8,RP11-204M4.2,RP11-206L10.2,RP11-212I21.4,RP11-212P7.2,RP11-214K3.24,RP11-214N9.1,RP11-214O1.2,RP11-214O1.3,RP11-215G15.5,RP11-215P8.3,RP11-216B9.6,RP11-217B1.2,RP11-217B7.3,RP11-218M22.1,RP11-219B17.1,RP11-219G17.4,RP11-220I1.1,RP11-221J22.2,RP11-222K16.2,RP11-223C24.1,RP11-227G15.8,RP11-228B15.4,RP11-229C3.2,RP11-22N19.2,RP11-22P6.2,RP11-22P6.3,RP11-236B18.5,RP11-23P13.6,RP11-23P13.7,RP11-243J16.7,RP11-244H3.4,RP11-245G13.2,RP11-245J9.4,RP11-24F11.2,RP11-252A24.3,RP11-252A24.7,RP11-254F7.2,RP11-256L11.3,RP11-258C19.7,RP11-258F1.1,RP11-259K5.2,RP11-259N19.1,RP11-25K19.1,RP11-260E18.1,RP11-262H14.1,RP11-262H14.3,RP11-262H14.4,RP11-265N6.1,RP11-265P11.2,RP11-266L9.5,RP11-267N12.3,RP11-268F1.3,RP11-273G15.2,RP11-274B18.2,RP11-275I4.1,RP11-275I4.2,RP11-276E17.2,RP11-277L2.3,RP11-277L2.4,RP11-277P12.20,RP11-279F6.3,RP11-279O9.4,RP11-284M14.1,RP11-284N8.3,RP11-285F7.2,RP11-288H12.4,RP11-288L9.1,RP11-290F20.3,RP11-290F5.1,RP11-291B21.2,RP11-293A21.1,RP11-293M10.5,RP11-293M10.6,RP11-294J22.6,RP11-295G20.2,RP11-295P9.3,RP11-296K13.4,RP11-296O14.3,RP11-297B17.3,RP11-297N6.4,RP11-298J20.3,RP11-299J3.8,RP11-2H8.2,RP11-301O19.1,RP11-302B13.5,RP11-304L19.5,RP11-305L7.1,RP11-307C12.12,RP11-307C12.13,RP11-308N19.1,RP11-309G3.3,RP11-312O7.2,RP11-313P13.5,RP11-314A20.2,RP11-314N13.3,RP11-315I20.1,RP11-315O6.1,RP11-317J10.2,RP11-318A15.2,RP11-318E3.9,RP11-319G6.1,RP11-31F15.2,RP11-320M2.1,RP11-323J4.1,RP11-324I22.4,RP11-325F22.2,RP11-326C3.11,RP11-326C3.12,RP11-326C3.15,RP11-326F20.5,RP11-326G21.1,RP11-326I11.3,RP11-326K13.4,RP11-327F22.2,RP11-327P2.5,RP11-328M4.2,RP11-332H14.2,RP11-333E1.1,RP11-334C17.5,RP11-336K24.12,RP11-337C18.8,RP11-337C18.9,RP11-338C15.3,RP11-338I21.1,RP11-339B21.11,RP11-33O4.1,RP11-342K6.4,RP11-342M1.3,RP11-343N15.1,RP11-343N15.5,RP11-344P13.6,RP11-345J4.5,RP11-345P4.9,RP11-347P5.1,RP11-348N5.7,RP11-349A22.5,RP11-350F4.2,RP11-350G8.5,RP11-350N15.5,RP11-351D16.3,RP11-351I24.1,RP11-353N4.4,RP11-354E11.2,RP11-354P11.3,RP11-355B11.2,RP11-356I2.1,RP11-356I2.4,RP11-356N1.2,RP11-357H14.17,RP11-35G9.3,RP11-360L9.7,RP11-362F19.1,RP11-363E6.3,RP11-366L5.1,RP11-367G18.1,RP11-367G6.3,RP11-367J11.3,RP11-367N14.2,RP11-368I7.4,RP11-372K14.2,RP11-373D23.3,RP11-378A13.2,RP11-378J18.3,RP11-378J18.8,RP11-379B18.5,RP11-379F4.9,RP11-379H18.1,RP11-37B2.1,RP11-37C7.3,RP11-381K20.2,RP11-382J12.1,RP11-383C5.3,RP11-383C5.4,RP11-383H13.1,RP11-383J24.6,RP11-384C4.7,RP11-384O8.1,RP11-385D13.3,RP11-385F5.5,RP11-385F7.1,RP11-386G11.10,RP11-386G11.5,RP11-386I14.4,RP11-388M20.1,RP11-38G5.4,RP11-38P22.2,RP11-390B4.5,RP11-390E23.6,RP11-390F4.3,RP11-390P2.4,RP11-392P7.6,RP11-393I2.4,RP11-395A13.2,RP11-395B7.4,RP11-395G23.3,RP11-397A16.2,RP11-398C13.6,RP11-398K22.12,RP11-400F19.6,RP11-400N13.3,RP11-401F2.3,RP11-401P9.4,RP11-401P9.5,RP11-402D21.2,RP11-402G3.3,RP11-403A21.2,RP11-403I13.4,RP11-403I13.5,RP11-403I13.8,RP11-404F10.2,RP11-406H21.2,RP11-407H12.8,RP11-407N17.5,RP11-408H1.3,RP11-410L14.2,RP11-412D9.4,RP11-413H22.2,RP11-415F23.2,RP11-418J17.1,RP11-420A6.2,RP11-420G6.4,RP11-420L9.5,RP11-421L21.2,RP11-421L21.3,RP11-421M1.8,RP11-422P24.11,RP11-423O2.5,RP11-425D10.10,RP11-425I13.3,RP11-426C22.5,RP11-426L16.3,RP11-427H3.3,RP11-428G5.5,RP11-428J1.5,RP11-429E11.3,RP11-429J17.2,RP11-430B1.2,RP11-430C7.4,RP11-430H10.1,RP11-432I5.1,RP11-432J22.2,RP11-434B12.1,RP11-434D11.4,RP11-434H6.6,RP11-434H6.7,RP11-438L19.1,RP11-440D17.3,RP11-440L14.1,RP11-440L14.4,RP11-441O15.3,RP11-442H21.2,RP11-443B7.1,RP11-445H22.3,RP11-446E9.2,RP11-446N19.1,RP11-448A19.1,RP11-449H11.1,RP11-449P15.2,RP11-44F14.2,RP11-44N11.2,RP11-452F19.3,RP11-452H21.4,RP11-452L6.5,RP11-452L6.7,RP11-455F5.3,RP11-455F5.5,RP11-455J20.3,RP11-459I19.1,RP11-45M22.5,RP11-462L8.1,RP11-464F9.20,RP11-466P24.6,RP11-467D6.1,RP11-467P9.1,RP11-468E2.4,RP11-468E2.5,RP11-469M7.1,RP11-46A10.4,RP11-46A10.5,RP11-46C24.7,RP11-46H11.12,RP11-46H11.3,RP11-46J23.1,RP11-472G21.2,RP11-473I1.5,RP11-473M20.14,RP11-473M20.16,RP11-473M20.7,RP11-473M20.9,RP11-474G23.2,RP11-474G23.3,RP11-475I24.3,RP11-475O6.1,RP11-477D19.2,RP11-479G22.8,RP11-47A8.5,RP11-47I22.2,RP11-47I22.3,RP11-47L3.1,RP11-480C16.1,RP11-481A20.11,RP11-483I13.5,RP11-483P21.2,RP11-484K9.4,RP11-485G4.2,RP11-486G15.2,RP11-486O12.2,RP11-488C13.5,RP11-488L18.10,RP11-489E7.4,RP11-490M8.1,RP11-490O6.2,RP11-492E3.2,RP11-492E3.51,RP11-493L12.3,RP11-493L12.4,RP11-498C9.15,RP11-498D10.6,RP11-498P14.5,RP11-499E18.1,RP11-499P20.2,RP11-4F5.2,RP11-500C11.3,RP11-502I4.3,RP11-505K9.1,RP11-505P4.6,RP11-506M13.3,RP11-50E11.3,RP11-513I15.6,RP11-513M16.8,RP11-514O12.4,RP11-517P14.2,RP11-518L10.5,RP11-519G16.3,RP11-519G16.5,RP11-51J9.5,RP11-522I20.3,RP11-526I2.5,RP11-527L4.5,RP11-528I4.2,RP11-532F12.5,RP11-532F6.3,RP11-533E19.5,RP11-533E19.7,RP11-534L20.5,RP11-535A5.1,RP11-536C5.7,RP11-536K7.3,RP11-538P18.2,RP11-539G18.2,RP11-539L10.2,RP11-539L10.3,RP11-53B2.3,RP11-53I6.3,RP11-53O19.3,RP11-542K23.7,RP11-542M13.3,RP11-544A12.8,RP11-544I20.2,RP11-545E17.3,RP11-545I5.3,RP11-546J1.1,RP11-548P2.2,RP11-54O7.17,RP11-553L6.2,RP11-553L6.5,RP11-554J4.1,RP11-556E13.1,RP11-557H15.3,RP11-558F24.4,RP11-559M23.1,RP11-55K13.1,RP11-560J1.2,RP11-563N4.1,RP11-568N6.1,RP11-571M6.17,RP11-572O17.1,RP11-573D15.9,RP11-574F21.2,RP11-574K11.5,RP11-575L7.8,RP11-576N17.5,RP11-57G10.8,RP11-580I16.2,RP11-582J16.5,RP11-589C21.6,RP11-589P10.5,RP11-58B17.2,RP11-58E21.3,RP11-58E21.4,RP11-597D13.9,RP11-598F7.3,RP11-5C23.1,RP11-600F24.7,RP11-60I3.5,RP11-611E13.2,RP11-611L7.1,RP11-611O2.1,RP11-617D20.1,RP11-617F23.1,RP11-61A14.3,RP11-61J19.3,RP11-61J19.4,RP11-620J15.3,RP11-624C23.1,RP11-624L4.1,RP11-624L4.2,RP11-626G11.5,RP11-629G13.1,RP11-629O1.2,RP11-631N16.2,RP11-634H22.1,RP11-635N19.1,RP11-637A17.2,RP11-63P12.6,RP11-640M9.1,RP11-641A6.2,RP11-644F5.11,RP11-645C24.5,RP11-646I6.5,RP11-649A18.12,RP11-649E7.5,RP11-656D10.3,RP11-658F2.8,RP11-65I12.1,RP11-65J3.1,RP11-65L3.4,RP11-660L16.2,RP11-661A12.14,RP11-661A12.5,RP11-664D1.1,RP11-666A1.5,RP11-66N11.8,RP11-66N24.3,RP11-66N24.4,RP11-670E13.2,RP11-672A2.6,RP11-672L10.6,RP11-673E1.1,RP11-677M14.3,RP11-679C8.2,RP11-67L2.2,RP11-680A11.5,RP11-680G24.5,RP11-682N22.1,RP11-686D22.4,RP11-686O6.2,RP11-690G19.3,RP11-693J15.5,RP11-69E11.4,RP11-6J24.6,RP11-701P16.5,RP11-705O1.8,RP11-706O15.1,RP11-706O15.3,RP11-706O15.5,RP11-707M3.3,RP11-708J19.1,RP11-70C1.1,RP11-70P17.1,RP11-712B9.2,RP11-712L6.5,RP11-713M15.2,RP11-713P17.3,RP11-71G12.1,RP11-727A23.11,RP11-727F15.11,RP11-727F15.12,RP11-727F15.9,RP11-728F11.4,RP11-72M17.1,RP11-730K11.1,RP11-732M18.3,RP11-734K2.4,RP11-737O24.1,RP11-737O24.5,RP11-73E17.2,RP11-73G16.2,RP11-73K9.2,RP11-73M7.9,RP11-73O6.3,RP11-74E22.3,RP11-752G15.6,RP11-755B10.4,RP11-755F10.1,RP11-760N9.1,RP11-764K9.1,RP11-76E17.3,RP11-76N22.2,RP11-770J1.4,RP11-773D16.1,RP11-774O3.3,RP11-778D9.13,RP11-77H9.2,RP11-77P6.2,RP11-782C8.1,RP11-783K16.14,RP11-783K16.5,RP11-78A19.3,RP11-78O7.2,RP11-791G15.2,RP11-792A8.4,RP11-793H13.11,RP11-796E2.4,RP11-796G6.2,RP11-798G7.6,RP11-798M19.6,RP11-799D4.4,RP11-79H23.3,RP11-79O8.1,RP11-79P5.9,RP11-7F17.7,RP11-7O11.3,RP11-802E16.3,RP11-802O23.3,RP11-803B1.8,RP11-803D5.4,RP11-804H8.6,RP11-806H10.4,RP11-809N8.4,RP11-809O17.1,RP11-810P12.5,RP11-814H16.2,RP11-817O13.8,RP11-819C21.1,RP11-81H14.2,RP11-81N13.1,RP11-820I16.4,RP11-830F9.6,RP11-834C11.4,RP11-83A24.2,RP11-83N9.5,RP11-841O20.2,RP11-849H4.2,RP11-849H4.4,RP11-84C10.2,RP11-856M7.6,RP11-85A1.3,RP11-85F14.5,RP11-861A13.4,RP11-862L9.3,RP11-867G23.8,RP11-869B15.1,RP11-872J21.3,RP11-874J12.4,RP11-875O11.1,RP11-879F14.2,RP11-87C12.2,RP11-884K10.7,RP11-890B15.3,RP11-894J14.5,RP11-894P9.1,RP11-89F3.2,RP11-89K11.1,RP11-8L8.2,RP11-91G21.1,RP11-927P21.1,RP11-92K15.3,RP11-930P14.2,RP11-936I5.1,RP11-94L15.2,RP11-950C14.3,RP11-95D17.1,RP11-95P2.1,RP11-960L18.1,RP11-96K19.2,RP11-96K19.5,RP11-973H7.1,RP11-973H7.4,RP11-977G19.11,RP11-977G19.12,RP11-97C16.1,RP11-98D18.9,RP11-98I9.4,RP13-1032I1.7,RP13-131K19.6,RP13-188A5.1,RP13-20L14.6,RP13-270P17.1,RP13-270P17.3,RP13-314C10.5,RP13-467H17.1,RP13-516M14.4,RP13-580F15.2,RP13-582O9.5,RP13-582O9.7,RP13-638C3.4,RP13-753N3.1,RP13-977J11.2,RP3-322G13.5,RP3-325F22.5,RP3-329A5.8,RP3-329E20.2,RP3-369A17.4,RP3-395M20.12,RP3-395M20.9,RP3-399L15.3,RP3-400N23.6,RP3-402G11.25,RP3-402G11.26,RP3-428L16.2,RP3-465N24.6,RP3-467K16.4,RP3-467N11.1,RP3-475N16.1,RP3-477O4.14,RP3-486I3.7,RP3-508I15.14,RP3-508I15.21,RP3-508I15.22,RP3-512B11.3,RP4-539M6.22,RP4-548D19.3,RP4-549L20.3,RP4-553F4.6,RP4-561L24.3,RP4-575N6.4,RP4-583P15.15,RP4-594I10.3,RP4-610C12.3,RP4-612B15.3,RP4-614O4.12,RP4-635E18.6,RP4-635E18.8,RP4-647C14.2,RP4-647C14.3,RP4-666F24.3,RP4-673M15.1,RP4-718J7.4,RP4-728D4.2,RP4-756G23.5,RP4-758J18.2,RP4-758J24.5,RP4-773N10.4,RP4-794H19.4,RP4-798A10.2,RP4-798A10.7,RP4-800J21.3,RP4-813F11.4,RP4-816N1.6,RP5-1007M22.2,RP5-1021I20.1,RP5-1021I20.2,RP5-1024G6.5,RP5-1024G6.7,RP5-1028K7.2,RP5-1042K10.10,RP5-1050D4.3,RP5-1057J7.6,RP5-1065J22.8,RP5-1068E13.7,RP5-1071N3.1,RP5-1073O3.7,RP5-1074L1.4,RP5-1085F17.3,RP5-1086K13.1,RP5-1091N2.9,RP5-1092A3.4,RP5-1103G7.4,RP5-1112D6.4,RP5-1115A15.1,RP5-1120P11.1,RP5-1125A11.1,RP5-1136G13.2,RP5-1148A21.3,RP5-1157M23.2,RP5-1159O4.1,RP5-1180C10.2,RP5-1180E21.5,RP5-821D11.7,RP5-827C21.4,RP5-864K19.4,RP5-882C2.2,RP5-886K2.3,RP5-887A10.1,RP5-892K4.1,RP5-894A10.2,RP5-894D12.3,RP5-899E9.1,RP5-903G2.2,RP5-914P20.5,RP5-935K16.1,RP5-940J5.6,RP5-940J5.9,RP5-943J3.2,RP5-997D24.3,RP6-109B7.3,RP6-109B7.5,RP6-206I17.2,RP6-65G23.3,RP6-91H8.3,RP6-99M1.2,RPA3-AS1,RPS17L,RPSAP58,RQCD1,RRNAD1,RSBN1L-AS1,RSG1,RTFDC1,SARS,SDCCAG3,SDPR,SELK,SELM,SELO,SELT,SEP15,SEPN1,SEPT1,SEPT10,SEPT11,SEPT2,SEPT4,SEPT5,SEPT6,SEPT7,SEPT8,SEPT9,SEPW1,SETD5-AS1,SETD8,SF3B14,SGK223,SGOL1,SGOL2,SHFM1,SHPK.1,SKIV2L,SKIV2L2,SLC25A10.1,SLC35E2,SLMO2,SMC5-AS1,SMEK1,SMEK2,SMIM4,SNORA40.1,SNORA76,SNURF.1,SOGA2,SPG20,SQRDL,SRP14-AS1,SRPR,SSFA2,SSSCA1,SSSCA1-AS1,STRA13,SUV420H1,SUV420H2,TAP2.1,TAPSAR1,TARS,TARSL2,TAZ,TCEB1,TCEB2,TCEB3,TCTEX1D2,TCTEX1D4,TLDC1,TM4SF19.1,TMBIM4.1,TMCC1-AS1,TMEM110,TMEM110-MUSTN1,TMEM136,TMEM159,TMEM161B-AS1,TMEM173,TMEM180,TMEM189,TMEM194A,TMEM194B,TMEM2,TMEM206,TMEM251,TMEM261,TMEM27,TMEM5,TMEM55A,TMEM55B,TMEM56,TMEM57,TMEM66,TMEM8A,TMEM99,TNKS2-AS1,TOLLIP-AS1,TOMM70A,TOPORS-AS1,TRAPPC2P1,TROVE2,TSSC1,TSTA3,TTC18,TTC37,TWISTNB,U47924.31,U73166.2,U91328.19,U91328.20,UBA6-AS1,UBL7-AS1,UBXN10-AS1,UFD1L,UHRF1BP1,UHRF1BP1L,UPK3BL,UQCR11.1,USMG5,USP27X-AS1,USP46-AS1,UTP11L,VARS,VIMP,VNN3,VPRBP,VWA9,WAPAL,WARS,WBP5,WBSCR16,WBSCR22,WBSCR27,WDR11-AS1,WDR34,WDR52,WDR60,WDR61,WDR66,WDR78,WDR92,WDYHV1,WHSC1,WHSC1L1,WIBG,WRB,XRCC6BP1,XXbac-B135H6.15,XXbac-B135H6.18,XXbac-B444P24.14,XXbac-B476C20.9,XXbac-BPG181B23.7,XXbac-BPG252P9.10,XXbac-BPG252P9.9,XXbac-BPG299F13.17,XXbac-BPG308K3.5,XXbac-BPGBPG55C20.2,XXcos-LUCA11.4,Y-RNA,YAE1D1,YARS,YTHDF3-AS1,Z83851.4,ZADH2,ZBED6CL,ZCCHC11,ZCCHC6,ZFYVE20,ZHX1-C8ORF76,ZNF223.1,ZNF259,ZNF582-AS1,ZNF720,ZNF724P,ZNRD1,ZNRD1-AS1,ZRSR1,ZUFSP,hsa-mir-1199,hsa-mir-8072
β
linked: FeatureSet(id='4ygLm9YMmSGEda6DLsXA', n=13439, type='float', registry='bionty.Gene', hash='gaUdPsDPTCmmEpfj_TY0', created_by_id='DzTjkKse')
π‘ parsing feature names of slot 'obs'
β did not validate 5 Feature records for names: orig.ident, nCount_RNA, nFeature_RNA, stim, seurat_annotations
β ignoring non-validated features: orig.ident,nCount_RNA,nFeature_RNA,stim,seurat_annotations
β no validated features, skip creating feature set
ln.save(file)
β
saved 1 feature set for slot: ['var']
β
storing file 'MEXvOFrrAMjeQ2TMGg7c' at '.lamindb/MEXvOFrrAMjeQ2TMGg7c.h5ad'
We further create two feature sets for degs_up
and degs_dw
which we can later associate with the associated pathways:
degs_up_featureset = ln.FeatureSet.from_values(degs_up.names, lb.Gene.symbol)
Show code cell output
π‘ using global setting species = human
β
validated 127 Gene records on symbol: TMSB10, ISG20, PSMB8, PSMB9, CLIC1, IRF8, XRN1, PSMB8, ISG15, PSMB8, ELF1, TAP1, TNFSF10, NT5C3A, PSMB9, HLA-B, TAP1, TAP1, PSMB8, IRF7, ...
β did not validate 4 Gene records for symbols: DDX58, H3F3B, PSMA2.1, WARS
β ignoring non-validated features: DDX58,H3F3B,PSMA2.1,WARS
degs_dw_featureset = ln.FeatureSet.from_values(degs_dw.names, lb.Gene.symbol)
Show code cell output
π‘ using global setting species = human
β
validated 72 Gene records on symbol: RPSA, HLA-DRA, RPL13A, ZFP36L1, FTH1, CYBA, RPL14, HLA-DRB1, OAZ1, EEF1A1, RPL11, RPL7A, GSTP1, RPS2, ID3, HLA-DRA, EEF2, RPL4, GAPDH, HLA-DRA, ...
β did not validate 2 Gene records for symbols: GNB2L1, TMEM66
β ignoring non-validated features: GNB2L1,TMEM66
Link the top 10 pathways to the corresponding differentially expressed genes:
# get ontology ids for the top 10 pathways
enr_up_top10 = [
pw_id[0] for pw_id in enr_up.head(10).Term.apply(parse_ontology_id_from_keys)
]
enr_dw_top10 = [
pw_id[0] for pw_id in enr_dw.head(10).Term.apply(parse_ontology_id_from_keys)
]
# get pathway records
enr_up_top10_pathways = lb.Pathway.from_values(enr_up_top10, lb.Pathway.ontology_id)
enr_dw_top10_pathways = lb.Pathway.from_values(enr_dw_top10, lb.Pathway.ontology_id)
β
validated 10 Pathway records on ontology_id: GO:0032728, GO:0034341, GO:0035456, GO:0045069, GO:0045071, GO:0048525, GO:0051607, GO:0070106, GO:0140374, GO:0140546
β
validated 10 Pathway records on ontology_id: GO:0002181, GO:0006364, GO:0006412, GO:0009059, GO:0010467, GO:0016072, GO:0022613, GO:0042254, GO:0042274, GO:0043043
Link feature sets to file:
file.features.add_feature_set(degs_up_featureset, slot="up-DEGs")
file.features.add_feature_set(degs_dw_featureset, slot="down-DEGs")
Associate the pathways to the differentially expressed genes:
degs_up_featureset.pathways.set(enr_up_top10_pathways)
degs_dw_featureset.pathways.set(enr_dw_top10_pathways)
degs_up_featureset.pathways.list("name")
['interleukin-27-mediated signaling pathway',
'response to type II interferon',
'response to interferon-beta',
'defense response to symbiont',
'positive regulation of interferon-beta production',
'negative regulation of viral genome replication',
'negative regulation of viral process',
'antiviral innate immune response',
'defense response to virus',
'regulation of viral genome replication']
Querying for pathways#
Querying for pathways is now simple with .filter
:
lb.Pathway.filter(name__contains="interferon-beta").df()
name | ontology_id | abbr | synonyms | description | bionty_source_id | updated_at | created_by_id | |
---|---|---|---|---|---|---|---|---|
id | ||||||||
l06ZujxW | cellular response to interferon-beta | GO:0035458 | None | cellular response to fibroblast interferon|cel... | Any Process That Results In A Change In State ... | pj5J | 2023-08-17 17:34:51 | DzTjkKse |
GD9xCHBK | regulation of interferon-beta production | GO:0032648 | None | regulation of IFN-beta production | Any Process That Modulates The Frequency, Rate... | pj5J | 2023-08-17 17:34:51 | DzTjkKse |
uu9GYFx2 | negative regulation of interferon-beta production | GO:0032688 | None | down regulation of interferon-beta production|... | Any Process That Stops, Prevents, Or Reduces T... | pj5J | 2023-08-17 17:34:51 | DzTjkKse |
mCgM7JYR | response to interferon-beta | GO:0035456 | None | response to fiblaferon|response to fibroblast ... | Any Process That Results In A Change In State ... | pj5J | 2023-08-17 17:34:51 | DzTjkKse |
SGYMKD7O | positive regulation of interferon-beta production | GO:0032728 | None | positive regulation of IFN-beta production|up-... | Any Process That Activates Or Increases The Fr... | pj5J | 2023-08-17 17:34:51 | DzTjkKse |
Query pathways from a gene:
lb.Pathway.filter(genes__symbol="KIR2DL1").df()
name | ontology_id | abbr | synonyms | description | bionty_source_id | updated_at | created_by_id | |
---|---|---|---|---|---|---|---|---|
id | ||||||||
TSXmNUbN | immune response-inhibiting cell surface recept... | GO:0002767 | None | immune response-inhibiting cell surface recept... | The Series Of Molecular Signals Initiated By A... | pj5J | 2023-08-17 17:34:51 | DzTjkKse |
Query files from a pathway:
ln.File.filter(feature_sets__pathways__name__icontains="interferon-beta").first()
File(id='MEXvOFrrAMjeQ2TMGg7c', suffix='.h5ad', accessor='AnnData', description='seurat_ifnb_activated_Bcells', size=5896640, hash='vAWd5emmLj0nv0E0x5LOSA', hash_type='md5', updated_at=2023-08-17 17:35:45, storage_id='EJlRAlPa', transform_id='6oxEIEduvo6wz8', run_id='aTaG4cz17LGPqTetRLw0', created_by_id='DzTjkKse')
Query featuresets from a pathway to learn from which geneset this pathway was computed:
pathway = lb.Pathway.filter(ontology_id="GO:0035456").one()
pathway
Pathway(id='mCgM7JYR', name='response to interferon-beta', ontology_id='GO:0035456', synonyms='response to fiblaferon|response to fibroblast interferon|response to interferon beta', description='Any Process That Results In A Change In State Or Activity Of A Cell Or An Organism (In Terms Of Movement, Secretion, Enzyme Production, Gene Expression, Etc.) As A Result Of An Interferon-Beta Stimulus. Interferon-Beta Is A Type I Interferon.', updated_at=2023-08-17 17:34:51, bionty_source_id='pj5J', created_by_id='DzTjkKse')
degs = ln.FeatureSet.filter(pathways__ontology_id=pathway.ontology_id).one()
Now we can get the list of genes that are differentially expressed and belong to this pathway:
pathway_genes = set(pathway.genes.list("symbol"))
degs_genes = set(degs.genes.list("symbol"))
pathway_genes.intersection(degs_genes)
{'BST2',
'IFI16',
'IFITM2',
'IFITM3',
'IRF1',
'OAS1',
'PLSCR1',
'STAT1',
'XAF1'}
Conclusion#
Registering pathways and associated gene sets is made simple with .from_values
that ensures that all parsed objects are linked to ontology IDs.Linking both sets is possible with FeatureSet to facilitate simple querying for datasets that contain specific pathways. Since the pathways are linked to genes, Lamin also enables fetching the associated genes of a registered pathway to, for usecase, retrieve sets of differentially expressed genes that are a part of a specific pathway.
Try it yourself#
This notebook is available at laminlabs/lamin-usecases.
Show code cell content
!lamin delete --force enrichr
!rm -r ./enrichr
π‘ deleting instance testuser1/enrichr
β
deleted instance settings file: /home/runner/.lamin/instance--testuser1--enrichr.env
β
instance cache deleted
β
deleted '.lndb' sqlite file
β consider manually deleting your stored data: /home/runner/work/lamin-usecases/lamin-usecases/docs/enrichr