Monarch geneset OGS2.0

DPOGS203704
TranscriptDPOGS203704-TA2808 bp
ProteinDPOGS203704-PA935 aa
Genomic positionDPSCF300010 - 1562778-1566654
RNAseq coverage93x (Rank: top 62%)
Annotation
HeliconiusHMEL0125110.064.57% 
BombyxBGIBMGA003496-TA5e-13263.05% 
DrosophilaCG2292-PA2e-12231.64% 
EBI UniRef50UniRef50_O954273e-17938.03%GPI ethanolamine phosphate transferase 1 n=70 Tax=Eumetazoa RepID=PIGN_HUMAN
NCBI RefSeqXP_001639662.12e-17736.59%predicted protein [Nematostella vectensis]
NCBI nr blastpgi|3174198150.039.15%GPI ethanolamine phosphate transferase 1 [Dicentrarchus labrax]
NCBI nr blastxgi|3007974850.038.02%GPI ethanolamine phosphate transferase 1 [Bos taurus]
Group
Gene OntologyGO:00065061.7e-242GPI anchor biosynthetic process
GO:00167401.7e-242transferase activity
GO:00057891.7e-242endoplasmic reticulum membrane
GO:00081521.7e-30metabolic process
GO:00038241.7e-30catalytic activity
KEGG pathwaygga:4209080.0 
 K05285 (PIGN)maps-> Glycosylphosphatidylinositol(GPI)-anchor biosynthesis
InterPro domain[1-916] IPR0070701.7e-242GPI ethanolamine phosphate transferase 1
[418-884] IPR0178528.5e-83GPI ethanolamine phosphate transferase 1, C-terminal
[40-335] IPR0178501.7e-30Alkaline-phosphatase-like, core domain
[42-331] IPR0178497.7e-30Alkaline phosphatase-like, alpha/beta/alpha
[161-272] IPR0025919.1e-08Type I phosphodiesterase/nucleotide pyrophosphatase/phosphate transferase
Orthology groupMCL13414 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203704-TA
ATGTTTTTGTTGGGGTTGATTGTGCATATAGTTTTTTTGTTTTCTATATTTGATATTTATTTTAAATCGCCTATTGTAAGTCATGTAAAACCATATCAACCTATTCACGAAGCTCCGGCGGACCGTCTAGTATTATTTGTTGTTGATGGCTTAAGAGCAGAATCTTTCTTAAATTATACAACGATGCCATATCTTCGATCGGTCGCAAATACCAATGGTCTGTGGGGTATATCACGCACTAGGGTGCCCACGGAGTCAAGACCAGGTCATGTAGCTATATTAGCAGGATTCTATGAAGACCCCTCAGCTGTTGCAAAAGGCTGGAAACAAAATCCTGTTGACTTTGATTCAGTGCTCAATCAAAGTGTTTATAGTTGGTGCTGGGGCACATATGATATATTAGAAATATTTGCTAAGGATGACTTAAGTGGGCACATATACACTGAAAAAATGGATCCTTATGATGAAACATACAGCCCAAATAGAAATACCACCACTTTGGATAAATGGGTATTTGATAGAGTAAATTATTTTTTTAATAGACAAGAATTGGACAGTGAAATTTACAAAAAGTTACAACATGATAAAATTTTATTTTTTTTGCATCTGTTAGGAACAGATAGCTCTGGACACATGCATAAACCAAAGTCACAAAATTTCTTAACAACTATTAAATTTGTTGATGAGAATATTCAAGAAATTGAACAAATCATAAGGAAATTCTACAAAGATGATGGACGAACAGCATTTTTAATGACATCTGATCATGGGATGACTGATTGGGGTTCTCATGGTACAGGGGATGATCATGAAACTGAAACCCCTTATGTCTTATGGGGTGCTGGCGTAACACAGATTGAAAGTGAATCAATACAGCTTGATAACAACTATGAAATGTCTCTTGATAACCGACATGATATCAATCAGGCCGATCTAACTCCTTTGATGTCTACATTGCTTTCTATTCCCGTTCCCGTTAACTCCATTGGACAGTTGCCAAGTGAACTACTGAACATGACACTTCCAAATAAGGCTAAAGCTATATACAGTAACTGCATACAGATGATATCACAATACAATAAAAAGAGAATGGATATTGAATCCAGCGCAATATCATTCCTCTACCATCCATATGAACCACTCAGTAGTGAAAAAGTAGAAGAAATACTTCATGTAATGGAAATGTTATTAGCTGAAGAAAAATACAACAGTCTCATTAGTTTGTGCGAAGACATTATGCATTTAAGCTTAAACGGCCTCTCATATTATCATAACTATTATCAGAAGCCGTTGTTGATCACTATATCACTATCATTTATGGGCTGGATTATGTTTTTATTGAAAGTATTACTAAAACAAAGAATAAATACTCAAGCTGAATATTCTATCGCAAGTAAAGGACTGCTTAGGCCCAATGGAGGAGTGGACATAGTTATTAGAACAATTTCTATTCTTATGGCTATATTACCTTTTTATTTGATATACGCTCAAAACTTACCAATGCAGTATTATATTTACTTTGTGATGCCAATAGTTATGTGGATGTATGCTGTGACACCAGTCAAATTATGGATGGTTACTTTAAAATCTATTAGAAATAAAAAAACGCTGTTGATCTTATGGTTTGAAATAGTATGTTATACTTTGGGATCACTTGCTATGGGGTTTGCTTTTAGTCAGAGATGGGTGTTGAGCATCCCTCTACTAGGTATGAGCTTGTGGCCATTTTTATCATCAAAACAAAACTCAAGATCCACTTACATTGCATGGGTTGCTGGATGTATAATGCTAAGTGTTTTTTCATTCCTGCCAGTAGTTGGTAAAGATGTGTGTATTGAATTAGTAATTCTTGCAGGTTTAATTTGGATAATAGCGGTGAGTTTCTATGTATGGAACACATTGATGCCATACTACGACAAAAATAACGAAATTAAACGGGAGGCGATTATGAGTATCATACAAATAACAATCTTAGCAATATCATTGGTAATCATCTTTTTGCAGTCGAAAAGGTTTATAGACGAAATGCCAATATCAAGGGTCTTTCAAATATTATGTTGGAATTTTTCTGTTATGTTCCCTCTACTGCCACTGTTCTATACAAAGAAATGTATAAATAGACTTCTTGGTATAAACACATCAATTCTCAATTTTTACTTGCTATTATCTGTGTCTCACGAGGGCCTTTTTATGGTGACACTAATATTTATTTTGAACTGTTGGATGATTATTGAATATAAACTAATAGATGTGGGAAAAGCAAAGATTGACGATTTAACTTTTGAAAGTGATGACACTGATAATGAAAGGAACATATCGTATATAGAAAGAGGTATCAATAAACAAGATTTCAGAAGGGCTTTCTTTTTTACATTATTTATAATACTAGCTTACTTTGGAACAGGAAACATAGCTTCATTGAATTCATTTGAAGTCAGGTGGGTGTTGTGTTTCACAACATCGTTTCAGCCATTCGTAATAACAACATTAATTCTGCTGAAAACTTTGGCTCCGTTTTTAAGTGTTGCCTGCACATTTAGGGCCATACAAATATTTACAAAGGCTCCCGTTGGCTGTTTGAACATAATTGTGCTGATATTTTCAAATATAATGGGAATCCAGATGTTATTTCAAGTTAGAAACACTGGAAGTTGGCTTGAGATAGGAACATCAATATCTCAGTTTGTCATAGTACAAACAATTACATTATTTATTGTTTTAATAAGTCAAATAGCCAAAATATTCACCGGAACGGATCTGTATGGTACTTTACTTAAGTTATTAATGACTAAAAAAAAATACGCTTAA

Protein sequence:

>DPOGS203704-PA
MFLLGLIVHIVFLFSIFDIYFKSPIVSHVKPYQPIHEAPADRLVLFVVDGLRAESFLNYTTMPYLRSVANTNGLWGISRTRVPTESRPGHVAILAGFYEDPSAVAKGWKQNPVDFDSVLNQSVYSWCWGTYDILEIFAKDDLSGHIYTEKMDPYDETYSPNRNTTTLDKWVFDRVNYFFNRQELDSEIYKKLQHDKILFFLHLLGTDSSGHMHKPKSQNFLTTIKFVDENIQEIEQIIRKFYKDDGRTAFLMTSDHGMTDWGSHGTGDDHETETPYVLWGAGVTQIESESIQLDNNYEMSLDNRHDINQADLTPLMSTLLSIPVPVNSIGQLPSELLNMTLPNKAKAIYSNCIQMISQYNKKRMDIESSAISFLYHPYEPLSSEKVEEILHVMEMLLAEEKYNSLISLCEDIMHLSLNGLSYYHNYYQKPLLITISLSFMGWIMFLLKVLLKQRINTQAEYSIASKGLLRPNGGVDIVIRTISILMAILPFYLIYAQNLPMQYYIYFVMPIVMWMYAVTPVKLWMVTLKSIRNKKTLLILWFEIVCYTLGSLAMGFAFSQRWVLSIPLLGMSLWPFLSSKQNSRSTYIAWVAGCIMLSVFSFLPVVGKDVCIELVILAGLIWIIAVSFYVWNTLMPYYDKNNEIKREAIMSIIQITILAISLVIIFLQSKRFIDEMPISRVFQILCWNFSVMFPLLPLFYTKKCINRLLGINTSILNFYLLLSVSHEGLFMVTLIFILNCWMIIEYKLIDVGKAKIDDLTFESDDTDNERNISYIERGINKQDFRRAFFFTLFIILAYFGTGNIASLNSFEVRWVLCFTTSFQPFVITTLILLKTLAPFLSVACTFRAIQIFTKAPVGCLNIIVLIFSNIMGIQMLFQVRNTGSWLEIGTSISQFVIVQTITLFIVLISQIAKIFTGTDLYGTLLKLLMTKKKYA-