Monarch geneset OGS2.0

DPOGS205070
TranscriptDPOGS205070-TA3960 bp
ProteinDPOGS205070-PA1319 aa
Genomic positionDPSCF300074 - 70425-78981
RNAseq coverage586x (Rank: top 22%)
Annotation
HeliconiusHMEL0121230.086.44% 
BombyxBGIBMGA006813-TA0.076.40% 
DrosophilaNup154-PA0.035.81% 
EBI UniRef50UniRef50_E2BSQ00.041.65%Nuclear pore complex protein Nup155 n=5 Tax=Formicidae RepID=E2BSQ0_HARSA
NCBI RefSeqXP_001605127.10.043.19%PREDICTED: similar to nuclear pore complex protein nup154 [Nasonia vitripennis]
NCBI nr blastpgi|3454853770.043.30%PREDICTED: nuclear pore complex protein Nup155-like [Nasonia vitripennis]
NCBI nr blastxgi|3214674070.037.10%hypothetical protein DAPPUDRAFT_320530 [Daphnia pulex]
Group
Gene OntologyGO:00056430nuclear pore
GO:00069130nucleocytoplasmic transport
GO:00170560structural constituent of nuclear pore
KEGG pathway 
InterPro domain[1-1320] IPR0048700Nucleoporin, Nup155-like
[63-419] IPR0149081.3e-78Nucleoporin, Nup133/Nup155-like, N-terminal
[687-1198] IPR0071871.9e-65Nucleoporin, Nup133/Nup155-like, C-terminal
Orthology groupMCL13676 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205070-TA
ATGCAACTCGAATGCTTGGAACTTGCGGGTTGCACTCTCGATCGGTATATCAGTGCCGACAATTCCCGCCCTTCGTTTCTTGAGATAACTGGTATAGCAGCCCAAGGAAGTCCAACATGCAGTGGTCTGAATGCAGGGGACTACAGGAATTTAGCATCATTGCTAAATCATCCAAATTTATCTCTTTTGAAAATTTTAAACAAAGTTCCTCTGCCCCCAGAGATAATGGAACACTTTGCTCATATGCAATGTCACTGTTTAATGGGAGTGTTTCCAGAAATTAGCCGAGTATGGCTTGCTATAGACAGTAATATATATGTATGGGCCTTTGAGCATGGGAGTGATGTGGCCTATTTTGATGGTCTAGGTGAAACTATTGTTAGTGTTGGCCTCGTCAAGCCTAAATCTGGTGTCTTCCAGAATTTTGTAAAATATTTACTAGTTCTTACAACTACAGTTGAAATTGTTGTATTAGGGGTCACATTTTCAAGTAGTAAACAAGATGGCACTGCAGAACTTGAAGAAATTCATCTTGTCCCAGAACCTGTATTTGTTTTGCCTACTGATGGTGTTTCTATGCTATGTGTAAAGTCTACATCGAAAGGTAGAATATTTATGGGAGGAAAGGATGGCTGCTTATATGAGATAACTTACCAGGCACAATTAGGTTGGTTCGGAAAACATTGCAAGAAAGTGAATCATTCAACCAGTGCATTGTCATTCCTAGTCCCTTCATTCCTTAATGCTGCACTATATGATGAAGATTCCATAGTTAAAATTGAAGTTGATAACTCTCGTCACATTCTGTATACGTTAAGTGAGAAAGGATGTATAGAAGTGTTCGATTTAGGCAGTGACGGTGAGGGATTTAGTAAAGTGGTGAGATTGAATCAAGGGAAAATTGTATCATTGTCTGTGGATATTGTGAAAACGCTCGAGCCTAATAATTTTAAGCCAGTTATAGCTATATCAGCTGTAGATGAGTCAGAATCGGAACATTTGAATTTGGTTGCCGTCACACAAACAGGGGCAAGATTATATTTTAGTGCTGGAACTGGAGACTCCAGTCAAGGTGGACCACAGAGGCCTCAGTACTTGACATTGCTACATGTAAGACTCCCTCCTGGTTTTACCCCAAATGCATCAGTATTAAAACCCAAACAAGTGCACAGCGCAGTATACGACAATGGTACCCTAGTGATGGTATGTTCGTCCAGCGGTGGGGAAGAGGAATCATTGTGGTGTTTATCTCGAGTCCTATCTGCTGCTGGCTTCAGCGAGGCTCACACAGCTCTACCATTAGACGGACCGGCCTGGGCTCTCACACCACTACCGCCAACACATTCAGACTTTTTATCTCCAGCGCTGCTTGCTAAAAGAGAGGTGTGGTCGTCGTCTCGTTGGGCGGTGGTGAGTGCTTGGGGCGCCGCGGTGCTAGCGACGGGGGCGGCGCCGGATGTATTACGATCTCTGTTAAGGGACTACCGCGGACCGGACGCACAACCTGTCAAGGATATGTTTCAGCTCCACGGTATCGACCAGGCTTGTGCGTGTGCACTTTACTTGGCGTGCGAGGACACTAGTGACATGACAGTGAGCGAGTGGGCGGCGCGGGCGTTCTTCCTATACGGCAGTCAAGTAGCACCTTCACCGCTATACCAACAGACATCACAAATGCCTTCTATTAATTCACCGAAATTCTCACCGCGCCAAGTATCAACACCGATGTCAACGAGACCCCAGGACCAAATGCGTATGAACCAAATGAACCAGTCGTACCAACAGCAGCAGATGAACCAATCGATGGCAGTCCAGAACCAGACCATGATGCCCTCCTACCAACAGAATATGATGAGCCCCACACCATTCAATCAATCGTCGTTTAACATGACCCAGCAGCCGAGAGATAATTTCCCCCTACAAATAGAATACAGTGCAAAACACAACGGAGTATACATTTACGTCGGTAGAATCCTGAGTCCTATATGGAATTTGAAATGTGTAACGAAGTCAATGACGCCGGATAATAAGGAGTTTATGTCGAGTAATGTAACGGGCGAAGACTGCGCGTATATAGCGAAGAAGTTGCAACGAGTGTCCGCTTTCATGCAACGAGTGGGCGCGCCGCCTCATACAGAAGAACAGGCGTCTTTGCACGCGTTGAAAATGTTCATTACTATGGCAATTGAGATGTTGAGCTTATGGAAAGTTTTGTGCGAGCACCAATTCCATGTGATAGCTGCGAGTCTGCCGTCTGAACAACAAACGGCTTTACAAGCGGCAAGTTTCAGAGAACTGCTAGTTGGTGGACAGGAGGTGGCGTGTCTTCTCCTCGGATCTCTCGTGGCTGGTTATTTGAGAGATAATGCATCAGTGGATGGCATCTCACAGAAACTGAGGCAGCTCTGCCCAACGCTATATAGACAGGAGGATGCTACATGTTCTAAAGCCAATGAATTGTTAATATTCGCTAAACAACAGAAGAATCCCGAAGAACGTGAGGAAATGTTACATCAGGCATTAAAACTTTGCAAGGATGTGGCGCCAAATGTAAACTTACCACTGGTCTGTTCGAAACTGGTATCAGCAGGGTTCTATACTGGCGTAGTAGAACTGTGCGAGGCCTGTGCCAGTAAGTTAGACCCGCAGGATAAAGCCGTTTATTACTACAAGAGCGACCAACCCTCCCAAGATAGGGAGGGACATTTGGCTTATTATAGAAGAATGGAGATTTATCGCGAAGTGTGCTGTGCGCTGGAGCGACTGTACGAGCGTAGTGCTGAAGGTACCGGCACTCCGCCCTCAGACACACACGCTTTGTCACCAGCAGACGCCAACTACCAGGGTAGGAAATTGGTGTGGGACTGTTTATGCCGAGACGATGAGCTATTACACGTCGCAGTGTACGAGTGGCTGGTTTCTAAAAATTTGGGTTCAGAACTGTTGTCTTTACCGGGTGCTCCGCCCGCGTCGCTTCGTACATACCTGACTGCGGCGGCCCGTAGTGCGCCTGCGCCCGCTGCCCTACTTCTTCTCGACCTTCTGTGGAAGGTACTAGAAAAAGCTGGGGATCACCTGGCAGCGGCAAACGTTCTGGAGGGACTTGCTACTAAACCTGGCACGGGCGCGACTCTAAGTCAGCGTATGTCGTGGGTGTCAGCGGGTGTAGTGTGCGTCCGCGGTGCCGGCGGGCGGGCGGCGGGCGGCTCGGAGCTGCTCCGTCGTCTGGAGGAGGCGGCGGAGGTGGCGCGAGTGCAGGCGGCCGTGAGAGCTGCCATGCGCTCAGTCCACTCTTCCGCCCCACCTCATTTGCTTCAACGGTTGGACGATGAGCTGCTGGAGATCACTCAGCTCTACGAAGAGTATGCCGACCGTTATAACCTGTGGGAGTGTAAGCTGGCAATAGTACAATGTTCCGGTCACAATGACGCCCTACTCGTAGAAAATATTTGGAGCAACATTCTAGCTGAAGCGGAAGCCGCTACTCGTAACCTGCCGACGCCAGATGAAAGACTGGCTTCAGTACTGAGCAAGCTGACTACCCTCGGCCGGGAGTATGTTAATACTGGTCATTGCTTCCCACTGTATTTCATCGTCCGACAATTAGAAATAATGAACTGCAAGTTGCAAGCCGACAAGAGCATGGTATTTAAAGCTATATTGAGTATCGGAGTGTCGTTGGAACAGGTTTTAGATATTTATATCAAATTGGTGAGTGTGAACGAGCGTGTATGGCTTGGCTGTGGAGACGAATCGCACGTGTGCGCCTGTGCCGCTCTACTCCTGAACGCTGCTCGTGCCGACCTGCTGCCTCTGCCGCCAGCGCCGCGACGAAGAGCCCTAACTCGCTGTAAGGACCTGCATGAAGCGGCTTTATCCGCGCTACAGAGCCGTCCGAATACACAACACCTCATTGATAAGTTAACAGTTGCGCAGGCGCATCTTGACCGGATGGATTGA

Protein sequence:

>DPOGS205070-PA
MQLECLELAGCTLDRYISADNSRPSFLEITGIAAQGSPTCSGLNAGDYRNLASLLNHPNLSLLKILNKVPLPPEIMEHFAHMQCHCLMGVFPEISRVWLAIDSNIYVWAFEHGSDVAYFDGLGETIVSVGLVKPKSGVFQNFVKYLLVLTTTVEIVVLGVTFSSSKQDGTAELEEIHLVPEPVFVLPTDGVSMLCVKSTSKGRIFMGGKDGCLYEITYQAQLGWFGKHCKKVNHSTSALSFLVPSFLNAALYDEDSIVKIEVDNSRHILYTLSEKGCIEVFDLGSDGEGFSKVVRLNQGKIVSLSVDIVKTLEPNNFKPVIAISAVDESESEHLNLVAVTQTGARLYFSAGTGDSSQGGPQRPQYLTLLHVRLPPGFTPNASVLKPKQVHSAVYDNGTLVMVCSSSGGEEESLWCLSRVLSAAGFSEAHTALPLDGPAWALTPLPPTHSDFLSPALLAKREVWSSSRWAVVSAWGAAVLATGAAPDVLRSLLRDYRGPDAQPVKDMFQLHGIDQACACALYLACEDTSDMTVSEWAARAFFLYGSQVAPSPLYQQTSQMPSINSPKFSPRQVSTPMSTRPQDQMRMNQMNQSYQQQQMNQSMAVQNQTMMPSYQQNMMSPTPFNQSSFNMTQQPRDNFPLQIEYSAKHNGVYIYVGRILSPIWNLKCVTKSMTPDNKEFMSSNVTGEDCAYIAKKLQRVSAFMQRVGAPPHTEEQASLHALKMFITMAIEMLSLWKVLCEHQFHVIAASLPSEQQTALQAASFRELLVGGQEVACLLLGSLVAGYLRDNASVDGISQKLRQLCPTLYRQEDATCSKANELLIFAKQQKNPEEREEMLHQALKLCKDVAPNVNLPLVCSKLVSAGFYTGVVELCEACASKLDPQDKAVYYYKSDQPSQDREGHLAYYRRMEIYREVCCALERLYERSAEGTGTPPSDTHALSPADANYQGRKLVWDCLCRDDELLHVAVYEWLVSKNLGSELLSLPGAPPASLRTYLTAAARSAPAPAALLLLDLLWKVLEKAGDHLAAANVLEGLATKPGTGATLSQRMSWVSAGVVCVRGAGGRAAGGSELLRRLEEAAEVARVQAAVRAAMRSVHSSAPPHLLQRLDDELLEITQLYEEYADRYNLWECKLAIVQCSGHNDALLVENIWSNILAEAEAATRNLPTPDERLASVLSKLTTLGREYVNTGHCFPLYFIVRQLEIMNCKLQADKSMVFKAILSIGVSLEQVLDIYIKLVSVNERVWLGCGDESHVCACAALLLNAARADLLPLPPAPRRRALTRCKDLHEAALSALQSRPNTQHLIDKLTVAQAHLDRMD-