Monarch geneset OGS2.0

DPOGS200783
TranscriptDPOGS200783-TA4725 bp
ProteinDPOGS200783-PA1574 aa
Genomic positionDPSCF300370 + 2297-14654
RNAseq coverage357x (Rank: top 33%)
Annotation
HeliconiusHMEL0049242e-4065.60% 
BombyxBGIBMGA001818-TA6e-17843.91% 
DrosophilaNup98-PA2e-11028.86% 
EBI UniRef50UniRef50_E2CA024e-14728.78%Nuclear pore complex protein Nup98-Nup96 n=15 Tax=Formicidae RepID=E2CA02_HARSA
NCBI RefSeqXP_001649036.11e-14230.76%nuclear pore complex protein nup98 [Aedes aegypti]
NCBI nr blastpgi|3320219759e-15029.16%Nuclear pore complex protein Nup98-Nup96 [Acromyrmex echinatior]
NCBI nr blastxgi|1571672771e-16131.01%nuclear pore complex protein nup98 [Aedes aegypti]
Group
Gene OntologyGO:00068101.1e-53transport
GO:00056431.1e-53nuclear pore
KEGG pathway 
InterPro domain[516-674] IPR0072301.1e-53Peptidase S59, nucleoporin
[1129-1394] IPR0219674.2e-42Nuclear protein 96
Orthology groupMCL10889 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200783-TA
ATGTCTCCAGCGCGTCTGGCCCCGAGTCTCTTCGGCTCGTCCAGCATCGGCTCCGCGCCGGCCTTCGGACAGACCAATACCTTCTCGTTCGGGGGCGGCGCGCAGCCGGGGGCGCAGACCACGGGCCTGTTCGGAGCCAACAAGCCGGCCTTCGGCGCCACCACCAACACCGGCACGAGTCTTTTCTCAACAACGACGACCCAAGCCCCGGCCTTCGGCACCGCCACATCCACATTCGGCTTCGGAGCCAACACACAGAACCAGCCGGGAGGGTTGTTCAACAGCAAGCCGGCGACCACGGGCTTCGGCTCCACCCCCAGCTCGGGCTTCGGGGGCTTCGGCACCGGCATGTTCGCCAAAACTAACACCACCACCGCCAGCTTCGGCACCACCACGCCCGCGTTCGGCTCCACGGGCTTCGGCACCAACACCTCGGGCTCCAGCCTGTTCGGGAACAACACGTTCGGGAAGCCGGCGACAACGACGCCCTCGTTCGGATTCAACACACAACCTACCCTCGGTCTAGGCGGCGGCCTGGGCTCATCGTTCCAGTCCAAGCCGGCCAACTCGGGCTTCGGCACACTGGGAGGGACGGGCTCTCTGTTCCAGCAGCCGGCTCAGAACACCTTCAGAACCGACTCGGGCGTGGGCGGAGGTCTGTTCAACAACACCTTGGGCTCGCTGGGCGCCACGCAAGTCAACAACGCCAGCGTCGCGGGTGGGAATGGCTGGGCCCCGCTCGCTGGTCACGTGACATCCGCCAGCGACAGGATCCCGTATATTCCTACCCTCGCATTGATCACAGTACATATAGCTGTCTGTATGTGCAGGGCTCCGCTGGGCGGCTCGTCCAACGTGCACGAACAAATACTCACTCTAGTAGCGAGACCCTACGACACTCCGCTCTTCAAAGACCTCGCTCCGGACACAGCGACATCTACCGAGGAGGTGCTGAAGCCGACCAACCCCTCGGCGGTGAGGGCGGTGCTGGACTCGTACAAGGTGTCGCCCAACAACAGGACCAGGGTCACGGTCCGGCCCGGGCCTCATAAGCACGACAAGAAGTCTCTGTTCGACGGCCTGGAGGAGGGAGACGCCAGCGTGGAGGACAAGCTGACGACACGACCCAGCAGGAAGAGGCTCGTGCTGAGGAACAGGCCGCCAGCCGACAGGTCGCTGGAGGAGAGTCAACAGAACGGCGGCGAGGAGCGCCCGGCGGCGGAACACGAGAAGGCCGACGGCGAACACACGCAGAGCGACGCCGCGGCGAATCGGCACGGCAGCTGGCTCGCGTCTCCGAAGAACTCAAACTCCTGGAAGGAAAATGAGAAGCCGGCTGACAGTGAGCCGGCTGCGAGGCTGTACCCCGACCTGGAGAAGGAGCTCCCGCCGCAGGTGCCGGACAGACGGGCCAGCTGGCTGTCGTCTCTGCCCCTGCGCCCTCTGCCGGGGTCTCTGGACGCGGAGAGCTCCGTCAGGGAGCTGGTCCGGGGAGGCAGGGACAAAGTATCCGAGGAGGAGAATATTCCCCCCCGGGAGGTCGCGCCCCACCCGGCCGGGGTCAAGCTGACGAGGCCGGGATACTACACCATACCCAGCCTGGAGGAGATGACTGAGTATCTCCGACCCGATGGTTCCTGCCGCGTGCCTCACCTCACCATCGGCAGGAAGAACTACGGGAACGTGTTCTACGACTGTGAGATCGACGTGGCCGGCCTCGACCTGGACCACCTGGTCCACTTCCTGAACAAGGAGGTGATCATCTACCCCGAGGACGAGGGGAAGCCGCCGGTCGGCTCGGGTCTCAACCGCCGGGCCGTGGTCACGCTCGAGAGGGTCTGGCCCCGCGACAAGACCGAGAGGAGGCCCGTCACGGAACCCGACAGGTTACTGAAGATGGACTACGAGGGCAAGCTGAGGCGGGTGTGTGACAAGCACGACACCAAGTTCATAGAGTACAGGCCGCAGACGGGCAGCTGGGTGTTCAGGGTGGAACACTTCAGCAAGTACGGCCTGACTGACTCGGATGAGGAGGACGACATCACACCCAACATACTTAAGAGACAGCTGGTGGACCAGAACTTGCAGCAATCTGCAGCACCTCCTAAGCCGCCGCCACCGTCTGCCGGCCAGCAGCCAGGTCTAGGAGGTCTCGGAGGTCCGGTCGCTCCGGCTACATCGGGGCCGGGACTGAGCGGGTCCGGCGCGGGTCCAGCGGGCCCCGGCCTCACCCTCGGCCTGTCAACAACCTCCGTAGGGAGAGACGAGTACATGGAGCAGACATCACTCAACCTGTTGAACGGAACTAATAAAGGCTTCACTATGGACTTCACTGAGGACGGGGACCAGAACAGTCTGTACCAGGACGGAGGTGTGTGTGTGAAGAGCCCCACTAGTGAGCTGGCCCGCCTGGAACACCGCGGCAGCCACCGCCTGCAGCTGATGAAGGCCAGCCTGTACGCCGACGGCGCCGCGGACATGATGGAGGAGGTGTCCTCGTTCTCCGGGGACCAGCTGGTGCCTCACGCGGCGATGACGTCACCACACCCCACCAGCGACACCATCCGGGAGGTGGTGCAGACCGTGGACTCTACACAAGTGCAGCCGGAAGTGTCGGAGGTGATGGCGCGTCCCATAACAGTTCACCCTCACACTGTGGTGCTGAAGTACCACAAGAAGATACCACCCTTCAGGGAGACCATAGCCGGTCGTATGTCAGCATCTTCCCTGGTGGACCTGTCCGTGTCCCGGGCCCGCCTGTCTCGCTGGTGTGGCCGGCCGGGGGTCATGGTGGTACACTCCACCACAGCCGCAGCCGACCACCTCCCACCAGCGTCGGACCTGGGAGACCTAGGGCTGTACGTGTCGGGGCGGGCGGAGCACGACTGGAGCGACCAAGTGCTGCTGAGGGTGGCCTTCGGAGCGCCGGAGGGGACTGCGGCCATGGTCGGTGACGAACGTGCATACAGCACAGAACCTGCTGAGTGCCTGTCACGCCAGCTGAACAGCCTGCTGGAGTGTTCCGACACGGATACTACACGACCGATCTGTCCTCGCCTCGTCGTGCGACAGGAACCTCTCCAGAGGAGGAGACTCCTCGCTAAGCTGCTGGAGCACGCCAAGGTGGCTGAGGACTATAAACCCAAGTTCGGCGTGTCAGGACAGTACTGCGTCCAGGTGTGGAAGCTGTGTGAGGCGCTGTGGGGGCTGGACCTGGAGAATGATGGTGTTCCGGGTAACACGGAGCAGTATGTGGTGTCTCAGCACCAGCGGCTGGTGGAGTGGCTGAAGGAGGTGGTGCAGCGGTCCACGGACGAGGAGCTGGAGCAGCCCGGAGACCTGGAAGATATTGATGAGTACGACGCTCACAGCAGCAAGGTGTGGTGGCTGGTGGTAGGTGGTAGGATCCTGGAGGCCTGCAAGCTCGCCAGGGACAAGGGAGACCTCAACATGGCCGCTATCATAACACAGGCCAGCGGTGACCCGGCCTTCAAGAGCCTCTTGGAGCGCCAGCTGCGAGTGTGGAGGGAGTGCGGGGCGGAGGGGCTGGTGGCGGAGTCGCGGCGGGCCACCATCAGTCTCCTGGCGGGACTACGCCCCCGAGGGTACTTCGAGAAGTGCGACTGGCTAAGAGTACTGCTGGCGGTCGCCTGCTACATCTGCCCGCAGGTGCCGACCCTGGAACAGATCCTGCACACGTACGAGAGCTACCTCACGGACGACGAGATGAGCGTGCAGCATCCGCGACCCGCCTACCAGGACCGATACGACGCTCGGCTGGATAACAACCCCTCTCCGTGGCGCGACCTCCGCTACGAGCTGCTGAAGGCCAGGGCCACCCAGAGCCGGCCGCGGACCGACACGCACACCTACACACCGGACACGATGGACTGTTCCCTGAGCTTCCTGGTGGGGTCGTGGCTGGGCGTGTCGTCTTCCAGCAGTGTCACGGGAACAGCTGAACAGCTGGAGGCCGCCGGCAGCTGGCACCTGGCCGTGCAGGTGTTGGCCTACCTTCCCGACGACACACTGCGAGGTCATCTCATCCGTCAAGTGCTGAACCGCAACGCGCCCTCCAAATTCGAGGGTCCAGTGGACGAGGAGCGCCTGGCTCTGATGAAGAGGCTGAGGATACCTCACACCTGGCTGTTGGCGGCACAAGCCTGGAGAGCCAAGTATGAGCACAAGCCGCTGCTTCAAGCTGAACACCTGGTGGCCGCCCAGGAGTGGACGGAGGCTCACCGTGTGCTGGTCGAGGAACTGCTCACTGGTGCCGTGCTCTCCGACAAGCTGTCGAGCATCTCGGGCGTGGTGTGCGAGCTGAGCGCCGCGTCGCGCCAGCACCTGGTCCGCGGCTGGAGCCCCGCCGCGGACGCGCTCACACACTACCTGCGCCTGTGCGAGGAGATCCGAGAAGTGGTAAACGCGGGGGGAGCCGCCGCCGGGGACGGGCTGAAGGCGCTCCGCCCCTCCCTGACAGCCGCATGTAGGGCTCTGGCACAGTTTGAACCCAGGAGTCCGAAGCAGGCGGCGGCTCACTCGGAGATGGGCGCCCGGCTGGTGCAGCTGTCTCTCGCGGGAGGCCGGCCCGCGCCCCACATGGCGGCCCTGCTGGCGGCCCTCAGACTGCCCCCGGACTGCGCCGCCGCCGCCGCCACTAAGATAGCGACCGACCTCGCGGAACGAGCCTCGGAGGTGTGTCTGGAGTCGGCCGTGAGGAACTAG

Protein sequence:

>DPOGS200783-PA
MSPARLAPSLFGSSSIGSAPAFGQTNTFSFGGGAQPGAQTTGLFGANKPAFGATTNTGTSLFSTTTTQAPAFGTATSTFGFGANTQNQPGGLFNSKPATTGFGSTPSSGFGGFGTGMFAKTNTTTASFGTTTPAFGSTGFGTNTSGSSLFGNNTFGKPATTTPSFGFNTQPTLGLGGGLGSSFQSKPANSGFGTLGGTGSLFQQPAQNTFRTDSGVGGGLFNNTLGSLGATQVNNASVAGGNGWAPLAGHVTSASDRIPYIPTLALITVHIAVCMCRAPLGGSSNVHEQILTLVARPYDTPLFKDLAPDTATSTEEVLKPTNPSAVRAVLDSYKVSPNNRTRVTVRPGPHKHDKKSLFDGLEEGDASVEDKLTTRPSRKRLVLRNRPPADRSLEESQQNGGEERPAAEHEKADGEHTQSDAAANRHGSWLASPKNSNSWKENEKPADSEPAARLYPDLEKELPPQVPDRRASWLSSLPLRPLPGSLDAESSVRELVRGGRDKVSEEENIPPREVAPHPAGVKLTRPGYYTIPSLEEMTEYLRPDGSCRVPHLTIGRKNYGNVFYDCEIDVAGLDLDHLVHFLNKEVIIYPEDEGKPPVGSGLNRRAVVTLERVWPRDKTERRPVTEPDRLLKMDYEGKLRRVCDKHDTKFIEYRPQTGSWVFRVEHFSKYGLTDSDEEDDITPNILKRQLVDQNLQQSAAPPKPPPPSAGQQPGLGGLGGPVAPATSGPGLSGSGAGPAGPGLTLGLSTTSVGRDEYMEQTSLNLLNGTNKGFTMDFTEDGDQNSLYQDGGVCVKSPTSELARLEHRGSHRLQLMKASLYADGAADMMEEVSSFSGDQLVPHAAMTSPHPTSDTIREVVQTVDSTQVQPEVSEVMARPITVHPHTVVLKYHKKIPPFRETIAGRMSASSLVDLSVSRARLSRWCGRPGVMVVHSTTAAADHLPPASDLGDLGLYVSGRAEHDWSDQVLLRVAFGAPEGTAAMVGDERAYSTEPAECLSRQLNSLLECSDTDTTRPICPRLVVRQEPLQRRRLLAKLLEHAKVAEDYKPKFGVSGQYCVQVWKLCEALWGLDLENDGVPGNTEQYVVSQHQRLVEWLKEVVQRSTDEELEQPGDLEDIDEYDAHSSKVWWLVVGGRILEACKLARDKGDLNMAAIITQASGDPAFKSLLERQLRVWRECGAEGLVAESRRATISLLAGLRPRGYFEKCDWLRVLLAVACYICPQVPTLEQILHTYESYLTDDEMSVQHPRPAYQDRYDARLDNNPSPWRDLRYELLKARATQSRPRTDTHTYTPDTMDCSLSFLVGSWLGVSSSSSVTGTAEQLEAAGSWHLAVQVLAYLPDDTLRGHLIRQVLNRNAPSKFEGPVDEERLALMKRLRIPHTWLLAAQAWRAKYEHKPLLQAEHLVAAQEWTEAHRVLVEELLTGAVLSDKLSSISGVVCELSAASRQHLVRGWSPAADALTHYLRLCEEIREVVNAGGAAAGDGLKALRPSLTAACRALAQFEPRSPKQAAAHSEMGARLVQLSLAGGRPAPHMAALLAALRLPPDCAAAAATKIATDLAERASEVCLESAVRN-