Monarch geneset OGS2.0

DPOGS208101
TranscriptDPOGS208101-TA1992 bp
ProteinDPOGS208101-PA663 aa
Genomic positionDPSCF300395 - 669-5665
RNAseq coverage2x (Rank: top 91%)
Annotation
HeliconiusHMEL0049244e-4065.60% 
BombyxBGIBMGA001818-TA2e-6958.11% 
DrosophilaNup98-PA5e-7135.82% 
EBI UniRef50UniRef50_Q9VCH57e-6935.82%Nup98 n=48 Tax=Drosophila RepID=Q9VCH5_DROME
NCBI RefSeqXP_001994256.13e-7443.41%GH10992 [Drosophila grimshawi]
NCBI nr blastpgi|1950546875e-7343.41%GH10992 [Drosophila grimshawi]
NCBI nr blastxgi|3123740103e-7536.53%hypothetical protein AND_16639 [Anopheles darlingi]
Group
Gene OntologyGO:00068101.1e-53transport
GO:00056431.1e-53nuclear pore
KEGG pathway 
InterPro domain[172-330] IPR0072301.1e-53Peptidase S59, nucleoporin
Orthology groupMCL10889 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208101-TA
ATGACGAAGGGTGACGGCCTGTGGTTCCAGAAGTCTCTGTTCGACGGCCTGGAGGAGGGAGACGCCAGTGTGGAGGACAAGCTGACGACACGACCCAGCAGGAAGAGGCTCGTGCTGAGGAACAGGCCGCCAGCCGACAGGTCGCTGGAGGAGAGTCAACAGAACGGCGGCGAGGAGCGCCCGGCGGCGGAACACGACAAGGTCGACGGCGAACACACGCAGAGCGACGCCGCGGCGAATCGGCACGGCAGCTGGCTCGCGTCTCCGAAGAACTCAAACTCCTGGAAGGAAAATGAGAAGCCGGCTGACAGTGAGCCGGCTGCGAGGCTGTACCCCGACCTGGAGAAGGAGCTCCCGCCGCAGGTGCCGGACAGACGGGCCAGCTGGCTGTCGTCTCTGCCCCTGCGCCCTCTGCCGGGGTCTCTGGATGCGGAGAGTTCCGTCAGGGAGCTGGTCCGGGGAGGCAGGGACAAAGTATCCGAGGAGGAGAATATTCCCCCCCGGGAGGTCGCGCCCCACCCGGCCGGGGTCAAGCTGACGAGGCCGGGATACTACACCATACCCAGCCTGGAGGAGATGACTGAGTATCTCCGACCCGATGGTTCCTGCCGCGTGCCTCACCTCACCATCGGCAGGAAGAACTACGGGAACGTGTTCTACGACTGTGAGATCGACGTGGCCGGCCTCGACCTGGACCACCTGGTCCACTTCCTGAACAAGGAGGTGATCATCTACCCCGAGGACGAGGGGAAGCCGCCGGTCGGCTCGGGTCTCAACCGCCGGGCCGTGGTCACGCTCGAGAGGGTCTGGCCCCGCGACAAGACCGAGAGGAGGCCCGTCACGGAACCCGACAGGTTACTGAAGATGGACTACGAGGGCAAGCTGAGGCGGGTGTGTGACAAGCACGACACCAAGTTCATAGAGTACAGGCCGCAGACGGGCAGCTGGGTGTTCAGGGTGGAACACTTCAGCAAGTACGGCCTGACTGACTCGGATGAGGAGGACGACATCACACCCAACATACTCAAGAGACAGCTGGTGGACCAGAACTTGCAGCAATCTGCAGCACCTCCTAAGCCGCCGCCACCGTCTGCCGGCCAGCAGCCAGGTCTAGGAGGTCTCGGAGGTCCGGTCGCTCCGGCTACATCGGGGCCGGGTCTGAGCGGGTCTGGCGCGGGTCCAGCGGGCCCCGGCCTCACGCTCGGCCTGTCAACAACCTCCGTAGGGAGAGACGAGTACATGGAGCAGACATCACTCAACCTGTTGAACGGAACTAATAAAGGCTTCACTATGGACTTCACTGAGGACGGGGACCAGAACAGTCTGTACCAGGACGGAGGTGTGTGTGTGAAGAGCCCCACTAGTGAGCTGGCCCGCCTGGAACACCGCGGCAGCCACCGCCTGCAGCTGATGAAGGCCAGCCTGTACGCCGACGGCGCCGGTACATACCCACATACACACTCACATACACACACACATACACGCATATATTTATACTATAGTATTATCTGTTCCGATCCGTCGCCAGCGGACATGATGGAGGAGGTGTCCTCGTTCTCCGGGGACCAGCTGGTGCCTCACGCGGCGATGACGTCACCACACCCCACCAGCGACACCATCCGGGAGGTGGTGCAGACCGTGGACTCTACACAAGTGCAGCCGGAAGTGTCGGAGGTGATGGCGCGTCCCATAACAGTTCACCCTCACACTGTGGTGCTGAAGTACCACAAGAAGATACCACCCTTCAGGGAGACCATAGCCGGTGAGCTGAGACACACACGCACACAGTCGTATGTCAGCATCTTCCCTGGTGGACCTGTCCGTGTCCCGGGCCCGCCTGTCTCGCTGGTGTGGCCGGCCAGGGGTCATGGTGGTACACTCCACCACAGCCGCAGCCGACCACCTCCCACCAGGTTAGGCCCGCACACACATACTGTGTGGATGACTGTTCTCCTGAACCTCACACACAGAGCACTCATAGGAAACGAATGA

Protein sequence:

>DPOGS208101-PA
MTKGDGLWFQKSLFDGLEEGDASVEDKLTTRPSRKRLVLRNRPPADRSLEESQQNGGEERPAAEHDKVDGEHTQSDAAANRHGSWLASPKNSNSWKENEKPADSEPAARLYPDLEKELPPQVPDRRASWLSSLPLRPLPGSLDAESSVRELVRGGRDKVSEEENIPPREVAPHPAGVKLTRPGYYTIPSLEEMTEYLRPDGSCRVPHLTIGRKNYGNVFYDCEIDVAGLDLDHLVHFLNKEVIIYPEDEGKPPVGSGLNRRAVVTLERVWPRDKTERRPVTEPDRLLKMDYEGKLRRVCDKHDTKFIEYRPQTGSWVFRVEHFSKYGLTDSDEEDDITPNILKRQLVDQNLQQSAAPPKPPPPSAGQQPGLGGLGGPVAPATSGPGLSGSGAGPAGPGLTLGLSTTSVGRDEYMEQTSLNLLNGTNKGFTMDFTEDGDQNSLYQDGGVCVKSPTSELARLEHRGSHRLQLMKASLYADGAGTYPHTHSHTHTHTRIYLYYSIICSDPSPADMMEEVSSFSGDQLVPHAAMTSPHPTSDTIREVVQTVDSTQVQPEVSEVMARPITVHPHTVVLKYHKKIPPFRETIAGELRHTRTQSYVSIFPGGPVRVPGPPVSLVWPARGHGGTLHHSRSRPPPTRLGPHTHTVWMTVLLNLTHRALIGNE-