Monarch geneset OGS2.0

DPOGS203512
TranscriptDPOGS203512-TA2904 bp
ProteinDPOGS203512-PA967 aa
Genomic positionDPSCF300055 - 541029-552184
RNAseq coverage737x (Rank: top 18%)
Annotation
HeliconiusHMEL0027500.081.32% 
BombyxBGIBMGA004241-TA0.075.28% 
DrosophilaCas-PA0.053.16% 
EBI UniRef50UniRef50_Q9XZU10.053.05%Exportin-2 n=15 Tax=Coelomata RepID=XPO2_DROME
NCBI RefSeqXP_968023.10.057.64%PREDICTED: similar to importin (ran-binding protein) [Tribolium castaneum]
NCBI nr blastpgi|910941150.057.64%PREDICTED: similar to importin (ran-binding protein) [Tribolium castaneum]
NCBI nr blastxgi|910941150.057.64%PREDICTED: similar to importin (ran-binding protein) [Tribolium castaneum]
Group
Gene OntologyGO:00068862.3e-145intracellular protein transport
GO:00054883.4e-140binding
GO:00082622.2e-136importin-alpha export receptor activity
GO:00056342.2e-136nucleus
GO:00082832.2e-136cell proliferation
GO:00057372.2e-136cytoplasm
GO:00085658.8e-13protein transporter activity
KEGG pathway 
InterPro domain[157-526] IPR0137132.3e-145Exportin/Importin, Cse1-like
[1-890] IPR0160243.4e-140Armadillo-type fold
[534-958] IPR0050432.2e-136CAS/CSE, C-terminal
[921-922] IPR0119898.9e-104Armadillo-like helical
[29-102] IPR0014948.8e-13Importin-beta, N-terminal
Orthology groupMCL13460 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203512-TA
ATGGAAGTTACTGAAGACAATCTTTCAACTCTGGCTACCTATTTACAACAGACATTGAATCCTGATCCAAATATCAGAAGACCCGCTGAGAAATTTCTAGAGGGTGTAGAGGTTAATCAGAACTATGCGATACTACTCCTCCACCTTATTGACAAAGATACAGCAGACCCAACCATCAGGGTTGCCGCTGCGGTGACATTCAAGAATTATATCAAAAGGAATTGGCCGGTGGAGGAAGATGGAGTTGATCGTATACATGCATCAGATCGTGCTACCATCAAGACTCTCATTGTGTCACTAATGTTGAAGTCTCCTGAGGCGATACAACGACAGTTCAGTGATGCTGTGTCTATAATCGGAAAGTCAGATTTTCCAGAAAAATGGCCAGGACTGATCAGCGAAATGGTTGAAAAGTTTGCAACAGGCGATTTCCATGTCATCAACGGCGTGCTACGTACAGCTCACTCACTCTTCAAGCGGTATCGTTACGAGTTCAAGTCGCAGAAATTATGGGAAGAAATAAAGCATGTTCTCGAAAACATAGCCAAGCCTCTTACGGATCTGTTTGTGGTGACAATAGACTTAACGAACAAGCACGCGGGTAACGCTCAAGCTCTGAAAGTCATCTACGGCTCGCTGGTATTAATATGCAAGGTATTCTACTCACTGAACTATCAAGACCTGCCCGAGTTCTTTGAAGACAACATGCCGATATGGATGCCCAATCTACTAAATCTCCTACAAGTTACAGTGCCGTGTCTGGCCGATGATGAGGACGACAAGCCGGGTGTAATAGAGTGTCTCCGTACTGAGGTGTGTGAGTGTGCATCGCTGTATGCCCTCAAGTATGAGGAGGAGTTTGCACCACACGCACCTGGCTTCGTGACCGCTGTATGGCATGTGCTGTTGCATACAGGGGCAAAGGAGAAATACGACTCTTTGGTGTCCAACGCTCTAACATTTTTGGCGAAGGTCGCTGAGAAGAACAGCTACAAAAATCTGTTCGAAGATCCGGCCACACTCAGCCAGATATGCGAGAAAGTCGTCATACCCAACATGGAGTTCAGAGAGTCGGATATCGAGTTGTTTGAAGATAATCCCGAGGAATATGTCAGGCGTGATATCGAAGGGTCGGATGTGGAGACCAGACGCCGCGCCGCCTGCGACCTGGTGCGAGCCCTCGCCACACACTATGAGGATAAAATGATGGCCATCTTCGGACAGTATGTAGAGATGATGCTGTCAAAGTACTCCTCGAGCGGAGGGACCGCCTGGATCAATAAAGACACGGCGCTGTTTCTGGTGACTTCGCTCGCGTCCCGGGGCTCCACACAGGCAGCGGGCGTGACTAGGGCTTCGCCCTTGGTGGATTTAACGTCGTTCGCCGCTACACATGTATTGCCGGAGCTTCAGCGGCCCGACGTGAATGAGCTGCCGGTGTTGAAGGCTGATGCTATCAAATACATTATAACGTTCAGGTCGCTGCTCCCCAAGGACCTGATCGCGTCGGCTCTACCGCTACTGATCCAGCACATACGTGGCGGTGGCGTAGTGTGTACGTACGCGTCGTGTGCGGTGGAGAAGCTGCTCGCTGGAGGCCTAGCGACCAAGACGCAGCTGGAGCCCTACGCTGGGAACCTGATGGGAGCCCTACTGACCGCGATCGATGGAGGACCTACCGCCCACAACGAATACGTCATGAAAGCGTTACTCCGCACCCTGGCCAGTCTTCGCGAGGCTGCGCTGCCCTATCTCGGGGAGGCGCTGCCGAAACTTGCATCCATACTGTCTCTGGTTGCCAAGAACCCTTCCAAGCCCCACTTCAACCACTACCTGTTCGAGAGTCTCTCTTTGGCTGTGTCTTTAGTAGTCAAAGCAAACCCGAGCGCCATCACAGCTTTCGAAGACGCCCTCTTCCCCATATTCCAGGATATATTACAGAACGATGTATTAGAGTTCATGCCATACGTGTTTCAAATGCTGTCGTTGCTCTTGGAGCTGCGCGGTTCGTGTTCGGGGGCTGGGGACGGGGGGGGTGATACATACGCCGCCCTCCTGCCCTGCCTGGTCGCGCCCCCGCTGTGGGAGCGACCAGCTAATGTCCGTCCTCTAGTGAGGCTACTGTGCGCCTTCGTTGCCGTCCGATCGGACCTCGTGCTCAGCTCGGGGAAGCTGAACGCCATGTTGGGTGTATTCCAAAAGCTAATAGCCTCCAAGACGAACGATCACGAAGGTTTCTTCCTCATACAGACAATGCTCTTCAAGTTTGGCGAGTCTGTGATGCAGCAGTACACGAAACAGATAATAACTCTACTGTTCCAACGTCTCTCATCATCGAAGACAACAAAGTATGTCCGAGGTCTGATAGCGTTCCTCGGCTTCTATTCTGCACATTTCGGAGCCGATCCTCTGATAGAGCTCGTGGATTCCGTACAGGCTAATATGTTCGCTATGTACGTGGAGAGGGTTTTGGTACCGGATGTGCAGAAGGTTTCCGGGGCTCTGGAACGGAAGGCGGCGGCTGTCGGGTGCGTGAAGCTGCTGTGTCACAGCGCACACTTCCGGGAGGGGAGATTGGCTGGTCTGTGGACGAATCTGTTGCAGGCTCTCATCGCGTTGTTCGAGTTGCCGCCCGACGAAACCCAACTGCCTGATGACCACTTCATAGAGGTCGACGACACCCCGGGCTACCAGCCGGTGTACGCACAGCTAGCCTGCGCCAAGACCGGAAGCGATGATCCCTTGGCTGCCATAGAAGATCCGAAACGTTACCTGGCGGAGAGTCTAGCGGCCATGTCGCGGGACTTCCCCGGCGTCCTGCCGCCGAGGGTCGGTTCCCTTGACGAACCTCATCGCGGCGTGCTGCAGAACTACCTCCAGAGTTACGGGGTAGCGATCTGCTGA

Protein sequence:

>DPOGS203512-PA
MEVTEDNLSTLATYLQQTLNPDPNIRRPAEKFLEGVEVNQNYAILLLHLIDKDTADPTIRVAAAVTFKNYIKRNWPVEEDGVDRIHASDRATIKTLIVSLMLKSPEAIQRQFSDAVSIIGKSDFPEKWPGLISEMVEKFATGDFHVINGVLRTAHSLFKRYRYEFKSQKLWEEIKHVLENIAKPLTDLFVVTIDLTNKHAGNAQALKVIYGSLVLICKVFYSLNYQDLPEFFEDNMPIWMPNLLNLLQVTVPCLADDEDDKPGVIECLRTEVCECASLYALKYEEEFAPHAPGFVTAVWHVLLHTGAKEKYDSLVSNALTFLAKVAEKNSYKNLFEDPATLSQICEKVVIPNMEFRESDIELFEDNPEEYVRRDIEGSDVETRRRAACDLVRALATHYEDKMMAIFGQYVEMMLSKYSSSGGTAWINKDTALFLVTSLASRGSTQAAGVTRASPLVDLTSFAATHVLPELQRPDVNELPVLKADAIKYIITFRSLLPKDLIASALPLLIQHIRGGGVVCTYASCAVEKLLAGGLATKTQLEPYAGNLMGALLTAIDGGPTAHNEYVMKALLRTLASLREAALPYLGEALPKLASILSLVAKNPSKPHFNHYLFESLSLAVSLVVKANPSAITAFEDALFPIFQDILQNDVLEFMPYVFQMLSLLLELRGSCSGAGDGGGDTYAALLPCLVAPPLWERPANVRPLVRLLCAFVAVRSDLVLSSGKLNAMLGVFQKLIASKTNDHEGFFLIQTMLFKFGESVMQQYTKQIITLLFQRLSSSKTTKYVRGLIAFLGFYSAHFGADPLIELVDSVQANMFAMYVERVLVPDVQKVSGALERKAAAVGCVKLLCHSAHFREGRLAGLWTNLLQALIALFELPPDETQLPDDHFIEVDDTPGYQPVYAQLACAKTGSDDPLAAIEDPKRYLAESLAAMSRDFPGVLPPRVGSLDEPHRGVLQNYLQSYGVAIC-