Monarch geneset OGS2.0

DPOGS208474
TranscriptDPOGS208474-TA3183 bp
ProteinDPOGS208474-PA1060 aa
Genomic positionDPSCF300064 - 1498962-1502483
RNAseq coverage748x (Rank: top 17%)
Annotation
HeliconiusHMEL0043040.095.57% 
BombyxBGIBMGA010657-TA0.092.55% 
Drosophilaemb-PA0.073.26% 
EBI UniRef50UniRef50_F4WNS60.082.44%Exportin-1 n=16 Tax=Opisthokonta RepID=F4WNS6_ACREC
NCBI RefSeqXP_001604619.10.082.22%PREDICTED: similar to nuclear export factor CRM1 [Nasonia vitripennis]
NCBI nr blastpgi|3320240510.082.44%Exportin-1 [Acromyrmex echinatior]
NCBI nr blastxgi|1565433080.082.22%PREDICTED: exportin-1-like [Nasonia vitripennis]
Group
Gene OntologyGO:00054881.2e-141binding
GO:00068863.5e-11intracellular protein transport
GO:00085653.5e-11protein transporter activity
KEGG pathway 
InterPro domain[698-1015] IPR0119891.2e-141Armadillo-like helical
[699-1015] IPR0148771.3e-129Exportin 1, C-terminal
[697-1016] IPR0160243.4e-118Armadillo-type fold
[119-263] IPR0135985.4e-40Exportin-1/Importin-beta-like
[41-107] IPR0014943.5e-11Importin-beta, N-terminal
Orthology groupMCL13973 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208474-TA
ATGGCAACTTTAGAGCAGCAAGCTTCTAAACTTCTTGATTTCAACCAAAAATTGGACATAACACTTCTTGATAATATCGTTGGATGTTTATATTCCACTGTTGGAGAACAGCAACGTGTTGCACAAGATATTTTAACAGCACTCAAAGAACATCCCGATGCTTGGACCCGAGTTGATACCATACTTGAGTATTCTCAGAATCAGGAAACAAAATATTATGCTCTGCAAATTTTGGAACAAGTGATTAAAACTAGGTGGAAAGTATTACCAAGAAACCAGTGTGAAGGTATTAAAAAATACATTGTGGGTCTGATTATAAAGAACTCGTCGGATCCTGCCACTATGGAAAGCAATAAGGTTTATTTAAAAAAATTAAATATGATACTTATTCAGGTCTTGAAAAGAGAATGGCCTCATAATTGGGAGACATTTATAAGTGACATTGTTGGAGCATCTAAAACTAATGAAAGCCTGTGCCAGAATAATATGGTTATCTTAAAACTTCTCAGCGAAGAAGTGTTTGTATTCAGTACAGGTGAACTAACTCAGACAAAAGCAAAACATCTAAAAGATAACATGTGTTCTGAATTCAGTCAAATTTTTAATCTATGTCAATTTGTATTAGAGAATTCACAAAATGCACCCCTCGTTGATGCAACACTTCACACACTTCTGAGATTTTTAAATTGGATTCCTCTTGGCTACATATTTGAAATGAAATTAATAAGTACACTTATTTTCAAGTTTCTGAATGTTCCCATGTTCCGCAATGTTACCCTTAGCTGCCTTACCGAAATAGCTGGTGTCACAGTAAGTAATTATGAAGAGCAGTTTGTTGCTTTACTTGTTCAAACAATGGAGCAATTGGAAGTTATGCTTCCCCTATCAACTAACATACGAGAAGCTTATGCAGCAGGTCGAGATCAAGAACAGGTTTTTATTCAAAACCTTGCTCTATTTCTATGTACTTATTTAAAAGAGCATGGACAATTAATTGAAAGGAAGGGCCTTACTAACACTCTTATGAATGCCCTCAGATACCTTGTATTAATATCTGAAGTAGAAGATGTAGAGATTTTTAAGATATGTTTAGAATTTTGGAATGCCTTAGCTGCTGATTTGTATAAGATAACGCCATGTTCCCATTCAGTAGGATTTTATAGTTTAGGGAAAAATGTTGGACGAAAAGCATTGTATGCTGATGTTCTGAGCAGTGTTCGATATATTATGATTTCAAGAATGGCAAAACCTGAAGAGGTTTTGGTTGTCGAAAATGAAAATGGTGAAGTTGTAAGAGAATTCATGAAGGACACAGACACTATAAATTTATACAAAAATATGAGAGAAACTTTGGTATATTTAACACATTTGGATTATCAAGATACTGAAAGGATTATGACTGAGAAACTTCAAAATCAAGTGAATGGCACTGAGTGGTCCTGGAAAAATCTGAATACCCTTTGTTGGGCAATTGGTTCGATATCAGGTGCCTTGACAGAAGAAGATGAAAAGAGATTTTTGGTTATTGTTATTAAAGAGCTTTTGGGGTTGTGTGAGCAAAAAAAGGGAAAAGACAATAAAGCTATTATAGCTAGTAATATCATGTATGTTGTGGGTCAGTACCCCCGCTTCCTAAGAGCTCATTGGAAATTTTTGAAAACTGTTGTTAATAAACTTTTTGAATTCATGCATGAAACCCACGATGGCGTTCAGGATATGGCATGTGATACATTTATAAAAATCGCTTTAAAGTGTCGCCGTCACTTTGTAACTACCCAAGTTGGAGAAGCTTGTCCTTTTATTGAAGAAATTTTAAGTACCATCAGTTCTATCATCTGTGATCTCCAGACACTACAAGTTCACACCTTTTACGAAGCTGTAGGGTACATGATCAGTGCACAAGTAGACCAGGTTGCACAAGAACAGCTCATTGAGAAGTACATGTTGCTTCCGAATCAGGTATGGGATGATATAATATCCCAAGCCTCTCATAATGTAGACATCTTGAAAGATGCTGAGGCTGTTAAACAACTTGTTAGTATTCTTAAAACAAATGGTCGAGCTTGTCGTGCCCTGGGCCACCCATATGTTGTCCAATTAGGCAGAATATATCTAGATATGCTCAATGTGTATAAGGTCATGTCGGAGAATATCAGCCAAGCTATAGCGTTAAACGGTGTAGTTGTGACTAAGCAGCCACTCATAAAGAATATGAGGATAATAAAAAAAGAAACTCTTAAACTAATTTCAAGTTGGGTTTCTCGTTCAACAGATAACAGTATGGTATTGGAGAATTTCATTCCTCCCTTACTTGATGCTGTTCTATTGGATTACCAAAGAACTGCAGTGCCTGATGCAAGAGAATCAGAAGTTTTATCCTGTATGGCAGCAATTGTTTATAAACTCGGAGGGCATATAACATCGGAAGTACCGAAAATCTTTGACGCGATCTTTGAATGTACTTTGGAAATGATCAATAAAGATTTTGAAGAATATCCAGAGCACAGAACCGAGTTCTTCTTGTTGTTACAGGCTGTGAACACACACTGTTTCAAAGCATTCCTAAGCATACCTCCGGCGCAATTTAAATTAGTGCTTGATTCCATTATATGGGCATTTAAACATACAATGAGAAATGTTGCTGACACTGGTCTTCAGATATTATATCGGCTGCTTCAAAACGTCGAACAACATCCCCAGGCCGCGCAGAGTTTCTACCAGACCTATTTATGTGATATACTGGAACACGTTTTTAGTGTGGTAACAGATACTTCTCACGGCGCAGGTTTGACGATGCATGCTACCATTCTAGCACATATATTCTCCTTAGTAGAAACGGGTAGAGTGACTGTTCCTCTGGGTCTCACTCCAGATAATATTCTTTATATACAGGAATATGTTGCCCGTCTTCTCAAAACAGCATTCCCTCATTTGAATGATAATCAAATTAAGATTACCGTACAGGGACTGTTCAATTTGGATCAAGACATACCTGCCTTCAAAGATCATCTCAGAGATTTCTTAGTTCAAATTAGAGAGTACACGGGTGAGGACGACAGCGATCTATTTTTGGAAGAACGGCAGTTTGCTCTCAGCAAGGCACAAGAGGAGAAGAGAAGGGTCCAGTTATCTGTGCCGGGGATTATCAATCCGCATGAATTGCCGGAGGAAATGCAGGATTAA

Protein sequence:

>DPOGS208474-PA
MATLEQQASKLLDFNQKLDITLLDNIVGCLYSTVGEQQRVAQDILTALKEHPDAWTRVDTILEYSQNQETKYYALQILEQVIKTRWKVLPRNQCEGIKKYIVGLIIKNSSDPATMESNKVYLKKLNMILIQVLKREWPHNWETFISDIVGASKTNESLCQNNMVILKLLSEEVFVFSTGELTQTKAKHLKDNMCSEFSQIFNLCQFVLENSQNAPLVDATLHTLLRFLNWIPLGYIFEMKLISTLIFKFLNVPMFRNVTLSCLTEIAGVTVSNYEEQFVALLVQTMEQLEVMLPLSTNIREAYAAGRDQEQVFIQNLALFLCTYLKEHGQLIERKGLTNTLMNALRYLVLISEVEDVEIFKICLEFWNALAADLYKITPCSHSVGFYSLGKNVGRKALYADVLSSVRYIMISRMAKPEEVLVVENENGEVVREFMKDTDTINLYKNMRETLVYLTHLDYQDTERIMTEKLQNQVNGTEWSWKNLNTLCWAIGSISGALTEEDEKRFLVIVIKELLGLCEQKKGKDNKAIIASNIMYVVGQYPRFLRAHWKFLKTVVNKLFEFMHETHDGVQDMACDTFIKIALKCRRHFVTTQVGEACPFIEEILSTISSIICDLQTLQVHTFYEAVGYMISAQVDQVAQEQLIEKYMLLPNQVWDDIISQASHNVDILKDAEAVKQLVSILKTNGRACRALGHPYVVQLGRIYLDMLNVYKVMSENISQAIALNGVVVTKQPLIKNMRIIKKETLKLISSWVSRSTDNSMVLENFIPPLLDAVLLDYQRTAVPDARESEVLSCMAAIVYKLGGHITSEVPKIFDAIFECTLEMINKDFEEYPEHRTEFFLLLQAVNTHCFKAFLSIPPAQFKLVLDSIIWAFKHTMRNVADTGLQILYRLLQNVEQHPQAAQSFYQTYLCDILEHVFSVVTDTSHGAGLTMHATILAHIFSLVETGRVTVPLGLTPDNILYIQEYVARLLKTAFPHLNDNQIKITVQGLFNLDQDIPAFKDHLRDFLVQIREYTGEDDSDLFLEERQFALSKAQEEKRRVQLSVPGIINPHELPEEMQD-