Monarch geneset OGS2.0

DPOGS200725
TranscriptDPOGS200725-TA3429 bp
ProteinDPOGS200725-PA1142 aa
Genomic positionDPSCF300030 - 146897-154933
RNAseq coverage572x (Rank: top 22%)
Annotation
HeliconiusHMEL0089610.065.45% 
BombyxBGIBMGA001123-TA0.076.88% 
DrosophilaRanbp21-PA0.042.29% 
EBI UniRef50UniRef50_E2B3730.046.11%Exportin-5 n=10 Tax=Endopterygota RepID=E2B373_HARSA
NCBI RefSeqXP_320637.30.045.85%AGAP011888-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1583008090.045.85%AGAP011888-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3838539080.046.31%PREDICTED: exportin-5 [Megachile rotundata]
Group
Gene OntologyGO:00054881e-46binding
GO:00068863.3e-06intracellular protein transport
GO:00085653.3e-06protein transporter activity
KEGG pathway 
InterPro domain[16-645] IPR0160241e-46Armadillo-type fold
[113-273] IPR0135983.8e-31Exportin-1/Importin-beta-like
[508-569] IPR0119891.3e-17Armadillo-like helical
[39-102] IPR0014943.3e-06Importin-beta, N-terminal
Orthology groupMCL14302 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200725-TA
ATGAACTCAGTAGGATTTGGAGAAAGGGAAGTGGCTTTAATAGCTGATGAATTAAGTCATGCTGTAGAGCTCACCCTTAATCCGACAGCCAATCAAGACGCCAGGAAACAGGCCTACACAGTATGTGAAAGCTTCAAAGAAAATTCACCATGGTGTGCCCAAGCAGGTTTACTTCTGGCAAGTGGAACTCAATACTCACCAGTTGTAAAGCATTTTGGTCTTCAACTCTTGGAGCACACCGTTAAATATAGATGGACCCAGATTACACAACCAGAAAAGATATTTATTAAGGAAAATTCCATGAAACTTCTCTCAATGGGCGGCTGGGAGACAGGTCACTTGAATGATGCATTAGCTCGTGTGATAGTCGAGATGATAAAGAGGGAATGGCCACAACAATGGCCAACACTTCTAGCTGAACTTAGTGACGCCTGTACTAGAGGACATTTGCACACGCAAATAGTCTTACATGTATTCCTGAGGCTAGTTGAGGATGTAGCAACATTACAGACTTTAGAACAACACCAACGTCGTAAGGATATATACCAGGCTCTGACGAGCAATATGGCGGAAATATTCTCATTCTTCATGAGACTCATAGAACTACACGTACAAGAGTTTAGAGAGAAGACTGCAGCTGGTGATTATGCTGCAGCGGCAAGCAATGGCAGGGTTGTTCAGGTTGTACTACTGACTCTCACAGGATTTGTGGAATGGGTTTCGACAAACCATGTAGTGACAAATAATGGAAGGTTACTACAAATCTTATGTATATTGCTCAGCGATGATGTCTTCCAACTACCAGCTGCTGAATGCTTACTACAAATTGTTAATAGAAAAGGCTCGGTTCTCGCTGGTCTTGCGCAACAACTGACCTCAATGTGGAACTGTGCCACAAACGTGGACTCGTGGTTGCCCCTTCTCCTTGAGACGATGCTGCTTCTAACGTCACACCCCTCGCACACATTGGCTCACACAGCTAACTCAGTGTGGCTCGCCTTCCTCAAACATGATCAGATATCAAAACTGCCACATGTACTGGCCATGGTGCCGAGGTGGTTACAAGCGGCCACACCTAAGATATTGAAGGTGTGGTCCCCTCGTGAATCTGTCGGCTGTGCGTGTGACTACGACAGCGATCACGAGTTCGCAGTGTTCTTCAATCGCGCAAGAACTGAGATGCTAGAGAGCTTCAGATACTGTATGACCGCAGCGCCTTTGGTGACTTGGTCTTATGTTGAGCAATGGACGAACACGGCGTTAGACAAGGTGGACTCGTGTCCGCTACAGTGTGACGTCATGCACCCGCTACACGTGGAGTGGGAGGCTCTCGCACAGGTCCTGGACGTGGTCCTATCTCGGCTGCTCCTGGCGGAGCCTCGGCCCAATGTGTCGGAGGGTCTTCGCGTCCTCCAGCGCTGTGTGTCTTGTGAGCCCCGGGCGCCTCTCATTCTCTCGCTGCTGTTGTCTTTCATATCAGCCCTGTTTGTGTTCCTGAGCTGCGCGTACAGTCAGATGGCAGGTCCGGGGGTCGGGTCAGCGGGCGCGGAGCTTCTACCACGTGTTTTGGACAAGATATTCGCGTCGCTCGTGTACGAGGGGACGCCGCCTGAAGATCGTAGTTCACGGAACGTAAAGAACGTGAGGCGACACGCCGCGGGGTTGTTGGTCAAACTCGGTTCCAAGTACCCGCTGCTGCTGCTGCCGGTGTTCGGTCGTCTCCACGAGCTGACCCGCGCCGCGCTCTCCCGCCCCGACCTCAGCGCCATGGAGAGCGTCACGCTGCAGGAGGCACTGCTGTTGGTCTCCAACCATTTCTGTTGCTACGAACGACAGAGTGCTCTTGTCGCTCAAGTGATGGGCGACTGTCGCGACCGGTGGGCCGCTTTAGCCGAACACATCCAGTCCGCGGCGGGGCTGGCGCGGCTCATAGGGCTGGACGCGCCGCCCACGGACGACCCTGAGCGCGCGACCGCTCGCCGGACCTTACTTCACGCCCTCACCCTCACGCTCGGTGTCGTAAAAAGAACTCAAGTGCCGGCCGATCCGGACAAGGCGGCTCGCGGGGGCTTTGGCGCTGGCGTGACGGCTTCGGGGAATCCAGTGTGGAGGAACCCGTGCGGGGTGCACGTGCTGCCTCTGTTCCCCGGAGTGCTGGCGTTGGCTCGCTCGCTGCACGAGCTGCACTCGCCTCACGCGGCCTCCCTGCGGCACGCGGGTCACGCCCGGGCGTTGTTGCCGCCGCCGGCGGAGCGCCGCAACCTCCTAGGGCTGCGGGACGACGCGCCGTCCTCGCCCACCGGACAGCCGGCCTCACCGACCTCACCACAGGACCGTATGCAGAATCTTCTTCATACGCTACATGACAATGTGTGTCAGCTGGTGGGCGCGGCGGCCTCGTCTCTGGGTCGTGAGTTGTACAGCTGTGCAGGTCTGGCGCGCGTGGTGGCGGGGTCTCTGCTGGGGGCTCCGGAGCGTCTGCCCGAGCACTGGCTCCGCCGGGTGCTGCGAGCCGCTCTCAGACCACTCCTCCTGCACTGCCCGCCGGCTCACTACAGGGACGTGGCGCTGCCGCTGTTACAACATCTAGCTCCTATCATGTTGGTACATTTAAATCAGCGCTGGGACTACGTGACGTCGTTATATGAATCAGGGAAGTTAGAAGAAGAGGGGGGCAGTGAGTCACAGGAGGTGCTGGAAGATATGCTGGTTAGACATCTCACTAGGGAACACCTTGATCTACTGAAGGTTTGTTTAGTGGAGGGCGGTGTCACCGCTGAGAGTACGGATATGGAGGATGAAACCGTAACACCAGTGAATCCTGGCCGCGCTCCGGAACAAGTTTCCGAGTTAGGAGGGATCATTATATCGGATCCCATAGCGGGACCCGCGGTCTTACACACAATTCTCCGCGCCCTCACTTGGAACGATTCCACTTCGTCTCTTCGCGCGTGTTCGCTGGCACTACCCGCTCTACGAGCGGCCTTGGCCGCTCTAGGCGCCGGCGAGGCTGCAGGCGCTTTGGCCGCCGTTCTACACGCGCTCAGACTACACGGACAGCATGAAGCGAATCAGGCTGCTCTACTGGCCCTCGCTGTCCAGACCTATGAGGCGCTCCGCCCCGTGTACCCCGAGGTGTCGTCAGTGTTGTGTGCTATCCCCGGAGTGGAGCCGGTGGACGTGCAACGCCTGGACGACAAACTGGCCGCTCCCCCTCTCAGGCCCGCCAAGGTCGACAAGAGCAAGAGGGACCTGTTCAAGAAGATCGCCTCCAGGTTGATAGGAAGAAACGTCGGACAACTGTTCAAGAAGGAGGTCTTCATACTGGATCTACCTACGATGAACATACAGAGAGACAAACCTCGCAGCGCCTTGGACACCGACGGGGCGGGACTAGAAAATCTTTTCAATACAAACGCTCCAACCTAG

Protein sequence:

>DPOGS200725-PA
MNSVGFGEREVALIADELSHAVELTLNPTANQDARKQAYTVCESFKENSPWCAQAGLLLASGTQYSPVVKHFGLQLLEHTVKYRWTQITQPEKIFIKENSMKLLSMGGWETGHLNDALARVIVEMIKREWPQQWPTLLAELSDACTRGHLHTQIVLHVFLRLVEDVATLQTLEQHQRRKDIYQALTSNMAEIFSFFMRLIELHVQEFREKTAAGDYAAAASNGRVVQVVLLTLTGFVEWVSTNHVVTNNGRLLQILCILLSDDVFQLPAAECLLQIVNRKGSVLAGLAQQLTSMWNCATNVDSWLPLLLETMLLLTSHPSHTLAHTANSVWLAFLKHDQISKLPHVLAMVPRWLQAATPKILKVWSPRESVGCACDYDSDHEFAVFFNRARTEMLESFRYCMTAAPLVTWSYVEQWTNTALDKVDSCPLQCDVMHPLHVEWEALAQVLDVVLSRLLLAEPRPNVSEGLRVLQRCVSCEPRAPLILSLLLSFISALFVFLSCAYSQMAGPGVGSAGAELLPRVLDKIFASLVYEGTPPEDRSSRNVKNVRRHAAGLLVKLGSKYPLLLLPVFGRLHELTRAALSRPDLSAMESVTLQEALLLVSNHFCCYERQSALVAQVMGDCRDRWAALAEHIQSAAGLARLIGLDAPPTDDPERATARRTLLHALTLTLGVVKRTQVPADPDKAARGGFGAGVTASGNPVWRNPCGVHVLPLFPGVLALARSLHELHSPHAASLRHAGHARALLPPPAERRNLLGLRDDAPSSPTGQPASPTSPQDRMQNLLHTLHDNVCQLVGAAASSLGRELYSCAGLARVVAGSLLGAPERLPEHWLRRVLRAALRPLLLHCPPAHYRDVALPLLQHLAPIMLVHLNQRWDYVTSLYESGKLEEEGGSESQEVLEDMLVRHLTREHLDLLKVCLVEGGVTAESTDMEDETVTPVNPGRAPEQVSELGGIIISDPIAGPAVLHTILRALTWNDSTSSLRACSLALPALRAALAALGAGEAAGALAAVLHALRLHGQHEANQAALLALAVQTYEALRPVYPEVSSVLCAIPGVEPVDVQRLDDKLAAPPLRPAKVDKSKRDLFKKIASRLIGRNVGQLFKKEVFILDLPTMNIQRDKPRSALDTDGAGLENLFNTNAPT-