Monarch geneset OGS2.0

DPOGS207582
TranscriptDPOGS207582-TA2841 bp
ProteinDPOGS207582-PA946 aa
Genomic positionDPSCF300072 + 785376-793993
RNAseq coverage236x (Rank: top 43%)
Annotation
HeliconiusHMEL0171410.057.05% 
BombyxBGIBMGA004690-TA9e-15250.24% 
Drosophilacdm-PA5e-7324.98% 
EBI UniRef50UniRef50_D6WZG32e-8126.86%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WZG3_TRICA
NCBI RefSeqXP_970544.18e-8126.81%PREDICTED: similar to GA20183-PA [Tribolium castaneum]
NCBI nr blastpgi|2700132616e-8126.86%hypothetical protein TcasGA2_TC011842 [Tribolium castaneum]
NCBI nr blastxgi|2700132613e-7126.02%hypothetical protein TcasGA2_TC011842 [Tribolium castaneum]
Group
Gene OntologyGO:00054883e-47binding
GO:00068867.1e-07intracellular protein transport
GO:00085657.1e-07protein transporter activity
KEGG pathway 
InterPro domain[1-860] IPR0160243e-47Armadillo-type fold
[810-859] IPR0119892.8e-15Armadillo-like helical
[26-92] IPR0014947.1e-07Importin-beta, N-terminal
Orthology groupMCL13484 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207582-TA
ATGGAGTATACAGCTCACAATTTAGAATATGCAGTGACTGTATTTTATAATGGAAACGATGAAGATAGATCTAAAGCCCATACTTGGCTGTCAGCAGCCCAACGAGTTCCTGAAGCCTGGAACTTTGTTTGGGAGCTGCTCCAGTCCAACAAGGGTACGGAAGTACAATTTTATGCTGCCACCACATTGCATACAAAAATTCTTCGATGTTGGAATGAAGTGCCAGAGGAAAGTTACACTGAACTGAAAGAAAAGCTATTACAAGCTATGATGGCTTATTCTAATGGTCCAAAAATTGTAACCAACAGATTATGTATAAGTCTAGCAGCTTTTATCCTACAACAAGGCTCTACAGACATAGCAGATATATTGAGGCCATTGTCAACTACCGCCACAACTTCTTTACTTCTTGAAGTACTCACTGTGATACCAGAAGAATTTAATAGTATGACTATGGGGACTGCTCTTCGTGCTAGAAACAGAGCTGCTTTGAATCAGGCTTGCAGTATGGTCCTTGATGATATGCTGAGATATTTACAAGATGTCTTCAATGACTACAGTAATTCACCGCCGAGTGAGGCTTCCATTCAACTGTGGACATCGGCTGCGTCATGCGCCAGTAATTGGTTGGCACTTAGTGAGGATACATTGGAAAGTACAACCACTCTGCCAGAGCGCGCTCCGCTGTGCCGCGCCTTGTATACCGCCGTGCGATTGCTTTGCACTTGGAACGAGGCGGTGAGTGGTAGTGCTTTAGAAGCATGCGAGGCGTTCTTATCAACGCTGCGGGCGGCGGGAACAGCTGGTGGTGCGACGCGGCATCCCACCTCCGCGCGGTCACTGCTCACTGACCTCGCAACACTCGCTGAACCTCTCCTTGCAGCACACAACCAGCCCAATTCAATCAATGAGGAGTTGCTGGCGGCTTTGATTATATGCTGCGTGGCGGTGGCTGAGTGTCACGCGCGCGTGTTGGTGGAGGCTGTCGAGGAAGGGAGAGAGGGGAAGGAGGAGGAGGGGGGTGCCAGGAGGATTCTACAAGTGTTGCTGGCGGCACAGGCTGCACCCGGACACTACCCTCTACACGAGACACGTTCCAACCTCGTGTTCGGACTGTGGTATACGTTACAAGATCAGATTCTCAACATGTCGGATAGCTCGAACAAAGTGAGCCCGGTGTGGTATGAGGTGTTCACTCAACTGCTCATCACGCTCATAAAGAAGTCGGAGATGCCGCCGGAGTCCATGCTGTCTAGAGATGAACAGGAACTACTACGCTGCTACAGACAAGACATCGCGGATATTGTGATGTACTGCTACAGTATACTCGGTGAGAATTGTTGGTCGCTGGTAGAGAGCGCTTTCACTGGCGCGGAGAGCGCTCGTCAGCGTGAGGCAGCGCTGCATGTTTTCGCCGCACTAGCCGACGCAGCGCCGGCAGGACGCGCACCTACAGCGCTTCCAGCGCTGCTCCAGCACGCGCTACGTCTAGCCGCAGATCCAGTTAACAACGATACACGGCTTCTGCATACAGCGCTAGATTGTCTAGGTTTCTACGCTTCCTGGCTCAGTTCTATGGAGGGACCGCAGGGCACGGAGCTTGGTCGGGAATGTATGCGAGCCGCCGGAGCAGCGCTGCAGCGCTGTCCAGCGCCCGCTGCTCTCGCCCTACGCAAACTATGCTTAGACTGTGCAGCACCAGCAGCGGAACTCGTCGCTGATATTGTACAACTGGCTCAAAACGTAGAAACTACATCTGAGGGTTGGACTCGTCGTCAGCTGGTGAGCGCTGCGGGCGCGGCGCTGGCGGCCAGTGACCCCGAGCGCGCCGCCCCTCTCCTGGCCTCACTCGCTCACCGCTTACACGACCTACTGACTGCACAGGCTCAGTCCGCGGCGTCCGCTCGCTCGTCGGCGGGCGGGGCGGCGGAGTGCGTGTCCTTGATGTGCGCTCTGGGCGCTCGACCCGCGCTCGCCGCGGAACTGTTCCGTATACTGCGGCCGGCGCTTGCCTTACTACCGGCAAATCAAGACCTCACGCAGGCAATGTTTCAAATCTTAAAGCATACGGTGTCTGCTCTGATGGGGGATTGTATAAATGTTATTGAAGACATCGCAGTATTAATAATCGCCGGCTTCGAATCTCAGCCATGTCCGGTTGGTCTGGATGTAATAAAACTGGTGGCGGTGATGGTGGGTACCGAGTGGGAAGGCACGGCGGGTATAGTCCGAGCAGGCGTGGTGGCGAGCGCGCGGGCGGTAGCCCCACAGACCAGCATGCACGCGCTCGACCCGCAGTACCTACTGACGGTAGAGCACACCACGGGACATCCCTGTCCCGAACTAGCGGAAGCTCTGTTTGTGTTGCTCGACGCGCTGACCAAGAAACAGCCGCGGGCCATCGAGTGGATTGAAGATATACTGTCTGATCTTATCGCTTTAGCGTGCGAGTTCGTCCGCGCGTGGGAGGCTCGTGCGGCGAGCGCGGCGTGTTCGTGGCTGGGGTCCCTGGCGGCCGTCCGTCCCGCCTCCCTGGAGCCCCGAGCGCCGCTCCTCACACACACAGCACTGCGGTGCATCGGCGGCGGGGCGGGGGCGGGGGGAGGAGGTGGAGGTGGAGGACCTCTCGGGGACTGGCTGCGAGCCTCACTGTCAGCTCCCGGTTTCCCCACCGTACACTCCACTGAAGCTCACAAACAAAAGTTCATTGCAGCCGTTCTCAGAGAGAAGACCAGCAAACGACGACTATTGGAGGCGGTTCAAGAGTTCTCATTAGGGTGTCGAGGTCTCATAGGAACGGAGTACGCACGCCAGACGCTATCAGCTAAACAAATGGTCTAG

Protein sequence:

>DPOGS207582-PA
MEYTAHNLEYAVTVFYNGNDEDRSKAHTWLSAAQRVPEAWNFVWELLQSNKGTEVQFYAATTLHTKILRCWNEVPEESYTELKEKLLQAMMAYSNGPKIVTNRLCISLAAFILQQGSTDIADILRPLSTTATTSLLLEVLTVIPEEFNSMTMGTALRARNRAALNQACSMVLDDMLRYLQDVFNDYSNSPPSEASIQLWTSAASCASNWLALSEDTLESTTTLPERAPLCRALYTAVRLLCTWNEAVSGSALEACEAFLSTLRAAGTAGGATRHPTSARSLLTDLATLAEPLLAAHNQPNSINEELLAALIICCVAVAECHARVLVEAVEEGREGKEEEGGARRILQVLLAAQAAPGHYPLHETRSNLVFGLWYTLQDQILNMSDSSNKVSPVWYEVFTQLLITLIKKSEMPPESMLSRDEQELLRCYRQDIADIVMYCYSILGENCWSLVESAFTGAESARQREAALHVFAALADAAPAGRAPTALPALLQHALRLAADPVNNDTRLLHTALDCLGFYASWLSSMEGPQGTELGRECMRAAGAALQRCPAPAALALRKLCLDCAAPAAELVADIVQLAQNVETTSEGWTRRQLVSAAGAALAASDPERAAPLLASLAHRLHDLLTAQAQSAASARSSAGGAAECVSLMCALGARPALAAELFRILRPALALLPANQDLTQAMFQILKHTVSALMGDCINVIEDIAVLIIAGFESQPCPVGLDVIKLVAVMVGTEWEGTAGIVRAGVVASARAVAPQTSMHALDPQYLLTVEHTTGHPCPELAEALFVLLDALTKKQPRAIEWIEDILSDLIALACEFVRAWEARAASAACSWLGSLAAVRPASLEPRAPLLTHTALRCIGGGAGAGGGGGGGGPLGDWLRASLSAPGFPTVHSTEAHKQKFIAAVLREKTSKRRLLEAVQEFSLGCRGLIGTEYARQTLSAKQMV-