Monarch geneset OGS2.0

DPOGS204769
TranscriptDPOGS204769-TA3432 bp
ProteinDPOGS204769-PA1143 aa
Genomic positionDPSCF300231 + 95633-105431
RNAseq coverage110x (Rank: top 59%)
Annotation
HeliconiusHMEL0034933e-16350.80% 
BombyxBGIBMGA002849-TA2e-17739.30% 
DrosophilaCG17211-PA2e-3928.87% 
EBI UniRef50UniRef50_D6X1E03e-4137.70%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6X1E0_TRICA
NCBI RefSeqXP_973867.18e-4237.70%PREDICTED: similar to AGAP009450-PA [Tribolium castaneum]
NCBI nr blastpgi|2700130541e-4037.70%hypothetical protein TcasGA2_TC011603 [Tribolium castaneum]
NCBI nr blastxgi|2700130544e-6231.10%hypothetical protein TcasGA2_TC011603 [Tribolium castaneum]
Group
Gene OntologyGO:00055158e-05protein binding
KEGG pathway 
Orthology groupMCL26575 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204769-TA
ATGCCGTGTTCCCATGGGGCTGCTCTGACTGTCTCGACCGGAAGCGGCTCGTTACTCCGTACTATAACTGTGCTACTCGTTCCTGTGCCCGGTCTCAGTTCCCACGCTCGCCCCGCTCACCACCTCACCTCACGAGATGAGCGCTTATCACTCAAGTGGGGTTGTCGATACAATAGATTTGACCATCTCAATGAAAGCTTCATTATTATTCTTACGGTGTCTTTTGTTGTAGTTGGTTTTGGAGACATCGAGCGATCCCATTATGATACCAGAAGTGTGAGAGCGTCGGGTAAGTCGTTAGCGGAGACAGAGTCTGGTTGTTACTACGAAGGCTCCTGGTTCGTGGCGGGGGCGGCGGTGAGGACCCGTGAAGCCTGCTTGTCCTGTGCGTGTGCTCGGGGCGCTCTCTCCTGTCGCCGGCGCTCCTGTGCTACCCCGCCAGAACCTCCAGTACAGTGCTCCGTGGTTCACCGACGAGGAGAGTGCTGTCCTGAACTCCACTGTCCAAATAGAATAACTTACTTGGATGATGCAGCATCAACAAGATCTGAGAATTCATATATTGATATGCCGTCTTCCATCGGTCATGCCTGCGTGGAAGGAGGCACAGTGTACGCGGCGGGGTCTGCGATGACCTCCAGCCTGGCCTGCGAGCAGTGCTTCTGTTTGGGCGGCAGGAGACGCTGCGTGAGACCCCGCTGCCTGCCGCCGCCCCCCGGCTGCTCGGCCCGCCCTTCCTCAGGAGCCTGCTGTCCACAACGATATTATTGTCTACATGGAGATAATAAGCCTCCCATTGAAACTGATTCTCATGATTGTTTAACCCCTACTGGTAAATGGATTCAGGAAGGCGGCCGAGTCAAGGAATCTGAAAGCTCAGATTGTGTCCAATGCTTTTGTCTTCATGGTTCCATCCGTTGTCAGCATCTGTCATGTGCTCCGATGTTACACGGCTGTAAACCCCTGGTCCAACCTGGACAATGTTGTGCCCATCAATACATGTGCGATCATAACAGAAATGGAACAACGTTAAAAAATTTGAGTCAAGTCCGACAAAATGTACTTTTACCTTTGAAAGAGGAAATTCGATCGTTTCATACTTCAAAATCAGCTAAGAGAGAGACCACTCTCAGAGCAATGGAGTCTGTAAGCGAAAAAAGTACCGCTGCTGTGACGGTTACTAAAGAAGAACCTATTAAAAGTACTGTAAAAGTAAAAAGAAAAATAGATGAAAATTTGAGTAATGACACAGTGGCATCAGAAAGAGTTGATACAACCAGAGTGATGGGAGTGGAGACCACCCAAAATAATAATGACATTACGTCAACTACCACTCCCGAACCCACTACTGAGGCGACAGAAGAATTAATGACAGAACAACCCGAAGGATCAGTGAGAGTTATAATAAATGGCACCATCAACTGTACCGCGGAACTATCGTCAACGTCTTTGTTCCTAAATGTTACTAACACGAATGACACGATCTGGATACACAACGATGTGCAGCCTCGCATACCACTCATAGATCAAATAGAGAACTTGGACCACACATTCCCCCCAAGTGATATTATTACGGAGGGGAACCACGATGCTAACTTTGACGAAAACGAAACCTTTACAGTAAATGTAACTTCTTCTCTTCATACTAACAGCACTCGTACAACACTATCGACCGTATCGTCGACAGCTCTCCCTAAAACGATCCTCCCAGCCTTAATCGGAGCAGTTAATATTTCTAAAACAAAGAAAAACGAGTATGATTACGATTATACGGAACCTACGCTGCCGCCGTCGTTACCTAATTTGAAGATCATTCCATTTGTAGCTGCTGACGCGGTCGTTGATAGTGATATGTCACCTAAAGAAATGTCCAACTATCCACCTTTAACGAAAGATGACAAATTTCCAGTTTACTACCCGTCTAAAGACTCGAAAGAGACAGTTTATGAAACGCACAGAGAGGACGTTTACAATTTAACGCATTATTCAATGTTTGCTCCTGCAAAAAATGAACCTCAATATCCGGGCATAGTACAGGATGAAGATGACGTCAATATCGCTTATCCGAGATTATCTGATGACGTAGGCTCAGACGTCCACGAGTACACAGTATCTGCATCTTTCGGGAATCCCAGCAAAATAAGTTCCAGCATCGTCAGCGAACCGAGAGGTTTCGTTCCCAAAGACCCGGTGCTCATTGATGACTACTACACGCCATATCAAAGTACCTCATCTGCAATCATACCTCACTTAACAACGTCCATGCCATTGAATACGAAAGATGAATGCCTATCAGACGATGGTCGACCGGTGTCTGAGGGTGAGTCAATCAATATTGAATGTTCCATCTGCTCCTGTATGTGGGGAGAACTTCACTGCTCTCCGCGACCATGTCACACGCCACCTGGCTGCAAAAGACTGATGATTAAAAAGAACAATGTTGACTCATGCTGCGGAAAATTAATTTGTGATCAAGATGACAATATTCCAAAGGTTTCTAATACATCTACTCCAGGTTATCAAGAGAGTTTTAACGAAACATTTAGCACAATATTTGAAGATAAAAACAACATAAATGTCACCCAAAAAGAACATCCAACAGTTGATGAGGACGTAAATAATGACAATAAAACAATACACAATGTGACGGAAAACCACACGTTAATAGATACTTTATTTGACAAAACAACGAAATCATCAGTTATAAGTGGGATTACTTTAAATGCTACAACAACGGAAACAGTAAACAAATTCTCTACGAGTAGCAGTCACCCTTCAGACTATGAAGAAGAAGATGATGAGGACGAAGGATTCTCTCTGGGAAGTGTTTTAAAACTCTTGTTGAGTGAAACGTACGACACGACGACATCTACCCCTAAGAAGAAACCGCCGACGACAACTCATATACCAAAAGTAGTTTCAACTACAGCAGCGACCACGACAACGAGGAAACAGTCAAATATTCCACCGTTTGTTCCTCTAACACAACATTTCCCTTACATTCCTCCGAAACAATCACTTCATCGATTCAATACAGTTGATAGAATTGATCATTTGGTGTTGGGAGAAGCAAGCGCCATCAGGACGACCACCACACAGGCACCTGTCACTTACAAACCGGTCACTTACACCACAACCAAAAAGATTTTCACTACCAAACCCACTCAGAGGCCGGTCGTCACAAAGTCCGTTGAAATTAACAGCAAGGAAGGCTTTGTAAGTCAATCAGTAGAAAAGCGTCCTACCTCTAATCTATTCTCCGGGTTCGGTCTGGGTTTACCGAAACTAGCCGGCTGTAATATCTATGGACGAATGTACCGGGTTGGTAGGATCATCGCCGAGTTGTCTTCGCCGTGCCAAGAATGTAAATGCACCGAGGATAAAGGAGACGTCATCTTCCTGTCCAAAGATGCAGAAGTTGTTTCTTCGTAG

Protein sequence:

>DPOGS204769-PA
MPCSHGAALTVSTGSGSLLRTITVLLVPVPGLSSHARPAHHLTSRDERLSLKWGCRYNRFDHLNESFIIILTVSFVVVGFGDIERSHYDTRSVRASGKSLAETESGCYYEGSWFVAGAAVRTREACLSCACARGALSCRRRSCATPPEPPVQCSVVHRRGECCPELHCPNRITYLDDAASTRSENSYIDMPSSIGHACVEGGTVYAAGSAMTSSLACEQCFCLGGRRRCVRPRCLPPPPGCSARPSSGACCPQRYYCLHGDNKPPIETDSHDCLTPTGKWIQEGGRVKESESSDCVQCFCLHGSIRCQHLSCAPMLHGCKPLVQPGQCCAHQYMCDHNRNGTTLKNLSQVRQNVLLPLKEEIRSFHTSKSAKRETTLRAMESVSEKSTAAVTVTKEEPIKSTVKVKRKIDENLSNDTVASERVDTTRVMGVETTQNNNDITSTTTPEPTTEATEELMTEQPEGSVRVIINGTINCTAELSSTSLFLNVTNTNDTIWIHNDVQPRIPLIDQIENLDHTFPPSDIITEGNHDANFDENETFTVNVTSSLHTNSTRTTLSTVSSTALPKTILPALIGAVNISKTKKNEYDYDYTEPTLPPSLPNLKIIPFVAADAVVDSDMSPKEMSNYPPLTKDDKFPVYYPSKDSKETVYETHREDVYNLTHYSMFAPAKNEPQYPGIVQDEDDVNIAYPRLSDDVGSDVHEYTVSASFGNPSKISSSIVSEPRGFVPKDPVLIDDYYTPYQSTSSAIIPHLTTSMPLNTKDECLSDDGRPVSEGESINIECSICSCMWGELHCSPRPCHTPPGCKRLMIKKNNVDSCCGKLICDQDDNIPKVSNTSTPGYQESFNETFSTIFEDKNNINVTQKEHPTVDEDVNNDNKTIHNVTENHTLIDTLFDKTTKSSVISGITLNATTTETVNKFSTSSSHPSDYEEEDDEDEGFSLGSVLKLLLSETYDTTTSTPKKKPPTTTHIPKVVSTTAATTTTRKQSNIPPFVPLTQHFPYIPPKQSLHRFNTVDRIDHLVLGEASAIRTTTTQAPVTYKPVTYTTTKKIFTTKPTQRPVVTKSVEINSKEGFVSQSVEKRPTSNLFSGFGLGLPKLAGCNIYGRMYRVGRIIAELSSPCQECKCTEDKGDVIFLSKDAEVVSS-