Monarch geneset OGS2.0

DPOGS207937
TranscriptDPOGS207937-TA2577 bp
ProteinDPOGS207937-PA858 aa
Genomic positionDPSCF300090 - 406586-433743
RNAseq coverage55x (Rank: top 69%)
Annotation
HeliconiusHMEL0215202e-11677.33% 
BombyxBGIBMGA000378-TA0.074.37% 
Drosophila% 
EBI UniRef50UniRef50_B4QRN00.055.20%GD13896 n=10 Tax=Neoptera RepID=B4QRN0_DROSI
NCBI RefSeqXP_001815363.10.060.82%PREDICTED: similar to AGAP006590-PD [Tribolium castaneum]
NCBI nr blastpgi|1892374160.060.82%PREDICTED: similar to AGAP006590-PD [Tribolium castaneum]
NCBI nr blastxgi|1582961130.059.82%AGAP006590-PB [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055151.9e-38protein binding
KEGG pathway 
InterPro domain[37-142] IPR0006971.9e-38EVH1
[44-143] IPR0119933.7e-18Pleckstrin homology-type
Orthology groupMCL18469 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207937-TA
ATGGGCAACAAACTATCGTCTTGCTCGTGCGCGCCCATCCTCCGCAAGGCGTACCGCTATGAAGATAGCCCGTGGCAGAACTCCCGACGACGTGATGGACATCTATTAAGGCTGTGGGCGGAGGTATTCCACGTGTCAGCCAGTGGAGCTGGTACAGTTAAATGGCAGCAAGTATCCGAGGACCTGGTGCCAGTCAACATCACCTGCATCCAGGACTCACCGGAATGTGTGTTCCACATCACCGCGTATAACTCACAGGTCGACAAAATTTTAGACGTCAGACTACTCCAGCCTGGCACTCGTATCGGTCAGGCCTCCGAATGCTTCGTGTACTGGAAGGATCCTATGACTAACGACACCTGGGGCCTCAACTTCACTTCGCCCATCGATGCAAAACAATTTAGGGAATGCTGCTCACCATCATTCAAATTCTCACGTAAGGCATCTTCGAGCTACAGCCTGAAACTCGAGCCACCGGGCAGCAAGCAGAAGATGAAAACTAAGAGGAAACCAGTATCTACACCAGCCAGCCCTAACCGATCGAGTCTGACATATGGCCGCGAACCACAGTGCACTTGCATGACTCAAGAACAATATTCCAGGCTACGAGCTCAAGACCCTCGCTATCGCTCGTCAACTTTGCCGCGAACAGCTACTCGTGCTATTGAGACGGACGCAGCCGCTGGTGCAGGTCGCTCCGACAAAGTCGCCGCTGCAACATCTTCCACATCGCTTTACGACAATGTCACCAATACACAGCCGCAAAACCAGGCGCCACCTAAACCTGCAACCCGCCAGAGTGAGAGTCAAGCTCCATCGCGTCCTCCAAAATCTCAAGAGACCTCCACCATGACTACAAACACTTCAACTGCTCCGAAAACGGTGACAGCATCGGTTGGAACACATGGCACTAGTACCAGCGAGGAATCACAAACCTCTACGGGCACTACAGTTCAGCATGCTCAAGACCTGAAGTCTGAAGGGGTACAGGCTGGCGGAACGCTTACCTCCAAATCTTCCTCAACATCCACTCGATCAAAGGACCACCTGCAGCACATGCCTAAAAGTGTGGACTACGGAGATGGAAATGAATCTTCTCGTGAATCTGACAGACACTCTATGCACAACCACAACGTAATCAATAACAACACATCTGGGTCACGACGTACAAAGTCAAAAAGCACAGAAGATATGAATATGGACTCGAGCACTCTCAAGCGTATGCTCAAGCCGATGCCATCTACTGAGAGCCCAGTAACGTCACCGGAAATGGGGCGCCGGCGGTACGGCGGTGCCTGCCCACCGACCTGCGGACCCCACGGACATCGCCACCAACCACACGGTCATGGACATTACGCCCATATCGTCAACAATAACAGCAGACAGGGATCTCAAAGATACTCATCATGCCGTGGCGTTGGTGTGGGTGCAGCACCAAGTGGCTACCCCGGCCGTGGCTTATATTTGGAGCTAGGAGGTGGTGAAAGGGATTTATCCCCTCCGTCTGATAATGTGATGTTCGATAATCAGTGCTATGCCACGACTCCGTCATCATCCAACGGCAACTCTGATCAGGAACCCTGTCGCCGCGACCGGGAACAAGCAATGCATCAGAGGCAGCATCATCACGGTCAGAAACCCTGCCTGTCTCGGCAGGCATCTCAGTCTAGTGCGACACCAGCCCCAGGCTCCCCTACATCTCGTTTGCTGTTAGAATATGAAATGCATCTCCGGAATACTCTCGCTAAGGGCATGGACGCTGAGAGCTACAGCCTGCATACATTTGAGGCCCTGTTGAGTCAGAGTATGGAAGACTTAGAATACAACGACAGCATGCCGCCTTCAAATCAGCGCAGTCCATATCCGTCTCGTAGAAGACCAGCATCTCAATGTTCCGGCGGTGGCAGCCGCTCCTCCACTTTGCCCTTGCCACATCGCCTCGGTGTTGAGAGACAGCACAGCGCACGCTCTGACCGCGATGGCTATTACAGTTTTGTAAGTGCGTCTCGTTGCGCTTCTTGCATCGGCGAGTCAGCTCGGTCGGCATGGTACCGTCATTCTGATGGATGGCGAGGCGCTCCACCACCAGGCCCTCGTCGTTCACCCTGGGACTCCTTACCAAGTCTACGACATGAAGGCAGCCTTAACGACTCTGGTTACAGAAGCAATCGAGCTGATAGCTTTGAACAACGTTTTGTAAGTGCGTCTCGTTGCGCTTCTTGCATCGGCGAGTCAGCTCGGTCGGCATGGTACCGTCATTCTGATGGGTGGCGAGGCGCTCCACCACCAGGCCCTCGTCGTTCACCCTGGGACTCCTTACCAAGTCTACGTCATGAAGGCAGCCTTAACGACTCCGGTTACAGAAGCAATCGAGCTGATAGCTTTGAACAACGTGGTGTGTTCGACCGTCAAGACAGCGTGCGCTCAGAGTACACAAGTGACCGGGAATCCTCACGTTATGGCATCGTACAGCAAGCCTCTATAGATAGCACTGACTCCAGAATTTGTTACCTGACTTCTAGCGAGGCCCTATTCGTCGTTTACATTAACAGGTATAATAGAATAGCAGATGACTAA

Protein sequence:

>DPOGS207937-PA
MGNKLSSCSCAPILRKAYRYEDSPWQNSRRRDGHLLRLWAEVFHVSASGAGTVKWQQVSEDLVPVNITCIQDSPECVFHITAYNSQVDKILDVRLLQPGTRIGQASECFVYWKDPMTNDTWGLNFTSPIDAKQFRECCSPSFKFSRKASSSYSLKLEPPGSKQKMKTKRKPVSTPASPNRSSLTYGREPQCTCMTQEQYSRLRAQDPRYRSSTLPRTATRAIETDAAAGAGRSDKVAAATSSTSLYDNVTNTQPQNQAPPKPATRQSESQAPSRPPKSQETSTMTTNTSTAPKTVTASVGTHGTSTSEESQTSTGTTVQHAQDLKSEGVQAGGTLTSKSSSTSTRSKDHLQHMPKSVDYGDGNESSRESDRHSMHNHNVINNNTSGSRRTKSKSTEDMNMDSSTLKRMLKPMPSTESPVTSPEMGRRRYGGACPPTCGPHGHRHQPHGHGHYAHIVNNNSRQGSQRYSSCRGVGVGAAPSGYPGRGLYLELGGGERDLSPPSDNVMFDNQCYATTPSSSNGNSDQEPCRRDREQAMHQRQHHHGQKPCLSRQASQSSATPAPGSPTSRLLLEYEMHLRNTLAKGMDAESYSLHTFEALLSQSMEDLEYNDSMPPSNQRSPYPSRRRPASQCSGGGSRSSTLPLPHRLGVERQHSARSDRDGYYSFVSASRCASCIGESARSAWYRHSDGWRGAPPPGPRRSPWDSLPSLRHEGSLNDSGYRSNRADSFEQRFVSASRCASCIGESARSAWYRHSDGWRGAPPPGPRRSPWDSLPSLRHEGSLNDSGYRSNRADSFEQRGVFDRQDSVRSEYTSDRESSRYGIVQQASIDSTDSRICYLTSSEALFVVYINRYNRIADD-