Monarch geneset OGS2.0

DPOGS211310
TranscriptDPOGS211310-TA3462 bp
ProteinDPOGS211310-PA1153 aa
Genomic positionDPSCF300125 - 108533-126210
RNAseq coverage234x (Rank: top 43%)
Annotation
HeliconiusHMEL0093710.067.72% 
BombyxBGIBMGA004963-TA0.076.46% 
DrosophilaCG31368-PD7e-14971.39% 
EBI UniRef50UniRef50_O603060.057.77%Intron-binding protein aquarius n=117 Tax=Metazoa RepID=AQR_HUMAN
NCBI RefSeqXP_002429122.10.061.53%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420172850.061.53%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|3800248660.061.62%PREDICTED: intron-binding protein aquarius [Apis florea]
Group
KEGG pathwayphu:Phum_PHUM4207200.0 
 K12874 (AQR)maps-> Spliceosome
Orthology groupMCL14984 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211310-TA
ATGCTGAAATTTTATGCTCGTTTTGAGATCAGCGACGAAACCGGTGATCCCATGACCGACCGCGACATGACCTTACAGCACTATTCCAGAATAACATCACTACAGAAAGCTGCGTTTACAAAGTTTCCCGATCTAAGATTGTTCTCTCTAGCAAACGTAGCAAGCGTTGATACCAGGGAATCTCTTCAGAAACACTTTGGGAATCTCAGTGATAAGGCATTAAGGGCCATTGCCACTTATCTGAATTTAGTTCCCACGGAAGGCAAGGAAGATGAAGCGCCTTGGCACAGACTGGACAAAGATTTCCTCAGGGAACTTTTGATATCAAGACACGAGCGAAGAATTTCTCAGCTGGAAGAATTGAATTCAATGCCCCTATATCCCACCGAGAAGGTTGTGTGGGACGAGCACGTGGTGCCGACCGAGGTTTACAGCGGGGAGCGTTGCCTCGCCTTACCGAAACTTAATCTCCAATTCCTGACACTTCACGATTACCTGTTGAGGAACTTCAACCTTTTCCGCCTAGAGAGTACATATGAGATCCGTCAGGACATTGAGGATGCTGTTTATCGCCTGTCACCATGGAAATCTGAAGACGGTACTGTGATATTCGGAGGCTGGGCACGTATGGCTCATCCTATCCAAAGCTTCGCTGTGGTTGAGGTGGCGAAACCGAATATAGGAGAGAAGGCGCCTTCAAGGGTCCGTGCTGACGTCACAGTGACCCTCAGCGTCAGGAACGAGATCAAGCACGAGTGGGAGAGTCTCAGGAAGCACGATGTATGCTTCCTTATAACCGTACGGCCTAGCGAGGGTATAGGGACGAAATACGATTACAAGAAAAGTATGGTCGACCAGGCTGGTATAGTCTACATCCGAGGTTGTGAGGTCGAGGGGATGTTGGACGCCGGCGGGAGGGTCATAGAGGACGGGCCAGAACCTCGACCAGAACTAGAGGGAGATTCCAGAACATTCAGGCTGCTGCTAGACCCTAACCAGTATAGGTTGGACCTTGACGAAGCCAGCAAAGGAAAAGAGGCTGGTATAGTCTACATCCGAGGTTGTGAGGTCGAGGGGATGTTGGACGCCGGCGGGAGGGTCATAGAGGACGGGCCAGAACCTCGACCAGAACTAGAGGGAGATTCCAGAACATTCAGGCTGCTGCTAGACCCTAACCAGTATAGGTTGGACCTTGACGAAGCCAGCAAAGGAAAAGAGGATGTGTACGAGACATTCAATATCGTTGTCCGACGGAAGCCTAAAGAGAACAACTTTAAGGCTGTTCTGGAGACGATACGAGAGCTGATGAACACGGAGTGCGTGGTGCCTGAGTGGCTTCATGACATAGTGCTGGGCTATGGCGACCCTGGGCAGGCGCACTACACCAGGATGCCCAACGAAATCCCTACCCTGGATTTCAACGACACGTTCCTGGATATGGAACATCTACGGAACAGTTTCCCGGGACACGAGATAAAGGTACAGACGGACGATCCGCGGAAACTCGTCCGACCGTTCAAATTGACTTTCGAGAACGTTCTACGTAAACAGCGAGGCGAAACGGATATGGATGAAGAGGAACCCAAGAAGGTTATAGTTGTAGAACCCCACGTGCTGCCCAAGAGAGGGCCGTACCTGTACAATGAACCTAAAAAGAACAACATACTGTTCACGCCGACCCAGGTGGAAGCGATCCGTTCAGGAATGCAGCCGGGGCTGACGGTCGTGGTGGGACCTCCCGGCACGGGTAAAACTGATGTCGCAGTCCAGATAATATCGAACTTGTACCACAACTTCCCGTCCCAGAGGACGTTAGTTGTGACGCACAGTAATCAAGCTCTTAACCAGCTGTTCGAGAAGGTTGCTGAGCTGGATGTGGACGAAAGGCACCTGCTGCGTCTTGGACACGGCGAGGAGGCTTTGCAGACGGACAAGGACTTCTCCAGGTATGGACGTGTGAATTACGTGCTGGCAAAGCGTTTGGAACTCCTCGGCCAGGTGTCGCGTCTTCAGACCACGCTGGGGGCGGGGGGAGAGGCGGGTGGTTTTATTATTATTGGGCACGTTATTATTGAGGGTAGTTGTGAGAATATTTGCATATTCTTACCTAGATCTGGCTCTTCCCTGAGCATCCAACTCCACATACGGCACACCGAGTCTCACCATCCTCGTGAAAAGGCTTTGTTCCATATTACAGTACTTCTGGAAAGCCATTATAATTTTAATAATTATCCAGTCTTAATATTAATTTCATTGAAAATAAAATGCAAACTAATTTCCGTTGTGTTTGTCGTTTTAGAACGTTCCTTGGCACATTGTTTTGTATGTTTCAAATACGACAACATTCTGATGGAGGAGTCCGCACAAATTCTTGAAATAGAGACCTTCATACCACTGCTGCTGCAGAACCCTCAAGATGGTAGGTCCCGGTTGAAGAGGTGGATAATGATCGGCGACCACCACCAGCTACCGCCGGTGGTGAAGAACATGGCTTTCCAGAAGTACTGTAATATGGAACAAAGCCTTTTCACGAGGATGGTGAGACTCGGTGTGCCGTATGTGGAGTTGGATGCTCAGGGAAGAGCCAGATCTAGCATATGCAACCTGTACCGCTGGCGCTATCGTAACCTGGGAGACCTGCGACACGTCTGCCAGCTGCCAGAGTACCGCGCGGCCAATGCCGGCCTCAGGCACGATATACAACTCATCAATGTAGACGACTTTAATGGAGCTGGAGAGACGGAACCCAGCCCGTACTTCTATCAGAATTTGGCAGAAGCGGAATATGTCGTGGCCGTGTTTATGTACATGCGTCTGATAGGCTGGCCAGCTGAGAAGATCTCGATCCTCACCACTTACAACGGACAGAAACATCTCATTAGGGACGTTATTAACAAACGGTGCGCCGACAACCCGCTCATTGGGAGACCACATAAGGTGACGACAGTAGACAAGTATCAGGGTCAGCAGAACGACATCGCTCTCATATCGCTGGTGCGGACGAAGGCGGTGGGTCACGTGAGAGATCTGAGACGTCTTATAGTAGCGACCTCTAGGGCTCGCCTCGGACTGTACATCTTCGCCAGAGCCAGCCTCTTCAGGAACTGCTTCGAATTGCAGCCGACATTTAATCAGTTGTTAGAGCGGCCGTTACAGCTGGAGTTGATCCCGGGTGAGTCATACCCGGCCCAGAGGACGCTCAGTGCTGCCGTGCCCGAGGAGCTAGTGCTGCGTGTAATGGACATGCCGCACATGGCGCGATACGTTTACGATATGTACATACAGAGAGTCAGAGACTCAGCTCAGGATTCCACATGGAGCGCCCCCGGATCTGATCGTTCAGCTCGTTCCAAGGAGGCGGATCATCACGTGGCGGTGCACCCGGGGGGTGACAGCGACGAGGACGACGCCACCGCCTTCCAACCCACGGATATAGTGAACGAGATCGAGGAACAGGAGTGA

Protein sequence:

>DPOGS211310-PA
MLKFYARFEISDETGDPMTDRDMTLQHYSRITSLQKAAFTKFPDLRLFSLANVASVDTRESLQKHFGNLSDKALRAIATYLNLVPTEGKEDEAPWHRLDKDFLRELLISRHERRISQLEELNSMPLYPTEKVVWDEHVVPTEVYSGERCLALPKLNLQFLTLHDYLLRNFNLFRLESTYEIRQDIEDAVYRLSPWKSEDGTVIFGGWARMAHPIQSFAVVEVAKPNIGEKAPSRVRADVTVTLSVRNEIKHEWESLRKHDVCFLITVRPSEGIGTKYDYKKSMVDQAGIVYIRGCEVEGMLDAGGRVIEDGPEPRPELEGDSRTFRLLLDPNQYRLDLDEASKGKEAGIVYIRGCEVEGMLDAGGRVIEDGPEPRPELEGDSRTFRLLLDPNQYRLDLDEASKGKEDVYETFNIVVRRKPKENNFKAVLETIRELMNTECVVPEWLHDIVLGYGDPGQAHYTRMPNEIPTLDFNDTFLDMEHLRNSFPGHEIKVQTDDPRKLVRPFKLTFENVLRKQRGETDMDEEEPKKVIVVEPHVLPKRGPYLYNEPKKNNILFTPTQVEAIRSGMQPGLTVVVGPPGTGKTDVAVQIISNLYHNFPSQRTLVVTHSNQALNQLFEKVAELDVDERHLLRLGHGEEALQTDKDFSRYGRVNYVLAKRLELLGQVSRLQTTLGAGGEAGGFIIIGHVIIEGSCENICIFLPRSGSSLSIQLHIRHTESHHPREKALFHITVLLESHYNFNNYPVLILISLKIKCKLISVVFVVLERSLAHCFVCFKYDNILMEESAQILEIETFIPLLLQNPQDGRSRLKRWIMIGDHHQLPPVVKNMAFQKYCNMEQSLFTRMVRLGVPYVELDAQGRARSSICNLYRWRYRNLGDLRHVCQLPEYRAANAGLRHDIQLINVDDFNGAGETEPSPYFYQNLAEAEYVVAVFMYMRLIGWPAEKISILTTYNGQKHLIRDVINKRCADNPLIGRPHKVTTVDKYQGQQNDIALISLVRTKAVGHVRDLRRLIVATSRARLGLYIFARASLFRNCFELQPTFNQLLERPLQLELIPGESYPAQRTLSAAVPEELVLRVMDMPHMARYVYDMYIQRVRDSAQDSTWSAPGSDRSARSKEADHHVAVHPGGDSDEDDATAFQPTDIVNEIEEQE-