Monarch geneset OGS2.0

DPOGS203275
TranscriptDPOGS203275-TA4956 bp
ProteinDPOGS203275-PA1651 aa
Genomic positionDPSCF300229 + 368802-392893
RNAseq coverage82x (Rank: top 64%)
Annotation
HeliconiusHMEL0034500.091.24% 
BombyxBGIBMGA000456-TA0.088.08% 
DrosophilaCG30116-PA0.045.22% 
EBI UniRef50UniRef50_Q7Q9380.048.19%AGAP004827-PA n=4 Tax=Culicidae RepID=Q7Q938_ANOGA
NCBI RefSeqXP_001652900.10.047.61%hypothetical protein AaeL_AAEL001276 [Aedes aegypti]
NCBI nr blastpgi|1571169020.047.61%hypothetical protein AaeL_AAEL001276 [Aedes aegypti]
NCBI nr blastxgi|1582930970.048.16%AGAP004827-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055153.4e-60protein binding
KEGG pathway 
InterPro domain[1157-1453] IPR0110463.4e-60WD40 repeat-like-containing domain
[856-1120] IPR0159439.6e-50WD40/YVTN repeat-like-containing domain
[886-925] IPR0016801.4e-09WD40 repeat
[1205-1238] IPR0197811.6e-08WD40 repeat, subgroup
Orthology groupMCL16095 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203275-TA
ATGAGCGCGCCGGAGCCTTTAGTGCTTCAAGCGCTGCAGGGCTTCATCGACCAGAGTGACAAATGGCCTGGTCCGCGTGCCGTCAAAATATTCGTTTCTTCGGTTTACAACGAATTTCGTGAAGAAAGACGTCAAATTCTTGAACTTGTCGGTCCAGAACTACAAGCCACATACGATGATCGATATATCGAGTTCGAGTTCGTGGACATGCACTACGGGACAGATGGTGGGGATGAAAATAATCCAGGTCTTCTGCGATACCATCTGGATGAAATACGCTGCTGCTATCAAACTTCAAGAGCTGGATATTTTCTTTGCTTTATTGGTGGGGACACTTCAAGCTACAATCCTGTTCTACCATTCAAAATCGAAGCTGAAGCATTTGAACGACTGATCAAATCTGATTCTCCGCAAACCGCTCTTATAAAGGCATGCTATAAACTCAATGAAGATGGACTATATCATCTTGAAGGCGATGAAAAATGGTATAGCGATTTATTAGAGCGAGATGAACAACGCAAGCGATTAGCAACAGTACAAAAAGTATTCAATACTTTAGCTCTAGAAGCTTTGGAGAATGGAGGGAATATATCGAATTTACTACGAAGTCCTGCAGAGATACAATGTGAATTAGCAATTGAACTTTTATCACAAGGAAAACACCCAAAAGGTTTTATTGCCGTATTAAGGGACATTCCGGACTTGGAGCCAGATGACACAAAATTAACAGTCTTAGTTCATCAGAGGATAAAAAAGTTGCAAAAGAAATTAGAAGACATTTTACCTGAAACACATATAATTAGATTAGACGCTATATCAGCAGATTCATCACGAACAGACGCAGCATCAGATTCAGAAGAGAAGTTATCACCATTCAGAGATCGTATACAAACCGTAGTAAGTTCGTTGGTAGACGACACCTTAAGTACAGAGCCCGATCAAGGCAAGGGTAGAAAAAAGACTGTACAGGAAGTGTTTCTTGAGCACATTACTCATCTTAGAATATGTATAGAACATAATAAACTGTATGAAGTTAATAATAAGGAAATCGAAAATACAGCAACAACTATTCTTAATAATGCTAAGGAAAACTATGATAGTCGATCACGCCATCCACCGGTACTGATATATGGGCCCGACGCTTCCGGAAAAAGTACTTTGTTGACACATCTTTACTTTAAATTCGAGGAAATCTTCCCCAAACCTGTTTTGAGAATAATAAGATTTTCAGCTTCCACGCCAAGATCGGCATACAATTTGGAGTTACTCAGAGTTATGTGCCAACAAATTTCCATTATCCTCAACATCCCTGAGGGCTATCTACCAAAAGATGCCAGTTTTGATCCCCTTTATATAAACAACTGGTTTCAAAGTTTACTACGAAGATGCGAAGAAATGGAGGACGAAGTTTTATTGATATTTATTGACAATGTACATAGAGTAAATCCTTTAGAGTGTGATATTGTAACGGGGCTCTCCTGGCTGCCTATGTCTTTGCCAAGAAATGTTTTCTTAGTTTGTACAAGCGCTGTGTCTCTAGATCAATTGCAATTGACACCAGCTCAAAAAGAAAAGTTTAAAGTTCAGAATTGTTATTATTTGTTAGACGCTATTGAAGATATTGCTGAGAACAATAGTTCAGGCGACTTCATTGACGGTGCCTTTGATAATTTAGAAGTCGTCTTCGGCTCTAAAGCCTTTAGTAAATTGGCTGGTTATATTACTTGTTCTGAATTTGGACTGACGGAACTTGAACTGTTGGAACTTTTGATGCCAACAAGCAATAGCGAAGCTGTTATTACATTGAAGGAAGCTAACTTCAACTTCTCTACGTTGTGTGTCATCAAGCATATGATGAAATCTATAATTGAGGAAAACGTTGTGTCTGGCCGGAGCACTTGGCGCTGGCGGGCGGCGGCTGCAAGTTCTCGCGCCCGACGTCGCTATGTACGAGTGCAGACTGCCTTACGAGACGCGCATTCGGACCTCGCTGCATTGCATTTTGCTAACTTCCTCAATGACGCTGATGATACAGACACATCGGAAACCCAAGAACCTGGTTGTGTGGATGACGACGATGCCCTTTTAGATTCCACTCCATTCCATTCAGCCAGTAGAACAGCCGCTGCATTTACTCAAAGACACGTGGAAGAGTCTTGGCTGCATTTATTGCTAGCTGGTGATTTCACAAAACTTAAAGATCTCACCGTTTGCAATTTTGATTTCTTGCTTGCTGCTGTACAAACTGTAACGATATCGTATCTCCGTTGCATCCTGGAGCACGTCCGCTGCTACATCCTGGATCGGGATGTCGAGCTTGTGTACGGTGCGGTACGGAAGTCTAGTGACATACTTACCCGTGACCCAATGCAATTAGGAGCCCAGATAATTGCATGGTTACGTCCGGCGGTTGCCCGTCGTGGTGTACTCGCTACATTGGTAACTGCGGCCATGGCCTGGTGCGATGGTTATGACAAACCTCTACTGGTTCCACTTAATGGATGGCTTCATCCGCCCATAGCGTCAACTGTCCGAGTTGTTTCTGTCGGAGGTTCTACTCCAGGAGCTGGTATAAGATTATTACAGCTCGCGCCATCAGGACAGCACTTGGTGTTAGCACCTTCAGCTGGAGATCCACAGCTTTGGCACGTGATGTCGAATTCTAAAGTGCACACTTTTAAAGGTCATTCTGGACGCATTTTATGCATGTGTGTAACGCGGGAATCTCAATATCTTCTAACTGGCTCTGAAGACACGAGCGTCATTGTCTGGGACTTACATACTTTGGCTGTGAAGACAAAAATACTGGAACACATAGCGCCTGTGTTGTGTGTAGCGGCCATAGTAAATCGATCTCTAGTTATCAGCGGAGGTGAGGATTCAGCAGTTATCGTTACCTCACTAGTGGACGGAGCTCTGATAACTAAGCTCGACCACCACCGCGGTCCCGTGACAGCTATCAAAGTAATCCAAGATGGGGAGATTCTGGTGACATGTTCACAAGATGGTACTGTCTGCACCTGGAATGTTGATAACTTCGTACTGCTTAGTACTGTGACGGCAGGTGTACCGATACACGCTATGGAAGTCACCGAGGACAATGTTTTCCTCGTCACACTGCAAGGAGAAAATGAATTACATATTCGTACTTTCATAACGGGGACTCATTTGCATGTACTAAAACGTCATAAGACTAAGGTGAAATGTTTCTGCGTGGGCCATGATTCTTCTCGAGCTGCAGTTGGTTGCGCTGATCAAAGGATATACGTTTATAGCTTACATAGTGCTCAGCTGCTTCGCACATTAGCAGCAGCACATGACCTCTCAGCACTAGCGATTGCTGATAAGGACCACTTTTTATTAGCTGCTGGTGGCAACAGAGTGACAATATATTCATTTCATACTGAAGACAATCTTACTAACTTTAGACCAACGAAACAAATGAAACGACGTCAAACAAAATCTACAACTAACATCACCCTGCTACAGGCCGAACAAAGTGAACTAATCCCAATAAGCTGTCTAGAAGTGTCGAGGGATGGCCAACTTGCGGCTAGTGGTTGCGCGAGAGGCTTAGTTAGAGTATGGCAACTCTCTACCCACAGACTGCAAAATACCCTAAGTGGCCATATGGGTCACATAACCTGCGTTACTTTCAGCCCAAATAATTTAATGGTATTAAGTGGATCCGAGGACAGAACTGTTGTAGTCTGGCAGTTGGCAGATAACTCTGCATCATTGACGTATAAGGGCCACCAATCCGCTCTTCAAGTACTGTTGATGATGTCTGACGGTAGACGCGCAATGTCCGGAGACCGCGCCCGCACTGTACACGTTTGGCTCGTGGACTCAGGCATCGTACTGCTCTCCGCAAACTGTCCAACAACCAGCATTGATGTTACCCTTAATATGAAGTTTGCGGTTTTATCTGATGGTGATAATTCAGTTCGAATCTGGACCTTAGCCGAAGGAGACAGTGGCGAGGAAAAGCGTTCCGTGTCCCACGCAGAGCGTGTTACCTGCTTCGCTCTTACAGCGGATTCACAGCACGTGGTGACTGGATCTATGGACATGTCGCTCAAAGTTTGGCAACTTAATGGTGGAAAGTTATCGCAGGTTTTGGTAGGTCATACCGACATTGTGACTTGTGTTGCTGTCTCGATCACAAACAAGACTCAAGTGGTATCTGGTTCGTGGGATTACAACTTAATTGTATGGGATATAAACACTGGTTCAGATATTCATCTGTTATCTGGTCATTTGGGCAAAGTTACTTGTGTAAAAGTCACTGGCGACGGGACAATTGCAGTTTCGGGTGCGGAAGACAAAACACTCATAATATGGGAAACAAAACGGGGTCTTGCTCTGACGTCACTAGCATTGCACGTACCCGCGCTAACCTTCCAGATTACTAGCGACTGTTCGCGCATCGTTGTCCATCTTTCAGATAGAGGTTGCCTGCCAATAATCTGTCTACACAATACTCCAGCAACGTACGTTAAAATACCAACTTATGCCGCACCCACTAAAAACGTCGATGAATTGCGGCCGCTAGCACCAAAACGTCCTATGAGGAGATTGCTTAAGAAGGAAGTATCACTGGATACTTACACTTGGCAAAAGAAATACGGGCATCTTACATCCGCGGCAATGATGGCACAAGTTGACGAACGATTAAAGAGAAGATTTTCTGTGTCAGCGTCAATGGAAGAGATATCTAAAATCCAAGAAGCTAAGAACAAAGATTTGGGTTCGCAGGTTAGTCTTGGTCCAGAACAAGCAGCAATCGCTCAATCACAGCACTTTGATCAGCTGGAAGCTCTGTGGAATAAAATATCACCCCCGAGGCGACGATCCAATAAGACACTTTCAAAGCAAAGTTCACTCGTTGAGAGACTTGATTCCTCTGACGAGGATCACACCCCAGTTGAAGAACAAGAACACATGGTGGAGTAA

Protein sequence:

>DPOGS203275-PA
MSAPEPLVLQALQGFIDQSDKWPGPRAVKIFVSSVYNEFREERRQILELVGPELQATYDDRYIEFEFVDMHYGTDGGDENNPGLLRYHLDEIRCCYQTSRAGYFLCFIGGDTSSYNPVLPFKIEAEAFERLIKSDSPQTALIKACYKLNEDGLYHLEGDEKWYSDLLERDEQRKRLATVQKVFNTLALEALENGGNISNLLRSPAEIQCELAIELLSQGKHPKGFIAVLRDIPDLEPDDTKLTVLVHQRIKKLQKKLEDILPETHIIRLDAISADSSRTDAASDSEEKLSPFRDRIQTVVSSLVDDTLSTEPDQGKGRKKTVQEVFLEHITHLRICIEHNKLYEVNNKEIENTATTILNNAKENYDSRSRHPPVLIYGPDASGKSTLLTHLYFKFEEIFPKPVLRIIRFSASTPRSAYNLELLRVMCQQISIILNIPEGYLPKDASFDPLYINNWFQSLLRRCEEMEDEVLLIFIDNVHRVNPLECDIVTGLSWLPMSLPRNVFLVCTSAVSLDQLQLTPAQKEKFKVQNCYYLLDAIEDIAENNSSGDFIDGAFDNLEVVFGSKAFSKLAGYITCSEFGLTELELLELLMPTSNSEAVITLKEANFNFSTLCVIKHMMKSIIEENVVSGRSTWRWRAAAASSRARRRYVRVQTALRDAHSDLAALHFANFLNDADDTDTSETQEPGCVDDDDALLDSTPFHSASRTAAAFTQRHVEESWLHLLLAGDFTKLKDLTVCNFDFLLAAVQTVTISYLRCILEHVRCYILDRDVELVYGAVRKSSDILTRDPMQLGAQIIAWLRPAVARRGVLATLVTAAMAWCDGYDKPLLVPLNGWLHPPIASTVRVVSVGGSTPGAGIRLLQLAPSGQHLVLAPSAGDPQLWHVMSNSKVHTFKGHSGRILCMCVTRESQYLLTGSEDTSVIVWDLHTLAVKTKILEHIAPVLCVAAIVNRSLVISGGEDSAVIVTSLVDGALITKLDHHRGPVTAIKVIQDGEILVTCSQDGTVCTWNVDNFVLLSTVTAGVPIHAMEVTEDNVFLVTLQGENELHIRTFITGTHLHVLKRHKTKVKCFCVGHDSSRAAVGCADQRIYVYSLHSAQLLRTLAAAHDLSALAIADKDHFLLAAGGNRVTIYSFHTEDNLTNFRPTKQMKRRQTKSTTNITLLQAEQSELIPISCLEVSRDGQLAASGCARGLVRVWQLSTHRLQNTLSGHMGHITCVTFSPNNLMVLSGSEDRTVVVWQLADNSASLTYKGHQSALQVLLMMSDGRRAMSGDRARTVHVWLVDSGIVLLSANCPTTSIDVTLNMKFAVLSDGDNSVRIWTLAEGDSGEEKRSVSHAERVTCFALTADSQHVVTGSMDMSLKVWQLNGGKLSQVLVGHTDIVTCVAVSITNKTQVVSGSWDYNLIVWDINTGSDIHLLSGHLGKVTCVKVTGDGTIAVSGAEDKTLIIWETKRGLALTSLALHVPALTFQITSDCSRIVVHLSDRGCLPIICLHNTPATYVKIPTYAAPTKNVDELRPLAPKRPMRRLLKKEVSLDTYTWQKKYGHLTSAAMMAQVDERLKRRFSVSASMEEISKIQEAKNKDLGSQVSLGPEQAAIAQSQHFDQLEALWNKISPPRRRSNKTLSKQSSLVERLDSSDEDHTPVEEQEHMVE-