Monarch geneset OGS2.0

DPOGS212135
TranscriptDPOGS212135-TA2964 bp
ProteinDPOGS212135-PA987 aa
Genomic positionDPSCF300038 + 89313-98594
RNAseq coverage57x (Rank: top 69%)
Annotation
HeliconiusHMEL0049880.068.52% 
BombyxBGIBMGA006586-TA5e-16771.28% 
DrosophilaCG30357-PA3e-4432.25% 
EBI UniRef50UniRef50_E2BM202e-12949.89%Kelch-like protein 26 n=4 Tax=Formicidae RepID=E2BM20_HARSA
NCBI RefSeqXP_001600164.14e-12951.02%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3838570287e-12950.33%PREDICTED: uncharacterized protein LOC100877623 [Megachile rotundata]
NCBI nr blastxgi|3838570282e-12450.33%PREDICTED: uncharacterized protein LOC100877623 [Megachile rotundata]
Group
Gene OntologyGO:00055155.1e-48protein binding
KEGG pathway 
InterPro domain[671-908] IPR0159155.1e-48Kelch-type beta propeller
[257-350] IPR0117051.3e-16BTB/Kelch-associated
[124-243] IPR0113339.1e-12BTB/POZ fold
[770-815] IPR0066526.2e-10Kelch repeat type 1
[149-242] IPR0130699.1e-07BTB/POZ
Orthology groupMCL12386 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212135-TA
ATGGAGGAAATGAACGCCCAAAAAGCTGCATTCCCATTTAAACCTACGCCTAGTGGGAATTTAAAAGTACAATCAAATCAGGTAGTGTTTCAATCGTCAACTGTTGGCGTCCTTTTAGCGGATCCAACTCAAGCTAAAGTGAAAGTAAAATTTGAAGGAAAAGCAACAGAAGATCAAACTGACCGCCAAATTCCTGATGACAGCAATGACTCTGAGCCACAACTCCAGTTCGGACCTAAAGTTGCAATCGCAGCACAAGCCAAAATCGCCGGTACTTTTAGAAATTCAAATAAGAAATCACATGGAGTGAAAGAAGAAAAAGGAAGAGATTTAACCAGCCTTGGCTACGTACAAACGGAGACACTTGATTGGACAAGAGTACAATTACCAAAAAAACAAGATCTTTTTCAAGAATTTTATCGAAGAATACATAATTTTATCAATACAGATACAGTTGTACGTATTGGTAAAGATGAATTTCTCTGCCATCGTATAGTTCTCCAGATATATTCGGCCTTTTTTGACGTCAATAACCAACAAGTAATTGAATTGTCCAAAGACAACGTAACATCAGAAGCCTTTCATATCATTTACGAATGGATGATATCAAACGGTCAAGAAAGTAATCGTATTTTTAGAAAAGATAACATTCTGGATTTATTTTGTGCAGCTCAGTTTTTGGCCATAAAAGATTTGGAAGACCAATGTTGGTCATTTATTGTCAACGAACATATTTTTAACGAAGACACGGCCTTTGTTTTGTATAGAAAGGCGAGAGTCAGAGAACTGGTGCCCGTTATGGATATCATGGTACCACGTGTTATGAAATTTTTCTTACCACTAGTGGCATCTAATGATTTCCTTCAATTGGAACCGGATGAAGTGATGACATTCTTGAAATCTAATTATATAAACGTCACCAGTGAAATAGAAGTTCTGATGGCTGGCATCAGGTGGTTGTACGGGGACTGGGCCAATAGAAAACGACATTCCATAGAAGTGATGAAATGCGTTCGCTTTGGACTAATCTCGCCCTGGCAATTGGTTGACATTAAGAGAAACCCAGACAATGCTGAGATTTTGGAAATCATTAACGAAGCCGAGATACAACAGATGGTGGATGATGGGCTCGCGTATGTTATAATTAAATATTGGTATGGTAACAATTCAAAAAATTATTACCATTGGATCGACGTGCTGGGTCTAAGTGAGCCATGCGAAAGAAACTGGATAGGCGAAGAAAAGAATCACGGCACATACAGGGATTTCTTAAAATATCTCGAGCAGTTTTTATTACCAAAGGAACAGATGTATCAACATATGGCCATGATGCAAACCACGAAGCGTGATGATTGTCCGGGTAACATGCGTGGTCTAGACAATACTATGGACATGAAGAGACCGAAAATGGACTTTCCATTACCGGCAACTTTAAAAGGCCTAACTGGCGATAAGGAAAAGTTTCCTACAATGGGTGATTTTTACGAAGTTCGTAATAAGGAAAATCGTACGATACAATCACCGTCATCACAGCAATGTTGCAAGCAAATACATGAAATACACATGTGCAAGTTACAGCCATCTCATATTCCTTTCACCACAAAAATATATGAGCAACCGGAAAAGGATTCTGCGGAGGATCTCAACACTACGAGTGAGGCGAGCAGCGAAATGATGGGTACCAGCACCAGTGATAATATGGGGTACATACATAGGCAAAGCGAAAGGAAACATAGTATCGCGGCCAGTTACCTCGCTGCAGCCACCGCAGCGCTAAGTGGATCTAGGAGAGATTCTCAAAGCCCAACACTTCGTGGTCAAACTACGAACACAAGCAGAACAGAGGCAAATACATCCCCAACAGCTACCAAAAAATTTAGTCCACAGGCTTCTATATTTAACCAATCTAAAGAATCGCTGGCGAGGATAAATATGAATAAATTATCGACTTCGATACTTGGCCCATCGAACAAGAACTACATCGCTGAGGGTTCCCTTTTCAATTGGGATAGGGAAACAGTTTTGGTGTTTGGTGGTATAGATCCACATACCTCCTATGGCGTGGCGGGAAACACTGGCAAAGATATTTATAGATTCGATCCCGTTACAAACACATGGGATTTGGTCGGTGAACTTCCAGAGCCAAGACATCATCATTCAGTCGCTTTTTTGAGAGGAAGAGTCTTTCTCGTTGGTGGAGCAGATCCTCGTGAGGATGATCTTCGAGGTAAATCGGTGGTGGTGTCGACAGTGTGGAGCTTTGAACCGGTAACACGCTCTTGGTACAGTGAATCAGGTTTAGCGACACCCAGGAAGAACTTTGGATTGGTGGTACATCGGATGGCATTATACGCGATTGGTGGTCAGGATAAAAAAGGGAGGGTGCTTCGGTCAGTGGAGCGCTTCGACCCGAAGACCGGTTCTTGGAGCGAAGTCCGCGGGATGTGCGCGGCTCGTATGGCAGCTGCGGCGGCCAAGCACCGTGACTATATCTGGGTCGCTGGCGGCATGACCGGCGAGAAGAGGCGACCGGTCTGCGGCGTCGTTGAATGCTATAACTCTAATACCAACCAGTGGACACAAATTCATAGCCTCCGTTTTCCAAGATGTTTCGCTACGTTGTTCTCTATGAACGATAAGTTATACATCATCGGTGGTGCTGGCAAGATATCAGAAAAGGACAAAACTCCGAGTAGCGTTGGAGCGATAGACGTTTGGGATTGGAAAGACCGCGCTTGGAAACTTGAAACAGAAATGTCTATCCCGCGACACGGCCACGCCCTTGCTTACTTAGGCACTCAACTTATTATTATAGGGGGTGTCACGACGATTTATATGCGTGCCCTTAGCAATGTGGAATCGTTCTGTTGCGAGCGTGGCGCCTGGATCCGAGGGGTTTCGACCCTACCGTCACCTCTATCAGGGCATGGGGCTGTGACACTACCTCCTGCGTCTCTCATGTAG

Protein sequence:

>DPOGS212135-PA
MEEMNAQKAAFPFKPTPSGNLKVQSNQVVFQSSTVGVLLADPTQAKVKVKFEGKATEDQTDRQIPDDSNDSEPQLQFGPKVAIAAQAKIAGTFRNSNKKSHGVKEEKGRDLTSLGYVQTETLDWTRVQLPKKQDLFQEFYRRIHNFINTDTVVRIGKDEFLCHRIVLQIYSAFFDVNNQQVIELSKDNVTSEAFHIIYEWMISNGQESNRIFRKDNILDLFCAAQFLAIKDLEDQCWSFIVNEHIFNEDTAFVLYRKARVRELVPVMDIMVPRVMKFFLPLVASNDFLQLEPDEVMTFLKSNYINVTSEIEVLMAGIRWLYGDWANRKRHSIEVMKCVRFGLISPWQLVDIKRNPDNAEILEIINEAEIQQMVDDGLAYVIIKYWYGNNSKNYYHWIDVLGLSEPCERNWIGEEKNHGTYRDFLKYLEQFLLPKEQMYQHMAMMQTTKRDDCPGNMRGLDNTMDMKRPKMDFPLPATLKGLTGDKEKFPTMGDFYEVRNKENRTIQSPSSQQCCKQIHEIHMCKLQPSHIPFTTKIYEQPEKDSAEDLNTTSEASSEMMGTSTSDNMGYIHRQSERKHSIAASYLAAATAALSGSRRDSQSPTLRGQTTNTSRTEANTSPTATKKFSPQASIFNQSKESLARINMNKLSTSILGPSNKNYIAEGSLFNWDRETVLVFGGIDPHTSYGVAGNTGKDIYRFDPVTNTWDLVGELPEPRHHHSVAFLRGRVFLVGGADPREDDLRGKSVVVSTVWSFEPVTRSWYSESGLATPRKNFGLVVHRMALYAIGGQDKKGRVLRSVERFDPKTGSWSEVRGMCAARMAAAAAKHRDYIWVAGGMTGEKRRPVCGVVECYNSNTNQWTQIHSLRFPRCFATLFSMNDKLYIIGGAGKISEKDKTPSSVGAIDVWDWKDRAWKLETEMSIPRHGHALAYLGTQLIIIGGVTTIYMRALSNVESFCCERGAWIRGVSTLPSPLSGHGAVTLPPASLM-