Monarch geneset OGS2.0

DPOGS213625
TranscriptDPOGS213625-TA3225 bp
ProteinDPOGS213625-PA1074 aa
Genomic positionDPSCF300033 + 1069131-1076075
RNAseq coverage286x (Rank: top 38%)
Annotation
HeliconiusHMEL0077890.083.77% 
BombyxBGIBMGA011688-TA0.063.91% 
DrosophilaCG10144-PA4e-7727.47% 
EBI UniRef50UniRef50_B4N5I86e-7627.30%GK20314 n=1 Tax=Drosophila willistoni RepID=B4N5I8_DROWI
NCBI RefSeqXP_001958397.18e-8028.96%GF10899 [Drosophila ananassae]
NCBI nr blastpgi|1947521731e-7828.96%GF10899 [Drosophila ananassae]
NCBI nr blastxgi|1951269476e-8128.00%GI13210 [Drosophila mojavensis]
Group
Gene OntologyGO:00055152e-06protein binding
KEGG pathway 
InterPro domain[91-238] IPR0110462e-06WD40 repeat-like-containing domain
Orthology groupMCL14052 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213625-TA
ATGGATTTATTGAAAACACCTTCAACACAGTCTTTATTGGAATCAGATTTAGAATCAGTTGAGAGTCTTCAGTATGTGGATTTTGAAGAGCTTGATGAAGTCGAATATGCCTTGCCAACAAGTGAAGCACCGAGCCTGGCCGAAATTTTATCATCACAAGAATTGGAAATCAATAAAGGGCCATTAAAAAATGTTGAGGAACCTACATGTTCTGCTCTACATGTTGATTTCTTGCAAGCTATTTCACAGCAGCTATTTCAAGCTGAAGAAAGATCATCAGCTGGCGCAGCGACTACATTAAGTATAGGAACAAATGGAAGACTGACTGTAGGAACGGCTCATGGACATCTACTTTCGTTTCATGATCAAACATTGAGATGGGTCTGTGACGCTAACGGAGACAACGGTGCTGTAACTTGCTTGTCGTACAACCATGATAGCACTCGATTATTGGCGGGTTTCGCCCGAGGGCTTGTTTATCAATACGAGAGTGTACGCGGTGTTATCTTGAGACGGGTCACATTGGGAGGTAATATATGGGGCGCGTTGAGAGTCACATGGGCTGGTACTTCGGGACTAGCTCTCGATACAGGTGGATCCGTGTGGCTCATTAAATTCTCAAGACCACTCGGAGTCCGCTCCGCTCGAGTCTCGTGTCTATTTTCCGGTGCTCGTGGTGAAGTTGTAGCTATGACGGCTAGAGATGCTCGTATCTTAGCGTTGGCCACCCTCTCTAGAGTTATCATTGTTGCGGGTGGCCGCGCAGCCGGAGTGAAATTAGACGGACCAGCAGACGTTCTTCCGGTGTTAGAATGGTATGAGATAGATAACAGGCTGCTCGTATGCGCCAGAGCCAACATCATGCAATGGCTCAGTGTTGTTATAAGTGGACCCTCAATTAGCCTGCAGTCGGTCCAACGTGTTGAGTTAAAGTCGACGCCAATTTGGCTCGGTTGGTTGGCCGGAAGTCTAGCGATATTCGATTCGGATGAAAATCTCCGTTTGTGGGGTGATGATTATGATAAACCATTGGATTTGTCACAAATAGAACCAGTATACGCTTCGGCATTTTTCAAGGGTCATTGGACAGATGGTAACGTATCGAGAGCAATGTGCAAGGCTGGCGAGAGTGCGCTTGGAGGGGCTTGTATATCGGAAGGCACGTTAGCACTATTGGGGCGTCGCGGCGTTGTTAGAGTGAAACCTCGTGATCTTCTTGCCAGATCCCAAGCATTCCTGACCTCGGGGCGATATTCTCAAGCATTGCGACTGCTCTGCTCAGCCCAGGGTCCCGAAGCTAAGAAGCTAGCAAACGAGTTTATCTGCAATTTAGCTGATAGGCCACACATAGTGAATAGCAAAAATGTAGCAGTTCAAGTTGTCAAGTTATGCCTCAAATTTGACATGAGTTATGAGTTGTGGAATGTACTATGGGAGAACTGTTCGAGCGAAGACGCGTTTGTGGAGGCATTAAGCGATGCCGTAGTACGAGGAGAACTTGCAAATTTCGCTCCATCGCCTGATTTTACACAGTCACTAATCGAGCGTCTGGCTGACCTTGAGCCAGAACTCGTGGAGCTGGTGGTGTCGTGTGTACCACTGACCTCCCTGGACCCTCACCGGGCCAGTGTGTTCACGAGGGAGAGGCGTCTGTGGCGCGGCGCGGGGGCCATAGTGGCCGCTCTCGACGGTTGTAGCGGTGCTATACGAGAGTTGGTTAGCTACGTGGATTTGTCGTGTGGGAGAAGCGCGGGGGAGGGCGGTGGGGGGTGCAGGTGTGCGGGGGGCGCGCTACTGTTGACTGCCGCGGACGCGTTGGCGGGCCGAGGGGTGGGGGGTCGGCCGCTACCACCACACGCCCGACCCTCGGCCAGGCATGACGCACTACAGGCCTTGTTGGCTGAAGATCCGGAGGGTAGGTCTCCACTGCGAGCGTTGGTGTTGCACGACGCGAGCGCCAGCGTTCGTCTGTTGGAGCAGTGCGCTCGTGAACCGCCGTTCGCGGGACCCCTCGCCAAACAGAACAGGCTGCGTGTAGCAAGAGCGCTCCTCACCTACATCAATCAGTTGCAGGTGTCTGACAGCATAGAAATACTAGAGTTTATATGTGGGCAACTACAAACCGGCGCTTTGCCGCTTGACCAGGAGTTGATAAAAAGAGTTCAGGAAGTCATATCGAACACAGATGACGAGCGAGCAGACGTCGCCTGGTTAGCAGTCTTAACACGGATCCGAACGCAGAGAGATCAGATGGTCATGCAATATAAAGATGCCGTCCCCCGACCACGGGTGCTGTGGCGGATTAATGCGATGCTCGACCAGCATAGCGAGGTCCTCAAGGAGTTCTTCAACATCAGCAATCCGTCCAGTCGCGATATAAACGAGCTGTTTGAATATTTGCGATCCCGAATCGAAACTGACCCCGAGGCTAGAGACCATATACGGGATCACCTTCCAGCTCTGATTCAGTTGCGACCGCGATCGGCGGCGGCGCTTCTCAATGAACAGCAAACTAATACGATAGGATCTGTTTACGACACATTAAGTACCGAATGTAGAATAGAATTCGGCGAGTGTCTCCTAGACATGGGGCGCTTGAAGGGGGACATCGCCGCATCTCACCTCCGAACTCTGTGTATCGAGAAGCCGAACGACGTTAAAGAGTTTCTTAAAAAGAATTCGGGAATAATCAGACCCGAGGACGCTTTAAAAATAATCAAAGAACACGGCCCAAAAGACGCGGAGCCGATCTGTCTGGAGGCGAGCGGTGATCACATGGGCGCTCTGGAGTCGTTGCTGCAGTCAGTGGCCGCCGCTGATGACGAGGCGACCAAAGCCAGTCTGATCGAGGAGGCGGGCGCGCTGTGCGTGCGTGTGGGACCCGCCGTCCCGCAGGCCGTGGCCTCCGACATGTGGTCGCGGCTCCTGCGCCACACGGACACGATTCCCGCGACGCTGCTGTTCGAAGCCGTCGCCTATCTTCCTCTCGAAGAACTCGCCACTAAGACTTGCACTACGATAACAATGGCCCGAACCATTTTAGCGAGCGGCGTTAGCGGGCGCGACGCCTGGGAATGTGCTTCCCGGCTAGTGCAGCGCGAGGCGCACGAGGCGCTCGCGCGGGAGTTGAGCACGGCTCGTCGCGGACTGGCGGTTCGCGGTCGCTGCGGGCATTGCGAGCTCCTTCCCCTGGACGCTCCACGCAGGACTACGCTCTATCCCTAG

Protein sequence:

>DPOGS213625-PA
MDLLKTPSTQSLLESDLESVESLQYVDFEELDEVEYALPTSEAPSLAEILSSQELEINKGPLKNVEEPTCSALHVDFLQAISQQLFQAEERSSAGAATTLSIGTNGRLTVGTAHGHLLSFHDQTLRWVCDANGDNGAVTCLSYNHDSTRLLAGFARGLVYQYESVRGVILRRVTLGGNIWGALRVTWAGTSGLALDTGGSVWLIKFSRPLGVRSARVSCLFSGARGEVVAMTARDARILALATLSRVIIVAGGRAAGVKLDGPADVLPVLEWYEIDNRLLVCARANIMQWLSVVISGPSISLQSVQRVELKSTPIWLGWLAGSLAIFDSDENLRLWGDDYDKPLDLSQIEPVYASAFFKGHWTDGNVSRAMCKAGESALGGACISEGTLALLGRRGVVRVKPRDLLARSQAFLTSGRYSQALRLLCSAQGPEAKKLANEFICNLADRPHIVNSKNVAVQVVKLCLKFDMSYELWNVLWENCSSEDAFVEALSDAVVRGELANFAPSPDFTQSLIERLADLEPELVELVVSCVPLTSLDPHRASVFTRERRLWRGAGAIVAALDGCSGAIRELVSYVDLSCGRSAGEGGGGCRCAGGALLLTAADALAGRGVGGRPLPPHARPSARHDALQALLAEDPEGRSPLRALVLHDASASVRLLEQCAREPPFAGPLAKQNRLRVARALLTYINQLQVSDSIEILEFICGQLQTGALPLDQELIKRVQEVISNTDDERADVAWLAVLTRIRTQRDQMVMQYKDAVPRPRVLWRINAMLDQHSEVLKEFFNISNPSSRDINELFEYLRSRIETDPEARDHIRDHLPALIQLRPRSAAALLNEQQTNTIGSVYDTLSTECRIEFGECLLDMGRLKGDIAASHLRTLCIEKPNDVKEFLKKNSGIIRPEDALKIIKEHGPKDAEPICLEASGDHMGALESLLQSVAAADDEATKASLIEEAGALCVRVGPAVPQAVASDMWSRLLRHTDTIPATLLFEAVAYLPLEELATKTCTTITMARTILASGVSGRDAWECASRLVQREAHEALARELSTARRGLAVRGRCGHCELLPLDAPRRTTLYP-