Monarch geneset OGS2.0

DPOGS209055
TranscriptDPOGS209055-TA1758 bp
ProteinDPOGS209055-PA585 aa
Genomic positionDPSCF300102 + 110528-112390
RNAseq coverage355x (Rank: top 33%)
Annotation
HeliconiusHMEL0060970.062.44% 
BombyxBGIBMGA009128-TA3e-16457.17% 
DrosophilaCG10710-PA8e-4934.05% 
EBI UniRef50UniRef50_E2AYF01e-10742.63%Putative uncharacterized protein n=4 Tax=Formicidae RepID=E2AYF0_CAMFO
NCBI RefSeqXP_001121582.15e-11041.10%PREDICTED: similar to CG10710-PA [Apis mellifera]
NCBI nr blastpgi|3838604362e-11243.28%PREDICTED: uncharacterized protein LOC100875645 [Megachile rotundata]
NCBI nr blastxgi|3838604362e-11543.35%PREDICTED: uncharacterized protein LOC100875645 [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL15910 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209055-TA
ATGCGGCGTGCGGCGTGCGGCGACGTCAGCACCGCGCCTTACAACACATCATCAGACAACAAGGAACTACTGCGACGACAGCTATTGTCCTCGAGCAAGGAGCGGAACGTAGACAAGTGTGACTCGACGAAAATTAAACACACGAGAAGCAGACAAAAGAACAGATGGAAGCTGAAGTTCCATCACCAGGCACTGCCCCAGGAGTACCTCGACCATTACGAACAGAGTCTGCAGAAAGCCAACTGCAAGAAGAAAACTCAAGACAAACAGAAGGAGCAGAGCGTCGAAAACAGCTTCACCAACGAAACCTTTAAGATTTGGGCGGAGAGACTGGCCGGCGTACAGACCGAGACCCCGGCGCTGACCCTCGCCTCGCCTGCGGCCAGCGGGACGATCAAGAACAGGGAAGAGAGAAAGAAGAAGAACCCTGCTATAACACCCAGCAAGATCATGACGTACGAAGACCTTCCTTACATGGGAGAAATGACCTTGAACAACTCCAAACCGAGGAGAGGCAGGAAACCGAAAAAGGCCGACATTTGCCACCTCATATACGAGAACTATGGGACCGTGGTGCCCGGCCGGCCGCCTCCCTCCAGGCCCAGCACGTGCCCGCCCTTCAAGACGGACCCCGTAGGAGGCCCGCCGCGAACGGACCTCCAAAACAGGATCATATCCAGTTTGCTGGAGAGAAAACTGAGTCAGGAGAACAGGAGGCGCTTCGAGAGTCAACCCGACGAGCCGCCGGCACCGACATGTGTCGCTCTTGGGCTCGGGCCGGGACTCCTGCACAGCGATCGCGAGCCGCTCAACCTGTGCATCAGGGATTTGAGTCACCTCAAACTGCAAGTACAGAAAAAGTTCGGCGATATTTATCCGGAGGTCAAGGTGGAGCGAGATGAGGCCGACAGCGGCCGCGGGGAGACGCCCAACGCTATCTCCGTCATACAACGGAACGAGAACGTCCCGCCCTCGCCACGGACTCCGCCCGACGCGGACGACAACTTCCCCGGGTACGTGTACTGGCCTGAGGCCGGCGTATACATGCACCCGCTAGCGCTGCAGACACAGTTGCTCTACTGTCAGCAGGCGGAGGCCTCGCAGAAGAAGGAGGGAGGGGCGGGGGGACTCGCGGTCAAAAGGATATCGGAGCTGCTGGAGCCCGAGAGGCAGACGGATGGACCCCGGGAGAAGAGGGTCGCCGCCGCCGCGCCCGCCAAGAGGAAAAGGTCCGCCATCTTCATCCCACCCGTGAGCGCCGGGAGCGGGACGGCGGCGCCCACCGCGGAGGTCAGCATATGCAAGTTCAAGTTCACGGGGGGCTCCAAGCCGTCGCTTCAGGAAAAGAAGATGCTGTCCGTCGATTCCGGGGGGAACTTCCGGTATTACAGCGGCGGAGGCGCCAAAGGCAACAAAGGGTACGACTTCCTACAGGGAGACTCGGCTCGAGACCGAAGCGACCCCGCGCCGGCGACGCGGCAAACCGACCCGGCGGTCGAAGAACAGAAGAAGAAGAGGAAGTCGAGGAAGACGCTGCAGAGGGAGAAGCTGGAGCAGACCTTCAAGGAGAAGGGCTTCCTCATACAGACGCAGCAGCTGCAGTCCGCAGAGGGCGCCACCTACTGCAAATTCCGACAGCTCCGGAAGTTCACGCGCTACCTGTTCCGGAGCTGGAGGGACTACCTGCCGGGGGAGCTGGCGGGAAGAGACGGCGCTAGGGGGGGAGCGGACGACCCCGCTGGCCCGGGGGGGACCTGA

Protein sequence:

>DPOGS209055-PA
MRRAACGDVSTAPYNTSSDNKELLRRQLLSSSKERNVDKCDSTKIKHTRSRQKNRWKLKFHHQALPQEYLDHYEQSLQKANCKKKTQDKQKEQSVENSFTNETFKIWAERLAGVQTETPALTLASPAASGTIKNREERKKKNPAITPSKIMTYEDLPYMGEMTLNNSKPRRGRKPKKADICHLIYENYGTVVPGRPPPSRPSTCPPFKTDPVGGPPRTDLQNRIISSLLERKLSQENRRRFESQPDEPPAPTCVALGLGPGLLHSDREPLNLCIRDLSHLKLQVQKKFGDIYPEVKVERDEADSGRGETPNAISVIQRNENVPPSPRTPPDADDNFPGYVYWPEAGVYMHPLALQTQLLYCQQAEASQKKEGGAGGLAVKRISELLEPERQTDGPREKRVAAAAPAKRKRSAIFIPPVSAGSGTAAPTAEVSICKFKFTGGSKPSLQEKKMLSVDSGGNFRYYSGGGAKGNKGYDFLQGDSARDRSDPAPATRQTDPAVEEQKKKRKSRKTLQREKLEQTFKEKGFLIQTQQLQSAEGATYCKFRQLRKFTRYLFRSWRDYLPGELAGRDGARGGADDPAGPGGT-