Monarch geneset OGS2.0

DPOGS209915
TranscriptDPOGS209915-TA1800 bp
ProteinDPOGS209915-PA599 aa
Genomic positionDPSCF300519 - 17884-22905
RNAseq coverage34x (Rank: top 74%)
Annotation
HeliconiusHMEL0105848e-15357.28% 
BombyxBGIBMGA007056-TA0.061.50% 
DrosophilaCG15128-PA3e-1822.80% 
EBI UniRef50UniRef50_D2A2M88e-5130.79%Putative uncharacterized protein GLEAN_07034 n=1 Tax=Tribolium castaneum RepID=D2A2M8_TRICA
NCBI RefSeqXP_001814237.11e-5130.79%PREDICTED: similar to CG7634 CG7634-PA [Tribolium castaneum]
NCBI nr blastpgi|1892368683e-5030.79%PREDICTED: similar to CG7634 CG7634-PA [Tribolium castaneum]
NCBI nr blastxgi|1892368685e-6829.39%PREDICTED: similar to CG7634 CG7634-PA [Tribolium castaneum]
Group
Gene OntologyGO:00054885.7e-13binding
KEGG pathway 
InterPro domain[238-239] IPR0119905.7e-13Tetratricopeptide-like helical
Orthology groupMCL25311 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209915-TA
ATGAGAAAGAAGTTAGAACTTGAAGAGAAACCTATTTGCAATGTTGTTTCAGTATATAGAGAAAGAGGAAATTACTTGCAACGCTTAGAGCAATTCGAGAAAGCAATTTTGGCCTACAACGAAGCACTGCGATGGAATAAAACAGACGTGCGTTCTCTGCTGGGTAGGAGTTTGGCTCGCGCTAAAGCAACTTATTATACTGGTGCTTTAAGAGATGCCGCTCGAGCAGCTGAATTGGAGCCAGAAAATCTTACGGCTTTACAGATAAAAGCTCAAACGGAATATGAAAAGTGTGCTTTTGAACGATCGCTTCTGCTTTCATATGGAGGACAACGACTCCGTAAACTGCCACCAAATTTTGAGGACTGCGCTAGATGTGCAGAAGAAACGATACGAGAATGCACCGGTCTGAGTTCTTCTAAAATTATGTTAGCTGCTGCAAAATTATCTCCGCCAATCAATTTGCTGCAAGATTCACAAAATGGTATAACAAGAACTACAATCCGAAAAAGTCGGATGCAATCAAAGTCGACTCCACAGGTTCAAGAAATATCTCGAGTGGAGAGGAAAAGAAACGAAAATATCAGTCGTCTAATGGCATCCAAATATTTGGAACAAATGGCTCACGACAAATATTTCTTAACAGCGCTATATAAGGATGAAAGAATTATTTCCGCCAACAAGAAAGGATCTAAAGAATTGCGAGAACTAGCTAGAAGTGCGCTTGCTGACATTGAAAAACGACAAGAAGTCCTTAGGGAACGTAGACCACTATATGCAGCACGAGCTCCAGAATCAGAGGCTCGGGCAAGATTATCGAAAGCTCGAAAACAGAGAATCTTTAATGCTCAACGCCAGCACATAACAGATGCTCGCCGTTTAATAAATACAACACAAGAAATGTACGAAAAACATGATACTTTAAAATGTCTGGAAGCTGCTGAATTTGGAATGGAACAGATAGCAAGAATACCAGCAAGGTTACTACCCGGGAAAGAGAAGCTTTTACAGGAATTACATGAAATCGTTGCGGAAGCTTTTCTGGACCAGAAACGTATTAAGAAAGAAATGAGTGAAGGAGATCGTGAGAAAAGAGCTTTTATACTCTTAGGAATACCGATATCTAGAGAACCGAGTAGAGATTCAATCCTACGTACTCGTCCACCAGCACCTTGTCGGGATGCTAAACGAAGGCTACGCACTCTAGAACGCTGTTTGACTCTAAGTAGTCATGCTTCTGAACGATGTTACGTCCTGCATGAGTTAGCTCGTCTTCAAATTGACATTAAACAAGCGCATAGAGCCCGATTTTATGCTTCTAAGTGCCAATCAGAATCCAGATCTGCAAACCAACGACTGTGGCTGCTCAATGCTACATTCCTATTGGCACGTTGTCATATCTTACAGAACAATCGCCCTGAGAGTCGTGCCACATTATTAGAAGGGGCAGGACTTGCAAAAGCTTTTGGATATCCAGATGTTGCCTCCTTCTTCGATACGTGTGTAGATGTATCTCTTGAGGGTGAAATTGGTTCAAATGATTCAATATTGGAAAAGCGTGAAAAAGCTGTGGTCAACTTAATGCAAGACGAAGACATGCGACAGGCAGCTCAACATTTATTCCGAAGAATGTCTGCCATACCTGCATCGAGACGATTTTCCATAATGCCGGGTGCTCGTGCTGATGACGCTGCTCCGGCAGGAAATCGCCGGGCTTCTATTATGCCGCGAACACAACTTCCCGCAAGGCTAGTGCGTACCAGTCAACATCCACTCGGCTTTCAGGATTTTGATTTATAA

Protein sequence:

>DPOGS209915-PA
MRKKLELEEKPICNVVSVYRERGNYLQRLEQFEKAILAYNEALRWNKTDVRSLLGRSLARAKATYYTGALRDAARAAELEPENLTALQIKAQTEYEKCAFERSLLLSYGGQRLRKLPPNFEDCARCAEETIRECTGLSSSKIMLAAAKLSPPINLLQDSQNGITRTTIRKSRMQSKSTPQVQEISRVERKRNENISRLMASKYLEQMAHDKYFLTALYKDERIISANKKGSKELRELARSALADIEKRQEVLRERRPLYAARAPESEARARLSKARKQRIFNAQRQHITDARRLINTTQEMYEKHDTLKCLEAAEFGMEQIARIPARLLPGKEKLLQELHEIVAEAFLDQKRIKKEMSEGDREKRAFILLGIPISREPSRDSILRTRPPAPCRDAKRRLRTLERCLTLSSHASERCYVLHELARLQIDIKQAHRARFYASKCQSESRSANQRLWLLNATFLLARCHILQNNRPESRATLLEGAGLAKAFGYPDVASFFDTCVDVSLEGEIGSNDSILEKREKAVVNLMQDEDMRQAAQHLFRRMSAIPASRRFSIMPGARADDAAPAGNRRASIMPRTQLPARLVRTSQHPLGFQDFDL-