Monarch geneset OGS2.0

DPOGS204978
TranscriptDPOGS204978-TA1782 bp
ProteinDPOGS204978-PA593 aa
Genomic positionDPSCF300123 - 209458-211239
RNAseq coverage11x (Rank: top 84%)
Annotation
HeliconiusHMEL0105844e-2526.97% 
BombyxBGIBMGA010144-TA0.080.50% 
DrosophilaCG13502-PC3e-0926.70% 
EBI UniRef50UniRef50_D2A2M83e-4229.01%Putative uncharacterized protein GLEAN_07034 n=1 Tax=Tribolium castaneum RepID=D2A2M8_TRICA
NCBI RefSeqXP_001814237.16e-4329.01%PREDICTED: similar to CG7634 CG7634-PA [Tribolium castaneum]
NCBI nr blastpgi|1892368681e-4129.01%PREDICTED: similar to CG7634 CG7634-PA [Tribolium castaneum]
NCBI nr blastxgi|1892368682e-6126.85%PREDICTED: similar to CG7634 CG7634-PA [Tribolium castaneum]
Group
Gene OntologyGO:00054885.8e-11binding
KEGG pathway 
InterPro domain[405-432] IPR0119905.8e-11Tetratricopeptide-like helical
Orthology groupMCL30728 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204978-TA
ATGGAAGAAAAACAACTTTATGGGGCATTAACTGTTTATCGAGAAAGAGGGGCATATCTGAGACGCTTAGAACAGTTTGAGAAAGCGAAAGCCTCTTACGATGAAGCCTTTAGAAATGCTCCAGATGATGTTCGCATGCTCACGGGCAGGAGTCAGGTATGCGCGGATGCGGTTCAACCAGTCCAAGCTTATTCCGACGCAGAACTTGCTCTTAAACTTGCTCCAGGCAACATGAACGCTCGAAATATGCAGGCACGAGCTATGTATACAATGTCTGATTTCGAGAGATCGGTAGTTATGAATTATCGAGGTGCTAGACATAGGCGGCAACCACCATACTTTATAGAAGGTATAAATCAAGGTGTGGAAACTATCCAAGACTGCATAGGTGTTAACGCTGGCAATGTAATGATTGAATTTTTGCCACTAATAAAGCAAACAGAAGCGGCGCGGGTTGATGAAGATGGACCTCAAAAACCTTTACACGTATCTAGAATACCGCGACCAGAAAAGAAAAGAAAGCTCACTCAAATGGAAGCACGGAAACATTTAACATTAGCTCGTGTTTTAGCTATGAAGTATTTAGGGCCTATGGCCTACGATAAATTTTTCTTACAACATCTAATGGAAGATCAAAGACTAAAATCTGCTAATTCAGGGGGAAGTATAGAATTAAAGGAAATAGTTAGGGAAGCTTTAAGATCATTAAGTGAAAGGCAAGATATGTTGCGTTCCCAACGTCCTTATTATACCATTAAATTGGCTGAGAAAGCAGAATCCAAATATCAAAACAAGTATAAAGAAGAAGTATTAATAAAAGAACGGGAGATAGGAGCTCGAACAGCTGAGAGACTTCTCAAACAACTTGAAAAAAGTATGCGTGAGGAGCGTATAATGGATTTGATTGCACAAGCCGAACGAATGCAGCTGTTTTTGGACGCTAAAACTCCACGCACTTTACCAGATAAAGAGATTTATACTGATAGGTTGTATCGAGCGGTGGGAGAAGGATATTTGTGTCAACATCGTTTATCATATACGCTTAGTGAACGTGGTAATAGAAGACGAATAGCTTTCCTGATGGGTTTGCCAGTTGGACGTCCAACTTCTTTTGATTCTGTAATGGCTAACTATCCGTACAAGTTTATAGATATAAAACAAGCTACTGCAAAGGTTATAGCCTCTCTTGAAATGTGTGAAAATTCAACAATGAAATGCTGGTTAATGTATGAATTATCACGATTGTTATGTACCCAAAAGAATTATGCACTGGCTAAGTTCTATGCAAAACGTTGTCAACGCGAAGCTCAAGAGATAAGTAATGTTACTTGGTGGCTTAATGGATGTTTTGTTTTGATGAGTGGTGACATGCAGCAAGGAAATTCTAATGAAGTCCGCATACAAGTTGAAGAAGCTTATGAGTTTTCAAAGAAATTACAAGACCCTGAACGCGTTCAGGCATTTTTAGCTAAATGTGCTGAAATGGCCTCTGAGACTGTGACCGCTGACGAAAGAAAAGCCATAATACAACGTGAGAGGCAAATTATAGGAGTTATGGAGGAACAACAAAGTGTTGAAACTCAAGTACTATTCAAGAGAATGTCCACAGTACCTCCTGGAAGAAGATTCTCAGTCCTGCCTCGTAAACCAGAAGCTGGTGAAGATCGTGCCGAACGTAAACGCTACCGCCAAAGAGGATTATCTGTTATCCCCGGTCCAGAGCGGTCTCTTCCCAAACCACCAAAATCTAAAACACTTGGATTCCAACTATTTGATATTTAG

Protein sequence:

>DPOGS204978-PA
MEEKQLYGALTVYRERGAYLRRLEQFEKAKASYDEAFRNAPDDVRMLTGRSQVCADAVQPVQAYSDAELALKLAPGNMNARNMQARAMYTMSDFERSVVMNYRGARHRRQPPYFIEGINQGVETIQDCIGVNAGNVMIEFLPLIKQTEAARVDEDGPQKPLHVSRIPRPEKKRKLTQMEARKHLTLARVLAMKYLGPMAYDKFFLQHLMEDQRLKSANSGGSIELKEIVREALRSLSERQDMLRSQRPYYTIKLAEKAESKYQNKYKEEVLIKEREIGARTAERLLKQLEKSMREERIMDLIAQAERMQLFLDAKTPRTLPDKEIYTDRLYRAVGEGYLCQHRLSYTLSERGNRRRIAFLMGLPVGRPTSFDSVMANYPYKFIDIKQATAKVIASLEMCENSTMKCWLMYELSRLLCTQKNYALAKFYAKRCQREAQEISNVTWWLNGCFVLMSGDMQQGNSNEVRIQVEEAYEFSKKLQDPERVQAFLAKCAEMASETVTADERKAIIQRERQIIGVMEEQQSVETQVLFKRMSTVPPGRRFSVLPRKPEAGEDRAERKRYRQRGLSVIPGPERSLPKPPKSKTLGFQLFDI-