Monarch geneset OGS2.0

DPOGS207071
TranscriptDPOGS207071-TA3321 bp
ProteinDPOGS207071-PA1106 aa
Genomic positionDPSCF300001 + 2454735-2468930
RNAseq coverage2045x (Rank: top 6%)
Annotation
Heliconius% 
BombyxBGIBMGA013025-TA0.079.10% 
DrosophilaeIF3-S10-PA0.053.10% 
EBI UniRef50UniRef50_G6CI190.099.91%Putative eukaryotic translation initiation factor 3, theta subunit n=2 Tax=Obtectomera RepID=G6CI19_DANPL
NCBI RefSeqXP_973312.10.066.45%PREDICTED: similar to eukaryotic translation initiation factor 3, theta subunit [Tribolium castaneum]
NCBI nr blastpgi|910899450.066.45%PREDICTED: similar to eukaryotic translation initiation factor 3, theta subunit [Tribolium castaneum]
NCBI nr blastxgi|910899450.061.43%PREDICTED: similar to eukaryotic translation initiation factor 3, theta subunit [Tribolium castaneum]
Group
Gene OntologyGO:00055153.2e-11protein binding
KEGG pathway 
InterPro domain[369-494] IPR0007173.2e-11Proteasome component (PCI) domain
Orthology groupMCL14160 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207071-TA
ATGGCGAGATACGGTCAGAGACCGGAAAATGCTCTTAAGAGAGCCAATGAGTTTATGGACTTGGAGAAGCCTGCAAGGGCGCTTGATACCTTGCAGGAAGTGTTTCGAAACAAAAAATGGGCTTACAATTGGTCCGAATCTGTCTTAGAACCGATCATGTTCAAATACCTGGAACTATGCGTCGATTTACGCAAGTCTCATATTGCCAAAGAAGGGCTGTTTCAATACAGAAACATGTTTCAATCAGTAAACGTTGGTTCATTAGAACAAGTTATCAGGGGATATCTCCGGATGGCTGAAGAGCGTACAGAGTCTGCTCGAGAACAATCCACCCAGGCAGTAATTGACACAGATGATCTTGATAACCTTGCAACACCTGAGAGTATTTTGCTAAGTGCTGTCTCAGGGGAAGATGCCCAGGACCGCTCTGATAGAACAATATTAACACCATGGGTGAAATTTCTTTGGGAGTCCTACTGCCAGTGTCTGGAGCTCCTGCGGACGAATGCACATGTGGAGACCCTATACCATGACATAGCTCGCATGGCATTCCAGTTCTGCTTGAAGTATTCAAGGAAGACTGAGTTTAGGAAGCTTTGTGATAAACTCAGAAAACATCTTGATGATATTTGCAAATCCGTGTCTCAGCCAGGCAATGTTAGTATCAGTAAGCCTGAAACACAGCAGCTCAACTTGGAAACCAGATTGTTCCAACTGGACAGTGCCATTCAGATGGAGCTGTGGCAAGAAGCCTACAAAGCAATAGAAGACATCCATAATCTCATGAACATGTCCAAGAAGACGCCAGTCGCCAAAACCATGGCTAATTATTATGGCAAGCTGGCTCTCGTCTTTTGGAAGGCTGGACATTGCTTGTTCCATGCAGCTGCCCTGTTAAAGCTTTTCCAACTGTCTAGAGAAATGAAGAAAAATATCACTCAGGAGGAATTGCAAAAGATGGCATGTCGAGTCTTGGTGGCTGTGCTATCGGTCCCCTTACCATCGCTTCATCCTGAGTTCGATCGTTTCGTTGAAACTGACAAGAGTCCTGTTGAGAAGGCACAGAGATTAGCAGTGTTGCTTGGACTCGCTCAACCTCCGACCAGAGCTAGCTTACTGAAAGACGTGGTCCGTATGAACGTGGTGTCGCTGGCGTCGCCACAGCTGCAGCAGCTGTACTCGTGGCTCGAGGTGGAGTTCGATCCCCTGTCCATCTGTCAGAACGTCCAGAGCGTCGTCAGAACACTGCAGGAGGATCCTAACTCCCCGCTGGCACAATACTCGGTGGCTATAACGGATGTAGCGCTGGTACGTCTCATTCGTCAAGTCGCTCAGTGCTACGCCTGCATTCAATTCTCCAGACTGTTGGAGCTGGCTGCCACCGACGACCTCTTCCATATCGAACGTCTGCTCGTCGACTGTGTCCGCCATAATGATATGCAGATACGGGTGGATCATGCTAACAAATGTGTCCACTTCGGCGTGGAAGCTGGAGGCGGTGAATGGTGTTCTACTGCTGACGAGGCGTGCGGCGGGGCCATACTCCAGGCAACGCCCGCTGAACAGGTTCGCGAGCAGCTCGTCCGTGCTGCGGAGGTAGTTTCTCGTGCTGCTCAGACATTGTTCCCAGCTCGTCGTCGTGCCGATCGCGAGCGTGCTAGGGCCGCCATGGTGCAGCACTATCACGAGAACAAACACGCTGAACATCATCGCGTTCTACAAAGACATAAGATCATAGAGGAGAGGAAGGAGTACATTGAGAGACTCAACACTGTCAGGGAGGAAGAGGAGTTGCGCCGTCAAGAAGAGCAGTTGCGCGCAGCAGCGGCAGCGGAGGCACGCCGTCAAGAACAGGAAAGAGAAGAGAGGGAGAAGAGGAGACACGCCTCGGAACTAGCAGCAATGAAGGAGAGGAATCTGAGGGAGAGAATCGCTCACATCTCACAGACTATGCACGGGAAGAAGGTGCTGCAAAAGTTGGATGAGGAGGATTTGAAGAAAATGGACGCCGAAGCCATTGCTCAACGCGAGGCCGAAGAACTGATGAAGGAACGTCGGGAGCTTGCAGCTCGTCTTAAGTCTCAAGAGAAGAAGGTTGATTACTTCGAGCGGGCCAAGCGTCTGGAGGAGATTCCACTCCTACAGAAGAGTTTGGAAGAGAAGCAAGTGCAGGATAAAGCATTCTGGGAACAGCAGGAGAAGGAACGCATCGCCCAACTCATCGAGGCGCGTGGCCGTGATGTAGCTACAGCAACACGTCTGTCTCGTATGTCGGTGCACCGCGAACAGTTCACGACTCGACTGAACAGCGAGCGTGGCGCATTGTACCACAGCAGGCTGGCAGAGTTCACCGAAACCATCACCAGGGAGAGGGAGGCGCGACTCGCTCATAGGAGACAGCAGAGGATCGAGAAGAGACGAACAGAGTGGCTGACGGAGAAGCGTCGCATGGAAGAGCGTGCTGCGGAGGAAGCTCGCAAGGCACAGGAGGAACATGAGAGGAGGGAGAAGGAGAGAAAGCAGGCGGAGGAACTGGCCGCCCTCAAGGAGAAGAAGGAGAAGTCACTCAAGGAGCATCAAGAAATGTTGGCTAGGGCTGAAGCAAAGGCTCGCGCTATGGAAGCTGAAGTCACCCGCAAGCTAGAGGAACAAAAAGCGGCCGCGTTGTCCAGCTGGAGGAGACCCGGACCTCCGGCCAAGGAACCAGAGAAGAAGGAACCCTGGCGGCCCAGTCGCCTTCGCGAGCCCGTTGCTGATGAACGTCCACGCTCTCCAGGACGTAGAGATGAAGAGAAACGCGAGGAAAGACCTCGCGATATTAGCTTCAAGGATGACAGGCCAAGAGAAGAACGCAGCTACAGGGACGATAAGAACAGGGATGATCGGCCGAGAGATGACTCTGGATGGCGTTCAGCTAACCGGGATGCAGACCGGGATAGGGATAGAGAAAGACCGCGCTACACCGGTCGTTCAAGCGGCCCCGAGTCCGGCAGCTGGCGTCGTGGCCCTTCAGACCCAGCGCCCTCCGCCGAGCGTTCGTCAACTTGGCGCACCAAGGAGGCGTCTCGTGACGACCGTCGTGATGACCGCCGTGATGACCGCCGTGATGATCGTCGTGATGACCGCCGTGATGACCGCTACCGTGATCCGCCGCGTGATCGTGATGGATATCGTGATAGACCGCCGCCACGACGCGACGAGCGTGATCCGCCTCGTAGAGATGATCGTGATCGCCGCGATGATCGCGAACGTCGCGTGCCCCCGCGCAGAGAGGACAAACCCCGTGACCCGGACGACTTCCAAACCGTCTCCAAACGTTAA

Protein sequence:

>DPOGS207071-PA
MARYGQRPENALKRANEFMDLEKPARALDTLQEVFRNKKWAYNWSESVLEPIMFKYLELCVDLRKSHIAKEGLFQYRNMFQSVNVGSLEQVIRGYLRMAEERTESAREQSTQAVIDTDDLDNLATPESILLSAVSGEDAQDRSDRTILTPWVKFLWESYCQCLELLRTNAHVETLYHDIARMAFQFCLKYSRKTEFRKLCDKLRKHLDDICKSVSQPGNVSISKPETQQLNLETRLFQLDSAIQMELWQEAYKAIEDIHNLMNMSKKTPVAKTMANYYGKLALVFWKAGHCLFHAAALLKLFQLSREMKKNITQEELQKMACRVLVAVLSVPLPSLHPEFDRFVETDKSPVEKAQRLAVLLGLAQPPTRASLLKDVVRMNVVSLASPQLQQLYSWLEVEFDPLSICQNVQSVVRTLQEDPNSPLAQYSVAITDVALVRLIRQVAQCYACIQFSRLLELAATDDLFHIERLLVDCVRHNDMQIRVDHANKCVHFGVEAGGGEWCSTADEACGGAILQATPAEQVREQLVRAAEVVSRAAQTLFPARRRADRERARAAMVQHYHENKHAEHHRVLQRHKIIEERKEYIERLNTVREEEELRRQEEQLRAAAAAEARRQEQEREEREKRRHASELAAMKERNLRERIAHISQTMHGKKVLQKLDEEDLKKMDAEAIAQREAEELMKERRELAARLKSQEKKVDYFERAKRLEEIPLLQKSLEEKQVQDKAFWEQQEKERIAQLIEARGRDVATATRLSRMSVHREQFTTRLNSERGALYHSRLAEFTETITREREARLAHRRQQRIEKRRTEWLTEKRRMEERAAEEARKAQEEHERREKERKQAEELAALKEKKEKSLKEHQEMLARAEAKARAMEAEVTRKLEEQKAAALSSWRRPGPPAKEPEKKEPWRPSRLREPVADERPRSPGRRDEEKREERPRDISFKDDRPREERSYRDDKNRDDRPRDDSGWRSANRDADRDRDRERPRYTGRSSGPESGSWRRGPSDPAPSAERSSTWRTKEASRDDRRDDRRDDRRDDRRDDRRDDRYRDPPRDRDGYRDRPPPRRDERDPPRRDDRDRRDDRERRVPPRREDKPRDPDDFQTVSKR-