Monarch geneset OGS2.0

DPOGS207477
TranscriptDPOGS207477-TA4173 bp
ProteinDPOGS207477-PA1390 aa
Genomic positionDPSCF300051 + 281156-293883
RNAseq coverage579x (Rank: top 22%)
Annotation
HeliconiusHMEL0225430.088.67% 
BombyxBGIBMGA000950-TA1e-2964.29% 
Drosophilatho2-PB0.047.68% 
EBI UniRef50UniRef50_E2QCS80.047.68%Tho2, isoform B n=27 Tax=Neoptera RepID=E2QCS8_DROME
NCBI RefSeqXP_393587.30.053.44%PREDICTED: similar to tho2 CG31671-PA [Apis mellifera]
NCBI nr blastpgi|3838619500.052.98%PREDICTED: THO complex subunit 2-like [Megachile rotundata]
NCBI nr blastxgi|3072002050.053.56%THO complex subunit 2 [Harpegnathos saltator]
Group
KEGG pathwayame:4100990.0 
 K12879 (THOC2)maps-> Spliceosome
InterPro domain[868-1170] IPR0214181.2e-93THO complex, subunitTHOC2, C-terminal
[556-612] IPR0217263.6e-18THO complex, subunitTHOC2, N-terminal
Orthology groupMCL12157 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207477-TA
ATGGGATCGTTTAATAAATTTGTATCTGATTATTGTAAAGCATGGGAAAAATCTGGACGGGAGCAATTCTTAAAAGCAATTACCCAGTTTATAAAGGATGAAGCAAAGAGTCCCTTGTTTTCCAAGTCAAACAAGCTATCAGGATTGTCACAAACGATCTATGATCTTTTACTTTGTGGTCTTCGTGGTGTCTTGAAAAAGGATTCTGTAATATCTGTGTTGAAAGATATTGTTGGTGTACATGCAGACATACCATCAATATTACTAGATGTAGTTTGTGTTCTGGATTCAGAAACATCTCTTGATGTCCAGAATGAAGAGAGAAGTAATTTTTGCTATTTAGTTAGGGAGTTGGAGTCGTTTATATCAGATAAACTCCTGAAGGAGCGTTTAGAGATTGACACCCTGCAGGATGTTGGTACACTGAAGAATAAGAATTTTTATACTAAGTTTATTAAAATCAAAACTAAACTATATTATAAGCAACGTAAGTTCAACCTTTTTAGAGAAGAAAGTGAAGGCTATTCAAAACTAATAGTTGAATTAAACCAAGAAATATCCGAAGATACAGATTGGAAGACAATATTAGAAATCATTCAGTCTCTCATAGGTTGTTTCAACTTGGACCCAAACAGAGTCTTAGATATAATTTTAGAATCATTTGAAGCTCGACCTCACTTAGACAAACTATTTATTTCATTAATAAAAAATTACATGGGCGATGCTCAAGTGATTTGTGAAGTATTAGGATTTAAGCTCGGCGATATGGAAGTATTAGAAAATTGTAAAAGCCCACCATCATTAATGACTGTCATTGCACTGCTTTTACAACATGAAGTTATATCTCTGGACGATATTTACCCCTGGCTACGTCCAGATGATACGGTCATGGCCAAGGAAGCTGACAAAGAATTTAAAGCTGTGCAGGATTATATTCGTAGACTTAATATTGTATCTACAAAGGGACCACAGAGTAATGCACCTGCAGAGTTTATCGAAGAAAAAGCTGATCCACAGGAATACTGGAACAATCAGAAACTAGTACTCTGTGAAGAGCTCCTAAATGTGAGAGCATGGAAGGAATTTTCATCACTTTTTTCAAGATTGTCAGTTACTTGCGTACCGCAAAGGCCTGCTATAGCTTTGTGCAGCATGCTTCACGCTTTGATTGAACCTTTGTACAGAATACATTGTCGAGTAGCTCCTAAAATAATAGGTAAGCCTATACCACCTTTGAAGTCTCCTTTGGCACCGCCAGCGTGCAAGACTTTTGAAGATATGAAGGAAACTGTCATACCAGCTCTGATGATGTTGGGTCCATCCCTTCATTATGATCCTATTTTAATGTACAAAATAATTCGTGTTCTGAGAACTGCTCGATCTCTGAAAGAGGATCCTTTGCATCATGAAGCACTTACAGTGCTGGATACAGCAATACTACCAGCATTAAGTCTGATGGAGGGAAATTGTTGCATGGCTGAAGAAGTTTACACCTTGCTTAAGTTATACCCTTACCAATGCAGATACTGTTTATACGCGAGGTGGAAAAACGAATCCGGTGAGAAGATCCCGTCGCTGATGCGTGTTCGGGGCAACTCCTTGCAGCGTATAAAACATATTATGAAGCGAGTGTCCAAGGAGAACATCAAACCCCAGGGACGTCTCATAGGCAAGCTGTCACACGCTGCACCGGCTTTCCTGTTCGATTACATGCTGCTACAAATACAAACCTATGACAACCTCATTGGTCCGGTGGTGGAATCTCTGAAGTATTTAACATCCCTCTCCTTGGACATTCTGGGCTATTGTCTCGTTGAAGCTCTTGCGGCCCGTAAGGGTACCGTGGGAGCCGCACATCCACCAGCTCTTCAAGCGCTCGCGGCATTCGCTGCAGCGGCTTTCAAGAAACATAATATAGAATTGACGGCATTGCTGCAATTTGTAGCAAATAGGCTTAAAGCGCAGCAGAGTCACGATCTTCTGATTCTGAAGGAAATAGTGCAAAAAATGGCGGGAATAGAAGCCGCTGAGGAAATGACTCCGGAACAACTCGATGCCATGGCCGGCGGAGAGCTGCTGAAAGGAGAGGCCGGTTATTTCTCTCAAGTACGTAACACGAGAAGATCGTCGGCGAGACTGAAAGAGGCCGTGGTGGGAAATAATTTGGATATTGCTCTATGTATACTGTCCGCTCAGCAGAGACATTGTTGTGTATGGAAAGAGTACGCTGAGGATAGTCCATCTAGTGGTGAGCCACGCGGGTCTCAGCTTAAAGTGGTCGGTCGTCTTGCGGACCAGTGTCAAGACGCGCTTGTCCAACTGGGTACCTTCCTCGCTTCCTCGCACGCGCCTGATGAATACGCCGCTAGACTTCCACTTCTACAAGAACTACTCCGAGACTATCACGTAGACGCCGATGTGGCGTTCTTCCTCCACCGTCCGGTGCTCAGTCAAAAAATAGCAGCCAAGGCTGAAGCTCTACGAAAAAGCTCCGACAGCAGAAGCGAGTCATTAGAGAGAAGTATAGAGAGATACAACATAGCTTCTAAAGAGGCGCTGGAACCTATCGTGACGTCGATAACTCCCCTACTACCGTCCAGAGTCTGGGAAGATATATCTCCCGAGTTCTATGTGACTTTTTGGTCCTTGTCCATGTACGACCTTCGCGTGCCCGTCGAGAGTTACGAGAGGGAGATAGATCGCTTGAAAACGGCCGCTGCTAATGTAGCCAAAGACAGCTCACAAGGTACCAAAGGAAAAAAGGAACAGGAACGGTTTAACACTCTCATTGATAAGTTGCAAGAAGAGCGTCGTCGTCAAGAAGAGCACGTGGCGCGGGTCCGCGGTCGCTTGCAGCGCGAGTGCGTCGCTTGGTTCCCAGCTCGTGCGGCGAAATCAGCCAAGAACGAGACTGTGACGCGTTTGATGCAACTCTGCATCTTCCCTCGCTGCATCTTCACGGCCCCGGACGCCTTGTACTGCGCCGAGTTCGTCCACACAGTCCACGCACTCAAGACGCCTAATTTCTCAACGCTCCTGTGCTATGACCGGTTGTTCTGCGACATCACGTACTCGGTGATGTCGTGTACGGAGGGCGAGGCAGCTCGCTACGGTCAGTTCCTGTGCCGTGTGATGAGGACGGCCATGCGCTGGCACAGAGACCGTACGGCCTTCCACGAGGAGTGCGCGCACTACCCGGGCTTCGTCACCAAGTACAGAGTGTCCAATCAGTTCACTGAAGCCAACGATCACGTCGGATACGAGAACTACCGGCACGTGTGTCACAAGTGGCACTACAAGATCACCAAAGCGATGGTGGTGTGTCTCGACTCCGGGGACTACGTGCAGATAAGAAACGCTCTGATAGTACTCATACGAGTGTTGCCGCACTTCCCCGTGCTAGAGAAACTCGCACAGATCATTGAGAAGAAAGTTGAAAAGGTCAAAGAGGAAGAGAAAACACAACGACAGGACCTGTACGTGCTCGCGACGGGTTACAGCGGCCAACTGAGGAACAAGGTGCCTCATATGATGAAGGAGAGCGACTTCCATCAGATCGTTCATCTCACGACCGGGGAAGTTAAACCCAGGGAGCAGACGACCGACGTGCCCGCACCAGATAATGAGAAGAAAGAATCGAGAACAAGCGAGAGACGCCGCGACGATACTGATCGTGAGAAGGAGGTCAAGCGCGAATCTCGTTCAAACGCCAAGGAGAGAAACAAAGAAGATGGCAGGACTAAAGACAGATCACCGAGAGAGAGGTCGCACAGAGAGGAACGCTACCTGGACACGGTGTCGCCGCCTCATGAACACCGTCATCCGCCCGATGACATAGATCGTGATGTGAAACGTCGTAAAGTCGAAAGCAGCGGTAACGGCAAGGGAAAGGAAATCGAAGAGCGTTCCCCCGAGAAGGAGAAAAGGAAAACGAAACTGAGGGGAGACGAAAGGAAAGAGCGTAAGATGAGTCGCAAGAGGGACCGAGCTGAAGAAACAGCTTTACTCGAACAAAAAAGACGCCGGGACGAACAAAAAGCTGTAGCTAAAATGAGCAGTCACCAGAACGGGTCTCAAGAGGATCACCACTATGAGAAGTATCACAAACGAGCGGTCCAACGAGTCGAAGATAAGAAGATAACTTTGGAAAGAAAACGGGCTTCAGAAGAATGA

Protein sequence:

>DPOGS207477-PA
MGSFNKFVSDYCKAWEKSGREQFLKAITQFIKDEAKSPLFSKSNKLSGLSQTIYDLLLCGLRGVLKKDSVISVLKDIVGVHADIPSILLDVVCVLDSETSLDVQNEERSNFCYLVRELESFISDKLLKERLEIDTLQDVGTLKNKNFYTKFIKIKTKLYYKQRKFNLFREESEGYSKLIVELNQEISEDTDWKTILEIIQSLIGCFNLDPNRVLDIILESFEARPHLDKLFISLIKNYMGDAQVICEVLGFKLGDMEVLENCKSPPSLMTVIALLLQHEVISLDDIYPWLRPDDTVMAKEADKEFKAVQDYIRRLNIVSTKGPQSNAPAEFIEEKADPQEYWNNQKLVLCEELLNVRAWKEFSSLFSRLSVTCVPQRPAIALCSMLHALIEPLYRIHCRVAPKIIGKPIPPLKSPLAPPACKTFEDMKETVIPALMMLGPSLHYDPILMYKIIRVLRTARSLKEDPLHHEALTVLDTAILPALSLMEGNCCMAEEVYTLLKLYPYQCRYCLYARWKNESGEKIPSLMRVRGNSLQRIKHIMKRVSKENIKPQGRLIGKLSHAAPAFLFDYMLLQIQTYDNLIGPVVESLKYLTSLSLDILGYCLVEALAARKGTVGAAHPPALQALAAFAAAAFKKHNIELTALLQFVANRLKAQQSHDLLILKEIVQKMAGIEAAEEMTPEQLDAMAGGELLKGEAGYFSQVRNTRRSSARLKEAVVGNNLDIALCILSAQQRHCCVWKEYAEDSPSSGEPRGSQLKVVGRLADQCQDALVQLGTFLASSHAPDEYAARLPLLQELLRDYHVDADVAFFLHRPVLSQKIAAKAEALRKSSDSRSESLERSIERYNIASKEALEPIVTSITPLLPSRVWEDISPEFYVTFWSLSMYDLRVPVESYEREIDRLKTAAANVAKDSSQGTKGKKEQERFNTLIDKLQEERRRQEEHVARVRGRLQRECVAWFPARAAKSAKNETVTRLMQLCIFPRCIFTAPDALYCAEFVHTVHALKTPNFSTLLCYDRLFCDITYSVMSCTEGEAARYGQFLCRVMRTAMRWHRDRTAFHEECAHYPGFVTKYRVSNQFTEANDHVGYENYRHVCHKWHYKITKAMVVCLDSGDYVQIRNALIVLIRVLPHFPVLEKLAQIIEKKVEKVKEEEKTQRQDLYVLATGYSGQLRNKVPHMMKESDFHQIVHLTTGEVKPREQTTDVPAPDNEKKESRTSERRRDDTDREKEVKRESRSNAKERNKEDGRTKDRSPRERSHREERYLDTVSPPHEHRHPPDDIDRDVKRRKVESSGNGKGKEIEERSPEKEKRKTKLRGDERKERKMSRKRDRAEETALLEQKRRRDEQKAVAKMSSHQNGSQEDHHYEKYHKRAVQRVEDKKITLERKRASEE-