Monarch geneset OGS2.0

DPOGS207405
TranscriptDPOGS207405-TA4917 bp
ProteinDPOGS207405-PA1638 aa
Genomic positionDPSCF300087 - 313732-324613
RNAseq coverage468x (Rank: top 27%)
Annotation
HeliconiusHMEL0020730.068.31% 
BombyxBGIBMGA009372-TA0.064.48% 
DrosophilaPcf11-PD7e-10938.32% 
EBI UniRef50UniRef50_D2A0S75e-17634.22%Putative uncharacterized protein GLEAN_08263 n=2 Tax=Tribolium castaneum RepID=D2A0S7_TRICA
NCBI RefSeqXP_970852.21e-17634.22%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|3838536220.035.60%PREDICTED: uncharacterized protein LOC100875741 [Megachile rotundata]
NCBI nr blastxgi|3838536220.032.97%PREDICTED: uncharacterized protein LOC100875741 [Megachile rotundata]
Group
KEGG pathwaytad:TRIADDRAFT_565752e-24 
 K12879 (THOC2)maps-> Spliceosome
InterPro domain[2-152] IPR0089424.5e-47ENTH/VHS
[4-138] IPR0065691.4e-29RNA polymerase II, large subunit, CTD
Orthology groupMCL14679 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207405-TA
ATGGCGAAGGAAATTGCGGACGAGTACGCATCAAGCCTGGCGGATTTGACTGTAAACAGCAAACCCCTTATTAATATGCTGACTATACTCGCGGAAGAGAACATCGAGCATGCAGGAGTTATCGTTGAGACAGTGGAGAAACATTTGGAGAAGGTGCCCCCGGACATCAAGCTGCCTGTACTGTATTTAGTAGATTCCATTATCAAGAATGTTGGTGGTGCATACACACAGAAGTTCTCACAGATCATCGTTAATATGTTCACAAAGACCTTCAAACAGTTACCTGAAGAATCTATAATGAGAGAATGTTCACAGGTCGATGAAAAGATTCGTTCCCAAATGTTCAAGTTACGGGAGACGTGGCATGAAGTATTCCCATCCACTAAGTTGTACCAACTGGATGTGAAGGTGAACCTGATTGATCCCGCTTGGCCCATCCAGGCTCAACCACATCAGTCTAACATCCATCCAACTCCGACAACCAGTGCTGCAGCCCTATCAGAAGAAGAGGAGAAAATGCGTGCCATACTAGCTAAGAAAGAACAAGAACTACTAATGCTGCAGAGGAAGAAGGTCGAGATGGAGCTGGAGCAGACTCGGAGACAGTTACAGATGGCTGAAAAAGTTCACAAGAAGCCGCCCTTGGTGCATCCCCCTGGGCTCCCCGCGCCCCAAGCGGAGGCCCCGCCGGTCCCTCTGGCGCCTACATCTCTTCCTCCTCAAGCGCCCATATCGTCGCCAGTACTGTCCATCAAGCAACGGCTTGGACCCCCTGTTAACAAGTCGGCGGGTCGTATCGCTCCCGTCACGGGTCCGTTGAGCGGAGCGCGGCGTGACCCGCGGCTCGCAAGGAGACAGCCCCCACCACAACGCCAACCACCTGCCACCGCCCACACCACCCCCACCACCGCCGCTACCACAATCGCCCCCAACGTTTTCGATATCAAGCCACTCCACCGACCGGCCAAGCGGAGCGCCGTCATCACCATCGAGGTGGCGACCGACTCGCGCCGCAAGGACAGAAAGGAAACACGCACCAAAATAGAGAACGGCAACGGGGACTCCGTACACACATACATCGACAGACTCAGAATACCCGACCCCAAGAAGATAAACAAGCTGCCACCCATACCCAAGATACGGCGGGACAGAGACGATCCGCCCTCCAAGAAAAAGAAGGAACTCAGAGAGAGGAAGAAGCGACGGGACAAGGACGGGTCGGCCTCCTCCTCGCCCGACAAGAAGCGAGGCAAGGACAAAGACAAGAAAGTTTATAATAAAGAAAGGATGGACGTCGACGACCAGCAGTACGCGCCCGAGACGGTCTCCTTCAAGGAATTGAAGAACTTCAACAAGAAGCACTACATGCGGAGAAATAAGGAAAAGTCGGAGAGTCCAGAGAGATTAGACAAAGAGGAAAAGAAAACGTCAGAAGAGGCTACAAATCCGCCTGAGATTATCCCTGAGACTAAAGATGTAGATCTTCGTGTTTTGCATCCCGTTATTCCAGAAACTGCCAAGGCGATCGCCCAGAAAAGACCGTCCACTGAGATGCTCGAGGGCAAACCTAAGAAAAATAAACTCGATAAATTCGATATCCTTTTTGGAAACGAGGACGTAGATCTTCGTCAGTTGCCTCAAGTTGAAGAGGCGAACGCACCACCGCCACCGTCTATATCTGAACCAAATTCTGTATCTAAGGAGGAAGTAGATCCAGAAGAATCTGACGATGTAATTGCGTCGCCCGTACGTTCTCCCAAAAAAGATTGGCAAGAAGTCAAAGAGAAGGAAGAAACTAAAAAGACACCTTCGAAGTTGGATCTCGTGAGAGCAAAGTTGGCGGAAGCTACAAAAGTTAAAGATGGCTTAGGACGTCCGTTACTGTTCAGCATAGAAAGAGAAAGACGTCGAACATTCAGCTCTGATGAAACTGAAGTTAAAACAGATAGCAACGAAGAATTTGATGCCGATGATCACAAGAAAACTATCTCTATTATCATGAATCAGGCCAAGGAACAGTTCAGTGACGGTCAGCTGGATAAAAATCAGTACAACACATTGATGTACCAAGTGTTGCAGCTCAATGAGAAGCTCAAATTAAAGGAAGCTAAGCAAAGGGAATCCCTAGAAATATCAAAACGAAAATTGAAAGCCCACGTCTCAGAAGATAAAAAGGTCCCTTCACCGAAATCTTCCCCGTCGGAGAGGAATAAATTCGGGGACATCGATGAACGAGTTCCCGTCGGTGGATTTTCTGACTCAGAAAACAACAATCAAGACTCTGATATGCGTCAAAATCATCACGACAATATAGATGGGAAGAGTGGTTTTCCAATGCCGCCTATGATGCCGCCGATGTACATGGGGCCTTTCCCGATGTGGAGAGGTCAACCGCGGCCTAGAATGGAAGAGTTCGGTCCTCGAAGGTTTAGAGGTCCATATTTTAGAGGAAAGTTTGATAAAAGAGGCCCAAGACCACCTTTCGATATGAGAATGCCTCTACTTCCCACTCCTAAACTCGGTATGTGCCAAGGCGAATGTCCTCTAAAGCCTTATGAGCGGTCAATTTCCCCTCCACCACTCGGCGCTCCCGGGTATACATTGCCTCCCACTGATTACAAAATATTGGAATATATCGATCAGGATCCGGTCAAAACTATCCAAATAGACGGTATTCCCAGAGAAATAAGATTCTACGGCGAGACGGCGATCATCATGCTCGACTGGGACGATCCTAGAGTTATCAAATTCTTACCAGGTTGTCGGAGAGTCACATTCGATAATAAAGATTCAGTGGTATTGACTTTCAACGAGGGTTACAAGAAGGTGGAAATAGACGATCAAGTCTTCGATATCAGGTTCGGTGCGCCGACCAGGGAACTGTTCATAAATGGGAGGTGGTATGAATGTTTCTTCGGGGGTCAGCCATTGGGCGTCATAATAGACGGCAAACCGCGATTAGTGCATCTAGAGGGGCCCCTGCCACAGGTGGATATAGGGAAGACGAAACGAACAGACTTAGTAGCGGGTAAAATAAATGTTATAATAAACGCGACAAACATTTGTCCGGTTTACTTGGATGCTAAAGTTCAGAAGTTCCACGTAAATGGACATTTCTTTACGATACGTTTTGTCGATTCCTTGAAGACCGTTCTAATTAATGAGCAGCCGTTCAAAGTGGAGTATGGGGATCTTCCGAAGCCGATATTCGTAAATGGGGAGAAATACTTTGTACGATTTTCTGCACTGCCCAAGAACATAAAACCGGGCCAAGTAGAGATCGCTGACATGGAAGGCTGTAAACCATCGACGGAATCCGAGAAGCTGCCCACAGTGCCGGAGAACGAAGACGTTCCGATGGAAACTGATTCTGAACAACCGCTCGAAGCTCCAGTGAAATCCCCAAGTCCTGAAGGTGAAATGAAAGGTTTGGATATGCTAGCAAATTTCATGCCGAGTGATATGGCCCCGGCTTCCAGCTCAGAATACAGTTCTGCTGAGCCGCTGTTCACAAAACCGGAAGTGATCCCAGGATTGGAAACTCCCGCTGAAGAGAAGCCAGCCAGTTCCCTGCCTCTTTTGGGCGGTATCAATGTCAATGACCTGTACGCCAAATTAGTTGCGACAGGTATAGTGCCAATGTTGAACGAAGTGAAACCAGAGAAGAAAACTGAAGTTCCTGAGATCGAGGAGACGAAAATGAAACCCAAAGATGATAAGAACGTCATCCACAAAGTCGATATACTGAAACCGGAAACACTTCGGATTAAGCAGCGCGGTCTGGTGCTGAAACTGTACAGCGGTATGCAGTGCAGCGGTTGCGGGGCGCGGTTCCCTCCCGAGCACACGGTCCGGTACTCGCAGCATCTCGACTGGCACTTCAGACAGAACAGGCGGGAGAGGGACTCTGCGCGGCGGGCTCACTCGCGACACTGGCACTACGATCTGTCTGACTGGCTGCTGTACGAGGAGCTAGAAGACCTAGAAGAACGAGAGAAGAGTTGGTTTGAGACGGGAGGTTCTGAAGAAACACCAGCTCAGGTAGAAGCTGTTGTAGAAGAAAGTCCAAGCACCGCGGCTGGTGGAGCTCCGCAACATAACTGTGCCTTGTGTGGAGACAGATTCCATCAGTTCTACAATGAGGACCAAGAGGAGTGGCATCTCAGGAATGCTGTCAAACATCAGGACAGTTACTATCATCCACTGTGTCTGCAGGATTATAAGGCTTCTCTAACAAAGGAAGAGCCACAGGCTGAGGAAGCGGCGGTGGATGTAGACGAGGAACCTCCGGCTGCGATAGAGATAGGAGACACCGCTGAGCTGTCGGACACTGAGTCTGTGGTTGAAGTGTTGGAGACTGAACCCTTGGAACCTGTTGAGATTGAAGCTGATGACGGTGATGATGATGTGGTCCTGAACGCTGAGCCCGTGGAACAGCTGGAGGTGGACGACGGAGACACAGACGATGAAACTACTGAGACGAGGAGGCAGAGAGATCACCTCGCACAAGTTGATTTTGCTAACATAAAGATCAAACAGGAACCTATCGATCCAGACGATGAACCAATTATAACAGCAGAAGTAGAAAGCATTCCGCCGACAATCGACACAACACATACGACTGTTACATCATCCATAGACGGGAACGTTCAGCTCGACGACGCCACGCTGACTCCAGCTCTACCCATCGGTGGCATCAGAATCAACATATCCAAAACCATAACCAGCTTTGCTACCAATCAAGATAGTCCGGATAAGTCCCTCGAAGACATCAGCACCGAGGACGAGCCCTTGCCTCCCGGAGAGGAACCAGAAATGGAGTACGAGTTGAAGCCTTCAATGAAAGACGTGAAGTTCAGCAGACAACCTCCCGTCCAAAAAGGAAGTGAATTGTCGGGATTGTGTTCTATAATGTGA

Protein sequence:

>DPOGS207405-PA
MAKEIADEYASSLADLTVNSKPLINMLTILAEENIEHAGVIVETVEKHLEKVPPDIKLPVLYLVDSIIKNVGGAYTQKFSQIIVNMFTKTFKQLPEESIMRECSQVDEKIRSQMFKLRETWHEVFPSTKLYQLDVKVNLIDPAWPIQAQPHQSNIHPTPTTSAAALSEEEEKMRAILAKKEQELLMLQRKKVEMELEQTRRQLQMAEKVHKKPPLVHPPGLPAPQAEAPPVPLAPTSLPPQAPISSPVLSIKQRLGPPVNKSAGRIAPVTGPLSGARRDPRLARRQPPPQRQPPATAHTTPTTAATTIAPNVFDIKPLHRPAKRSAVITIEVATDSRRKDRKETRTKIENGNGDSVHTYIDRLRIPDPKKINKLPPIPKIRRDRDDPPSKKKKELRERKKRRDKDGSASSSPDKKRGKDKDKKVYNKERMDVDDQQYAPETVSFKELKNFNKKHYMRRNKEKSESPERLDKEEKKTSEEATNPPEIIPETKDVDLRVLHPVIPETAKAIAQKRPSTEMLEGKPKKNKLDKFDILFGNEDVDLRQLPQVEEANAPPPPSISEPNSVSKEEVDPEESDDVIASPVRSPKKDWQEVKEKEETKKTPSKLDLVRAKLAEATKVKDGLGRPLLFSIERERRRTFSSDETEVKTDSNEEFDADDHKKTISIIMNQAKEQFSDGQLDKNQYNTLMYQVLQLNEKLKLKEAKQRESLEISKRKLKAHVSEDKKVPSPKSSPSERNKFGDIDERVPVGGFSDSENNNQDSDMRQNHHDNIDGKSGFPMPPMMPPMYMGPFPMWRGQPRPRMEEFGPRRFRGPYFRGKFDKRGPRPPFDMRMPLLPTPKLGMCQGECPLKPYERSISPPPLGAPGYTLPPTDYKILEYIDQDPVKTIQIDGIPREIRFYGETAIIMLDWDDPRVIKFLPGCRRVTFDNKDSVVLTFNEGYKKVEIDDQVFDIRFGAPTRELFINGRWYECFFGGQPLGVIIDGKPRLVHLEGPLPQVDIGKTKRTDLVAGKINVIINATNICPVYLDAKVQKFHVNGHFFTIRFVDSLKTVLINEQPFKVEYGDLPKPIFVNGEKYFVRFSALPKNIKPGQVEIADMEGCKPSTESEKLPTVPENEDVPMETDSEQPLEAPVKSPSPEGEMKGLDMLANFMPSDMAPASSSEYSSAEPLFTKPEVIPGLETPAEEKPASSLPLLGGINVNDLYAKLVATGIVPMLNEVKPEKKTEVPEIEETKMKPKDDKNVIHKVDILKPETLRIKQRGLVLKLYSGMQCSGCGARFPPEHTVRYSQHLDWHFRQNRRERDSARRAHSRHWHYDLSDWLLYEELEDLEEREKSWFETGGSEETPAQVEAVVEESPSTAAGGAPQHNCALCGDRFHQFYNEDQEEWHLRNAVKHQDSYYHPLCLQDYKASLTKEEPQAEEAAVDVDEEPPAAIEIGDTAELSDTESVVEVLETEPLEPVEIEADDGDDDVVLNAEPVEQLEVDDGDTDDETTETRRQRDHLAQVDFANIKIKQEPIDPDDEPIITAEVESIPPTIDTTHTTVTSSIDGNVQLDDATLTPALPIGGIRINISKTITSFATNQDSPDKSLEDISTEDEPLPPGEEPEMEYELKPSMKDVKFSRQPPVQKGSELSGLCSIM-