Monarch geneset OGS2.0

DPOGS204026
TranscriptDPOGS204026-TA3570 bp
ProteinDPOGS204026-PA1189 aa
Genomic positionDPSCF300138 - 54637-68117
RNAseq coverage430x (Rank: top 28%)
Annotation
HeliconiusHMEL0049520.062.81% 
BombyxBGIBMGA004879-TA0.051.93% 
Drosophilalds-PA0.040.24% 
EBI UniRef50UniRef50_P347391e-17840.24%Transcription termination factor 2 n=14 Tax=Drosophila RepID=TTF2_DROME
NCBI RefSeqXP_002056599.10.040.32%GJ10137 [Drosophila virilis]
NCBI nr blastpgi|1953959550.040.32%GJ10137 [Drosophila virilis]
NCBI nr blastxgi|1953959550.038.61%GJ10137 [Drosophila virilis]
Group
Gene OntologyGO:00036772.7e-69DNA binding
GO:00055242.7e-69ATP binding
GO:00043864e-12helicase activity
GO:00036764e-12nucleic acid binding
KEGG pathway 
InterPro domain[637-963] IPR0003302.7e-69SNF2-related
[630-842] IPR0140011.9e-29DEAD-like helicase
[1042-1126] IPR0016504e-12Helicase, C-terminal
Orthology groupMCL11474 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204026-TA
ATGGAGAATTCATATGTTGAATACCGTGATACAACTGGTGTCGAAAGTGATTCGGATACTGAAATTATTGACAATAGTTATACAGACGAATCTTTCGCTAAATTAAAAACCCCTGGAAAAAAGAACCATACAGCATTTGTGCCTGAATCGGATGACACAGTATCAGAAGACGATGATGTTTCAAAAATCAATCTATCTAAGTCAGTGCAAAGCAGAAGCAGTGTTAATCGTCATGTTGTTCTAAGTAGTGACGATGAAAAAAGTAAAAGGGGCAGTATACAAAAACAGTCTGATGCTTCAAATAACATTGTTCTAAGTTCTTCAGAAGATGAGGCTGAAGATTCACCTGAAATAAAACCGAGACGGCGTCCCAAAACCTCTCCTCGGCTGTCAGAGTCGTTTATTGGACGTAAAACAAAAAAACGTATATTTCTCATAGACTCGGACACTGAAAACTCTATAATAGTTGAACATGACAATAACCGGAAACTGAAATGTGTTAAGGACACTCCGGCTAAGATCGATAAGAACGATTTACGAAAAAGTATACTGGATATAAAGAATATGAGTATACAAAGTAATGATAGTGTACGGACTGATGATGGCGACGGAACTGATGATGAACAGATAAGTGGTGATGAAAAGACTGAGGGAAGAGATGGAACTGATCAGGACGAGGGAAGCCACAGTGATAGTGTTCACAGCAATGATGATCAGAACATTGAACATAGTAGACAGAGAGTTAGTGACAGCGGACACTCTGATGACAACACTGATCTTCAGGACACTGAAACGAGACGTGATATATCATCAAGAGGTGATGATTTAGATGAAAGTGATGGAAATGACAAAAACGAAGGTTACTCCAGTGACGACGGTCCTGATGAAGATCAGCTAGTGATGTCCAGAGCTACGAGGATGAGTATAATGGGTATACTGCCGAAGGAGAACGATAGTGATGACTCAGATTACCTGCAGTCTGATGATACCAACCAGACATCACGAGGTAGTTCTTTGGACCTCCCGACTGACCCACCAGCCGGCAATGAAACACCCAAGAAAGATGGAGACACATCAAGATTGACGTGCTCACCTTTTCAGAGTCCACTCCATGATATCACGAATGAAGTCAACTCACCAAATAATTCCAAGAACACTCCGGATATCTGTGATCTGACCAGATCTGAGCCTTGTGATCTACGGAACAGGGTTTTAGATAAGTTGAACAGTACCCAGACCAAATATGTGGAGAAGGTTATTGACGATGACGTCACCATCATAGATGCGAAACCAGAGGTCATAGCCCTCAGCAGCGATGAAGATGAGGTGAGAGATGAGAAGAAATCCCCCAATACAAAGGGTAACCTGAAAGCGGAGCCGACATCGGTTAGGAAGGACAACACCATCAAGCAGTACCTCCTGCCGCCCAGTTATCCCAACCAGGTGGTGTACGTCAAGAAGAATGTTCGTGAAAACGAACTCTCCAAGCTCAACGGACTCAAAGAAGACTTGCAGAATATCAGATATCTCCTGGAGAATATGGATATGAACTCGCTACCTGACGGAGGGCTCAAGCTGATAGAACGACTCACGACCCTGGAGGCGGAAGTCAGGAAACAGGGGGACAAAGTGGCCAACATGGTGATAGAGCCAGATGAACCTACTCGCGCGGATGTAGCGAGGGATGGCTTCGACAAAGAGAACAAGGGTCTGTCCTGGGACGACATACAGAAGGCGAGTAATGCGGTCCAGCCCAGGATGTTCGGCAAACAGGCGATGGCCACCCACATGGCGGAACGTAACCTGATCCTGGAGCGTCTCCGCGACCTGTACGAGTCCTTGGCTTCCCGTCCGTCGGAACAGCACCACCACCACCAGCCGGCGCCGCTCGTCACCTCTCTCATGGACCACCAGCTACACGCCCTCGCCTGGCTGCACTGGAGGGAGACGCAGAAACCACGGGGAGGGATACTGGCTGACGACATGGGCCTGGGCAAGACGATCACCATGATAGCTCTGGTAGTGAGCGACAAGGAGAAGAACATCGACCACCAGCCAGACGATGACGATCATGGAGGGAGGTCCAGATTGGCTCGCGGCGGCACGCTGGTGGTGTGTCCGGCGTCGCTGATGCAGCAGTGGGCGGGCGAGGTGGCGAAGCACTGCCGGCCGCACGCCGTGTCCGTGTGTCACCACCACGGAGCCGCCCGCGCCACGCAGCCCCACCGCCTCGCCAGCTACGACCTCGTCATCACCACCTACAACATCCTGCAGAGGGAGAGCGAGAAGGGCGGGGTGTTGACCCGCGTCCGCTGGCGCCGCGTCATCCTGGACGAGGCGCACGTGGTCCGTAACCACAAGTCGTCGACGTCGCTGGGCGTGTGCAGCCTGTCCTCCTGGGCTCGCTGGGCGTTGACCGGGACCCCGCTACATAACAAGGACCTGGACCTGTTCGCCTTGCTGAAGTTCCTCAAATGTACACCCTTCGACGACCTCGCGATGTGGAAAAAGTGGATCGATAACAAATCTCTCGGCGGCCAAGAACGACTGAGCACCATCATGAGGTGCATCATGCTGAGGAGGACCAAGCAGCTGCTGCAGGAGAGGGGCCAACTCACCTGTCTGCCGGAGCGGAGCGCGCACCACGTGGACGTCACGCTGCACAAGGACGAGATGAACGTGTACCAGAAGGTGTTAGTGTTCTCCAAGACCCTGTTCGCTCAGTTCCTCCAGCAGCGCGCCGAGCGTATCGGGGACTCCGCCCCCGGGAAGGACTCCGAGTACCATAAGATGCATAAGAAAATGATCGCTTTACAAGGAGCGAAACCAGTGAAATCTCACGAGATCTTAGTCCTTTTGCTGCGTCTCCGTCAAGTGTGTTGTCACTGTGGCCTGATAGCGGCCATGTTGGATCCAGACGACACGGCGGACGTGGTCGAGGACCAGGGAGGAGCCGACCTCATGGAAGAACTCAACAAACTGTCGCTAGAGGACTCGCGCTCTAAGAGAAAGATATTCAGCTCTTATACTGACTTAGATTTTAAAATTATTGAAAGAAAGAATTGTCGTTTTTCATTTCCAAGTGAGAAGGCGGTGGTGGTGTCTCAGTGGACGTCCGTGCTGCGCCTGGTGGAGCGCGCCCTGACCGCGCTGGGCGTGAGCAGCGTCACGCTCAGCGGCGCCGTGCCCGTCACCGCGCGGGCGGCGCTCGTTAACGCCGTTAATGATGCCAAATCAGATGTCAAGGTGATGTTGTTGTCGCTGTGTGCGGGCGGTGTGGGTCTCAACCTGTGTGGGGCAAACCACCTTCTCCTGCTGGACCCTCACTGGAACCCGCAGCTGGAGGAACAGGCCCAGGACAGGATATACCGAGTGGGACAGACTAAACACGTGCATATATACAGGTTCATGTGCGTGGGTACAGTGGAGCAAACGATCAGACAGCTGCAGGACGTCAAGCTAAAGATGGCGGACAGCGTGCTCACCGGCGCCAGGAACACGAACGCCTCCAAACTCACCATAGAGGACCTCAAGATGCTGTTCAACATGGGACCGCAGAGCGACTCATAG

Protein sequence:

>DPOGS204026-PA
MENSYVEYRDTTGVESDSDTEIIDNSYTDESFAKLKTPGKKNHTAFVPESDDTVSEDDDVSKINLSKSVQSRSSVNRHVVLSSDDEKSKRGSIQKQSDASNNIVLSSSEDEAEDSPEIKPRRRPKTSPRLSESFIGRKTKKRIFLIDSDTENSIIVEHDNNRKLKCVKDTPAKIDKNDLRKSILDIKNMSIQSNDSVRTDDGDGTDDEQISGDEKTEGRDGTDQDEGSHSDSVHSNDDQNIEHSRQRVSDSGHSDDNTDLQDTETRRDISSRGDDLDESDGNDKNEGYSSDDGPDEDQLVMSRATRMSIMGILPKENDSDDSDYLQSDDTNQTSRGSSLDLPTDPPAGNETPKKDGDTSRLTCSPFQSPLHDITNEVNSPNNSKNTPDICDLTRSEPCDLRNRVLDKLNSTQTKYVEKVIDDDVTIIDAKPEVIALSSDEDEVRDEKKSPNTKGNLKAEPTSVRKDNTIKQYLLPPSYPNQVVYVKKNVRENELSKLNGLKEDLQNIRYLLENMDMNSLPDGGLKLIERLTTLEAEVRKQGDKVANMVIEPDEPTRADVARDGFDKENKGLSWDDIQKASNAVQPRMFGKQAMATHMAERNLILERLRDLYESLASRPSEQHHHHQPAPLVTSLMDHQLHALAWLHWRETQKPRGGILADDMGLGKTITMIALVVSDKEKNIDHQPDDDDHGGRSRLARGGTLVVCPASLMQQWAGEVAKHCRPHAVSVCHHHGAARATQPHRLASYDLVITTYNILQRESEKGGVLTRVRWRRVILDEAHVVRNHKSSTSLGVCSLSSWARWALTGTPLHNKDLDLFALLKFLKCTPFDDLAMWKKWIDNKSLGGQERLSTIMRCIMLRRTKQLLQERGQLTCLPERSAHHVDVTLHKDEMNVYQKVLVFSKTLFAQFLQQRAERIGDSAPGKDSEYHKMHKKMIALQGAKPVKSHEILVLLLRLRQVCCHCGLIAAMLDPDDTADVVEDQGGADLMEELNKLSLEDSRSKRKIFSSYTDLDFKIIERKNCRFSFPSEKAVVVSQWTSVLRLVERALTALGVSSVTLSGAVPVTARAALVNAVNDAKSDVKVMLLSLCAGGVGLNLCGANHLLLLDPHWNPQLEEQAQDRIYRVGQTKHVHIYRFMCVGTVEQTIRQLQDVKLKMADSVLTGARNTNASKLTIEDLKMLFNMGPQSDS-