Monarch geneset OGS2.0

DPOGS215771
TranscriptDPOGS215771-TA3111 bp
ProteinDPOGS215771-PA1036 aa
Genomic positionDPSCF300041 + 1631640-1638463
RNAseq coverage1040x (Rank: top 12%)
Annotation
HeliconiusHMEL0141010.071.80% 
BombyxBGIBMGA003539-TA0.086.17% 
Drosophilal(2)35Df-PA0.068.24% 
EBI UniRef50UniRef50_Q9Y1340.068.24%L.2.35Df n=33 Tax=Eukaryota RepID=Q9Y134_DROME
NCBI RefSeqXP_624031.10.071.23%PREDICTED: similar to lethal (2) 35Df CG4152-PA [Apis mellifera]
NCBI nr blastpgi|3287923780.071.23%PREDICTED: superkiller viralicidic activity 2-like 2-like isoform 1 [Apis mellifera]
NCBI nr blastxgi|1565537290.071.50%PREDICTED: superkiller viralicidic activity 2-like 2-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00168170hydrolase activity, acting on acid anhydrides
GO:00055242e-62ATP binding
GO:00168182e-62hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides
GO:00080269.6e-21ATP-dependent helicase activity
GO:00036769.6e-21nucleic acid binding
GO:00043865.4e-15helicase activity
KEGG pathwayame:5516370.0 
 K12598 (MTR4, SKIV2L2)maps-> RNA degradation
InterPro domain[1-1036] IPR0164380RNA helicase, ATP-dependent, SK12/DOB1
[861-1036] IPR0129612e-62DSH, C-terminal
[128-311] IPR0140011.3e-34DEAD-like helicase
[134-280] IPR0115459.6e-21DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[432-521] IPR0016505.4e-15Helicase, C-terminal
Orthology groupMCL13416 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215771-TA
ATGTCAGATATAAATAGTTTATTTGATTGCTTCGAAGAACCAGCTTTAAACGAAGCTGCCACCCAATTGCCAAATGTTAAAAGTGAAGAAGAAGCGCCTGTAACCAAGACGGAAGATGTTAAAAAAGAAGAAACTATAGAAGCTTCGCCCAGTAATAAGCGTCCACATGAAGAAGTTAAGTATGCAGATACTTCTAAGAAGCCCAGACAAGAAGAGGAAGACGACGCCGATATTATAAGCGACATTAACCTCAACAATCTTGGTGCAAGGATACTTATACACACTCTTGATACACACGAAGGCTGTACACATGAAGTGGCGATACCTCCCAATCAGGAATATGCTCAACTAATGCCAATTACTTCAGAACCAGCTAAGCAATACAGTTTTATCCTGGATCCGTTCCAAAAGGAAGCTATCATGTGCATTGACAACTTACAATCAGTACTTGTATCAGCACACACATCTGCCGGAAAGACTGTTGTTGCAGAATATGCTATAGCTCTGTCACTAAAAAACAAACAAAGGGTTATTTACACAACACCCATCAAGGCTCTCTCTAATCAGAAATACAGGGAATTCTCTGAGGAGTTTCATGATGTGGGTCTGATTACTGGAGATGTTACTATCAATCCATCCGCTTCCTGTTTGATAATGACAACTGAGATTCTAAGAAATATGTTATATAGAGGTTCAGAGATAATGAGGGAAGTTGGTTGGGTTGTGTTCGACGAGATTCATTACATGAGGGACAAGGAAAGAGGTGTTGTTTGGGAAGAAACACTTATCTTGCTACCCGACAATGTTCACTATGTATTTTTATCGGCTACTATACCCAATGCTCGTCAGTTTGCTGAGTGGGTGTGTCGACTTCACTCTCAGCCGTGTCATGTTATATACACTGAATACAGACCCACACCCCTCCAGCATTATATATTCCCTGCTAGCGGAGACGGGATTCATCTTGTTGTGGATGAAAAGGGTCAATTCAAAGAGGACAACTTCAATACAGCTATGACGGTGTTGAGTAACGCGGGCGGGGCGTCGGCGGGGGGTGAGCGCGGCCGGAGGGGGGGACTCAAGGGGGGGAGCAGTAGTATCTTTAATATAGTCAAAATGATCATGGAGAGAAACTTCGCACCGGTGATTATATTCAGTTTCAGTAAGAAAGACTGCGAGCTGTATGCTATGCAGATGGCTAAATTGGATTTTAATACAATTGAAGAGAAAAAACTTGTAGACGAGGTTTTCAACAACGCGATGGACGTTCTATCTGAAGACGATCGTAAGTTACCGCAGGTTGAAAACGTGATACCCTTGTTGAGGAGAGGCATCGGTATACATCACGGAGGACTGCTGCCCATACTGAAAGAAACCATAGAAATATTGTTCGGCCTGGGGCTTATCAAGGCGCTGTTCGCCACCGAGACCTTCGCCATGGGGCTCAACATGCCCGCTAGGACTGTTGTGTTCACAAATTGCCAGAAGTTTGACGGCAAGGACTTCAGATTTATAACTTCCGGTGAATACATCCAGATGTCAGGTAGAGCTGGTCGTCGAGGGTTGGACGATAAAGGTATCGTCATACTGATGATCGATCAGAAGGTTACTCCCAGTGTCGTTAAGTCCATGGTACAGGGCAAAGCTGATCCTATAAATTCCGCCTTCCATCTTACATACAACATGGTCCTGAATTTATTAAGAGTTGAAGAGATAAACCCGGAGTACATGTTGGAGAGGAGTTTTTATCAGTTCCAAAACCAAGCTGTTATCCCAGACCTCATCGACAAGGTGAAAGCTAAGCAAAAGGAATATAGCGCGTTGTCAATAGAGGAGGAGCACTCTATAGCTTCATACTGTAATATAAGGTCACAGTTGGAGCTGCTGGGGTCACAGTTCAGGTCGTTCATCACGAAGCCGGAGTATATCAAGCCGTTCCTCCAGCCCGGTAGACTTGTTAAGGTGAAAACGGAAAAATACGAGTACGATTGGGGCATTATAGTGAACTTTAAACACAAAACCGGCAAAAGTAAGAAAGACGAGAACCCCCTGACCGCGGACACCGTCATAGTGGTGGACGTGTTGCTGCATGTTAAGAAATCAAAAGCCGACGAGGCCGACACGAACGTGCCTTGTCCTCCTGGAGAGACCGGCGACGTAGAGGTGGTGCCGATCCTACACACGTTAATATATCAGATAAGTTCGCTGCGGGTGTACTATCCCAAAGACCTGCGACCGCCCGACAACAGGAAGTCGGTGCTGAAAACTATAGGGGAGGTCAAGAAGCGGTTCCCGGAAGGACCGCCGCTACTGAATCCCATCAAGGACATGAAAATTGAGGACTCTGTGTTCAAGGAATGCGTCGAGAGAATCAAGTTGTTAGAGGAAAGATTATACTCTCACCCCCTCCACAACGACAAGAACCGTGGCGCCCTGACGGCGGCTTACGACGCCAAACAAGAAATATACGAAGAGCTGACGTTAGCCAAGTCCGAGTTGAGGAGGGCGAAGAGCATCTTACAGATGGACGAACTGAAGAAGAGGAAGCGAGTGCTGAGGCGACTCGGGTACTGCACGCTGTCAGACGTCATAGAGCTCAAGGGCAGGATAGCCTGCGAACTCAGCAGTGCGGACGAACTGCTTCTGACCGAGTTGATCTTCAACGGTGTGTTTAACAATCTGTCCGCGGAGCAGAGCGCGGCGCTAGTGAGCTGCTTCGTGTGTGACGAGAACAGCACTCAGACGTCCGCCACGGGCGAGGAGCTGAGAGGCGTCCTGAGACAACTACAGGAATACGCGCGTAGAATAGCGAAAGTATCAATCGACGCGAAGATGGATCTCGACGAGGACGAGTACGTTGGAAAATTCAAATGTACCCTCATGGACGTAGTACTCGCGTGGGCGAAGGGCGCCTCCTTCCTACAGATATGCAAGATGACTGACGTCTTTGAAGGTTCAATAATTCGTTGTATGCGTCGCCTGGAGGAGGTACTCCGGCAGTTGTGTCAGGCCGCCAAGAACATCGGGAACACGGACTTGGAGAATAAGTTCAGCGACGCCATCAAAATGCTGAAGAGAGACATAGTGTTCGCGGCCAGCCTTTACATGTAG

Protein sequence:

>DPOGS215771-PA
MSDINSLFDCFEEPALNEAATQLPNVKSEEEAPVTKTEDVKKEETIEASPSNKRPHEEVKYADTSKKPRQEEEDDADIISDINLNNLGARILIHTLDTHEGCTHEVAIPPNQEYAQLMPITSEPAKQYSFILDPFQKEAIMCIDNLQSVLVSAHTSAGKTVVAEYAIALSLKNKQRVIYTTPIKALSNQKYREFSEEFHDVGLITGDVTINPSASCLIMTTEILRNMLYRGSEIMREVGWVVFDEIHYMRDKERGVVWEETLILLPDNVHYVFLSATIPNARQFAEWVCRLHSQPCHVIYTEYRPTPLQHYIFPASGDGIHLVVDEKGQFKEDNFNTAMTVLSNAGGASAGGERGRRGGLKGGSSSIFNIVKMIMERNFAPVIIFSFSKKDCELYAMQMAKLDFNTIEEKKLVDEVFNNAMDVLSEDDRKLPQVENVIPLLRRGIGIHHGGLLPILKETIEILFGLGLIKALFATETFAMGLNMPARTVVFTNCQKFDGKDFRFITSGEYIQMSGRAGRRGLDDKGIVILMIDQKVTPSVVKSMVQGKADPINSAFHLTYNMVLNLLRVEEINPEYMLERSFYQFQNQAVIPDLIDKVKAKQKEYSALSIEEEHSIASYCNIRSQLELLGSQFRSFITKPEYIKPFLQPGRLVKVKTEKYEYDWGIIVNFKHKTGKSKKDENPLTADTVIVVDVLLHVKKSKADEADTNVPCPPGETGDVEVVPILHTLIYQISSLRVYYPKDLRPPDNRKSVLKTIGEVKKRFPEGPPLLNPIKDMKIEDSVFKECVERIKLLEERLYSHPLHNDKNRGALTAAYDAKQEIYEELTLAKSELRRAKSILQMDELKKRKRVLRRLGYCTLSDVIELKGRIACELSSADELLLTELIFNGVFNNLSAEQSAALVSCFVCDENSTQTSATGEELRGVLRQLQEYARRIAKVSIDAKMDLDEDEYVGKFKCTLMDVVLAWAKGASFLQICKMTDVFEGSIIRCMRRLEEVLRQLCQAAKNIGNTDLENKFSDAIKMLKRDIVFAASLYM-